Rigor

Seriously, what does that even mean? Anything?

and

Sep 13, 2021

If you’re involved in academia in any way, you’ve heard the term rigor. It’s a constant companion in any discussion of grades, assessments, teaching methods, curriculum and course design.

Let’s flip this intro, though: Before you read any farther, what does the word “rigor” mean to you?

Seriously: Take a minute. Actually write down what “rigor” means to you in a classroom setting, what it implies, maybe some examples of what you would consider rigorous or not. Then keep reading once you’re done.

What does rigor mean?

So, what did you write down?

After some extensive asking around, here are the words we most often heard used to describe rigorous courses: “difficult”, “challenging”, “strict”, “high standards”, “C average”, “bell curve”, “gatekeeping”.

If you’re not a fan of anecdata, here are some better-cited examples:

In Specifications Grading, a book that inspired many to rethink assessments, Linda Nilson uses rigor to mean “high academic standards”.
A collection of sentiments that many will recognize, wrapped up nicely by EdGlossary: “instruction, schoolwork, learning experiences, and educational expectations that are academically, intellectually, and personally challenging.”
In “Academic Rigor: A Comprehensive Definition” (written for Quality Matters, a nonprofit focused on measuring and guaranteeing course quality), Andria Foote Schwegler defines academic rigor as: “intentionally crafted and sequenced learning activities and interactions that are supported by research and provide students the opportunity to create and demonstrate their own understanding or interpretation of information and support it with evidence”. There’s a lot going on there.
Dictionaries cover a lot of ground with rigor. Ignoring non-academic meanings1, some relevant definitions include: “the quality of being extremely thorough, exhaustive, or accurate”; “strict precision”; “the quality of being unyielding or inflexible”; “scrupulous or inflexible accuracy or adherence”.
Robert even wrote about this way back in 2008: rigor is “thoroughness, carefulness, and right understanding of the material being learned”, and a rigorous course “examines details, insists on diligent and scrupulous study and performance, and doesn’t settle for a mild or informal contact with the key ideas”. (RT: I will have to think about whether I still agree with this.)

But by far the most common definition for rigor is: none. That is, most articles, books, and random internet conversations about rigor leave the term completely undefined. Rigor seems to be placed beyond definition, left up to the audience to interpret and recognize. You don’t need a definition for rigor — you know it when you see it, and it’s either there or it isn’t.

Or that’s what we’re told, at least. Authors and speakers often refer to the vaguely sinister specter of “lack of rigor” in modern education, a dog-whistle that covers all of the ground from kids-these-days have it too easy and I suffered back in my day and so you must too, to your course doesn’t cover my favorite pet topic, and therefore isn’t good enough. A recent example is “Upholding Rigor at Pandemic U”. If the word “rigor” weren’t in the title, you might not realize that this is what the article is supposedly about; the word “rigor” only appears twice outside the title, both times in the context of “upholding standards of rigor” and without clearly indicating what those standards are — or what rigor itself is. We are apparently supposed to simply know what rigor is, intuitively.

(That article coins the eye-rolling phrase “grace and compassion police” referring to those “who insist faculty shouldn’t demand very much from students”. There’s a lot of gatekeeping going on in this article.)

This is, ultimately, the problem: Rigor is a wildly overloaded word. It means something different to each person, and even instructors with many shared educational values are likely to have different definitions. When two or more people are gathered to talk about rigor, there too shall be ambiguity. These discussions are inevitably surrounded by unexamined assumptions, biases, and cultural baggage.

What can we say?

So, what can we do about this overloaded word? Perhaps the simplest solution is best: Don’t use “rigor” at all. Too much is tied up in the word; it’s a red herring and distraction and a vehicle for our biases.

Instead, let’s pull apart some of the knotted threads that form “rigor” and see what we can say about them. In particular, what concrete things can we say about how different grading systems approach the issues that seem to be indirectly addressed by “rigor”?

In the rest of this article, we’ll take a look at what we can say about the academic standards in assessment systems that embrace the four key pillars.

Clearly defined standards with marks that indicate progress

When student work is evaluated against clearly defined standards, there’s something that doesn’t happen: Comparison to other students. Criterion-referenced assessment gives a clearer and more consistent meaning to grades. This meaning is linked to clear criteria and doesn’t vary depending on how other students perform (aka norm-referenced assessment).

In other words, these two pillars lead to grades that are more meaningful and directly reflect student progress.

Despite the words “rigor” and “standards” often showing up near each other (like in that phrase, “standards of rigor”), actual standards — criteria for what constitute acceptable work on a task, clearly spelled out and accessible to the student — aren’t often a part of traditional grading. Instead, “standards” is often just a proxy for “grade frequencies”. Did your class have “too many” A grades? Not rigorous enough. Was the distribution bell shaped with the mean around C or C+? That’s more like it. To make this happen, instructors judge students against each other by “curving” grades, limiting the number of A’s, or using other wildly inequitable procedures that muddle the meaning of those grades.

Expecting student grades to fit a bell curve is simply not based in reality,2 and enforcing this through curving is the opposite of holding students to a high standard — it’s holding them to an arbitrary standard over which they have no control.

A reasonable objection here is that we haven’t actually said what or how high those standards actually are. Clearly, if we hold students to fluffy and light standards, like a meringue but with less academic meaning, then grading based on those standards isn’t going to fit anybody’s idea of rigor. And indeed, we want to make sure our courses challenge students intellectually and that a high grade is the result of working hard to provide authentic evidence of real learning. Choosing standards, and determining what should be involved in meeting a standard, is a place where instructors can have productive discussion.

But when grading with standards, instructors often overcorrect and expect perfection from students. This inevitably involves aspects of a student’s work that are not central to the idea being assessed, possibly including things like arithmetic, copy errors, or writing style.

If a standard is clear, it must specify what matters and, implicitly or explicitly, what doesn’t matter. And in most cases, what matters is not “everything”. When a student is writing a solution to a math problem, misspelled words or poor punctuation might matter (if communication quality is part of the standard) — but probably not. As long as the solution is understandable, we can judge whether it meets the standard separately from whether it is well communicated. And we probably should.

But wait! If we’re leaving out some parts of a student’s work — such as not “removing points” for spelling or arithmetic errors — doesn’t that mean we’re lowering standards, decreasing rigor? Only if those things are part of the standards being assessed. But then, they should also be part of what is taught in the class. Things that matter should be spelled out clearly in the standards or specifications.

In the end, the clearly defined standards must necessarily ignore some aspects of student work. Standards should make it clear which of these matter, and which don’t, and these will depend on the course in question. This can be a difficult, but essential, part of creating standards for your own classes. What matters in a lower-level course — for example, attention to algebraic detail — may not be as important in a later course with a different focus.

Helpful feedback and reattempts without penalty

The kinds of grading systems we talk about here also critically feature helpful feedback and the ability to revise, resubmit, or reattempt without penalty. Those two pillars look like they reflect lower standards. If we give students lots of feedback, and then give them chances to reattempt their work without penalty, what’s to prevent every student from earning full credit on everything?

Nothing! And, that’s a good thing!3 These two pillars are the core of the feedback loop that makes learning work. In the end, helpful feedback and unpenalized reattempts lead to greater learning, as opposed to one-and-done assessment that incentivizes students to focus only on the grade and flush away content from the brain once it’s been tested.

What if the feedback “gives away” too much of the solution? Then go ahead and give it! But to earn credit, ask the student to solve a similar problem from scratch, and give a detailed explanation of their reasoning. In larger and more involved assignments, feedback allows students to begin a significant revision process. That revision goes far beyond the level of detail that the feedback could give, producing original work that reflects the feedback, but is still generated by the student.

In all cases, some sort of metacognition — such as a reflective cover page on a reattempt — is important to ensure that students have reflected on their learning process. This is something that rarely happens in more traditional approaches to grading.

The goal of these two pillars is for grades to represent a student’s ultimate level of understanding. This reduces confounding factors, like whether a student was feeling ill on the day of a test, whether they were distracted by a personal crisis, whether the room was too cold, and so on. This approach acknowledges that different people learn at different paces and can grow in their understanding. If we really believe this — if we really care about treating students like human beings who can succeed in our classes — then “high academic standards” must have room for feedback loops.

An interesting consequence here: Many people who use this kind of grading notice that the number of A’s and B’s in their class increases. “Grade inflation” is often cited as a consequence of slipping academic standards and decreasing rigor. But here, it’s actually the opposite: By holding students to high standards and allowing reattempts without penalty, we remove grade penalties that aren’t related to learning. Having only one chance to demonstrate understanding penalizes students for not performing on the instructor’s schedule. Partial credit doesn’t make up for this: It still fundamentally represents a one-and-done approach.

When we insist on concrete evidence of learning, every ounce of those high grades can be traced back to an explicit piece of work that meets high standards. When grading with feedback and reattempts, students are no longer permanently penalized when they fail an early test. Instead, if they work to improve their understanding, their grade reflects that fully.

(For more, check out what Robert wrote a few weeks ago in “Specifications grading and ‘help’”.)

Instead of rigor…

We’re not saying that college courses shouldn’t be rigorous. We are saying that the word rigor itself has no inherent meaning and is therefore powerless to describe the kind of learning environment we want. In fact, we — David and Robert, and likely most others in on “Team Alternative Grading” — probably want the same kind of learning environment that many on “Team Traditional Grading” want: an environment where students are pushed and challenged to grow, engage deeply with difficult ideas, and show us in clear terms that they’ve met the challenge. In order to reach this goal, we have to start by using real words.

Rigor is not one of those. It is essentially a buzzword that just happened to appear in academic discourse decades ahead of all the other ones we currently deal with. So let’s stop using it. Instead, we have another word we’d like to propose as a replacement. To find out what that is, tune in next week.

Click here to receive Grading for Growth in your inbox, every Monday.

You don’t have to put your tongue too far into your cheek to see how some of the others could apply: “A condition that makes life difficult, challenging, or uncomfortable” “… often with copious sweating”.

I later wrote much more about this in: Abundance and Scarcity.

https://gradingforgrowth.com/p/abundance-and-scarcityIf you’re unsure about this, try saying “It’s a bad thing that all students have the chance to succeed in my class” out loud.

Grading for Growth