The true cost of politicising exam grades

Steve Farrell

1 year ago

It’s hilariously naive to say so now, but when I was at school I believed that there was a fixed threshold for each grading. The first time I became aware that overall results were improving year on year it was a rather discombobulating experience.

18

0

Reply

Milton Gibbon

1 year ago

Reply to Steve Farrell

This used to be the standard. Top 10% got the top grade, next 10% got the next etc.

4

0

Reply

Billy Bob

1 year ago

Reply to Milton Gibbon

But how is that fair? Some years would require to you score 90% to be in the top
10% and acquire the top grade, whereas fluctuations the following year may mean you only need to score 80% for a top grade. You could end up with students a year apart where one scored higher than the other but received a lower grade

3

-1

Reply

Milton Gibbon

1 year ago

Reply to Billy Bob

Yes. And objectively it would be fairer than what we have now with top marks given out to average students who are over-prepped for exams.

5

0

Reply

R S Foster

1 year ago

Reply to Billy Bob

…it was fair because it didn’t assume that everybody was getting smarter every year, because there is no empirical evidence that we are…but it knew that the quality of exam setting WOULD definitely vary…because exams are set by people, with all kinds of quirks and foibles…
…so the aim was to place people in the right percentile within their age cohort, confident that the top ten percent each year would be about as smart as those who went before, and those who followed after.
Your example actually shows that it worked exactly as it should have done…in year 1, the top 10% got 90% because the exam was comparatively easy…the next year the top 10% only got 80%, because the exam was harder…but you still identified the top 10%, and accurately placed all the candidates according to comparative ability within their cohort.
The marks don’t matter, and in fact we were not even told what they were beyond a pretty wide range…but we did know where we stood against our peers…and how fitted we were for academic life…as did Universities, Colleges and Employers…
How on earth is that to be achieved with everybody getting 3 A*s?

14

-1

Reply

Billy Bob

1 year ago

Reply to R S Foster

But that doesn’t answer my question. You could have somebody who outperforms another who is in the year below, but end up with a lower grade simply because of the achievement of their peers in the same age group. Exams should tell you how good you are against everybody else, not just compare you to others who happen to be the same age

5

-3

Reply

Peter B

1 year ago

Reply to Billy Bob

No they shouldn’t. That’s nonsense. Common sense and experience tells us that achievement varies hardly at all between years anyway. And there really is little need to “measure” that in any case – and if you do you’ll likely be measuring noise anyway.
RSF’s comment explains perfectly the main purpose of the exam grades. And it’s not “fairness”. Whatever that means. It’s objective ranking of students by competence in order to allocate places at the next level of education in a “fair” and objective way.

0

Reply

Peter B

1 year ago

Reply to Billy Bob

No they shouldn’t. That’s nonsense. Common sense and experience tells us that achievement varies hardly at all between years anyway. And there really is little need to “measure” that in any case – and if you do you’ll likely be measuring noise anyway.
RSF’s comment explains perfectly the main purpose of the exam grades. And it’s not “fairness”. Whatever that means. It’s objective ranking of students by competence in order to allocate places at the next level of education in a “fair” and objective way.

0

Reply

Billy Bob

1 year ago

Reply to R S Foster

But that doesn’t answer my question. You could have somebody who outperforms another who is in the year below, but end up with a lower grade simply because of the achievement of their peers in the same age group. Exams should tell you how good you are against everybody else, not just compare you to others who happen to be the same age

5

-3

Reply

Peter B

1 year ago

Reply to Billy Bob

Statistically you will not have such large fluctuations between years if the exams are competently designed (as they used to be). I think this is an imaginary problem. It certainly was – and should be.
The goal of the grading should be to sort the students into % achievement buckets and not to attempt to try to measure some absolute achievement level. The grades are principally to help decide who qualifies for which place at the next level of education. Inflate all the grades and you’re no longer able to do that quite so “fairly”.
In any case, the goal is to be objective and not “fair” (which can mean whatever people want it to mean and is never defined).

0

Reply

Milton Gibbon

1 year ago

Reply to Billy Bob

Yes. And objectively it would be fairer than what we have now with top marks given out to average students who are over-prepped for exams.

5

0

Reply

R S Foster

1 year ago

Reply to Billy Bob

…it was fair because it didn’t assume that everybody was getting smarter every year, because there is no empirical evidence that we are…but it knew that the quality of exam setting WOULD definitely vary…because exams are set by people, with all kinds of quirks and foibles…
…so the aim was to place people in the right percentile within their age cohort, confident that the top ten percent each year would be about as smart as those who went before, and those who followed after.
Your example actually shows that it worked exactly as it should have done…in year 1, the top 10% got 90% because the exam was comparatively easy…the next year the top 10% only got 80%, because the exam was harder…but you still identified the top 10%, and accurately placed all the candidates according to comparative ability within their cohort.
The marks don’t matter, and in fact we were not even told what they were beyond a pretty wide range…but we did know where we stood against our peers…and how fitted we were for academic life…as did Universities, Colleges and Employers…
How on earth is that to be achieved with everybody getting 3 A*s?

14

-1

Reply

Peter B

1 year ago

Reply to Billy Bob

Statistically you will not have such large fluctuations between years if the exams are competently designed (as they used to be). I think this is an imaginary problem. It certainly was – and should be.
The goal of the grading should be to sort the students into % achievement buckets and not to attempt to try to measure some absolute achievement level. The grades are principally to help decide who qualifies for which place at the next level of education. Inflate all the grades and you’re no longer able to do that quite so “fairly”.
In any case, the goal is to be objective and not “fair” (which can mean whatever people want it to mean and is never defined).

0

Reply

Billy Bob

1 year ago

Reply to Milton Gibbon

But how is that fair? Some years would require to you score 90% to be in the top
10% and acquire the top grade, whereas fluctuations the following year may mean you only need to score 80% for a top grade. You could end up with students a year apart where one scored higher than the other but received a lower grade

3

-1

Reply

JP Martin

1 year ago

Reply to Steve Farrell

At many UK universities, more than 40% of undergraduates leave with a first class degree. It’s absurd.

13

0

Reply

Gerard A

1 year ago

Reply to JP Martin

And 50 years ago some of that 40% would have struggled to get good enough O levels to go on to 6th form education

10

-1

Reply

Gerard A

1 year ago

Reply to JP Martin

And 50 years ago some of that 40% would have struggled to get good enough O levels to go on to 6th form education

10

-1

Reply

Milton Gibbon

1 year ago

Reply to Steve Farrell

This used to be the standard. Top 10% got the top grade, next 10% got the next etc.

4

0

Reply

JP Martin

1 year ago

Reply to Steve Farrell

At many UK universities, more than 40% of undergraduates leave with a first class degree. It’s absurd.

13

0

Reply

Steve Farrell

1 year ago

It’s hilariously naive to say so now, but when I was at school I believed that there was a fixed threshold for each grading. The first time I became aware that overall results were improving year on year it was a rather discombobulating experience.

18

0

Reply

Julian Farrows

1 year ago

As someone who has taught in secondary education and been a teacher trainer I can honestly say that exams and testing (as they exist in their current form) say nothing worthwhile about a pupil beyond whether they can memorize packets of information or formulate an answer in appropriate ‘exam’ language. In short, the process has become mindless. The education system needs a serious rethink and parents need to be more involved in educating their own children i.e. don’t leave it wholly up to school teachers.

16

-1

Reply

B. Mink

1 year ago

Reply to Julian Farrows

Yes to parents being more involved: so give us an option to receive the funding that goes to the system for each child.

2

-2

Reply

B. Mink

1 year ago

Reply to Julian Farrows

Yes to parents being more involved: so give us an option to receive the funding that goes to the system for each child.

2

-2

Reply

Julian Farrows

1 year ago

As someone who has taught in secondary education and been a teacher trainer I can honestly say that exams and testing (as they exist in their current form) say nothing worthwhile about a pupil beyond whether they can memorize packets of information or formulate an answer in appropriate ‘exam’ language. In short, the process has become mindless. The education system needs a serious rethink and parents need to be more involved in educating their own children i.e. don’t leave it wholly up to school teachers.

16

-1

Reply

Prashant Kotak

1 year ago

All Teacher assessment/Course Work based assessment now lies face down in the dust – the LLMs are here. I should also mention the chances of detecting what was generated by software vs created by the candidate is pretty much zero, output of that sort cannot be watermarked. So all existing forms of assessment are now moot – and different skillsets now arise and will be successful – for a while. And a while after that, nothing any human does will constitute any value because the algorithms will outdo anything any humans (individually or as a group) are capable of. If you don’t believe me, look at Alphafold. And my pessimistic timeline for all this is: all of it, this decade, probably in under five years.

Welcome, you have entered the eye of the whirlwind.

7

0

Reply

Steve Murray

1 year ago

Reply to Prashant Kotak

How we adapt as individuals (and therefore as a species) to the threats we’re creating will depend on wisdom that can’t be taught, only acquired over a long period of time. The current education/exam systems are woefully inadequate in preparing young people with a baseline from which to start, especially as MH points out, where it’s manipulated and can lead to early disillusionment. But therein may lie the beginnings of wisdom.
There’s an interesting juxtaposition between the “flexible” employee required by corporations in today’s Plastic Woman article and the kind of self-realisation required to flourish in “the eye of the whirlwind”, or at least avoid being blown away.

Last edited 1 year ago by Steve Murray

6

0

Reply

Alan Tonkyn

1 year ago

Reply to Prashant Kotak

‘….So all existing forms of assessment are now moot – and different skillsets now arise and will be successful – for a while.’ Yes, Prashant. Universities – if my local one is typical – now allow ‘take home’ exams done over a day with no restriction on Internet use or other recourse to outside sources. The argument given is that this replicates the real -world skill of using sources such as the Internet in producing a paper or report. However, as you say, this is a different skillset, and deprives assessors of the chance to find out how much a candidate actually knows, and whether the structure of an answer is one they have developed themselves as against one provided by the Internet or an AI bot. As you also say, this will only be successful ‘for a while’. Nations whose young people are assessed more rigorously, and whose memories and analytical skills have not been dulled by reliance on computer-provided texts, will overtake us, and the limitations of our school-leavers and graduates will be even more painfully exposed than at present.

5

0

Reply

Erik Hildinger

1 year ago

Reply to Alan Tonkyn

Instructors give them a whole day to fill out a take home exam? Surely this is a waste of time when AI will take care of the exam in seconds. Perhaps the faculty are factoring in the time it takes students to travel to and from the university.

4

0

Reply

Erik Hildinger

1 year ago

Reply to Alan Tonkyn

Instructors give them a whole day to fill out a take home exam? Surely this is a waste of time when AI will take care of the exam in seconds. Perhaps the faculty are factoring in the time it takes students to travel to and from the university.

4

0

Reply

Erik Hildinger

1 year ago

Reply to Prashant Kotak

Perceptive as usual PK. What do you think the effect will be on universities when it becomes impossible in most cases to distinguish student work from AI generated text and answers? Will there be any way to assess whether students have learned anything? I can imagine examinations in some STEM subjects might tell the tale if they are done by hand without computers nearby. As for the humanities, how can proficiency be tested without essays and persuasive papers? The classics are too good for the Academy; it doesn’t deserve them, so it doesn’t concern me if Lit Schools at universities suffer or close, but I do wonder what to be on the look-out for as things play out. Any ideas?

1

0

Reply

Prashant Kotak

1 year ago

Reply to Erik Hildinger

As things stand, there is a rabbit-in-the-headlights reaction across many many sectors, Education not least, and I expect chaos to ensue in short order.

The established model of how Education is delivered to young people was already under assault from the ease of getting away with internet based plagiarism (except at the edges), and this was combatted by software like Turnitin etc. for coursework, but the LLMs make it pretty much impossible to detect copying. Even in STEM, for example in software courses it is routine to ask candidates to create a large program as a project across many weeks, but the LLMs can now of course do most of the heavy lifting. I’m guessing that the providers of the most capable LLMs will now come under pressure to put watermarks in their output words such that checksums across sections of the output might allow black-box software to know what is software generated, but even assuming this is technically possible in truth it is trivially easy to solicit completely unique output from the LLMs to any question you care to ask them, and then manually tweak the output such that the checksums are invalid. Also, the proliferating explosion of very capable open-source LLMs means the genie is out of the lamp.

It should be very obvious that the entire sector now breaks and should be rebuilt differently from the ground up with an eye on not what is there now, but what’s coming. What I instead expect, is that they will tinker at every edge they can find, with the left and right coming up with solutions to match their political tastes. For example, I expect the right to push for a lot less coursework and a lot more closed book and verbal examinations. I expect the left to be dead against this and instead push for regulation of the tech around LLMs, no doubt with calls to nationalise Microsoft and Google.

Regardless, I have no doubt that the top end of governance for Education across the world, in both government and the sector itself is frantically debating how to react to the LLMs. (And if they aren’t then of course the whole lot of them should be taken round the back of the bicycle sheds and shot). What no one (outside of tecchies in AI research) seems to be projecting is the trajectory of capabilities gain. If the current rounds of LLMs can do what they do now, and literally tens of billions is being poured into AI research right now, what does everyone think the next LLM but three will be capable of?

Last edited 1 year ago by Prashant Kotak

3

0

Reply

Erik Hildinger

1 year ago

Reply to Prashant Kotak

Thanks for the detailed reply.

0

Reply

Erik Hildinger

1 year ago

Reply to Prashant Kotak

Thanks for the detailed reply.

0

Reply

Prashant Kotak

1 year ago

Reply to Erik Hildinger

As things stand, there is a rabbit-in-the-headlights reaction across many many sectors, Education not least, and I expect chaos to ensue in short order.

The established model of how Education is delivered to young people was already under assault from the ease of getting away with internet based plagiarism (except at the edges), and this was combatted by software like Turnitin etc. for coursework, but the LLMs make it pretty much impossible to detect copying. Even in STEM, for example in software courses it is routine to ask candidates to create a large program as a project across many weeks, but the LLMs can now of course do most of the heavy lifting. I’m guessing that the providers of the most capable LLMs will now come under pressure to put watermarks in their output words such that checksums across sections of the output might allow black-box software to know what is software generated, but even assuming this is technically possible in truth it is trivially easy to solicit completely unique output from the LLMs to any question you care to ask them, and then manually tweak the output such that the checksums are invalid. Also, the proliferating explosion of very capable open-source LLMs means the genie is out of the lamp.

It should be very obvious that the entire sector now breaks and should be rebuilt differently from the ground up with an eye on not what is there now, but what’s coming. What I instead expect, is that they will tinker at every edge they can find, with the left and right coming up with solutions to match their political tastes. For example, I expect the right to push for a lot less coursework and a lot more closed book and verbal examinations. I expect the left to be dead against this and instead push for regulation of the tech around LLMs, no doubt with calls to nationalise Microsoft and Google.

Regardless, I have no doubt that the top end of governance for Education across the world, in both government and the sector itself is frantically debating how to react to the LLMs. (And if they aren’t then of course the whole lot of them should be taken round the back of the bicycle sheds and shot). What no one (outside of tecchies in AI research) seems to be projecting is the trajectory of capabilities gain. If the current rounds of LLMs can do what they do now, and literally tens of billions is being poured into AI research right now, what does everyone think the next LLM but three will be capable of?

Last edited 1 year ago by Prashant Kotak

3

0

Reply

Steve Murray

1 year ago

Reply to Prashant Kotak

How we adapt as individuals (and therefore as a species) to the threats we’re creating will depend on wisdom that can’t be taught, only acquired over a long period of time. The current education/exam systems are woefully inadequate in preparing young people with a baseline from which to start, especially as MH points out, where it’s manipulated and can lead to early disillusionment. But therein may lie the beginnings of wisdom.
There’s an interesting juxtaposition between the “flexible” employee required by corporations in today’s Plastic Woman article and the kind of self-realisation required to flourish in “the eye of the whirlwind”, or at least avoid being blown away.

Last edited 1 year ago by Steve Murray

6

0

Reply

Alan Tonkyn

1 year ago

Reply to Prashant Kotak

‘….So all existing forms of assessment are now moot – and different skillsets now arise and will be successful – for a while.’ Yes, Prashant. Universities – if my local one is typical – now allow ‘take home’ exams done over a day with no restriction on Internet use or other recourse to outside sources. The argument given is that this replicates the real -world skill of using sources such as the Internet in producing a paper or report. However, as you say, this is a different skillset, and deprives assessors of the chance to find out how much a candidate actually knows, and whether the structure of an answer is one they have developed themselves as against one provided by the Internet or an AI bot. As you also say, this will only be successful ‘for a while’. Nations whose young people are assessed more rigorously, and whose memories and analytical skills have not been dulled by reliance on computer-provided texts, will overtake us, and the limitations of our school-leavers and graduates will be even more painfully exposed than at present.

5

0

Reply

Erik Hildinger

1 year ago

Reply to Prashant Kotak

Perceptive as usual PK. What do you think the effect will be on universities when it becomes impossible in most cases to distinguish student work from AI generated text and answers? Will there be any way to assess whether students have learned anything? I can imagine examinations in some STEM subjects might tell the tale if they are done by hand without computers nearby. As for the humanities, how can proficiency be tested without essays and persuasive papers? The classics are too good for the Academy; it doesn’t deserve them, so it doesn’t concern me if Lit Schools at universities suffer or close, but I do wonder what to be on the look-out for as things play out. Any ideas?

1

0

Reply

Prashant Kotak

1 year ago

All Teacher assessment/Course Work based assessment now lies face down in the dust – the LLMs are here. I should also mention the chances of detecting what was generated by software vs created by the candidate is pretty much zero, output of that sort cannot be watermarked. So all existing forms of assessment are now moot – and different skillsets now arise and will be successful – for a while. And a while after that, nothing any human does will constitute any value because the algorithms will outdo anything any humans (individually or as a group) are capable of. If you don’t believe me, look at Alphafold. And my pessimistic timeline for all this is: all of it, this decade, probably in under five years.

Welcome, you have entered the eye of the whirlwind.

7

0

Reply

Emmanuel MARTIN

1 year ago

There are deep implications to grading, how it should be used, and the responsibilities it entails.
As a principle, teachers should teach and should not be in charge of credentialling students or awarding diplomas, due to an obvious conflict of interest.
Then how examiners should grade students is another topic for debate, but the success thresholds should be somewhat simlar per year, location, school …
A complex but important topic indeed.

5

0

Reply

Emmanuel MARTIN

1 year ago

There are deep implications to grading, how it should be used, and the responsibilities it entails.
As a principle, teachers should teach and should not be in charge of credentialling students or awarding diplomas, due to an obvious conflict of interest.
Then how examiners should grade students is another topic for debate, but the success thresholds should be somewhat simlar per year, location, school …
A complex but important topic indeed.

5

0

Reply

Arkadian X

1 year ago

Yes, but what is the solution? The latest problem started with COVID, are we supposed to carry on like everything is fine?

2

-1

Reply

Arkadian X

1 year ago

Yes, but what is the solution? The latest problem started with COVID, are we supposed to carry on like everything is fine?

2

-1

Reply

polidori redux

1 year ago

Comment deleted

Last edited 1 year ago by polidori redux

0

Reply

David Lindsay

1 year ago

Since when was it the job of the Government to direct how many people should be awarded which exam grades?

In the blue corner, there is an alliance of those who abominate above all else the very thought of the schools attended by 94 per cent of the population, and those who will never be satisfied with anything short of confirmation that they had been cheated of the glittering lives to which their obvious genius ought to have entitled them.

And over in the shocking (not pale) pink corner, there are those by whom working-class pupils are twice as likely to be predicted an E grade, and by whom black pupils’ grades are staggeringly under-predicted, with only 39 per cent of predictions turning out to have been correct, while boys are also endemically ill-served.

0

Reply

Bret Larson

1 year ago

OMG! So huge bureaucracies continue to exist because the people controlling the money get what they want?
I am just astounded!
Wait, sorry, it reminds me of my schooling, and that was a very long time ago.

Its long past time to give the money to the parents and have schools compete for their funding.

0

Reply

	This comment is spam
	This comment should be marked mature
	This comment is abusive
	This comment promotes self-harm
	Other

The true cost of politicising exam grades

By Mary Harrington

Latest from the Newsroom

Francis Fukuyama : La victoire de Trump marque un « rejet décisif » du libéralisme

Francis Fukuyama: Trump’s win marks ‘decisive rejection’ of liberalism

La Californie devient-elle rouge ?

August 7, 2023 - 10:00am

Join the discussion

Join like minded readers that support our journalism by becoming a paid subscriber

To join the discussion in the comments, become a paid subscriber.

Join like minded readers that support our journalism, read unlimited articles and enjoy other subscriber-only benefits.