Categories: NewsEducation

Marking essays by algorithm rewards the wrong skills

<h2><em>This academic is concerned that ACARA’s decision to use algorithmic marking systems to grade NAPLAN essays will train students to follow a formula instead of focussing on producing a sound argument&period;<&sol;em><&sol;h2>&NewLine;<p>Picture this&colon; you have written an essay&period; You researched the topic and carefully constructed your argument&period; You submit your essay online and receive your grade within seconds&period; But how can anyone read&comma; comprehend and judge your essay that quickly&quest;<&sol;p>&NewLine;<p>Well&comma; the answer is no one can&period; Your essay was marked by a computer&period; Would you trust the mark you received&quest; Would you approach your next essay with the same effort and care&quest;<&sol;p>&NewLine;<p>These are <a href&equals;"http&colon;&sol;&sol;www&period;smh&period;com&period;au&sol;comment&sol;smh-editorial&sol;naplan-robomarking-plan-does-not-compute-20171012-gyzpl4&period;html">the questions<&sol;a> that parents&comma; teachers and unions are asking about automated essay scoring &lpar;AES&rpar;&period; The Australian Curriculum&comma; Assessment and Reporting Authority &lpar;ACARA&rpar; proposes to use this program to grade essays&comma; like persuasive writing questions&comma; in its NAPLAN standardised testing scheme for primary and secondary schools&period;<&sol;p>&NewLine;<p>ACARA has <a href&equals;"https&colon;&sol;&sol;www&period;acara&period;edu&period;au&sol;news-and-media&sol;news-details&quest;section&equals;201710120459&num;201710120459">defended its decision<&sol;a> <a href&equals;"http&colon;&sol;&sol;nap&period;edu&period;au&sol;&lowbar;resources&sol;20151130&lowbar;ACARA&lowbar;research&lowbar;paper&lowbar;on&lowbar;online&lowbar;automated&lowbar;scoring&period;pdf">and suggested<&sol;a> that computer-based marking can match or even surpass the consistency of human markers&period;<&sol;p>&NewLine;<p>In my view&comma; this misses the point&period; Computers are unable to genuinely read and understand what a text is about&period; A good argument has little worth when marks are awarded by a structural comparison with other texts and not by judging its ideas&period;<&sol;p>&NewLine;<p>More importantly though&comma; we risk encouraging the writing of text that follows &OpenCurlyDoubleQuote;the script” but essentially says nothing of worth&period; In other words&comma; the writing of &OpenCurlyDoubleQuote;bullshit”&period;<&sol;p>&NewLine;<h2>How does algorithmic marking work&quest;<&sol;h2>&NewLine;<p>It’s not entirely clear how AES functions&comma; but let’s assume&comma; in line with <a href&equals;"https&colon;&sol;&sol;www&period;itnews&period;com&period;au&sol;news&sol;how-australia-plans-to-mark-naplan-with-cognitive-computing-403322">previous announcements<&sol;a>&comma; that it employs a form of <a href&equals;"https&colon;&sol;&sol;research&period;googleblog&period;com&sol;2015&sol;06&sol;inceptionism-going-deeper-into-neural&period;html">machine-learning<&sol;a>&period;<&sol;p>&NewLine;<p>Here’s how that could work&colon; a machine-learning algorithm &OpenCurlyDoubleQuote;learns” from a pool of training data – in this case&comma; <a href&equals;"https&colon;&sol;&sol;www&period;nap&period;edu&period;au&sol;docs&sol;default-source&sol;default-document-library&sol;aes-fact-sheet&period;pdf&quest;sfvrsn&equals;2">it may be<&sol;a> &OpenCurlyDoubleQuote;trained using more than 1&comma;000 NAPLAN writing tests scored by human markers”&period;<&sol;p>&NewLine;<p>But it generally does not learn the criteria by which humans mark essays&period; Rather&comma; machine learning consists of multiple layers of so-called &OpenCurlyDoubleQuote;artificial neurons”&period; These are statistical values that are gradually adjusted during the training period to associate certain inputs &lpar;structural text patterns&comma; vocabulary&comma; key words&comma; semantic structure&comma; paragraphing and sentence length&rpar; with certain outputs &lpar;high grades or low grades&rpar;&period;<&sol;p>&NewLine;<p>When marking a new essay&comma; the algorithm makes a statistical inference by comparing the text with learned patterns and eventually matches it with a grade&period; Yet the algorithm cannot explain why this inference was reached&period;<&sol;p>&NewLine;<p>Importantly&comma; high grades are awarded to papers that show the structural features of highly persuasive writing – papers that follow the &OpenCurlyDoubleQuote;persuasion rulebook”&comma; so to speak&period;<&sol;p>&NewLine;<figure class&equals;"align-center "><img src&equals;"https&colon;&sol;&sol;images&period;theconversation&period;com&sol;files&sol;191137&sol;original&sol;file-20171019-1088-1luuo92&period;jpg&quest;ixlib&equals;rb-1&period;1&period;0&amp&semi;q&equals;45&amp&semi;auto&equals;format&amp&semi;w&equals;754&amp&semi;fit&equals;clip" alt&equals;"" &sol;><figcaption><span class&equals;"caption">A teacher points at a board during a lesson at Stafford State School in Brisbane&comma; Wednesday&comma; Aug&period; 5&comma; 2015&period;<&sol;span> <span class&equals;"attribution"><a class&equals;"source" href&equals;"http&colon;&sol;&sol;one&period;aap&period;com&period;au&sol;&num;&sol;search&sol;naplan&quest;q&equals;&percnt;7B&percnt;22pageSize&percnt;22&colon;25&comma;&percnt;22pageNumber&percnt;22&colon;2&percnt;7D">AAP Image&sol;Dan Peled<&sol;a><&sol;span><&sol;figcaption><&sol;figure>&NewLine;<h2>Rewarding bullshit<&sol;h2>&NewLine;<p>Are the <a href&equals;"http&colon;&sol;&sol;nap&period;edu&period;au&sol;&lowbar;resources&sol;20151130&lowbar;ACARA&lowbar;research&lowbar;paper&lowbar;on&lowbar;online&lowbar;automated&lowbar;scoring&period;pdf">claims by ACARA<&sol;a> that algorithmic marking can match the consistency of human markers wrong&quest; Probably not&comma; but that’s not the issue&period;<&sol;p>&NewLine;<p>It’s possible that machine-learning could reliably award higher grades for those papers that follow the structural script for persuasive writing&period; And it might indeed do this with higher consistency than human markers&period; Examples from other fields show this – for instance&comma; in the <a href&equals;"https&colon;&sol;&sol;www&period;newyorker&period;com&sol;magazine&sol;2017&sol;04&sol;03&sol;ai-versus-md">classification of images in medical diagnosis<&sol;a>&period; It will certainly be quicker and cheaper&period;<&sol;p>&NewLine;<p>But it will not matter what a text is <em>about<&sol;em>&colon; whether the argument is ethical&comma; offensive or outright nonsensical&comma; whether it conveys any coherent ideas or whether it speaks effectively to the intended audience&period;<&sol;p>&NewLine;<p>The only thing that matters is that the text has the right structural patterns&period; In essence&comma; algorithmic marking might reward the writing of &OpenCurlyDoubleQuote;bullshit” – text written with little regard for the subject matter and solely to fulfil the algorithm’s criteria&period;<&sol;p>&NewLine;<p>Not simply lying&comma; analysts use &OpenCurlyDoubleQuote;bullshit” to describe empty talk or meaningless jargon&period; Princeton philosopher Harry Frankfurt argues that <a href&equals;"https&colon;&sol;&sol;www&period;stoa&period;org&period;uk&sol;topics&sol;bullshit&sol;pdf&sol;on-bullshit&period;pdf">talking bullshit<&sol;a> may actually be worse than lying&comma; because the lie at least reaffirms the truth&colon;<&sol;p>&NewLine;<blockquote>&NewLine;<p>It is impossible for someone to lie unless he thinks he knows the truth&period; Producing bullshit requires no such conviction&period; A person who lies is thereby responding to the truth&comma; and he is to that extent respectful of it … For the bullshitter&comma; however&comma; all these bets are off&colon; he is neither on the side of the true nor on the side of the false&period; His eye is not on the facts at all&comma; as the eyes of the honest man and of the liar are&comma; except insofar as they may be pertinent to his interest in getting away with what he says&period;<&sol;p>&NewLine;<p>Unlike humans&comma; algorithms are incapable of truly understanding when something is nonsense rather than genuine ideas and argumentation&period; It doesn’t know whether a text has any worth or relationship to our world at all&period;<&sol;p>&NewLine;<&sol;blockquote>&NewLine;<p>That’s why algorithmic marking&comma; whether in NAPLAN or otherwise&comma; risks rewarding the writing of bullshit&period;<&sol;p>&NewLine;<h2>Encouraging the wrong thing<&sol;h2>&NewLine;<p>Our politics&comma; businesses and media are already flooded with empty arguments and jargon&period; Let’s not reward the skill of writing it&period;<&sol;p>&NewLine;<blockquote>&NewLine;<p>Any application of algorithmic decision-making creates feedback loops&period; It influences future behaviour by rewarding and foregrounding some aspects of human practice and backgrounding others&period;<&sol;p>&NewLine;<&sol;blockquote>&NewLine;<p>This is particularly the case when incentives are tied to the outcomes of algorithmic decision-making&period; In the case of NAPLAN&comma; we know that the government rewards schools that score highly&period; As a result&comma; there is already an entire industry geared towards &OpenCurlyDoubleQuote;cracking the script” of NAPLAN in order to secure high marks&period;<&sol;p>&NewLine;<p>Imagine what happens when students realise that genuine ideas and valid arguments are not rewarded by the algorithm&period;<&sol;p>&NewLine;<p><img class&equals;"alignleft size-full wp-image-5426" src&equals;"https&colon;&sol;&sol;www&period;school-news&period;com&period;au&sol;wp-content&sol;uploads&sol;2016&sol;10&sol;creative-commons&period;png" alt&equals;"creative-commons" width&equals;"88" height&equals;"31" &sol;>This article was written by Kai Riemer&comma; Professor of Information Technology and Organisation&comma; University of Sydney&period; The piece first appeared on <em><a href&equals;"https&colon;&sol;&sol;theconversation&period;com&sol;why-marking-essays-by-algorithm-risks-rewarding-the-writing-of-bullshit-85910">The Conversation&period;<&sol;a><&sol;em><&sol;p>&NewLine;

Explore our latest issue...
Kai Riemer

Recent Posts

Are you teaching out of field? Your input is needed

A study investigating the realities of out-of-field teachers is seeking participants for groundbreaking research.

7 days ago

New resources to support media literacy teaching

The resources are designed to support teachers to make sure all students are engaged in…

7 days ago

Understanding tic disorders: What every school should know

Tic disorders are far more common than many people realise, and are often misrepresented in…

7 days ago

The modern library: More than a book storeroom

The school library has long been a place of discovery, reflection, and learning. But as…

7 days ago

Build a strong school community to prevent bullying

Is your school an inclusive community that empowers students to recognise bullying and to stand…

7 days ago

Government school enrolments at 10-year low

Performance indicators for the education and VET sectors have just been released with some encouraging…

2 weeks ago

This website uses cookies.