Tools designed to detect whether academic writing has been generated by artificial intelligence âinherently discriminateâ against non-native English speakers, a study has found.
Researchers tested the performance of seven widely used detectors on 91 essays that had been written by Chinese students as part of their Test of English as a Foreign Language (Toefl) exams. More than half were incorrectly labelled as âAI-generatedâ, equivalent to an average false-positive rate of 61.3 per cent.
The study â published by Stanford University academics Weixin Liang, Mert Yuksekgonul, Yining Mao, Eric Wu and James Zou in the journal â also analysed the detectorsâ performance when presented with 88 eighth-grade essays written by American students and found that these were accurately classified.
Campus resource: How to use ChatGPT to help close the awarding gap
âThe design of many GPT detectors inherently discriminates against non-native authors, particularly those exhibiting restricted linguistic diversity and word choice,â the authors conclude, adding that they believe the findings emphasise âthe need for increased focus on the fairness and robustness of GPT detectorsâ.
51³Ô¹Ï
ChatGPTâs emergence late last year sparked the launch of several AIÂ writing detectors, all claiming high degrees of accuracy. Major players such as Turnitin have vied with start-ups and apps created by students to become the go-to detector used by universities concerned about whether students are using AI to cheat in tests.
Detectors use âtext perplexityâ to spot AI-generated text, the study explains, meaning that they predict what will be the next word in a sentence, mirroring the methods used by the text generators themselves. If words are easy to predict, text perplexity is low and AI is more likely to have been used; if the next word is hard to predict, text perplexity will be high.
51³Ô¹Ï
Because non-native speakers often have a smaller vocabulary and âexhibit less linguistic variabilityâ, they are more likely to be inadvertently penalised, the study finds.
The authors were also able to fool the detectors by prompting ChatGPT to self-edit its text by adding âmore literary languageâ and therefore increasing the text perplexity. This caused detection rates to âplummet to near-zeroâ.
âThe implications of GPT detectors for non-native writers are serious, and we need to think through them to avoid situations of discrimination,â the study concludes.
Potential repercussions include researchers from non-English-speaking countries being excluded from academic conferences or journals that prohibit the use of GPT, it warns.
51³Ô¹Ï
âNon-native students bear more risks of false accusations of cheating, which can be detrimental to a studentâs academic career and psychological well-being,â the paper adds. âEven if the accusation is revoked later, the studentâs reputation is already damaged.â
Non-native speakers might also âironicallyâ be forced to turn to ChatGPT to develop their writing, the study suggests, because it can be used to ârefine their vocabulary and linguistic diversity to sound more nativeâ.
In light of the findings, the authors said it was âcrucialâ that âmore robust and equitable methodsâ be developed by the companies creating AI detectors and that their use in educational settings be curtailed until then.
âEven for native English speakers, linguistic variation across different socioeconomic backgrounds could potentially subject certain groups to a disproportionately higher risk of false accusations,â they warn.
51³Ô¹Ï
Detectors should not use a âone-size-fits-all approachâ and instead be designed in collaboration with users and be benchmarked against diverse writing samples âthat reflect the heterogeneity of usersâ, it adds.
They should also be subjected to ârigorous evaluationâ, and users should be better made aware of their potential flaws.
51³Ô¹Ï
Register to continue
Why register?
- Registration is free and only takes a moment
- Once registered, you can read 3 articles a month
- Sign up for our newsletter
Subscribe
Or subscribe for unlimited access to:
- Unlimited access to news, views, insights & reviews
- Digital editions
- Digital access to °Õ±á·¡âs university and college rankings analysis
Already registered or a current subscriber?








