Abstract
It has become increasingly common to endorse and implement anonymous grading as a way of promoting fairness or equality of opportunity in the classroom. The American Philosophical Association currently recommends anonymous grading, as do the Canadian Philosophical Association, the British Philosophical Association, the Society for Women in Philosophy, and Minorities and Philosophy. Despite its increasing prevalence, the practice has received surprisingly little attention in applied ethics. This paper begins filling this gap. I start by clarifying the ‘Standard Argument’ from fairness or equality of opportunity for anonymous grading. I then argue that the Standard Argument fails to support current models for anonymous grading. The issue: current models for anonymous grading are practically self-undermining if the aim is to reduce violations of equality of opportunity or fairness. So, if we want to address bias in grading because we care about equality of opportunity or fairness, we need to change how we do it.
Similar content being viewed by others
Data Availability
This manuscript has no associated data.
Notes
More specifically, names and any other markers that would allow the grader to identify the author.
Interestingly, although anonymous grading is used in other fields, professional academic associations outside of philosophy rarely endorse the practice explicitly. Although much of what I say here will be broadly relevant, I focus on academic philosophy because the professional bodies in the field (a) explicitly recommend anonymous grading and (b) tend to recommend it on common diversity, equity, and inclusion grounds. It is this kind of argument that I’m interested in here.
A notable exception is Saul (2013), who recommends anonymous grading as part of a broader package of recommendations for addressing implicit bias and stereotype threat in philosophy.
It’s worth noting that although there is little published work in philosophy on anonymous grading, the issue has received some attention in influential online forums for academic philosophy; for example, Daily Nous (Weinberg 2015), New APPS (Novaes 2013), and Feminist Philosophers (Anonymous 2012; Saul 2016).
There are other possible formalizations. I choose this one only because it is structurally simple. I expect the argument I make here to apply to any way of precisifying this kind of reasoning.
This kind of argument is sometimes offered in defense of anonymizing procedures in other contexts. See Section 4.
Although, she notes that the success of anonymizing will be context dependent (Saul 2013, 52).
Thank-you to an anonymous reviewer for this point. Note that this kind of favoritism is sometimes called ‘Halo Bias’, see: Malouff et al. (2014).
The effect was small-medium in significance and heterogeneous. Some studies found that members of the targeted groups received lower grades than their peers, some found that they received higher grades.
Interestingly, recent research suggests that this point about (e.g.) ‘brilliance’ no longer tracks to differences in letters of recommendation (Bernstein et al. 2022).
In line with clarifications in Section 1, I give only the equality of opportunity version of the argument here.
We can set aside the question of whether anonymous grading is too costly with respect some other value or is too practically onerous, or so on, because I argue that premise 2* is false on equality of opportunity grounds. If I’m right, the argument fails without appealing to these further considerations.
While it’s true that for any particular author and assignment anonymizing is a binary issue, this is often not true at the class level. By implementing anonymous grading, we may anonymize (say) fourteen out of twenty students. One might wonder whether this is better from an equality of opportunity perspective than not anonymizing at all––after all, some students were successfully anonymized. It’s worth saying a few things about this scenario. First, a situation in which some, but not all, students are anonymized produces inequalities of opportunity with respect to being anonymized itself––which might harm the anonymized or the non-anonymized depending on whether being known to the instructor is a benefit or a harm in context. Second, presumably, the students who are most likely to be de-anonymized are those who reach out for help with their work. If, as seems likely, those students tend to be members of groups that have been disadvantaged by the education system, then it looks likely that the anonymous/non-anonymous line will indicate social group membership––which is a serious concern if we’re implementing anonymous grading to reduce the effects of pernicious social biases. Finally, even if some anonymizing is better than no anonymizing, we should still ask whether we can do better with respect to equality of opportunity. This is my main concern here.
Or potentially fails to anonymize. The issue here isn’t just failure, but contingency of success.
We might wonder whether failed anonymizing is nonetheless useful in reducing the perception of bias and thus creating better teacher-student relationships. Some research suggests this isn’t the case (Pitt and Winstone 2018).
In discussing drafts of this paper, some have thought it unlikely that anyone would go to these lengths to ensure anonymized grading, given the obvious costs of doing so. There are two things I want to say about this. First, note these are the actual grading conditions that obtain at (at least) some institutions in the UK where maintaining anonymous grading is sometimes a matter of university policy. Second, I suspect that this is an intuition in favor of my general point: the inability to support students in these ways would be a very bad consequence of ensuring successful anonymous grading.
Indeed, even putting concerns about recognizing students aside, Whitelegg (2002) found that anonymous grading reduces the quality of feedback that students receive and increases the distance between teachers and students.
Thanks to David O’Brien and Fritz McDonald for pushing me on this.
Regents exams are New York State’s state-level standardized exams.
Thanks to Julia Markovits for suggesting this challenge.
There are different kinds of grading contracts. On the ‘labor-based’ grading contract students automatically get a B– (say) if they complete all coursework to a minimal standard. They achieve higher grades by completing additional tasks that would develop their (or anyone’s) learning merely by completion, e.g., redrafting a paper at a shorter length, peer reviews, summarizing additional readings, and so on (Inoue 2019). Each task is worth some number of points and the more tasks completed, the higher the eventual grade. This works well for writing development classes but may be difficult to operationalize for courses that involve specific content knowledge. See also: Elbow and Danielewicz (2008).
Thanks to an audience member at the central APA meeting in 2023 for suggesting this.
Thanks to Shaun Nichols for this suggestion.
References
American Philosophical Association (n.d.a) 5 ways to be more inclusive. https://cdn.ymaws.com/www.apaonline.org/resource/resmgr/diversity/updated_5_ways_to_be_more_in.pdf. Accessed 13 Apr 2023a
American Philosophical Association (n.d.b) Diversity in Philosophy Bibliography. https://cdn.ymaws.com/www.apaonline.org/resource/resmgr/diversity/2022_diversity_in_philosophy.pdf. Accessed 1 Sept 2023b
Anonymous (2012) Anonymous Grading: Why and How. Feminist Philosophers (blog), 27 February 2012. https://feministphilosophers.wordpress.com/2012/02/27/anonymous-grading-why-and-how/. Accessed 13 Apr 2023
Arneson R (2015) Equality of opportunity. In: Zalta E (ed) The Stanford Encyclopedia of Philosophy, Summer 2015 Edition. https://plato.stanford.edu/archives/sum2015/entries/equal-opportunity/. Accessed 23 Apr 2023
Baird J (1998) What’s in a name? Experiments with blind marking in A-level examinations. Educ Res 40(2):191–202. https://doi.org/10.1080/0013188980400207
Bernstein RH, Macy MW, Williams WM, Cameron CJ, Williams-Ceci SC, Ceci SJ (2022) Assessing gender Bias in Particle Physics and Social Science Recommendations for Academic Jobs. Social Sci 11(2):74. https://doi.org/10.3390/socsci11020074
British Philosophical Association and Society for Women in Philosophy UK (2014) BPA/SWIP Good Practice Scheme General Guidance. https://bpa.ac.uk/wp-content/uploads/2021/09/All-GP-docs.pdf. Accessed 13 Apr 2023
Bygren M (2020) Biased grades? Changes in Grading after a blinding of Examinations Reform. Assess Evaluation High Educ 45(2):292–303. https://doi.org/10.1080/02602938.2019.1638885
Cardichon L, Darling-Hammond L, Yang M, Scott C, Shields P, Burns D (2020) Inequitable opportunity to learn: Student access to certified and experienced teachers. Learning and Policy Institute. https://learningpolicyinstitute.org/product/crdc-teacher-access-report. Accessed 13 Apr 2023
Clotfelter C, Ladd HF, Vigdor J, Wheeler J (2006) High poverty schools and the distribution of teachers and principles. Working Paper. National Center for the Analysis of Longitudinal Data in Education Research (CALDER), Urban Institute. https://caldercenter.org/sites/default/files/1001057_High_Poverty.pdf. Accessed 13 Apr 2023
Elbow P, Danielewicz J (2008) A unilateral grading contract to improve learning and teaching. English Department Faculty Publication Series, University of Massachusetts—Amherst. https://scholarworks.umass.edu/eng_faculty_pubs/3/. Accessed 25 Aug 2023
ExcelinEd (2018) College and career pathways: Equity and access. ExcelinEd Civil Rights Data Collection Analysis. https://www.excelined.org/wp-content/uploads/2018/10/ExcelinEd.Report.CollegeCareerPathways.CRDCAnalysis.2018.pdf. Accessed 25 Aug 2023
Goldin C, Rouse C (2000) Orchestrating impartiality: the impact of ‘Blind’ auditions on female musicians. Am Econ Rev 90(4):715–741. https://doi.org/10.1257/aer.90.4.715
Greenwald A, McGhee DE, Schwartz JLK (1998) Measuring individual differences in implicit cognition: the implicit association test. J Personal Soc Psychol 74(6):1464–1480. https://doi.org/10.1037/0022-3514.74.6.1464
Hinton DP, Higson H (2017) A large-scale examination of the effectiveness of anonymous marking in reducing group performance differences in higher education assessment. PLoS ONE 12(8):e0182711. https://doi.org/10.1371/journal.pone.0182711
Hussar B, Zhang J, Hein S, Wang K, Roberts S, Cui J, Smith M, Mann FB, Barmer A, Dilig A (2020) The condition of education 2020. NCES 2020 – 144. Institute of Education Sciences: National Center for Education Statistics, U.S. Department of Education. https://nces.ed.gov/pubs2020/2020144.pdf. Accessed 25 Aug 2023
Inoue A (2019) Labor-based grading contracts: building equity and inclusion in the compassionate writing classroom. University Press of Colorado, Boulder
Jansson J, Tyrefors B (2018) Gender grading bias at Stockholm University: quasi-experimental evidence from an anonymous grading reform. IFN Working Paper No 1226. Reseach Institute of Industrial Economics (IFN), Stockholm. https://doi.org/10.2139/ssrn.3334460
King EB, Mendoza SA, Madera JM, Hebl MR, Knight JL (2006) What’s in a name? A multiracial investigation of the role of occupational stereotypes in selection decisions. J Appl Soc Psychol 36:1145–1159. https://doi.org/10.1111/j.0021-9029.2006.00035.x
Lebuda I, Karwowski M (2013) Tell me your name and i’ll tell you how creative your work is: author’s name and gender as factors influencing assessment of products’ creativity in four different domains. Creat Res J 25(1):137–142. https://doi.org/10.1080/10400419.2013.752297
Malouff JM, Thorsteinsson EB (2016) Bias in grading: a meta-analysis of experimental research findings. Aust J Educ 60(3):245–256. https://doi.org/10.1177/0004944116664618
Malouff JM, Stein SJ, Bothma LN, Coulter K, Emmerton AJ (2014) Preventing Halo Bias in grading the work of University students. Cogent Psychol 1(1):988937. https://doi.org/10.1080/23311908.2014.988937
Meynell L, Equity Committee of the Canadian Philosophical Association (2018) Good equity practices. Canadian Philosophical Association. https://www.acpcpa.ca/articles/good-equity-practices. Accessed 13 Apr 2023
Minorities and Philosophy (n.d.) Resources (ed) Map for the Gap: Minorities and Philosophy. http://www.mapforthegap.com/resources.html. Accessed 13 Apr 2023
Minorities and Philosophy, Daoust L, Duford R, Li B, Mathie Smith C, Thompson MK (n.d.) Best practices for the inclusive philosophy classroom, grading methods. https://phildiversity.weebly.com/grading-methods.html. Accessed 13 Apr 2023
Newstead SE, Dennis I (1990) Blind marking and sex bias in student assessment. Assess Eval High Educ 15(2):132–139. https://doi.org/10.1080/0260293900150204
Novaes CD (2013) On anonymous grading. New APPS: Art, politics, philosophy, science (blog), 2 May 2013. https://www.newappsblog.com/2013/05/on-anonymous-grading.html. Accessed 13 Apr 2023
Oswald FL, Mitchell G, Blanton H, Jaccard J, Tetlock PE (2013) Predicting ethnic and racial discrimination: a meta-analysis of IAT Criterion Studies. J Personal Soc Psychol 105(2):171–192. https://doi.org/10.1037/a0032734
Payne K, Niemi L, Doris J (2018) How to think about ‘implicit bias. Scientific American. https://www.scientificamerican.com/article/how-to-think-about-implicit-bias/. Accessed 25 Aug 2023
Pitt E, Winstone N (2018) The impact of anonymous marking on students’ perceptions of fairness, feedback and relationships with lecturers. Assess Eval High Educ 43(7):1183–1193. https://doi.org/10.1080/02602938.2018.1437594
Quinn DM (2020) Experimental evidence on teachers’ racial bias in student evaluation: the role of grading scales. Educ Eval Policy Anal 42(3):375–392. https://doi.org/10.3102/0162373720932188
Sadker M, Sadker D (1994) Failing at fairness: how America’s schools cheat girls. C. Scribner’s Sons, New York
Saul J (2013) Implicit bias, stereotype threat, and women in philosophy. In: Jenkins F, Hutchison K (eds) Women in philosophy: what needs to change? Oxford University Press, Oxford
Saul J (2016) Another reason for anonymous marking. Feminist Philosophers (blog), January 5, 2016. https://feministphilosophers.wordpress.com/2016/01/05/another-reason-for-anonymous-marking/. Accessed 25 Aug 2023
Saul J, Beebee H, Joint BPA/, SWIP Committee for Women in Philosophy (2011) Women in philosophy in the UK: a report by the British Philosophical Association and the Society for Women in Philosophy UK. British Philosophical Association and the Society for Women in Philosophy UK. https://bpa.ac.uk/wp-content/uploads/2018/11/BPA_Report_Women_In_Philosophy.pdf. Accessed 13 Apr 2023
Schimmack U (2021) The implicit association test: a method in search of a construct. Perspect Psychol Sci 16(2):396–414. https://doi.org/10.1177/1745691619863798
Steinpreis RE, Anders KA, Ritzke D (1999) The impact of gender on the review of the curricula vitae of job applicants and tenure candidates: a national empirical study. Sex Roles 41(7/8):509–528. https://doi.org/10.1023/A:1018839203698
Storage D, Charlesworth TES, Banaji MR, Cimpian A (2020) Adults and children implicitly associate brilliance with men more than women. J Exp Soc Psychol 90:104020. https://doi.org/10.1016/j.jesp.2020.104020
Ushomirsky N, Williams D (2015) Funding gaps 2015: too many states still spend less on educating students who need the most. Washington, DC: The Education Trust. https://edtrust.org/wp-content/uploads/2014/09/FundingGaps2015_TheEducationTrust1.pdf. Accessed 13 Apr 2023
Weinberg J (2015) Is anonymous grading really better? Daily Nous (blog). December 15, 2015. https://dailynous.com/2015/12/15/is-anonymous-grading-really-better/. Accessed 13 Apr 2023
Whitelegg D (2002) Breaking the feedback loop: problems with anonymous assessment. Planet 5(1):7–8. https://doi.org/10.11120/plan.2002.00050007
Acknowledgements
Particular thanks to Adriene Takaoka for getting me thinking about the value of anonymous grading. Thank you also to Julia Markovits, Harry Brighouse, Emad Atiq, Kate Manne, Dave Shoemaker, Shaun Nichols, Rachana Kamtekar, Scott MacDonald, David O’Brien, Geoff Weiss, Hannah Winckler-Olick, Timothy Kwiatek, Fran Fairbairn, Alicia Patterson, Joseph Orttung, Gus Turyn, and David Mehrle for discussing this issue with me and for comments on earlier versions. Finally, thank you to two anonymous reviewers at this journal and to audiences at the American Association of Philosophy Teachers Teaching Hub in 2021, the Northeast Philosophy of Education Society annual meeting in 2022, the American Philosophical Association central meeting in 2023, Cornell University, Hampden-Sydney College, and Colgate University for helpful questions, comments, and suggestions.
Author information
Authors and Affiliations
Contributions
Libby Southgate is the sole author/contributor of this piece.
Corresponding author
Ethics declarations
Ethical Approval
N/A.
Informed Consent
N/A.
Statement Regarding Research Involving Human participants and/or Animals
N/A.
Conflict of Interest
None.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Southgate, L. Rethinking Anonymous Grading. Ethic Theory Moral Prac 27, 237–251 (2024). https://doi.org/10.1007/s10677-023-10415-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10677-023-10415-y