Examining the assumptions of AI hiring assessments and their impact on job seekers’ autonomy over self-representation

Aizenberg, Evgeni; Dennis, Matthew J.; van den Hoven, Jeroen

doi:10.1007/s00146-023-01783-1

Examining the assumptions of AI hiring assessments and their impact on job seekers’ autonomy over self-representation

Open Forum
Open access
Published: 21 October 2023

(2023)
Cite this article

Download PDF

You have full access to this open access article

AI & SOCIETY Aims and scope Submit manuscript

Examining the assumptions of AI hiring assessments and their impact on job seekers’ autonomy over self-representation

Download PDF

Evgeni Aizenberg ORCID: orcid.org/0000-0003-0755-0374^1,2^nAff3,
Matthew J. Dennis⁴ &
Jeroen van den Hoven^1,5

1959 Accesses
5 Altmetric
Explore all metrics

Abstract

In this paper, we examine the epistemological and ontological assumptions algorithmic hiring assessments make about job seekers’ attributes (e.g., competencies, skills, abilities) and the ethical implications of these assumptions. Given that both traditional psychometric hiring assessments and algorithmic assessments share a common set of underlying assumptions from the psychometric paradigm, we turn to literature that has examined the merits and limitations of these assumptions, gathering insights across multiple disciplines and several decades. Our exploration leads us to conclude that algorithmic hiring assessments are incompatible with attributes whose meanings are context-dependent and socially constructed. Such attributes call instead for assessment paradigms that offer space for negotiation of meanings between the job seeker and the employer. We argue that in addition to questioning the validity of algorithmic hiring assessments, this raises an often overlooked ethical impact on job seekers’ autonomy over self-representation: their ability to directly represent their identity, lived experiences, and aspirations. Infringement on this autonomy constitutes an infringement on job seekers’ dignity. We suggest beginning to address these issues through epistemological and ethical reflection regarding the choice of assessment paradigm, the means to implement it, and the ethical impacts of these choices. This entails a transdisciplinary effort that would involve job seekers, hiring managers, recruiters, and other professionals and researchers. Combined with a socio-technical design perspective, this may help generate new ideas regarding appropriate roles for human-to-human and human–technology interactions in the hiring process.

Reskilling and Upskilling the Future-ready Workforce for Industry 4.0 and Beyond

Article 13 July 2022

The rise of artificial intelligence – understanding the AI identity threat at the workplace

Article Open access 05 October 2021

The Participation of People with Disabilities in the Workplace Across the Employment Cycle: Employer Concerns and Research Evidence

Article Open access 22 January 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The use of artificial intelligence (AI) algorithms for assessment of job seekers’ attributes and job fit is becoming more common across different industries (Bogen and Rieke 2018; Crawford et al. 2019). These assessments come in a variety of forms, for example automated online interviews and games, that apply machine learning to map candidates’ responses and behaviors to a wide range of attributes, such as willingness to learn, relationship building, generosity, and decision making (Mondragon et al. 2021; Pymetrics, n.d.). One big vendor of such assessments, HireVue, refers to measured attributes as “competencies,” “cognitive ability,” “skills,” “personality traits,” and “emotional intelligence” (Mondragon et al. 2021: 17). Other vendors, like pymetrics and Harver, describe the job seeker attributes they assess as “soft skills” and “cognitive and emotional attributes” (Harver, n.d.; Pymetrics, n.d.). Given the variety of terms vendors use to describe what kind of attributes are measured, in this paper we will refer to these more generically as job seekers’ attributes. From these attributes, some algorithms then make a further inference about job fit. HireVue, for example, defines job fit as “the optimal combination of personality traits, cognitive ability, and competency areas for a target set of job roles” (Mondragon et al. 2021: 5).

Vendors of algorithmic assessments promise a range of benefits that are highly attractive to employers (Li et al. 2021). One common benefit claimed by vendors is that their assessments can save many hours of tedious and costly work, allowing recruiters and hiring managers to focus their efforts on interviewing the best candidates for the job (Harver, n.d.; HireVue, n.d.; Modern Hire, n.d.). Furthermore, vendors often claim that their algorithms make assessments that substantially reduce or even fully eliminate individual prejudice and systematic bias, contributing to fairness, diversity, and inclusion in the hiring process (Drage and Mackereth 2022; Raghavan et al. 2020).

However, the use of algorithmic hiring assessments and the bold claims made by their vendors have come under increasing ethical, legal, and scientific scrutiny. The ethical and legal aspects that seem to have received most attention in academic literature thus far are algorithmic bias and discrimination (Ajunwa 2021; Bogen and Rieke 2018; Hunkenschroer and Luetge 2022; Raghavan et al. 2020; Sánchez-Monedero et al. 2020). While these issues are very important, additional and fundamental concerns have been raised about the validity of these assessments and the claims being made. Some industrial and organizational (I–O) psychologists have expressed concerns about the scarcity of available evidence supporting the validity, reliability, and fairness of these tools (Gonzalez et al. 2019; Tippins et al. 2021). In fact, a recent study auditing two personality-assessing algorithms used in hiring concluded that both tools failed to exhibit sufficient reliability and therefore cannot be considered as valid assessments (Rhea et al. 2022). Furthermore, some algorithmic assessments measure features for which there is no established and scrutinized theory relating them to job seeker attributes or job performance (e.g., features like tone of voice and facial expressions) (Ajunwa 2021; Hinkle 2021; Stark and Hutson 2021; Tippins et al. 2021). Sloane et al. (2022) have argued that it is therefore critical to step back and examine the assumptions underlying the use of AI algorithms in the hiring process.

In this paper, we contribute to this effort by conceptually analyzing the epistemological and ontological assumptions algorithmic hiring assessments make about job seekers’ attributes and the ethical implications of these assumptions. Hiring assessments can be viewed as tools for producing knowledge about the job seeker, with the goal of informing the hiring decision-making process. As knowledge production tools, they embody certain epistemological and ontological assumptions that together constitute a knowledge production paradigm. But what is that paradigm and what are the assumptions it makes? Which meanings of job seekers’ attributes are discoverable through this paradigm and which are missed? Which alternative paradigms can be considered? And, what are the ethical implications of these paradigm choices? We begin our conceptual analysis, by observing that both traditional hiring assessments and AI-based hiring assessments share a common set of underlying epistemological and ontological assumptions from the psychometric paradigm. Based on desk research, we then turn to literature that has examined the merits and limitations of these assumptions, gathering insights across multiple disciplines and several decades. By doing so, we invite the readers to join us on a journey of connecting the dots between insights from the past and questions posed about technologies of today.

Our exploration leads us to conclude that algorithmic hiring assessments are incompatible with attributes whose meanings are context-dependent and socially constructed. Such attributes call instead for assessment paradigms that offer space for negotiation of meanings between the job seeker and the employer. We argue that in addition to questioning the validity of algorithmic hiring assessments, this raises an often overlooked ethical impact on job seekers’ autonomy over self-representation: their ability to directly represent their identity, lived experiences, and aspirations. In our view, this key aspect of human dignity deserves as much attention within AI ethics as the more prominent topics of non-discrimination (fairness), explainability, and privacy.

We conclude with a suggestion to begin addressing these issues through epistemological and ethical reflection regarding the choice of assessment paradigm, the means to implement it, and the ethical impacts of these choices. This entails a transdisciplinary effort that would involve job seekers, hiring managers, recruiters, assessment experts, design researchers, ethicists of technology, and other professionals who collectively with stakeholders study the work context, identify and navigate epistemological and ethical tensions, and co-design the assessment process. Combined with a socio-technical design perspective, this may help generate new ideas regarding appropriate roles for human-to-human and human–technology interactions in the hiring process.

2 What are the assumptions?

Algorithmic hiring assessments produce knowledge about the job seeker by measuring the job seeker’s responses to certain stimuli (e.g., measuring speech features in response to questions in an automated interview, or measuring game behavior in a gamified assessment). This may sound as a very abstract description that ignores the technical details of what is measured, how it is measured, and how the measurements are processed. But this description actually represents an epistemological choice of how knowledge is produced: measurement of responses to stimuli. On this basic level of analysis, algorithmic assessments share a common epistemology with traditional psychometric hiring assessments. It is certainly true that the types of stimuli, measurements, and the technical methods of mapping those measurements to knowledge in the form of assessment scores can be very different when comparing traditional assessments and AI-based assessments (Liem et al. 2018). To be clear, here we want to focus on the underlying epistemological and ontological assumptions of the psychometric paradigm, which are shared by both traditional and algorithmic hiring assessments (Rhea et al. 2022). This shared set of underlying assumptions implies that their merits and limitations—which have been explored in academic literature over multiple decades—also apply to algorithmic hiring assessments. Therefore, we now turn to some of this literature to examine what are the assumptions these assessments make about job seekers’ attributes, which meanings of job seekers’ attributes are discoverable through such assessments, and which meanings are missed.

A key assumption under the psychometric assessment theory is that “the person’s knowledge, attitude, skill, or other measured attribute is a steady state; that is, we assume that any differences among scores earned by an individual on different occasions of measurement are due to one or more sources of error, and not to systematic changes in the individual due to maturation or learning” (Shavelson and Webb 1991: 1). There are a number of aspects worthwhile to highlight more explicitly here. Based on elaborate examination of this matter by Delandshere and Petrosky (1998) and Govaerts and Van der Vleuten (2013), we point out that psychometric assessment theory assumes that:

1)
The person’s measured attribute is stable across time and contexts.
2)
There exists a true attribute value to be measured, i.e., the true score.
3)
Variability in measurements of a person’s attribute across time and contexts is due to measurement error (noise).
4)
The assessed attribute can be meaningfully represented by a numerical scale.

Some of the rationale behind the stability across time and contexts assumption is that an individual’s scores should be similar if they were assessed on multiple occasions, and that scores should not be affected by factors like day or time of assessment, the equipment used, and context, unless these factors are job relevant (Tippins et al. 2021). In this sense, the true performance value (the true score) is an idealization that refers to the average score a person would receive if tested under all possible acceptable conditions (Shavelson and Webb 1991).

But which meanings of job seekers’ attributes are discoverable through such knowledge production paradigm and which are missed? The assumption that there exists a single, true attribute value to be measured implies that there is a universally true answer to the question of how much of that attribute (e.g., competency, skill, ability) does the job seeker possess, independent of context and time. Under this assumption, it is plausible that the assessed attribute can be meaningfully represented by a numerical scale. As Delandshere and Petrosky (1998: 23) point out, “numerical ratings are useful in representing occurrences of simple and discrete behaviors that manifest themselves consistently across individuals, contexts, and time and where the correspondence between the assignment of ratings and the observed behaviors is more obvious.” It is fairly straightforward to see how, for example, the skill of lifting a specific weight or running a certain distance in a given time, can be numerically rated based on how close the job seeker’s performance is to the target value. In these cases, there is a clear correspondence between numerical ratings that measure weight, distance, or time, on the one hand, and the observed performances and their meaning, on the other. And within some restricted period of time, variability across multiple performances and contexts can be seen as variations around some true level of ability.

Let us now consider a different type of attributes, for example teamwork and creativity. In many contexts, the meanings of these attributes are not reducible to simple, discrete behaviors that are consistent across individuals, contexts, and time. On the contrary, we might be looking for aspects of job performance that are unique to that individual and whose meaning is a product of an individual’s interaction with others (e.g., colleagues and clients) in a specific work setting and socio-cultural context (Govaerts and Van der Vleuten 2013). The way a person expresses teamwork or creativity can vary in different contexts. There is then no single true value for creativity and teamwork that is stable across contexts and time, but rather a plurality of expressions whose meanings are constructed by an individual and the people they interact with in the surrounding context. Note that for such attributes, variability across contexts and time should not be dismissed as an “error” or “noise”.^{Footnote 1} Instead, it is integral to appreciating the individual and the specific ways they can contribute to a job (Delandshere and Petrosky 1998; Govaerts and Van der Vleuten 2013).

Attributes whose meanings are context-dependent and socially constructed are incompatible with a knowledge production paradigm that measures responses to stimuli in search of a single true answer. The assumption that there is one true answer eliminates the possibility of multiple true answers. Production of knowledge through measurement of responses to stimuli does not leave room for negotiation of meanings among people. Which alternative paradigms of knowledge production could be helpful here? As argued by a number of authors, social constructionist and interpretivist paradigms are well suited to capture such plurality, context-dependent nuance, and offer space for negotiation of meanings (Govaerts and Van der Vleuten 2013; Pratt and Bonaccio 2016; Tafreshi et al. 2016). This often entails qualitative research aimed at understanding what, how, and why individuals are doing or have done in a particular context. Importantly, in this process the employer would make an effort to view the world from the perspective of the job seeker (Bryman 1984). Interpretive assessment “focuses on participants’ own perspectives in conceptualizing and reconstructing their experiences and world view” (Gipps 1999: 371). In turn, this provides the job seeker the ability to engage in direct representation and storytelling about their lived experiences and aspirations.

Despite the incompatibility of the psychometric paradigm with attributes whose meanings are context-dependent and socially constructed, both traditional and AI-based hiring assessments apply this paradigm to measure such attributes. Doing so, restricts and reduces the meaning of the assessed attribute to a non-negotiable numerical scale. In effect, the meaning of the attribute becomes defined by the assessment (Lantolf and Frawley 1988; Vollmer 1981). Delandshere and Petrosky (1998: 17) point out that this circular thinking has been a common characteristic of psychometric assessments: “[c]onstructs were not defined theoretically, but instead on the basis of the tests that served to measure them and on the statistical methods used to analyze the scores they yielded.” This has root in the widely accepted definition of measurement in psychology, viewing measurement as “the assignment of numerals to objects or events according to rules” (Stevens 1946: 677). As Tafreshi et al. (2016: 238) explain, such a flexible definition “leaves no room for questioning the adequacy of those numbers for capturing the nature of psychological attributes.” Michell (1997, 2000, 2003) has argued that it has become common practice in psychology to quantify psychological attributes without presenting evidence that these attributes have quantitative properties, but rather merely presuming that they do (Tafreshi et al. 2016).

Given the incompatibility of the psychometric paradigm with attributes whose meanings are context-dependent and socially constructed, we question the validity of algorithmic hiring assessments as tools for producing knowledge about such job seeker attributes. An unreflective application of these assessments can result in distortion and loss of crucial information about job seekers. This not only puts into question the validity of algorithmic hiring assessments, but also raises an often overlooked ethical impact on job seekers’ autonomy and dignity, which we explore below.

3 Ethical impact: autonomy over self-representation

Algorithmic hiring assessments impose a reductionist, non-negotiable conception of job seekers’ attributes. This knowledge production paradigm eliminates the possibility for the job seeker and the employer to negotiate an aligned knowledge representation about what, how, and why the job seeker is doing or has done in a particular context, and how that informs their suitability for the job. The job seeker is denied of what Bernard Williams (1973: 236) called an “effort at identification: that [a person] should not be regarded as the surface to which a certain label can be applied, but one should try to see the world (including the label) from [the person’s] point of view.” We suggest that what is at stake here is a key aspect of the job seeker’s dignity and autonomy: their ability to act as a direct representative and storyteller of their identity, lived experiences, and aspirations, while acting as active constructors of the representations through which others view them. Building upon related concepts, we refer to this aspect of human dignity as autonomy over self-representation.

Halbertal (2015) has discussed the notion of control over self-representation as a key dimension of human dignity, referring to a person’s autonomy to represent themselves to the world the way they wish to. This kind of autonomy is often at stake in the context of privacy, specifically an individual’s control over what private information about them is shared with others. The common concern invoked in this context is exposure of private information that the person did not wish to share, undermining their standing as a social agent (Velleman 2005). The complementary aspect of self-representation we are focusing on here involves the things an individual considers essential to share or display to represent their identity, lived experiences, aspirations, etc. (Risam 2018). These are aspects of the self that cannot be fully known or understood by outside observation or measurement because these would fail to engage with the individual’s own perspective on their life (Manders-Huits and Van den Hoven 2008). Therefore, we see autonomy over self-representation as not only the ability to choose what to share or display to the world, but also the ability of a person to engage with the world to construct and negotiate the representations through which others view them.

Ultimately, hiring decisions are based on a representation of the job seeker and their attributes. These representations may, for example, involve some information about the job seeker’s past experiences, knowledge, skills, and aspirations. Let us examine how such representations are constructed in two contrasting scenarios: a traditional job interview in which the job seeker (Paul) and the employer (Michelle) meet face to face (Fig. 1) and an AI-based interview in which Paul records answers to questions posed by an algorithmic assessment (Fig. 2). The presented scenarios are quite schematic, but we believe they can help appreciate the impacts of different paradigms of assessment on autonomy over self-representation on a more intuitive level.

3.1 Scenario 1: face-to-face interview

During a face-to-face interview, Michelle asks Paul what is his vision on healthy and productive teamwork. Paul replies by sharing a recent experience in which several of his team members became ill during the COVID-19 pandemic, and how he and his colleagues navigated a few tense months of work pressure by filling in for each other while collectively struggling to keep up with the workload. While Paul has no guarantee that Michelle interprets this experience the same way he does, the face-to-face interaction provides Paul with room for feedback and negotiation. He can semantically negotiate with Michelle and strive to align the mutually perceived meaning of what he wishes to communicate: the significance of that experience to him, and how it has influenced his approach to working in teams. Note that although perfect alignment in meaning may be difficult or even impossible to achieve, Paul is acting as a direct representative of himself and is actively negotiating with Michelle the meaning of his experiences.

3.2 Scenario 2: AI-based interview

In the AI-based interview, Paul records answers to questions posed to him by an algorithmic assessment. After the recording is submitted, the algorithm scans the video for Paul’s facial expressions, choice of words, and voice tone. Based on these measured features, the algorithm computes numerical scores for teamwork, willingness to learn, and conscientiousness, which are later presented to Michelle along with scores of other candidates. During the interview, Paul has no indication how the algorithm interprets his answers. Although he recounts the pandemic-related experience and its significance to his approach to teamwork, the algorithm is actually not capable of interpreting that information in a way that can capture its qualitative richness, context, and meaning. Thus, an experience Paul considers essential to his self-representation and suitability for the job is missing from the representation produced by the algorithm. In fact, Michelle may never hear Paul’s story, unless she explicitly decides to view his recording. But for that to happen, the scores the algorithm assigned to Paul need to be high enough so that he is among the top-ranked candidates.

Although schematic, these contrasting scenarios provide some intuition on how the choice of assessment paradigm affects job seekers’ autonomy over self-representation. While in both cases Paul has the ability to share his story, the AI-based interview does not provide Paul with any possibility to negotiate the numerical representation the algorithm constructs and the meanings it communicates to Michelle. This is in stark contrast to the face-to-face interview where Paul and Michelle actively negotiate meanings between each other. The teamwork score presented to Michelle reduces Paul’s lived experiences to a single number void of qualitatively nuanced storytelling he tried to communicate. Furthermore, the scores may obscure the fact that the job seeker is a dynamic person who engages in self-improvement and evolves over time in ways that are not deterministic (Govaerts and Van der Vleuten 2013; Manders-Huits and Van den Hoven 2008). Each of these factors poses a real risk that the job seeker will be judged based on what the algorithm says they are capable of, rather than what the job seeker says they are capable of or plan to do, an example of what has been informatively labeled as “data determinism” (Ramirez 2013).

4 Implications for research and practice

In this paper, we sought to contribute to the effort of examining the knowledge production assumptions of algorithmic hiring assessments and their ethical implications. We believe that a key takeaway from this initial exploration is that job seekers’ attributes whose meanings are context-dependent and socially constructed are incompatible with an assessment paradigm that searches for a single true answer based on measurement of responses to stimuli. Such attributes call for assessment paradigms that offer space for negotiation of meanings between the job seeker and the employer. The absence of ability to negotiate meanings brings us to recognize that what is at stake is not only the validity of AI-based assessments but also their ethical impact on job seekers’ autonomy over self-representation, a key dimension of human dignity. The ability of the job seeker to act as a direct representative and storyteller of their identity, lived experiences, and aspirations is critical for arguing why they believe they are a good candidate for a job. And this autonomy is especially important for making their case about attributes whose meanings are context-dependent and socially constructed. Respect for job seekers’ autonomy over self-representation calls upon employers to make an effort to see the world from the job seekers’ perspective.

While we focused on assumptions algorithmic hiring assessments make about job seekers’ attributes, we did not address epistemological assumptions that underpin the mapping of measurements to scores. However, it is important to note that these mappings also have profound impact on autonomy over self-representation through the inherent comparison of the job seeker to the population sample these algorithms were trained on. This imposes a conception under which the job seeker is de-individualized (Vedder 1999): What matters is not the unique aspects of their attributes, but how the quantified aspects of their attributes compare to the “competent candidates” in the population sample the algorithm was trained on.

Looking for a moment beyond the hiring domain, we believe that autonomy over self-representation is an overlooked ethical issue that deserves as much attention within AI ethics as the more prominent topics of non-discrimination (fairness), explainability, and privacy. AI algorithms are being applied to produce knowledge about people in other high-stakes domains of life, such as healthcare, policing, finance, and welfare. These algorithms construct representations of individuals through which others view them and make decisions that affect their lives. As explored here, this can impose a reductionist, non-negotiable conception of the person, taking away their ability to construct the representations through which others view them. Furthermore, the quantitative and automated nature of algorithms can falsely create an impression of objectivity regarding the representations they produce. Therefore, we believe there is an urgent need to take a step back to reflect on what is an appropriate knowledge production paradigm for a given context, the means to implement it, and what are the ethical implications of these choices, including impact on individuals’ autonomy over self-representation. Reflection on knowledge production assumptions is also relevant for other constructs AI algorithms attempt to measure, for example fairness. Such reflection would complement the exploration of theoretical understandings and measurement assumptions about fairness discussed by Jacobs and Wallach (2021) by examining whether measurement itself is epistemologically compatible with contextual meanings of fairness, or whether a different paradigm for producing knowledge is warranted. In the remainder of this section, we share some initial thoughts on how these serious issues could be addressed in the design of hiring assessments going forward.

We suggest to engage in epistemological and ethical reflection regarding the choice of knowledge production paradigm before a decision is made on pursuing an algorithmic approach to assessment. This reflection would seek to contextually investigate the kind of questions we asked at the beginning of this paper:

1)
Which meanings of job seekers’ attributes are discoverable through a given paradigm and which are missed?
2)
Which alternative paradigms can be considered?
3)
What are the ethical implications of these paradigm choices?
4)
Which means of assessment are compatible with the preferred paradigm?

One of the critical first steps in this reflection would be to align the understanding of contextual meanings of various attributes among job seekers, hiring managers, recruiters, and designers and developers of assessments. This requires a qualitatively rich empirical investigation that together with these stakeholders reveals the often implicit assumptions of what attributes a given job entails and which qualities characterize a good job candidate. Such investigation would not suffice itself with establishing that “teamwork” is an important attribute, but would engage stakeholders to further specify what teamwork entails in the context of that job through concrete examples, storytelling, and deliberation among stakeholders. In this process of making the implicit explicit, the investigation may reveal areas where stakeholders wrongly assumed to have consensus among each other when, in fact, significant misalignments in meanings are present (Van den Broek et al. 2019). It is crucial to emphasize the importance of involving job seekers in this process. As early studies on autonomy over self-representation in hiring illustrate, it is this kind of direct engagement with job seekers that brings to the surface specific needs that can inform design choices (Ter Haar Romenij 2020; Van der Ploeg 2021).

Having established the contextual meanings of job seekers’ attributes, it is possible to begin exploring which assessment paradigms are compatible with capturing these meanings and how to implement them. At this step, it is essential to keep an open mind regarding what the roles of humans and algorithms might be, avoiding technical solutionism (Morozov 2013; Selbst et al. 2019) and assuming upfront that algorithms must be part of the assessment process. If the empirical investigation indicates that the meanings of a specific attribute align with the assumptions of the psychometric paradigm (i.e., existence of a single true value, variations over context and time being measurement error, etc.), then the use of psychometric measurement can be considered as possible means of assessment for that attribute. On the other hand, an attribute whose meanings are context-dependent and socially constructed would call for assessment paradigms that offer space for negotiation of meanings. This entails interaction between the job seeker and the employer. Such interaction can take the form of a face-to-face conversation, but it does not have to be the only meaningful way to achieve it. Furthermore, human-to-human interaction alone is not itself a guarantee that the assessment is not reductionist, as humans are certainly prone to making unfounded assumptions about each other in ways that can harm autonomy over self-representation. This human-to-human interaction needs to be embedded within a larger system with organizational process, policy, and training for hiring managers and recruiters, which collectively act in support of job seekers’ autonomy over self-representation. In fact, digital technology (not necessarily AI, but possibly as well) may be able to support and facilitate such interactions in innovative and meaningful ways. But note that this entails a very different role for digital technology compared to the dominant narratives marketed by vendors of AI-based hiring assessments. Instead of replacing human-to-human interaction with automated assessments, the focus of the technology would be supporting human-to-human interaction. This highlights the need for a socio-technical design perspective that jointly considers human-to-human and human–technology interactions.

Carrying out the outlined reflection process and empirical investigation is not an exercise that a single profession can engage in alone. It requires integrating ways of knowing brought by different stakeholders, professional fields, and academic disciplines—a transdisciplinary team effort. Such an effort would involve job seekers, hiring managers, recruiters, assessment experts, design researchers, ethicists of technology, and other professionals who collectively with stakeholders study the work context, identify and navigate epistemological and ethical tensions, and co-design the assessment process. Van der Bijl-Brouwer (2022: 9) identifies “epistemic intelligence, worldview awareness, power literacy, and reflexive and dialogic skills” as important competencies for engaging in transdisciplinary work.

One may point out that even with sincere intentions and efforts of all stakeholders to design for autonomy over self-representation, there is a fundamental tension between the needs of job seekers and the resources of employers. Even with support of digital technologies, it is likely that designing for autonomy over self-representation would require employers to invest more human effort, time, and budget into the hiring process. For jobs where the volume of applicants is high, this tension is likely to be especially pronounced. While limitations to job seekers’ autonomy are likely to occur in this balancing act, any derogation of job seekers’ autonomy over self-representation requires a strong justification and scrutiny. Based on the exploration presented in this paper, we argue that both scientifically and morally it is unacceptable to use hiring assessments that impose a reductionist, non-negotiable view on job seekers’ attributes whose meanings are context-dependent and socially constructed. Resolving these larger-scale tensions is a serious challenge that takes us beyond the scope of this paper. These dilemmas tie into systemic questions concerning the labor market, the economy, and ultimately political choices a society makes. However, an explicit recognition of these tensions, as well as the fact that it is often not possible to simply “tech” our way out of them, is a first step toward an honest conversation that could give rise to practical improvements.

5 Conclusion

The knowledge production paradigm behind algorithmic hiring assessments assumes that job seekers’ attributes are stable over time and contexts, and that they can be assessed by measuring job seekers’ responses to various types of stimuli. Our exploration of past insights on the merits and limitations of these psychometric assumptions led us to conclude that they are incompatible with job seekers’ attributes whose meanings are context-dependent and socially constructed. Such attributes call instead for assessment paradigms that offer space for negotiation of meanings between the job seeker and the employer. We have argued that in addition to questioning the validity of algorithmic hiring assessments, this raises an often overlooked ethical impact on job seekers’ autonomy over self-representation: their ability to directly represent their identity, lived experiences, and aspirations. Algorithmic hiring assessments undermine job seekers’ ability to construct and negotiate the representations through which the employer views them. This infringement on autonomy over self-representation constitutes an infringement on job seekers’ human dignity.

We suggest beginning to address these issues through epistemological and ethical reflection regarding the choice of assessment paradigm, the means to implement it, and the ethical impacts of these choices. This entails a transdisciplinary effort that would involve job seekers, hiring managers, recruiters, and other professionals and researchers. In this process, it is essential to keep an open mind about the possible roles of humans and technology and not assume upfront that algorithms must be part of the assessment process. Combined with a socio-technical design perspective, this may help generate new ideas regarding appropriate roles for human-to-human and human–technology interactions in the hiring process, which may differ substantially from the dominant narratives of today.

Data availability

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

Notes

For a spirited defense of the attractiveness of AI systems for reducing undesirable variability in human judgement, see Daniel Kahneman et al.’s recent book: Noise: A Flaw in Human Judgment (2021).

References

Ajunwa I (2021) Automated video interviewing as the new phrenology. 3889454, SSRN Scholarly Paper. Rochester, NY. https://papers.ssrn.com/abstract=3889454. Accessed 8 May 2023
Bogen M, Rieke A (2018) Help wanted: an exploration of hiring algorithms, equity, and bias. Upturn. Accessed Oct 21 2021. https://www.upturn.org/static/reports/2018/hiring-algorithms/files/Upturn%20--%20Help%20Wanted%20-%20An%20Exploration%20of%20Hiring%20Algorithms,%20Equity%20and%20Bias.pdf
Bryman A (1984) The debate about quantitative and qualitative research: a question of method or epistemology? Br J Sociol 35(1):75–92
Article Google Scholar
Crawford K, Dobbe R, Dryer T et al (2019) AI Now 2019 report. AI Now Institute, New York. Accessed Jan 7 2020. https://ainowinstitute.org/AI_Now_2019_Report.html
Delandshere G, Petrosky AR (1998) Assessment of complex performances: limitations of key measurement assumptions. Educ Res 27(2):14–24. https://doi.org/10.3102/0013189X027002014
Article Google Scholar
Drage E, Mackereth K (2022) Does AI debias recruitment? Race, gender, and AI’s “eradication of difference.” Philos Technol 35(4):89. https://doi.org/10.1007/s13347-022-00543-1
Article Google Scholar
Gipps C (1999) Chapter 10: socio-cultural aspects of assessment. Rev Res Educ 24(1):355–392. https://doi.org/10.3102/0091732X024001355
Gonzalez M, Capman J, Oswald F et al (2019) “Where’s the I-O?” Artificial intelligence and machine learning in talent management systems. Person Assess Decis. https://doi.org/10.25035/pad.2019.03.005
Govaerts M, Van der Vleuten CP (2013) Validity in work-based assessment: expanding our horizons. Med Educ 47(12):1164–1174. https://doi.org/10.1111/medu.12289
Article Google Scholar
Halbertal M (2015) Three concepts of human dignity. https://youtu.be/FyEvREFZVvc. Accessed 8 Aug 2019
Harver (n.d.) Gamified behavioral assessments. https://harver.com/gamified-assessments/. Accessed 5 May 2023
Hinkle C (2021) The modern lie detector: AI-powered affect screening and the Employee Polygraph Protection Act (EPPA). Georgetown Law J 109(5). https://www.law.georgetown.edu/georgetown-law-journal/in-print/volume-109/volume-109-issue-5-april-2021/the-modern-lie-detector-ai-powered-affect-screening-and-the-employee-polygraph-protection-act-eppa/. Accessed 10 June 2021
HireVue (n.d.) Assessment software for candidates | Hirevue hiring platform. https://www.hirevue.com/platform/assessment-software. Accessed 2 May 2023
Hunkenschroer AL, Luetge C (2022) Ethics of AI-enabled recruiting and selection: a review and research agenda. J Bus Ethics 178(4):977–1007. https://doi.org/10.1007/s10551-022-05049-6
Article Google Scholar
Jacobs AZ, Wallach H (2021) Measurement and fairness. In: Proceedings of the 2021 ACM conference on fairness, accountability, and transparency, New York, NY, USA, maart 2021. FAccT ’21. Association for Computing Machinery, pp 375–385. https://doi.org/10.1145/3442188.3445901
Kahneman D, Sibony O, Sunstein CR (2021) Noise: a flaw in human judgment, 1st edn. Little Brown Spark, New York
Google Scholar
Lantolf JP, Frawley W (1988) Proficiency: understanding the construct. Stud Second Lang Acquisit 10(2):181–195. https://doi.org/10.1017/S0272263100007300
Article Google Scholar
Li L, Lassiter T, Oh J et al (2021) Algorithmic hiring in practice: recruiter and HR professional’s perspectives on AI use in hiring. In: Proceedings of the 2021 AAAI/ACM conference on AI, ethics, and society, New York, NY, USA, 21 July 2021, pp 166–176. AIES ’21. Association for Computing Machinery. https://doi.org/10.1145/3461702.3462531
Liem CCS, Langer M, Demetriou A et al (2018) Psychology meets machine learning: interdisciplinary perspectives on algorithmic job candidate screening. In: Escalante HJ, Escalera S, Guyon I et al (eds) Explainable and interpretable models in computer vision and machine learning. The springer series on challenges in machine learning. Springer International Publishing, Cham, pp 197–253. https://doi.org/10.1007/978-3-319-98131-4_9
Manders-Huits N, Van den Hoven J (2008) Moral identification in identity management systems. In: Fischer-Hübner S, Duquenoy P, Zuccato A et al (eds) The future of identity in the information society. Boston, MA, 2008. IFIP —The International Federation for Information Processing. Springer US, pp 77–91. https://doi.org/10.1007/978-0-387-79026-8_6.
Michell J (1997) Quantitative science and the definition of measurement in psychology. Br J Psychol 88(3):355–383. https://doi.org/10.1111/j.2044-8295.1997.tb02641.x
Article Google Scholar
Michell J (2000) Normal science, pathological science and psychometrics. Theory Psychol 10(5):639–667. https://doi.org/10.1177/0959354300105004
Article Google Scholar
Michell J (2003) The quantitative imperative: positivism, naive realism and the place of qualitative methods in psychology. Theory Psychol 13(1):5–31. https://doi.org/10.1177/0959354303013001758
Article Google Scholar
Modern Hire (n.d.) Automated interview scoring | AI interviews. https://modernhire.com/platform/automated-interview-scoring/. Accessed 2 May 2023
Mondragon N, Liff J, Leutner K et al. (2021) Assessments overview and implementation. HireVue white paper. HireVue. https://webapi.hirevue.com/wp-content/uploads/2021/11/2021_10_TechnicalAssessmentsAssessmentsOverviewImplement-FINAL.pdf?_ga=2.153488568.146031070.1660916500-1151829273.1660916499. Accessed 27 Sept 2022
Morozov E (2013) To save everything, click here: technology, solutionism, and the urge to fix problems that don’t exist. Penguin UK
Pratt MG, Bonaccio S (2016) Qualitative research in I-O psychology: maps, myths, and moving forward. Ind Organ Psychol 9(4):693–715. https://doi.org/10.1017/iop.2016.92
Article Google Scholar
Pymetrics (n.d.) Soft skills assessment testing—pymetrics. https://www.pymetrics.ai/assessments. Accessed 27 Sept 2022
Raghavan M, Barocas S, Kleinberg J, et al. (2020) Mitigating bias in algorithmic hiring: evaluating claims and practices. In: Proceedings of the 2020 conference on fairness, accountability, and transparency, New York, NY, USA, 27 January 2020, pp. 469–481. FAT* ’20. Association for Computing Machinery. https://doi.org/10.1145/3351095.3372828
Ramirez E (2013) The privacy challenges of big data: a view from the Lifeguard’s Chair. Aspen, Colorado. https://www.ftc.gov/news-events/news/speeches/privacy-challenges-big-data-view-lifeguards-chair. Accessed 13 Oct 2022
Rhea AK, Markey K, D’Arinzo L et al (2022) An external stability audit framework to test the validity of personality prediction in AI hiring. Data Min Knowl Discov. https://doi.org/10.1007/s10618-022-00861-0
Article Google Scholar
Risam R (2018) Now you see them: self-representation and the refugee selfie. Popul Commun 16(1):58–71. https://doi.org/10.1080/15405702.2017.1413191
Article Google Scholar
Sánchez-Monedero J, Dencik L and Edwards L (2020) What does it mean to ‘solve’ the problem of discrimination in hiring? Social, technical and legal perspectives from the UK on automated hiring systems. In: Proceedings of the 2020 conference on fairness, accountability, and transparency, New York, NY, USA, 27 January 2020, pp 458–468. FAT* ’20. Association for Computing Machinery. https://doi.org/10.1145/3351095.3372849
Selbst AD, Boyd D, Friedler SA et al. (2019) Fairness and abstraction in sociotechnical systems. In: Proceedings of the conference on fairness, accountability, and transparency—FAT* ’19, New York, NY, 2019, pp 59–68. https://doi.org/10.1145/3287560.3287598
Shavelson RJ, Webb NM (1991) Generalizability theory: a primer. Sage Publications, Inc.
Sloane M, Moss E, Chowdhury R (2022) A Silicon Valley love triangle: hiring algorithms, pseudo-science, and the quest for auditability. Patterns. https://doi.org/10.1016/j.patter.2021.100425
Article Google Scholar
Stark L, Hutson J (2021) Physiognomic artificial intelligence. 3927300, SSRN Scholarly Paper. Rochester, NY. https://doi.org/10.2139/ssrn.3927300
Stevens SS (1946) On the theory of scales of measurement. Science 103(2684):677–680. https://doi.org/10.1126/science.103.2684.677
Article MATH Google Scholar
Tafreshi D, Slaney KL, Neufeld SD (2016) Quantification in psychology: critical analysis of an unreflective practice. J Theor Philos Psychol 36(4):233–249. https://doi.org/10.1037/teo0000048
Article Google Scholar
Ter Haar Romenij J (2020) Empowering academic graduate job search: the design and validation of a task-based vacancy platform. Delft University of Technology. http://resolver.tudelft.nl/uuid:a4c9d854-6905-4cd4-a335-4fdb4767d225. Accessed 27 Oct 2021
Tippins N, Oswald F, McPhail S (2021) Scientific, legal, and ethical concerns about AI-based personnel selection tools: a call to action. Person Assess Decis. https://doi.org/10.25035/pad.2021.02.001
Article Google Scholar
Van den Broek E, Sergeeva A, Huysman M (2019) Hiring algorithms: an ethnography of fairness in practice. In: ICIS 2019 Proceedings. https://aisel.aisnet.org/icis2019/future_of_work/future_work/6
Van der Bijl-Brouwer M (2022) Design, one piece of the puzzle: a conceptual and practical perspective on transdisciplinary design. In: Lockton D, Lenzi S, Hekkert P et al (eds) DRS2022. Bilbao, Spain. https://doi.org/10.21606/drs.2022.402
Van der Ploeg D (2021) The meaning in hiring: the potential loss of self-representation in AI hiring video interview systems. Delft University of Technology. http://resolver.tudelft.nl/uuid:98459ea5-fc0a-498e-a6d9-0615b938442a. Accessed 27 Oct 2021
Vedder A (1999) KDD: the challenge to individualism. Ethics Inf Technol 1(4):275–281. https://doi.org/10.1023/A:1010016102284
Article Google Scholar
Velleman JD (2005) The genesis of shame. Philos Public Affairs 30(1):27–52. https://doi.org/10.1111/j.1088-4963.2001.00027.x
Article Google Scholar
Vollmer HJ (1981) Why are we interested in ‘general language proficiency’? In: Alderson JC, Hughes A (eds) Issues in language testing. The British Council, London, pp 152–175. https://eric.ed.gov/?id=ED258440
Williams B (1973) Problems of the self: philosophical papers 1956–1972. Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9780511621253
Book Google Scholar

Download references

Acknowledgements

The artworks used in this paper are adaptations of https://pixabay.com/illustrations/avatars-ethnic-diverse-5615507/. https://pixabay.com/illustrations/online-web-statistics-data-3539412/. We would like to thank the anonymous reviewers for their valuable feedback.

Funding

The work of Matthew J. Dennis was supported by the research program Ethics of Socially Disruptive Technologies, which is funded through the Gravitation program of the Dutch Ministry of Education, Culture, and Science and the Netherlands Organization for Scientific Research (NWO grant number 024.004.031).

Author information

Evgeni Aizenberg
Present address: Human Centered Design Group, Department of Design, Production and Management, University of Twente, Enschede, The Netherlands

Authors and Affiliations

AiTech Interdisciplinary Research Program on Meaningful Human Control Over AI, Delft University of Technology, Delft, The Netherlands
Evgeni Aizenberg & Jeroen van den Hoven
Department of Intelligent Systems, Delft University of Technology, Delft, The Netherlands
Evgeni Aizenberg
Philosophy and Ethics Group, Department of Industrial Engineering and Innovation Sciences, Eindhoven University of Technology, Eindhoven, The Netherlands
Matthew J. Dennis
Department of Values, Technology and Innovation, Delft University of Technology, Delft, The Netherlands
Jeroen van den Hoven

Authors

Evgeni Aizenberg
View author publications
You can also search for this author in PubMed Google Scholar
Matthew J. Dennis
View author publications
You can also search for this author in PubMed Google Scholar
Jeroen van den Hoven
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Evgeni Aizenberg.

Ethics declarations

Conflict of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Aizenberg, E., Dennis, M.J. & van den Hoven, J. Examining the assumptions of AI hiring assessments and their impact on job seekers’ autonomy over self-representation. AI & Soc (2023). https://doi.org/10.1007/s00146-023-01783-1

Download citation

Received: 05 December 2022
Accepted: 12 September 2023
Published: 21 October 2023
DOI: https://doi.org/10.1007/s00146-023-01783-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Examining the assumptions of AI hiring assessments and their impact on job seekers’ autonomy over self-representation

Abstract

Similar content being viewed by others

Reskilling and Upskilling the Future-ready Workforce for Industry 4.0 and Beyond

The rise of artificial intelligence – understanding the AI identity threat at the workplace

The Participation of People with Disabilities in the Workplace Across the Employment Cycle: Employer Concerns and Research Evidence

1 Introduction

2 What are the assumptions?

3 Ethical impact: autonomy over self-representation

3.1 Scenario 1: face-to-face interview

3.2 Scenario 2: AI-based interview

4 Implications for research and practice

5 Conclusion

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Examining the assumptions of AI hiring assessments and their impact on job seekers’ autonomy over self-representation

Abstract

Similar content being viewed by others

Reskilling and Upskilling the Future-ready Workforce for Industry 4.0 and Beyond

The rise of artificial intelligence – understanding the AI identity threat at the workplace

The Participation of People with Disabilities in the Workplace Across the Employment Cycle: Employer Concerns and Research Evidence

1 Introduction

2 What are the assumptions?

3 Ethical impact: autonomy over self-representation

3.1 Scenario 1: face-to-face interview

3.2 Scenario 2: AI-based interview

4 Implications for research and practice

5 Conclusion

Data availability

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation