Skip to main content

Should AI allocate livers for transplant? Public attitudes and ethical considerations

Abstract

Background

Allocation of scarce organs for transplantation is ethically challenging. Artificial intelligence (AI) has been proposed to assist in liver allocation, however the ethics of this remains unexplored and the view of the public unknown. The aim of this paper was to assess public attitudes on whether AI should be used in liver allocation and how it should be implemented.

Methods

We first introduce some potential ethical issues concerning AI in liver allocation, before analysing a pilot survey including online responses from 172 UK laypeople, recruited through Prolific Academic.

Findings

Most participants found AI in liver allocation acceptable (69.2%) and would not be less likely to donate their organs if AI was used in allocation (72.7%). Respondents thought AI was more likely to be consistent and less biased compared to humans, although were concerned about the “dehumanisation of healthcare” and whether AI could consider important nuances in allocation decisions. Participants valued accuracy, impartiality, and consistency in a decision-maker, more than interpretability and empathy. Respondents were split on whether AI should be trained on previous decisions or programmed with specific objectives. Whether allocation decisions were made by transplant committee or AI, participants valued consideration of urgency, survival likelihood, life years gained, age, future medication compliance, quality of life, future alcohol use and past alcohol use. On the other hand, the majority thought the following factors were not relevant to prioritisation: past crime, future crime, future societal contribution, social disadvantage, and gender.

Conclusions

There are good reasons to use AI in liver allocation, and our sample of participants appeared to support its use. If confirmed, this support would give democratic legitimacy to the use of AI in this context and reduce the risk that donation rates could be affected negatively. Our findings on specific ethical concerns also identify potential expectations and reservations laypeople have regarding AI in this area, which can inform how AI in liver allocation could be best implemented.

Peer Review reports

Background

Livers are scarce, and transplantation is the only treatment for those with end-stage hepatic failure [1, 2]. Therefore, difficult decisions must be made in allocating donor livers. These involve complex predictions of donor organ and recipient interactions, and competing ethical values, including utility, urgency, justice, and responsibility [3].

Artificial Intelligence (AI) has been proposed to assist in decision-making in many areas of medicine [4]. While definitions vary, medical AI can be defined as “an information system capable of considering data and making clinical or patient care decisions commonly associated with a human”, which can use rule-based and/or non-rule based algorithms [5]. The former have specific rules set by experts and have been used in medicine since the 1970s [6]. Non-rule-based or machine learning algorithms are more complex and “learn” from vast amounts of data to detect patterns and make predictions [7, 8]. AI can now diagnose certain medical conditions equally (or more) accurately than specialist doctors [9,10,11]. AI has also been proposed to assist in resource allocation, for example in ICU prognosis and organ allocation [12, 13].

Algorithms have assisted in liver allocation for some time and have been increasing in complexity [14]. The Model for End-Stage Liver Disease (MELD) score is a basic algorithm which uses the results of three blood tests to predict how urgently a patient requires a liver (more accurately than clinicians) [15, 16]. Transplant units worldwide have used MELD scores to rank patients on waiting lists since the early 2000s [14].

However, using the MELD score (and its modifications) to rank patients is simplistic and there are concerns regarding its accuracy [17, 18]. This means human input by clinicians and transplant committees is required to consider additional factors which might be relevant (such as expected outcome) and to adjust decisions for certain circumstances. This potentially risks inconsistency, the influence of cognitive biases and deliberate manipulation [19, 20].

Furthermore, there has been a shift to incorporate further consideration of predicted outcomes into allocation, to maximise the utility of what is a scarce resource – particularly given a push to transplant poorer quality organs to increase supply [14]. In the UK, livers are first offered to those on the “super-urgent” list [21]. If there are no patients on this list, patients are ranked by their Transplant Benefit Score, an algorithm which uses 21 recipient and 7 donor characteristics to predict a patient’s “survival benefit” from a transplant [14]. This therefore incorporates both predicted urgency and outcome into allocation policy. However, independent simulations suggest that this algorithm may still be too simplistic to accurately predict survival benefit for certain subgroups of waiting list patients (i.e., those with hepatocellular carcinoma) [22].To consider additional factors and improve access to certain groups, the US has also started implementing a new algorithmic organ allocation system where specific factors are chosen and given a weighting, and patients are then ranked based on those factors [23].Footnote 1 The effects of this system on patient outcomes and fairness of distribution remain to be seen.

The development of liver allocation policies from MELD score (and its predecessors) to newer algorithms has been driven by a push to improve outcomes (for example via more accurate predictions of survival), and the desire to balance several complex factors in a way that is consistent. However, given the number of variables involved, the complexity of these allocation systems has the potential to increase even further [19, 25]. Many more complex algorithmic and AI models have been proposed or could be adapted for use in liver allocation (Table 1). These largely aim to improve on making predictions, for example of urgency, post-transplant survival or years of life to gain from a transplant. Given the potential beneficial effect on patient outcomes that may arise from more accurate predictions, and the trend towards using more complex allocation systems, we must explore the ethical implications of using algorithms and AI in this context.

Table 1 AI predictive model proposals

The ethical status of using AI for medical purposes remains controversial. Some have argued that, as well as improving accuracy, the use of algorithms will also lead to more impartial, consistent, and efficient decision-making [4, 5, 41,42,43,44]. On the other hand, there is ethical concern about AI bias [4, 45, 46] and that some (non-rule-based) types of AI are a “black box”, i.e., the decision-making process is uninterpretable to the user [45, 47, 48]. Broader concerns, such as the loss of important human elements or nuance in healthcare decision-making are also common [42, 45, 49, 50]. Specific concerns relating to AI in liver allocation have not been systematically explored, nor are the views of the public about this development known. However, a qualitative study of US transplant centre clinicians’ attitudes towards AI in liver allocation identified several key ethical themes in this context, relating to explainability, transparency, fairness, and trustworthiness [44]. Overall, the study found that clinicians were “cautiously optimistic” about the use of AI in this space.

On the other hand, studies have repeatedly shown that when all things are equal, laypeople prefer humans making medical or ethical decisions compared to computers [51,52,53,54]. However, it is unclear whether preference for human decision-making over AI extends to organ allocation. A study of attitudes to AI in kidney organ allocation found that preferences for AI or human decision-makers were relatively split [55]. No studies have thus far assessed layperson attitudes towards AI in liver allocation.

PART 1: why might we use algorithms and AI in liver allocation?

AI is increasingly used in medical domains – for example, to assist in diagnosing cancers using medical imaging [56]. Liver allocation, however, deals with a decision that is not (merely) medical (as in the case of medical diagnosis), but rather ethical: to whom should a scarce, life-saving resource be allocated?

There are challenges to using algorithms and AI in ethical decision-making [45, 47]. A main obstacle is achieving sufficient agreement on normative values to distil them into programmable variables. It may be obvious how to program AI to detect lung cancers on chest X-rays, but it would clearly be more difficult to create AI to decide when life support should be discontinued in intensive care. However, much depends on the approach we use to programming, and on the ethical values that are relevant to the specific context. We will introduce liver allocation as an area where there are some pro tanto reasons for using AI (although we will fall short of conclusively claiming that it should be used here). Firstly, there are top-down and bottom-up approaches to programming AI in liver allocation. Top-down would involve either single or multiple objectives being explicitly programmed. This could look similar to the recently approved US organ allocation policy, with specific weighted factors used to rank patients (Fig. 1) [24]. On the one hand, this sort of programming requires explicit endorsement of specific ethical values, which may be more difficult than native human ethical decision-making processes; health professionals in the emergency department may intuitively include factors in triaging patients, based on predicted urgency, outcome etc., but are not necessarily asked to write down factors and weightings used. However, in organ allocation, policies are generally transparent and publicly available anyway, even if humans make the decisions [57, 58]. If we can publish specific objectives in guidelines, programming specific objectives into AI is also feasible.

One potential concern with top-down AI, is that it may be too inflexible to account for nuances required for some types of decisions. For example, consider the difficulty in programming top-down AI with sufficient discrete variables to make end of life decisions for patients in intensive care, where (amongst other factors) judgments of future quality of life must be made. However, as mentioned, liver allocation is already largely algorithm-based [14] and factors which are currently already considered (such as urgency and predicted survival) may be more easily quantified. Given this, it might only be a small step to using top-down AI for this purpose, and an algorithmic process would allow for more consistent decision-making. Relative to traditional allocation algorithms, increasing the complexity of AI algorithms is also likely to mean more factors can be considered and decisions are more accurate [27, 32]. Accuracy may refer to making relevant predictions, for example of urgency and chance of survival, but also may refer to the accuracy by which an AI system applies programmed rules to a decision. For example, a transplant guideline may specify that livers should be primarily allocated based on urgency, but human decision-makers may erroneously give more weight to predicted survival when making a decision. AI programmed to give 70% weight to urgency and 30% weight to predicted survival is less likely to make this error. If AI can make predictions and weigh factors with a higher degree of accuracy and greater consistency than human decision-makers, then this provides one reason in favour of using it to use it to allocate livers. (This is a contingent empirical claim; if AI were less accurate/consistent than humans, we should not use it).

Alternately, AI could be programmed bottom-up, for example by training a machine learning algorithm on previous allocation decisions made by humans. Assuming the AI is accurate, this would result in similar decisions to those made currently or in the past, but with some of the advantages of speed, efficiency and decreased moral burden on those making decisions. Decisions would also potentially be more consistent, in that patients with identical features should yield the same decision. Another advantage of this method is that it bypasses the difficult question of how to explicitly weight multiple factors, as the AI would learn to do this based on how humans have in the past. Of course, this risks systematising and amplifying biases in the way decisions are currently made (although if trained on the verdicts of multiple transplant committees, using AI could dampen the effects of some biases which may affect any individual committee [59]). Implementing any changes to the current system would also be difficult — for example if we decided to improve access to transplants for certain groups (as is the case in the new US policy), some top-down element would be required to make this possible.

Fig. 1
figure 1

US Organ Procurement and Transplantation Network lung allocation system [24]. Each individual factor is given a specific weighting, which is used to give patients on the waiting list a ranking

Therefore, some have suggested that bottom-up AI should not be used as a decision-replacement, though it may have benefits as a decision-aid [59]. For example, if the AI recommends a course of action different to the transplant committee, and specifies the values and weights it used to make that recommendation (i.e., is not a black-box), this may assist human decision-makers to examine their own, comparatively opaque reasoning process.

There are several potential reasons to deploy AI in liver allocation. Both methods of programming AI in this context have advantages over human decision-making, but whether these outweigh the real concerns is unclear. One significant concern is whether AI in liver allocation would be accepted by the public. This is a particularly critical consideration as public approval of allocation policy may affect organ donation rates [20].

This pilot study aimed firstly, to indicate whether there is some public support for the use of AI in a high-stakes resource allocation context. Secondly, we aimed to identify respondents’ views on key issues regarding the design and implementation of AI in liver allocation. This may provide insight into whether the use of AI in this context is likely to be democratically legitimate. Additionally, these results may help identify ethical issues which require further exploration.

PART 2: public attitudes

Methods

Participants were recruited through the online platform Prolific Academic. Respondents were at least 18 years of age, fluent in English, based in the UK and had a minimum Prolific approval rate of 96%. The sample was gender balanced. The survey was created using Qualtrics XM and pre-tested on colleagues and a smaller Prolific sample.

A sample size of 200 was chosen based on time and resource constraints. Post-hoc power analysis suggested that a sample size of 172 (accounting for excluded responses) gave us 90% power to detect medium effect sizes (d = 0.5) in differences between conditions at a significance level of 0.05, and > 99% power to detect within-participant differences between questions. Precision analysis, based on a UK population size of 67 million [60], suggested that this would give a 7% margin of error at a 95% confidence level.

Questions were largely scenario-based, and responses recorded on 7-point Likert scales. Statements and questions in each section were presented in a randomised order (full survey in Appendix A).

Attitudes towards use of AI in liver allocation

AI acceptability

Participants were given explanations of AI and different areas of medicine, before being asked the extent to which they found the use of AI acceptable in those areas. They were also asked the extent to which they agreed with the statement: “If AI were used in liver allocation, I would be less likely to donate my organs”. To assess ethical concerns that could drive these acceptability ratings, participants were asked how they felt about AI compared to humans regarding four key ethical issues identified from previous literature [45]: consideration of decision-making nuances, dehumanisation of healthcare, bias, and consistency.

Decision-maker characteristics

Participants were told that decision-makers could possess different characteristics: interpretability, empathy, accuracy, consistency, and impartiality (which were chosen based on discussions amongst authors and are consistent with themes present in the relevant literature [44, 61, 62]). These terms were explained, and respondents were asked to divide a total of 100 points amongst these to indicate their relative importance.

Preferences for design of AI in liver allocation

To identify views about how AI might be used, participants were asked whether AI should be trained on previous human decisions (bottom-up) or programmed with specific factors (top-down).

Then, to assess whether the type of decision-maker would affect views on which factors should be incorporated into allocation, participants were randomised (within the survey platform) to either a “transplant committee” or “AI” condition. Both groups then were given 13 prompts regarding factors (identified in part from previous literature [3, 13]) that could be relevant to liver allocation. Participants rated the extent to which these factors should affect priority – when used by either a transplant committee or an AI decision-maker. These factors were: urgency, survival likelihood, life years gained, age, future medication compliance, quality of life, past alcohol use, future alcohol use, past crime, future crime, future societal contribution, socioeconomic status, and gender.

Finally, respondents were asked whether an AI decision should be overridable by a transplant committee. (Analysis of additional questions can be found in Appendix D.)

Statistical analysis

Statistical analysis was conducted using IBM SPSS Statistics. We used descriptive statistics to measure the frequency of various responses. For comparisons, Likert scales were assigned number values from 1 to 7 (for example where 1 = totally unacceptable and 7 = perfectly acceptable) and t-tests were performed to compare mean scores. Multiple linear regression was used to assess whether certain responses predicted views about the overall acceptability of AI for liver allocation. A p-value of < 0.05 was considered significant.

The project was reviewed and approved by the University of Oxford Central University Research Ethics Committee (R80692/RE003) as well as Monash University Ethics Committee (project number 34,555).

Results

Two hundred participants completed the survey. Twenty-eight were excluded for failing at least one of two attention checks (N = 172). The median age category was 35–44, 93.6% of respondents had completed high school or higher education, 88.4% identified as white, and 62.8% identified as having no religion (full demographics in Appendix B).

Attitudes towards use of AI in liver allocation

AI acceptability

A majority found the use of AI acceptable in all areas of medicine that were asked about (Fig. 2). Of all respondents, 84.3% found the use of AI in medicine acceptable (slightly acceptable, acceptable, or perfectly acceptable) compared to 69.2% in liver allocation. On average, participants found AI in liver allocation less acceptable than AI in medicine or resource allocation generally, although this effect was marginal. Behavioural questions revealed similarly positive views: 72.7% of respondents disagreed with the statement that if AI were used in liver allocation, they would be less likely to donate their organs, while 10.5% agreed and 16.8% felt neutrally (Appendix C).

Fig. 2
figure 2

Public attitudes towards AI in medicine. Bars represent how acceptable participants found the use of AI in different areas of medicine. **Participants found AI in liver allocation (M = 3.26, SD = 1.51) less acceptable compared to resource allocation (M = 2.99, SD = 1.43), t(171) = 3.19, p = .002. *** Participants found AI in liver allocation (M = 3.26, SD = 1.51) less acceptable compared to medicine (M = 2.55, SD = 1.10), t(171) = 7.26, p < .001. Where 1 = Perfectly acceptable, 7 = totally unacceptable

We then analysed attitudes towards AI that could drive acceptability ratings. Of all respondents, 82.0% thought that AI was likely to make less biased decisions than humans and 89% thought that AI would be more consistent than humans. However, 73.3% thought that AI was less likely to take into consideration nuances of individual liver allocation situations compared to humans, and 61.6% agreed with a statement that AI would lead to the dehumanisation of healthcare (Appendix D). Acceptability ratings were predicted by the extent to which AI is perceived as leading to the dehumanisation of healthcare (B = 0.294, p < .001), the extent to which it is perceived as more or less likely to consider the individual nuances of individual situations (β=-0.168, p = .021), and the extent to which AI is perceived to be more or less biased than humans (β = 0.157, p = .032). Perceptions of consistency did not significantly predict acceptability ratings (β=-0.097, p = .185). Further analysis can be found in Appendix D.

Decision-maker characteristics

Participants found accuracy the most important characteristic for a liver allocation decision-maker, followed by impartiality, consistency, interpretability, and empathy (Fig. 3). This same order was found by ranking the characteristics by highest average score or the number of participants choosing characteristics as their first or last ranked (Appendix E).

Fig. 3
figure 3

Public attitudes towards the importance of characteristics of decision-makers. The bars depict the average score (out of 100) that each characteristic received. Definitions that participants received are also depicted

Preferences for design of AI in liver allocation

Of all respondents, 40.7% preferred AI to learn from previous human decisions, 34.9% preferred AI to be programmed with specific objectives, and 24.4% were neutral towards this.

Most participants thought that the following characteristics should give patients priority for liver allocation: greater urgency, survival likelihood, life years gained, being younger, future medication compliance, quality of life, lower future alcohol use and lower previous alcohol use (Fig. 4). On the other hand, the majority thought the following factors were not relevant to prioritisation: past crime, future crime, future societal contribution, disadvantage, and female gender. Whether the decision was made by AI or transplant committee had very little impact on participant views of which factors should be included in allocation decisions. (Fig. 4). See Appendix F for further analysis.

Fig. 4
figure 4

Public attitudes to liver allocation priority factors (transplant committee and AI groups). Results are shown from the transplant committee condition on the left and the AI condition on the right. Each bar represents a factor which could be used in liver allocation. Green bars represent participants who thought that patients should be prioritised based on that factor (and red bars represent those who thought they should be deprioritised). Separate grey bars represent those who thought that factor was not relevant to liver allocation

Of all respondents, 87.2% agreed that liver allocation AI should be overridable by a transplant committee (either strongly agreed, agreed, or somewhat agreed).

Discussion

Attitudes towards use of AI in liver allocation

AI acceptability

Several findings in our study appear to indicate support for the use of AI in liver allocation. Firstly, respondents found the use of AI acceptable in all areas of medicine that were asked about, including liver allocation. Some studies have indicated that the public generally prefer human decision-makers to AI in medical and ethical decision-making, [51,52,53,54] even if there may not be an explicit reason for this [63]. Therefore, to avoid responses skewed unfairly against AI, we simplified the question and asked how acceptable people find AI without a human comparator. This may have resulted in more positive responses towards AI. Secondly, we found that most participants stated they would be just as likely to donate organs if AI were used in allocation. However, while only 10% of participants indicated that they were less likely to donate, a 10% reduction in donation rates would be a serious negative consequence. It would be important to confirm in other studies, whether those who indicated a negative response would donate in the absence of AI, i.e., to determine whether there would be a true reduction in donation rates.

A previous study has proposed “uniqueness neglect” – i.e., the idea that reducing patients to a number misses something important about their situation – as a reason for public aversion to medical AI [50]. Our study likewise indicates that this may be relevant, as most participants thought that AI would be less likely to consider nuances of allocation decisions, and, although the effect was not very strong, this predicted a slightly lower acceptance response for AI.Footnote 2 Most participants also agreed that AI would lead to the dehumanisation of healthcare, and the more they thought this, the more unacceptable they found AI for liver allocation. In fact, this was the strongest predictor of acceptability ratings of all the ethical concerns measured. This is consistent with a previous study [49]. These findings are useful as they offer an avenue towards practical solutions: for example, Longoni et al. found that explaining to patients the ways in which AI does actually consider individual factors appears to decrease aversion to AI [50]. Similarly, explaining to patients the extent that humans are still involved in the creation of algorithms may also alleviate some concern regarding dehumanisation.

Unsurprisingly, participants expected AI to be a more consistent decision-maker, although this did not appear to have an impact on overall acceptability ratings. While concerns about biased AI are common in the literature, they appear to be less significant in the public eye. One previous study found participants to be more concerned with human biases [49], which appears to complement our finding that participants thought that AI would be likely to make less biased decisions, compared to humans. This may reflect that participants may be more familiar with human biases that could affect healthcare professionals, compared to statistical biases that could affect AI decisions. Alternately, the public may also be more accepting of discrimination by algorithms rather than humans [64]. As familiarity should not guide whether discrimination or bias is problematic, some caution is warranted in attributing normative weight to these views.

Decision-maker characteristics

Participant ranking of characteristics for allocation decisions lends additional support to the use of AI in liver allocation. Respondents valued (in order of importance): accuracy, impartiality, consistency, interpretability, and empathy in those making decisions about liver allocation. AI is likely to be more accurate than humans in this context and most of our participants indicated that they expected AI to make less biased and more consistent decisions. Therefore, this appears to implicitly favour AI as the decision-maker. However, we acknowledge that these five characteristics do not necessarily capture all the potential characteristics of an allocation decision-maker - a different or expanded set of variables might have yielded a different conclusion.

Many studies have highlighted the importance of AI accuracy to the public [50, 51, 55], and it is perhaps unsurprising that this was seen as the most important factor for decisions. Interpretability was the second-least important characteristic, which is interesting, since interpretability is commonly discussed in AI ethics. There are often considered to be epistemic and ethical reasons why AI should be interpretable [12, 47]. Some have argued that black-box AI is inherently problematic because transparent explanations of decisions are indispensable for a fair decision-making process, and promote trust and acceptance - an (at least partially) contingent empirical claim [12, 48, 65]. However, previous work has similarly indicated that the public value accuracy more than interpretability in AI-assisted medical resource allocation [61, 62]. Empathy was rated as the least important characteristic by our participants. This appears to be consistent with some clinicians’ views that AI’s lack of emotion may be beneficial for the liver transplant evaluation process [44].

Preferences for design of AI in liver allocation

To our knowledge, this is the first study to attempt to gather public opinion comparing bottom-up and top-down AI used in ethical decision-making [66]. While AI based on previous human decisions (i.e., bottom-up AI) was slightly preferred, responses were split. This question is quite complex and therefore these results are perhaps not unexpected.

Prioritisation factors identified by participants match up well with those which are currently used in liver allocation policies across the world: for example urgency, survival, life-years gained, young age [14, 57].Footnote 3 Interestingly, participants indicated that the same ethical factors were relevant to liver allocation, regardless of whether top-down AI or a transplant committee made the predictions and weighted the factors. This might be practically useful information as this suggests that we could program top-down AI with the same values that we currently use for liver allocation guidelines. Secondly, some relevant factors (e.g., predicted life-years gained) involve calculations with hundreds of relevant variables and may be difficult for humans to make reliably [19]. If that factor is deemed relevant, this might support the use of AI in this area.

This was a small study and results may not extrapolate to the whole population (nor to other populations), although demographics were roughly comparable to the UK [60] and modest online convenience samples have been shown to yield similar results to representative sampling [68]. Prolific is a validated platform for recruiting survey participants, the population is limited to those who have access to the internet and have time for online surveys. AI and liver allocation are also complex: we attempted to provide sufficient information about AI without overloading readers, however, this is likely to be a survey of relatively intuitive responses rather than considered opinions. Further work assessing clinician perceptions of AI in liver allocation would also provide valuable information for the implementation of this technology (for example, it may indicate whether clinicians are likely to follow the allocation indicated by an AI). One recent study suggests US transplant centre clinicians are relatively optimistic about this use of AI [44].

Conclusion

This paper opens the door to further discussion and investigation in a relatively unexplored area of AI ethics. Complex algorithms are becoming more common in resource allocation, but the benefits of these must be further appraised and traded off with the potential concerns. One prior concern was whether the public would accept or reject this use of this technology. Our sample of UK participants appeared to support the use of AI in liver allocation and the majority were no less likely to donate their organs if AI was used. Interestingly, participants found accuracy to be a more important characteristic in allocation, compared to interpretability and empathy, which also may favour AI. These findings, if confirmed, would give democratic legitimacy to the use of AI in liver allocation, and mitigate concerns that donation rates could be adversely affected.

Additionally, our participants were open to either top-down programming with explicit values embedded into AI, or bottom-up programming utilising machine learning from human decision making. Regardless of whether livers were allocated by a transplant committee or AI, participants thought urgency, survival life years gained, and age were the most relevant factors to be considered, which is consistent with previous ethical analysis and current allocation policies. Further work is required to assess how these should be traded off, as well as to appraise some of the more contentious factors. Our findings on specific ethical concerns, also identify avenues for improving the way AI could be implemented.

The field of AI is moving rapidly. Since the time of this survey, large language models such as ChatGPT have gained enormous popularity and have become the topic of much debate. These models could feasibly be asked to choose between transplant patients using the methods described in this paper: by giving a set of ethical values and weightings (top-down) or by asking the model to summarise previous decisions (bottom-up). Clearly, this would require more rigorous testing, however it is apparent that we can no longer speak of this technology in hypothetical terms: AI could allocate livers. Although we have not concluded that AI should be used in high-stakes decision-making areas such as liver allocation, our study may help inform debate on this important question.

Data availability

The datasets used during the current study are available from the corresponding author on reasonable request.

Notes

  1. This has already been implemented for lung allocation in March 2023, but is still in development for liver allocation [24].

  2. There may be two concerns here: the idea that missed nuances may affect decision accuracy, and a perceived lack of patient-centredness.

  3. Other factors which were also deemed relevant such as quality of life are also consistent with other proposed approaches to scarce resource allocation [67].

References

  1. Lechler RI, Sykes M, Thomson AW, Turka LA. Organ transplantation—how much of the promise has been realized? Nat Med. 2005;11(6):605–13. https://doi.org/10.1038/nm1251.

    Article  Google Scholar 

  2. Ayllón MD, Ciria R, Cruz-Ramírez M, Pérez‐Ortiz M, Gómez I, Valente R, et al. Validation of artificial neural networks as a methodology for donor‐recipient matching for liver transplantation. Liver Transpl. 2018;24(2):192–203. https://doi.org/10.1002/lt.24870.

  3. Veatch RM, Ross LF. Allocating organs. Transplantation Ethics. Washington DC: Georgetown University Press; 2015.

    Google Scholar 

  4. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. https://doi.org/10.1038/s41591-018-0300-7.

    Article  Google Scholar 

  5. Clement J, Maldonado AQ. Augmenting the Transplant Team with Artificial Intelligence: toward meaningful AI use in solid organ transplant. Front Immunol. 2021;12:2278. https://doi.org/10.3389/fimmu.2021.694222.

    Article  Google Scholar 

  6. Buchanan BG, Shortliffe EH. Rule-based expert systems: the MYCIN experiments of the Stanford Heuristic Programming Project. 1984.

  7. Grote T, Berens P. On the ethics of algorithmic decision-making in healthcare. J Med Ethics. 2020;46(3):205–11. https://doi.org/10.1136/medethics-2019-105586.

    Article  Google Scholar 

  8. Durán JM, Jongsma KR. Who is afraid of black box algorithms? on the epistemological and ethical basis of trust in medical AI. J Med Ethics. 2021;47(5):329–35. https://doi.org/10.1136/medethics-2020-106820.

    Google Scholar 

  9. Haenssle HA, Fink C, Schneiderbauer R, Toberer F, Buhl T, Blum A, et al. Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists. Ann Oncol. 2018;29(8):1836–42. https://doi.org/10.1093/annonc/mdy166.

    Article  Google Scholar 

  10. Nam JG, Park S, Hwang EJ, Lee JH, Jin K-N, Lim KY, et al. Development and validation of deep learning–based automatic detection algorithm for malignant pulmonary nodules on chest radiographs. Radiology. 2019;290(1):218–28. https://doi.org/10.1148/radiol.2018180237.

    Article  Google Scholar 

  11. Abràmoff MD, Lavin PT, Birch M, Shah N, Folk JC. Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ Digit Med. 2018;1(1):1–8. https://doi.org/10.1038/s41746-018-0040-6.

    Article  Google Scholar 

  12. Rueda J, Rodríguez JD, Jounou IP, Hortal-Carmona J, Ausín T, Rodríguez-Arias D. “Just” accuracy? procedural fairness demands explainability in AI-based medical resource allocations. AI & Society. 2022. https://doi.org/10.1007/s00146-022-01614-9.

  13. Freedman R, Borg JS, Sinnott-Armstrong W, Dickerson JP, Conitzer V. Adapting a kidney exchange algorithm to align with human values. Artif Intell. 2020;283:103261. https://doi.org/10.1016/j.artint.2020.103261.

    Article  Google Scholar 

  14. Lee E, Johnston CJ, Oniscu GC. The trials and tribulations of liver allocation. Transpl Int. 2020;33(11):1343–52. https://doi.org/10.1111/tri.13710.

    Article  Google Scholar 

  15. Klein KB, Stafinski TD, Menon D. Predicting survival after liver transplantation based on pre-transplant MELD score: a systematic review of the literature. PLoS ONE. 2013;8(12):e80661. https://doi.org/10.1371/journal.pone.0080661.

    Article  Google Scholar 

  16. Fink MA, Angus PW, Gow PJ, Berry SR, Wang BZ, Muralidharan V, et al. Liver transplant recipient selection: MELD vs. clinical judgment. Liver Transpl. 2005;11(6):621–6. https://doi.org/10.1002/lt.20428.

    Article  Google Scholar 

  17. Peloso A, Moeckli B, Delaune V, Oldani G, Andres A, Compagnon P. Artificial Intelligence: present and future potential for solid organ transplantation. Transpl Int. 2022;35:10640. https://doi.org/10.3389/ti.2022.10640.

    Article  Google Scholar 

  18. Godfrey EL, Malik TH, Lai JC, Mindikoglu AL, Galván NTN, Cotton RT et al. The decreasing predictive power of MELD in an era of changing etiology of liver disease. Am J Transplant. 2019;19(12):3299 – 307. https://doi.org/10.1111/ajt.15559.

  19. Briceño J, Calleja R, Hervás C. Artificial intelligence and liver transplantation: looking for the best donor-recipient pairing. Hepatobiliary & Pancreatic Diseases International; 2022. https://doi.org/10.1016/j.hbpd.2022.03.001.

  20. Shaw D, Neuberger J, Murphy P. Lessons from the german organ donation scandal. 2013;14(3):200–1. https://doi.org/10.1177/175114371301400.

  21. NHS Blood and Transplant. Deceased Donor Liver Distribution and Allocation - POL196. UK: National Health Service; 2023. Available from: https://www.odt.nhs.uk/transplantation/tools-policies-and-guidance/policies-and-guidance/. Accessed 14 June 2023.

  22. Attia A, Rowe IA, Harrison EM, Gordon-Walker T, Stutchfield BM. Implausible algorithm output in UK liver transplantation allocation scheme: importance of transparency. The Lancet. 2023;401(10380):911–2. https://doi.org/10.1016/S0140-6736(23)00114-9.

    Article  Google Scholar 

  23. Organ Procurement and Transplantation Network. Continuous distribution. Richmond, Virginia, USA: OPTN; 2022. Available from: https://optn.transplant.hrsa.gov/policies-bylaws/a-closer-look/continuous-distribution. Accessed 15 June 2023.

  24. Organ Procurement and Transplantation Network. Continuous distribution - lung. Richmond, Virginia USA. OPTN; 2021. Available from: https://optn.transplant.hrsa.gov/policies-bylaws/a-closer-look/continuous-distribution/continuous-distribution-lung/. Accessed 15 June 2023.

  25. Ivanics T, Abreu P, De Martin E, Sapisochin G. Changing trends in Liver Transplantation: challenges and solutions. Transplantation. 2021;105(4):743–56. https://doi.org/10.1097/tp.0000000000003454.

    Article  Google Scholar 

  26. Cucchetti A, Vivarelli M, Heaton ND, Phillips S, Piscaglia F, Bolondi L, et al. Artificial neural network is superior to MELD in predicting mortality of patients with end-stage Liver Disease. Gut. 2007;56(2):253–8. https://doi.org/10.1136/gut.2005.084434.

    Article  Google Scholar 

  27. Bertsimas D, Kung J, Trichakis N, Wang Y, Hirose R, Vagefi PA. Development and validation of an optimized prediction of mortality for candidates awaiting liver transplantation. Am J Transplant. 2019;19(4):1109–18. https://doi.org/10.1111/ajt.15172.

    Article  Google Scholar 

  28. Cruz-Ramírez M, Hervás-Martínez C, Fernández JC, Briceño J, de la Mata M. Predicting patient survival after liver transplantation using evolutionary multi-objective artificial neural networks. Artificial Intelligence in Medicine. 2013;58(1):37–49. https://doi.org/10.1016/j.artmed.2013.02.004.

  29. Briceño J, Cruz-Ramírez M, Prieto M, Navasa M, De Urbina JO, Orti R, et al. Use of artificial intelligence as an innovative donor-recipient matching model for liver transplantation: results from a multicenter Spanish study. J Hepatol. 2014;61(5):1020–8. https://doi.org/10.1016/j.jhep.2014.05.039.

    Article  Google Scholar 

  30. Lau L, Kankanige Y, Rubinstein B, Jones R, Christophi C, Muralidharan V, et al. Machine-learning algorithms predict graft failure after liver transplantation. Transplantation. 2017;101(4):e125. https://doi.org/10.1097/TP.0000000000001600.

    Article  Google Scholar 

  31. Matis S, Doyle H, Marino I, Mural R, Uberbacher E, editors. Use of neural networks for prediction of graft failure following liver transplantation. Proceedings Eighth IEEE Symposium on Computer-Based Medical Systems; 1995: IEEE. https://doi.org/10.1109/CBMS.1995.465437.

  32. Ershoff BD, Lee CK, Wray CL, Agopian VG, Urban G, Baldi P et al. Training and validation of deep neural networks for the prediction of 90-day post-liver transplant mortality using UNOS registry data. Transplantation Proceedings. 2020;52(1):246 – 58. https://doi.org/10.1016/j.transproceed.2019.10.019.

  33. Haydon GH, Hiltunen Y, Lucey MR, Collett D, Gunson B, Murphy N, et al. Self-organizing maps can determine outcome and match recipients and donors at orthotopic liver transplantation. Transplantation. 2005;79(2):213–8. https://doi.org/10.1097/01.tp.0000146193.02231.e2.

    Article  Google Scholar 

  34. Hoot N, Aronsky D. Using Bayesian networks to predict survival of liver transplant patients. In: AMIA annual symposium proceedings 2005 (Vol. 2005, p. 345). American Medical Informatics Association.

  35. Khosravi B, Pourahmad S, Bahreini A, Nikeghbalian S, Mehrdad G. Five years survival of patients after liver transplantation and its effective factors by neural network and cox proportional hazard regression models. Hepat Monthly. 2015;15(9). https://doi.org/10.5812/hepatmon.25164.

  36. Dorado-Moreno M, Pérez-Ortiz M, Gutiérrez PA, Ciria R, Briceno J, Hervás-Martínez C. Dynamically weighted evolutionary ordinal neural network for solving an imbalanced liver transplantation problem. Artif Intell Med. 2017;77:1–11. https://doi.org/10.1016/j.artmed.2017.02.004.

    Article  Google Scholar 

  37. Zhang M, Yin F, Chen B, Li B, Li YP, Yan LN, et al. Mortality risk after liver transplantation in hepatocellular carcinoma recipients: a nonlinear predictive model. Surgery. 2012;151(6):889–97. https://doi.org/10.1016/j.surg.2011.12.034.

    Article  Google Scholar 

  38. Schaubel DE, Guidinger MK, Biggins SW, Kalbfleisch JD, Pomfret EA, Sharma P et al. Survival Benefit-Based Deceased-Donor Liver Allocation. Am J Transplant. 2009;9(4p2):970 – 81. https://doi.org/10.1111/j.1600-6143.2009.02571.x.

  39. Dancs PT, Saner FH, Benkö T, Molmenti EP, Büchter M, Paul A, et al. Balancing outcome vs. urgency in Modern Liver Transplantation. Front Surg. 2022;9. https://doi.org/10.3389/fsurg.2022.853727.

  40. Santos HGD, Zampieri FG, Normilio-Silva K, Silva GTD, Lima ACP, Cavalcanti AB, et al. Machine learning to predict 30-day quality-adjusted survival in critically ill patients with cancer. J Crit Care. 2020;55:73–8. https://doi.org/10.1016/j.jcrc.2019.10.015.

    Article  Google Scholar 

  41. Sherman M, Greenbaum D. Ethics of AI in transplant matching: is it better or just more of the same? Am J Bioeth. 2019;19(11):45–7. https://doi.org/10.1080/15265161.2019.1665734.

    Article  Google Scholar 

  42. Dalton-Brown S. The ethics of medical AI and the physician-patient relationship. Camb Q Healthc Ethics. 2020;29(1):115–21. https://doi.org/10.1017/S0963180119000847.

    Article  Google Scholar 

  43. He J, Baxter SL, Xu J, Xu J, Zhou X, Zhang K. The practical implementation of artificial intelligence technologies in medicine. Nat Med. 2019;25(1):30–6. https://doi.org/10.1038/s41591-018-0307-0.

    Article  Google Scholar 

  44. Strauss AT, Sidoti CN, Sung HC, Jain VS, Lehmann H, Purnell TS, et al. Artificial intelligence-based clinical decision support for liver transplant evaluation and considerations about fairness: a qualitative study. Hepatol Commun. 2023;7(10):e0239. https://doi.org/10.1097/hc9.0000000000000239.

    Article  Google Scholar 

  45. Mishra A, Savulescu J, Giubilini A. The Ethics of Medical AI. In: Véliz C, editor. The Oxford Handbook of Digital Ethics. Oxford Academic; 2022. https://doi.org/10.1093/oxfordhb/9780198857815.013.25.

  46. Parikh RB, Teeple S, Navathe AS. Addressing bias in artificial intelligence in health care. JAMA. 2019;322(24):2377–8. https://doi.org/10.1001/jama.2019.18058.

    Article  Google Scholar 

  47. Afnan MAM, Liu Y, Conitzer V, Rudin C, Mishra A, Savulescu J et al. Interpretable, not black-box, artificial intelligence should be used for embryo selection. Human Reproduction Open. 2021;2021(4). https://doi.org/10.1093/hropen/hoab040.

  48. Bjerring JC, Busch J. Artificial Intelligence and Patient-Centered Decision-Making. Philosophy & Technology. 2021;34(2):349–71. https://doi.org/10.1007/s13347-019-00391-6.

  49. Formosa P, Rogers W, Bankins S, Griep Y, Richards D. Medical AI and human dignity: contrasting perceptions of human and artificially intelligent (AI) decision making in diagnostic and medical resource allocation contexts. Comput Hum Behav. 2022:107296. https://doi.org/10.1016/j.chb.2022.107296.

  50. Longoni C, Bonezzi A, Morewedge CK. Resistance to medical artificial intelligence. J Consum Res. 2019;46(4):629–50. https://doi.org/10.1093/jcr/ucz013.

    Article  Google Scholar 

  51. Bigman YE, Gray K. People are averse to machines making moral decisions. Cognition. 2018;181:21–34. https://doi.org/10.1016/j.cognition.2018.08.003.

    Article  Google Scholar 

  52. Wu J, Xu L, Yu F, Peng K. Acceptance of medical treatment regimens provided by AI vs. Human. Appl Sci. 2021;12(1):110. https://doi.org/10.3390/app12010110.

    Article  Google Scholar 

  53. Promberger M, Baron J. Do patients trust computers? J Behav Decis Mak. 2006;19(5):455–68. https://doi.org/10.1002/bdm.542.

    Article  Google Scholar 

  54. Juravle G, Boudouraki A, Terziyska M, Rezlescu C. Trust in artificial intelligence for medical diagnoses. Prog Brain Res. Elsevier; 2020;253:263–82. https://doi.org/10.1016/bs.pbr.2020.06.006.

  55. Kramer MF, Schaich Borg J, Conitzer V, Sinnott-Armstrong W, editors. When do people want AI to make decisions? Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society; 2018. https://doi.org/10.1145/3278721.3278752.

  56. Jiang F, Jiang Y, Zhi H, Dong Y, Li H, Ma S, et al. Artificial intelligence in healthcare: past, present and future. Stroke Vasc Neurol. 2017;2(4):230–43. https://doi.org/10.1136/svn-2017-000101.

    Article  Google Scholar 

  57. The Transplantation Society of Australia and New Zealand. Clinical Guidelines for Organ Transplantation from Deceased Donors. TSANZ; 2021. Available from: https://tsanz.com.au/guidelinesethics-documents/organallocationguidelines.htm. Accessed 15 Mar 2023.

  58. Organ Procurement and Transplantation Network. Ethical Principles in the Allocation of Human Organs. Richmond, Virginia, USA: OPTN; 2015. Available from: https://optn.transplant.hrsa.gov/professionals/by-topic/ethical-considerations/ethical-principles-in-the-allocation-of-human-organs/. Accessed 15 June 2023.

  59. Demaree-Cotton J, Earp BD, Savulescu J. How to Use AI Ethically for Ethical Decision-Making. The American Journal of Bioethics. 2022;22(7):1–3. https://doi.org/10.1080/15265161.2022.2075968.

  60. Office for National Statistics. United Kingdom Census. 2023. Available from: https://www.ons.gov.uk/census. Accessed 15 Jul 2023.

  61. van der Veer SN, Riste L, Cheraghi-Sohi S, Phipps DL, Tully MP, Bozentko K et al. Trading off accuracy and explainability in AI decision-making: findings from 2 citizens’ juries. J Am Med Inf Assoc. 2021;28(10):2128-38. https://doi.org/10.1093/jamia/ocab127.

  62. Nussberger A-M, Luo L, Celis LE, Crockett MJ. Public attitudes value interpretability but prioritize accuracy in Artificial Intelligence. Nature Communications. 2022;13(1):5821. https://doi.org/10.1038/s41467-022-33417-3.

  63. Gogoll J, Uhl M. Rage against the machine: automation in the moral domain. J Behav Experimental Econ. 2018;74:97–103. https://doi.org/10.1016/j.socec.2018.04.003.

    Article  Google Scholar 

  64. Bigman Y, Gray K, Waytz A, Arnestad M, Wilson D. Algorithmic discrimination causes less moral outrage than human discrimination. 2020. https://doi.org/10.1037/xge0001250.

  65. Stoyanovich J, Van Bavel JJ, West TV. The imperative of interpretable machines. Nat Mach Intell. 2020;2(4):197–9. https://doi.org/10.1038/s42256-020-0171-8.

  66. Meier LJ, Hein A, Diepold K, Buyx A. Algorithms for Ethical Decision-Making in the Clinic: a Proof of Concept. The American Journal of Bioethics. 2022:1–17. https://doi.org/10.1080/15265161.2022.2040647.

  67. Savulescu J, Vergano M, Craxì L, Wilkinson D. An ethical algorithm for rationing life-sustaining treatment during the COVID-19 pandemic. Br J Anaesth. 2020;125(3):253–8. https://doi.org/10.1016/j.bja.2020.05.028.

    Article  Google Scholar 

  68. Mullinix KJ, Leeper TJ, Druckman JN, Freese J. The generalizability of survey experiments. J Experimental Political Sci. 2015;2(2):109–38. https://doi.org/10.1017/XPS.2015.19.

    Article  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This research was funded in part by the Wellcome Trust [203132/Z/16/Z] and by Monash University. The funders had no role in the preparation of this manuscript or the decision to submit for publication. For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.

Author information

Authors and Affiliations

Authors

Contributions

MD performed the literature search, drafted the survey and formulated the paper. DW supervised the research, contributed to ethical analysis and survey design, and provided feedback on drafts. JS and JK provided ideas for the formulation of the paper and provided feedback on drafts. JD helped design the survey, performed the statistical analysis, and provided feedback on drafts. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Dominic Wilkinson.

Ethics declarations

Ethics approval and consent to participate

This project was reviewed and approved by the University of Oxford Central University Research Ethics Committee (R80692/RE003) as well as Monash University Ethics Committee (project number 34,555). Informed consent was obtained from participants. The study was conducted in accordance with the relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

JS is a Bioethics Committee consultant for Bayer and an Advisory Panel member for the Hevolution Foundation (2022-). JK receives funding from Ferring Pharmaceuticals. All other authors declare no conflicts of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Drezga-Kleiminger, M., Demaree-Cotton, J., Koplin, J. et al. Should AI allocate livers for transplant? Public attitudes and ethical considerations. BMC Med Ethics 24, 102 (2023). https://doi.org/10.1186/s12910-023-00983-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12910-023-00983-0

Keywords