Neural correlates of error-related learning deficits in individuals with psychopathy A. K. L. von Borries1,2*, I. A. Brazil2,3, B. H. Bulten2, J. K. Buitelaar1, R. J. Verkes1,2 and E. R. A. de Bruijn3 1 Radboud University Nijmegen Medical Centre, Donders Institute for Brain, Cognition and Behaviour, Department of Psychiatry, Nijmegen, The Netherlands 2 Pompestichting, Nijmegen, The Netherlands 3 Radboud University Nijmegen, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands Background. Psychopathy (PP) is associated with a performance deficit in a variety of stimulus–response and stimulus–reinforcement learning paradigms. We tested the hypothesis that failures in error monitoring underlie these learning deficits. Method. We measured electrophysiological correlates of error monitoring [error-related negativity (ERN)] during a probabilistic learning task in individuals with PP (n=13) and healthy matched control subjects (n=18). The task consisted of three graded learning conditions in which the amount of learning was manipulated by varying the degree to which the response was predictive of the value of the feedback (50, 80 and 100%). Results. Behaviourally, we found impaired learning and diminished accuracy in the group of individuals with PP. Amplitudes of the response ERN (rERN) were reduced. No differences in the feedback ERN (fERN) were found. Conclusions. The results are interpreted in terms of a deficit in initial rule learning and subsequent generalization of these rules to new stimuli. Negative feedback is adequately processed at a neural level but this information is not used to improve behaviour on subsequent trials. As learning is degraded, the process of error detection at the moment of the actual response is diminished. Therefore, the current study demonstrates that disturbed error-monitoring processes play a central role in the often reported learning deficits in individuals with PP. Received 27 April 2009 ; Revised 16 October 2009 ; Accepted 29 October 2009 ; First published online 9 December 2009 Key words : Behavioural adaptation, error-monitoring, error-related negativity, feedback learning, psychopathy, reinforcement learning. Introduction Individuals with psychopathy (PP) show little concern about the consequences of their actions for others and themselves. They often show poor planning skills and fail to avoid behaviours that have been punished previously (Hare, 1991). The latter is reflected in, for example, the amount and types of incidents occurring in clinical settings (Hildebrand, 2005) and in their poor response to treatment and the high relapse rates of criminal behaviour (D'Silva et al. 2004). In line with these observations, psychopathic individuals show performance deficits in different stimulus–response and stimulus–reinforcement learning situations. Cleckley (1976) found individuals with PP to have a reduced capacity to learn from experience. Other studies have demonstrated abnormally low levels of aversive learning (Flor et al. 2002), instrumental learning (Mitchell et al. 2006) and avoidance learning (Newman & Kosson, 1986 ; Blair et al. 2004). The latter is the process by which one learns that omitting a certain response will result in the termination or prevention of an aversive stimulus. Additionally, impairments in decision making to rewarding and punishing stimuli have been found (Blair et al. 2006). Furthermore, studies of post-error slowing, the phenomenon of slower response times (RTs) following erroneous trials, have shown that individuals with PP fail to utilize feedback to alter future responses (Newman, 1987). Finally, recent behavioural data from a probabilistic response–reversal task indicated that individuals with PP showed learning deficits in the reversal phase only, in which the earlier learned reinforcement contingencies were suddenly reversed (Budhani et al. 2006). These findings are mainly in line with the integrated emotion system (IES) interpretation of PP (Blair, 2005 ; Blair et al. 2005), which assumes orbitofrontal and * Address for correspondence : A. K. L. von Borries, M.Sc., Pompekliniek, PO Box 31435, 6503 CK Nijmegen, The Netherlands. (Email : k.v.borries@pompestichting.nl) Psychological Medicine (2010), 40, 1559–1568. f Cambridge University Press 2009 doi:10.1017/S0033291709992017 ORIGINAL ARTICLE amygdala abnormalities in PP. The model predicts individuals with PP to show deficits in both stimulus– reinforcement learning involving the amygdala and reversal learning served by orbitofrontal areas and the basal ganglia (Cools et al. 2002 ; Clarke et al. 2008). Importantly, the model would not predict deficits in stimulus–response learning, a process that relies on the posterior medial frontal cortex (pMFC), including the pre-supplementary motor area (pre-SMA) and the anterior cingulate (Carter et al. 1998). In our view, the above suggests that psychopathic individuals have difficulties in using negative feedback or error information to adapt their behaviour. Holroyd & Coles (2002) proposed the reinforcement learning (RL) theory of performance monitoring, which assumes that whenever outcomes are worse than expected, an error signal is conveyed from the basal ganglia to the anterior cingulate cortex (ACC). Upon arrival of this error signal in the ACC, the errorrelated negativity (ERN), an event-related potential (ERP) component measurable at the scalp, is generated (Dehaene et al. 1994 ; Carter et al. 1998 ; Holroyd et al. 1998 ; Holroyd & Coles, 2002). The ERN occurs not only when participants make errors but also when they receive feedback indicating that they gave an incorrect response (for an overview on ERN and performance monitoring, see Ullsperger & Falkenstein, 2004). The onset of the ERN coincides with response initiation (rERN; Gehring & Fencsik, 2001), or occurs 200 ms after the delivery of error feedback (fERN; Milter et al. 1997). The former reflects internal error signals, the latter external error signals. Studies have demonstrated that the ERN is generated at the first moment in time when the error can be detected (Holroyd & Coles, 2002; Nieuwenhuis et al. 2002). Thus, fERNs are elicited when the negative feedback itself was not, or was only partly, predicted by earlier events. This is, for example, the case when subjects are still learning the correct stimulus–response mapping by trial and error. However, as the system gradually learns the stimulus–response mapping, subjects will eventually be able to detect errors at the moment of response onset. At an electrophysiological level, this is reflected in the fERN 'propagating back in time' and 'becoming' an rERN. Consequently, while learning takes place, rERN amplitudes increase (Holroyd & Coles, 2002). Although several studies have investigated learning in individuals with psychopathic traits at a behavioural level, learning deficits in individuals diagnosed with PP have never been studied in relation to the underlying electrophysiological markers of performance or error monitoring. Until now, most studies either focused on individuals with behavioural patterns related to PP (Dikman & Allen, 2000 ; Hall et al. 2007) or investigated aspects of error monitoring unrelated to learning (Munro et al. 2007 ; Brazil et al. 2009). An investigation of reward and avoidance learning in low socialized individuals (a concept related to PP; Kosson & Newman, 1989) has shown diminished rERN amplitudes only in the punishment condition (Dikman & Allen, 2000). Another study demonstrated reduced rERN amplitudes in healthy individuals scoring high on externalizing psychopathology, a factor comparable to the behavioural deficit cluster in individuals with PP (Hall et al. 2007). Only two studies have investigated the rERN directly in individuals diagnosed with PP. Munro et al. (2007) used a neutral and an emotional choice– reaction task and found reduced rERNs in the emotional task only. Brazil et al. (2009) reported no differences in rERN amplitude between healthy controls and individuals with PP on a neutral task, but did demonstrate problems in the conscious evaluation and signalling of errors. Taken together, these studies point towards learning deficits associated with a failure to detect and use internal and external error signals. The present study was designed to examine the relationship between error monitoring and reinforcement learning in individuals diagnosed with PP, by investigating the rERN and fERN and the relationship between the two while learning progresses. To investigate this, a probabilistic learning task was used in which participants learned stimulus–response mappings based on feedback about their performance (trial-and-error learning; see, for example, Holroyd & Coles, 2002 ; Nieuwenhuis et al. 2002, 2005). A crucial aspect of the task is that the imperative stimulus presented on each trial differed in the degree to which the response was predictive of the value of the feedback (50, 80 and 100%). Compared with healthy controls, we expected individuals with PP to display learning difficulties, reflected behaviourally by reduced accuracy and electrophysiologically by smaller amplitudes of rERN, fERN and a slower propagation in time of the fERN to become an rERN. Method Participants Thirteen male violent offenders aged between 18 and 55 years (mean=37, S.D.=9.5 years) diagnosed with a psychopathy score of o26 according to the Hare Psychopathy Check List – Revised (PCL-R; Hare, 1991) were selected from the in-patient population of 1560 A. K. L. von Borries et al. a forensic psychiatric institute in The Netherlands# (mean PCL-R score=31, S.D.=3.4). Educational level was coded according to the Dutch educational system (1=primary education, 2=secondary education, 3= higher education; mean education patients=2.8, mean education controls=2.3). Eighteen healthy male controls matched for age (mean age=37, S.D.=6.5 years) and educational level and without a criminal record or a history of psychiatric disorders were recruited by advertisement. Participants in both groups were checked for drug use and for medical/neurological history. Exclusion criteria were : use of alcohol >3 units/day during the week preceding the experimental measure and use of alcohol within 24 h of the measurement ; use of cannabis or other illicit drugs within the week before measurement and use of psychotropic medication other than oxazepam during the 5 days before measurement ; use of oxazepam within 12 h before measurement ; smoking within 3 h before measurement ; history of trauma capitis, visual and auditive disorders, neurological disorders, first-degree relative with any relevant neurological disorders. The study was approved by the local Medical Ethical Committee and carried out in accordance with the Declaration of Helsinki. Task and procedure Participants received written information about the experiment and gave their written consent before being screened for psychiatric exclusion criteria by trained psychologists using the SCID-II (Groenestijn et al. 1999) and the M.I.N.I. (van Vliet et al. 2000). The psychiatric exclusion criteria included: depressive disorder, bipolar disorder, schizophrenia, schizoaffective disorder, schizophreniform disorder, delusional and other psychotic disorders, schizoid or schizotypal personality disorder, current alcohol and substance intoxication, first-degree relatives with DSM-IV Axis I schizophrenia or schizophreniform disorder. Participants performed the experimental task and received a financial reimbursement. Additionally, all subjects received a bonus earned during the experiment. Participants performed a probabilistic learning task requiring a two-choice decision to an imperative visual stimulus (Holroyd & Coles, 2002) (see Fig. 1). Following each response, a feedback stimulus representing reward information was presented, informing participants whether their response was correct (green dollar signs : +2 cents), incorrect (red dollar signs : x2 cents) or too late (a cherry ; x4 cents). 0 ms 500 ms 1000 ms 1500 ms Correct trial Incorrect trial Fig. 1. Trial details for a correct and an incorrect trial. Each trial started with the presentation of the imperative stimulus for 500 ms, a blank screen with a fixation cross (500 ms), the presentation of a feedback stimulus (500 ms), and then a blank screen with a fixation cross (500 ms). For each imperative stimulus, one of two buttons had to be pressed with the index finger (right or left). A response deadline (1000 ms) was used to ensure that participants made sufficient errors in the 100% easy learning condition. # The Pompestichting is a 'TBS clinic ' located in Nijmegen. TBS is a treatment measure on behalf of the state for people who have committed serious criminal offences in connection with having a mental disorder. TBS is not a punishment but an entrustment act for mentally disordered offenders (diminished responsibility). These court orders are an alternative to either long-term imprisonment or confinement in psychiatric hospital, with the aim of striking a balance between security, treatment and protection. Error-related learning deficits in psychopaths 1561 The amount of learning possible was manipulated in three different conditions (50, 80 and 100%) by varying the degree to which the response was predictive of the value of the feedback. For stimuli in the 50% control condition, the value of the feedback was uncorrelated with the selected response, making it impossible to learn stimulus–response mappings. In the 100% and 80% learning conditions, participants could learn the stimulus–response mappings to varying degrees. In each experimental block, participants were presented with a new set of six different stimuli (for task and stimulus details, see Nieuwenhuis et al. 2002, 2005), that is two for each condition. The two stimuli from the 100% condition mapped congruently to either the left or the right response button throughout the entire block. For two stimuli, feedback was delivered randomly (50% condition). Of the two remaining stimuli, one required a left button press in 80% ('80% valid') but a right button press in 20% of the trials ('80% invalid'), and vice versa for the other stimuli. Participants started with a bonus of E2.50 and were informed about the status of this bonus at the end of each block. The aim was to determine the financially most beneficial strategy by trial and error. First, participants completed a practice block of 100 trials followed by four experimental blocks of 300 trials each. The six stimuli in each block were presented randomly 50 times each (Holroyd & Coles, 2002 ; Nieuwenhuis et al. 2002, 2005). Fig. 1 depicts details of the duration of the trial, which are identical to previous studies using the same paradigm (Holroyd & Coles, 2002 ; Nieuwenhuis et al. 2002, 2005). Electrophysiological recording A QuickAmp amplifier (Brainproducts, Germany) with an ActiCap system holding 32 active electrodes was used for data acquisition. A electroencephalogram was recorded at a sampling rate of 500 Hz and referenced to the left ear, but was re-referenced offline to the average of both ears. Signals were filtered offline using a band-pass filter of 0.019–20 Hz. Data analysis Trials with RTs <150 ms or >700 ms were excluded from the analyses (6.06%, S.D.=5.44%; Nieuwenhuis et al. 2002, 2005). For the ERP analyses, single-trial epochs were extracted relative to the presentation of the feedback stimulus for the fERNs and relative to the response for the rERN. Single-trial electroencephalography (EEG) signals were corrected for electrooculography (EOG) artefacts (Gratton et al. 1983) and averaged for each subject and condition separately using a 200-ms pre-response/feedback baseline. In line with previous studies using the current paradigm (Holroyd & Coles, 2002; Nieuwenhuis et al. 2002, 2005), difference waves were created by subtracting the individual averages for correct responses/ feedback from the individual averages for incorrect responses/feedback. The rERN amplitude was defined as the most negative peak of the response-locked difference waves at electrode Cz in a window of 0– 200 ms (de Bruijn et al. 2007). For the fERN, a window of 200–400 ms (de Bruijn et al. 2004 ; Mars et al. 2004) on the feedback-locked difference waves was chosen. Analyses were conducted using repeated-measures general linear models (GLMs) with group (psychopaths, controls) as a between-subject factor and block half [first (BH1) and second (BH2)], block (1, 2, 3, 4) and condition as possible within-subject factors. Depending on the independent variable entered into the GLM, the number of levels for the factor condition varied. First, to test the validity of our design, all four levels (100%, 80% valid, 80% invalid, 50%) were entered. Second, to investigate learning processes in more detail, the two learning conditions (100% and 80%) were analysed by means of a repeated-measures GLM with group as a between-subject factor and BH1, BH2 and condition as within-subject factors. Because any response-locked error-related activity in the 50% condition is known to result from random fluctuations in the EEG signal (Nieuwenhuis et al. 2002, 2005) and learning cannot occur, we excluded this condition from the analyses. Note that for the rERN analyses the factor 'condition' includes the 80% condition but that no distinction is made between valid and invalid trials, as the actual validity of a trial in the 80% condition is unknown to the subject until the moment of feedback. Results Behavioural results Confirming the validity of our design, an overall analysis of condition (100%, 80% valid, 80% invalid and 50%) revealed that accuracy was highest in the 100% condition, followed by the 80% valid condition, and lowest in the 80% invalid condition [F(3, 27)= 86.0, p<0.001]. Accuracy in the 50% condition was around chance level (see Fig. 2). An analysis of the two learning conditions (100% and 80% valid) including block half revealed no overall group differences between psychopathic individuals and controls in accuracy [F(1, 29)=1.65, p=0.209]. However, the significant interaction between condition and group showed that, compared to 1562 A. K. L. von Borries et al. controls, psychopathic subjects were less accurate in the 100% condition but not in the 80% valid condition [F(1, 29)=6.90, p=0.014]. Planned comparisons by means of an independent t test confirmed this [twotailed t test 100%: t(29)=2.00, p=0.055 ; 80% valid : t(29)=0.449, p=0.657]. Accuracy was higher in the second block half than in the first [F(1, 29)=23.8, p<0.001] and this was the same for both groups [F(1, 29)=0.03, p=0.87]. The interaction between condition and block half revealed that the increase in accuracy with block half was more pronounced for the 100% condition (6.9%) than for the 80% valid condition [2.6%; F(1, 29)=14.9, p=0.001]. Most importantly, the three-way interaction between condition, block half and group showed a clear trend towards significance [F(1, 29)=4.05, p=0.054]. Psychopathic individuals show less increase in accuracy between block halves for the 100% condition compared to controls, but a steeper increase between block halves in the 80% valid condition (see Table 1 and Fig. 2). These effects were confirmed by planned independent t tests [two-tailed t test 100% 85 65 45 25 –7 –6 –5 –4 –3 –2 –1 0 –7 –6 –5 –4 –3 –2 –1 0 A cc ur ac y (% ) rE R N (μ V ) –7 –6 –5 –4 –3 –2 –1 0 –7 –6 –5 –4 –3 –2 –1 0 fE R N (μ V ) 100% 100% 80% 50% 100% 80% 50% 80% valid 50% 80% invalid 100% 80% valid 50% 80% invalid 100% 80% valid 50% 80% invalid 100% 80% valid 50% 80% invalid 85 65 45 25 Fig. 2. Behavioural accuracy for individuals with psychopathy (PP) and controls for each condition and the two block halves ( , block half 1 ; %, block half 2). Error bars indicate standard errors. Mean amplitudes are shown for the response error-related negativity (rERN) and feedback ERN (fERN), for each of the two groups, each condition and the two block halves. Error bars indicate standard errors. Table 1. Mean percentage correct responses (and standard deviations) for each group, condition and both block halves separately and across block halves (total) Condition Psychopathy group (n=13) Control group (n=18) Block half 1 Block half 2 Total Block half 1 Block half 2 Total 100% 69 (9) 75 (13) 72 (11) 74 (7) 82 (8) 79 (8) 80% valid 63 (8) 66 (9) 65 (9) 65 (7) 67 (11) 66 (9) 50% 49 (3) 49 (2) 49 (3) 52 (3) 49 (2) 51 (2) 80% invalid 39 (9) 29 (9) 34 (8) 40 (10) 29 (12) 35 (10) Error-related learning deficits in psychopaths 1563 BH1: t(29)=1.74, p=0.093; 100% BH2: t(29)=2.05, p= 0.049 ; 80% valid BH1: t(29)=0.804, p=0.428; t(29)= 0.136, p=0.892]. To examine acquisition and generalization of learning rules in the two learning conditions (100% and 80% valid), we investigated accuracy per block. Accuracy increased with each block [F(3, 27)=37.2, p<0.001 ; all contrasts : p<0.05] without an interaction between block and group [F(3, 27)=1.78, p=0.175]. Planned comparisons showed that individuals with PP had lower accuracy in the first block but not in the fourth [F(1, 29)=5.07, p=0.03, see Fig. 3]. ERP findings fERN In line with previous studies (Nieuwenhuis et al. 2002, 2005), comparison of fERN amplitudes between conditions revealed that amplitudes were largest in the 80% invalid condition, in which negative feedback was most unexpected, followed by the 50% condition, the 80% valid condition, and finally the 100% condition [F(3, 27)=7.97, p=0.001, all contrast p<0.05, see Figs 2 and 4]. For the fERN in the learning condition (80% valid, 80% invalid and 100%), we did not find any differences in fERN amplitudes between groups or block half, or any interaction between the two (all p's>0.10 ; see Figs 2 and 4 for mean amplitudes). rERN Comparison of rERN amplitudes revealed a main effect of condition [F(2, 28)=42.9, p<0.001, all contrast pf0.003]. Amplitudes were largest in the 100% condition, followed by the 80% condition, and almost absent in the 50% condition (see Figs 2 and 5). For the rERN in the learning conditions (80% and 100%), we found a main effect for group [F(1, 29)= 7.94, p=0.009] and a main effect for block half [F(1, 29)=8.50, p=0.007; see Figs 2 and 5]. The interaction between condition and block half revealed that amplitudes in the 100% condition were larger in BH2 than in BH1, but such a difference was present to a lesser extent or absent in the 80% condition [F(1, 29)= 9.03, p=0.005]. This was confirmed by means of a paired t test [two-tailed rERN100BH1 – rERN100BH2: t(30)=3.383, p=0.002 ; rERN80BH1 – rERN80BH2: t(30)=1.2, p=0.240]. The significant interaction between group and condition showed that, although amplitudes in the 80% condition did not differ between groups, subjects with PP displayed smaller amplitudes in the 100% condition [F(1, 29)=11.4, p=0.002]. Most importantly, the interaction between group and block half was significant [F(1, 29)=7.29, p=0.011], indicating that subjects with PP showed a smaller difference in amplitudes between BH1 and BH2 compared with control subjects. Finally, the three-way interaction between group, condition and block half was not significant [F(1, 29)=0.285, p=0.598]. Discussion The present study has revealed that individuals with PP showed lower accuracy in a reinforcement-learning paradigm. Furthermore, diminished rERN but normal fERN amplitudes were found in psychopathic individuals. The current study investigated the relationship between error-monitoring and learning in individuals with PP and healthy controls. At an electrophysiological level, psychopathic individuals showed similar responses as controls to negative external feedback, reflected in the fERN. However, individuals with PP did display problems in using this signal to optimize performance, which was reflected in both the behavioural and electrophysiological data. Behaviourally, patients showed reduced accuracy in the 100% learning condition but not in the 80% learning condition. Additionally, the PP group had a smaller increase in accuracy between block halves in the 100% learning condition and the accuracy rate analyses over blocks demonstrated that individuals with PP had specific problems in the initial learning phase in the first block, but not in the later blocks. Importantly, diminished learning was also associated with the compromised propagation of the fERN to become an rERN. This was mainly reflected in a diminished increase in rERN amplitudes while learning progressed. Behavioural findings To master the present task, subjects have to learn the rules and apply them to new pictures in subsequent 80 60 40 20 0 A cc ur ac y (% ) Block 1 Block 2 Block 3 Block 4 Fig. 3. Average amount of correct responses (%) in the two learning conditions (100% and 80% valid) for control (%) and psychopathic individuals ( , PP), separately for each block. Error bars indicate standard errors. 1564 A. K. L. von Borries et al. blocks. Therefore, accuracy is expected to be low in the initial learning phase (the first block) but to increase rapidly during the generalization process (later blocks). Although this pattern was found in both groups, individuals with PP showed diminished accuracy during the first block, suggesting a deficit in initial rule learning. Similar accuracy levels in the last block suggest that psychopathic individuals do reach the same performance level as healthy controls but need more time to do so. Of note, differences in accuracy were only found in the easiest learning condition and not for the more –5 –4 –3 –2 –1 0 1 2 3 –5 –4 –3 –2 –1 0 1 2 3 –5 –4 –3 –2 –1 0 1 2 3 –5 –4 –3 –2 –1 0 1 2 3 0 200 400 600 0 200 400 600 0 200 400 600 0 200 400 600 (ms) (ms) (ms) (ms) Feedback onset C z (μ V ) C z (μ V ) 100% 80% valid 80% invalid 50% fERN fERN Fig. 4. Grand average feedback error-related negativity (fERN) difference waves (incorrect feedback minus correct feedback) for the control group (-) and the psychopathy ( , PP) group for electrode site Cz and all four conditions (100%, 80% valid, 50% and 80% invalid). Feedback was given at 0 ms ; the fERN is indicated by the arrow. –5 –4 –3 –2 –1 0 1 2 3 0 200 400 C z (μ V ) 100% –5 –4 –3 –2 –1 0 1 2 3 0 200 400 80% –5 –4 –3 –2 –1 0 1 2 3 0 200 400 50% (ms)(ms)(ms) Response onset rERN rERN Fig. 5. Grand average event-related potential (ERP) difference waves (incorrect responses minus correct responses) for the 100%, 80% and 50% condition for the control group (-) and the psychopathy ( , PP) group at electrode Cz. Responses were given at 0 ms ; feedback error-related negativity (fERN) is indicated by an arrow. Error-related learning deficits in psychopaths 1565 difficult 80% condition. One explanation for this finding is based on the so-called low-fear hypothesis of PP (Lykken, 1957), which assumes that psychopathic individuals are insensitive to punishment because of a low level of fear. Furthermore, some studies suggest that punishment-based learning is more impaired in PP than reward-based learning (Blair et al. 2004). If we assume that subjects with PP are impaired in learning based on (negative) feedback, subjects with PP will use substantially less trials to learn from than control subjects in the 100% condition. This then leads to a greater degree of uncertainty, which in turn leads to less accurate responding. In the 80% condition, however, accuracy does not depend solely on the amount of feedback information used. In this condition accuracy increases if the subject reacts as if this was a 100% condition, ignoring the 20% invalid unpredictable trials. Performance thus depends on how many valid trials are processed as useful information and how much of the invalid information is ignored. Therefore, it does not depend on the total amount of feedback information used but on the proportion of valid versus invalid feedback that is used to learn the rule. This is not affected in individuals with PP, which explains why they show the same levels of accuracy in this condition. Impaired learning under conditions of reward and punishment in psychopathic individuals has been shown before. For example, psychopathic individuals showed impairments in passive avoidance learning (Newman & Kosson, 1986; Blair et al. 2004) and on a differential reward/punishment task (Blair et al. 2006). Contrary to the present results, Budhani et al. (2006) found no acquisition problems in psychopathic individuals during the initial learning phase of a probabilistic response-reversal task. However, some important differences between the response-reversal task by Budhani et al. (2006) and the present task exist that may explain the different outcomes. First, the current task involved more complex learning material because we included three different reinforcement contingencies whereas Budhani et al. (2006) included only two. Second, the total number of stimulus–response associations to be learned in our study was 24. In the response-reversal task of Budhani et al. (2006), only six stimuli had to be associated with a response. Third, their task had no RT restriction whereas the present study used a deadline of 1000 ms. It seems plausible that these differences in complexity largely account for the divergent findings of the two studies. Moreover, the differences demonstrate that possible impairments in PP may only become evident in more complex situations and might be missed in less demanding tasks. Electrophysiological findings According to the RL theory (Holroyd & Coles, 2002), the fERN elicited by negative feedback is used to update and learn the earliest predictor of punishment. The error signal is carried to the pMFC, where it is used as a reinforcement-learning signal, guiding the adaptation of behaviour. Although individuals with PP show intact processing of external negative feedback at an electrophysiological level, they do not seem to use the error signal to optimally form an internal template of the rules (stimulus–response mappings) at hand. For an rERN to occur, detection of a mismatch between expected and real outcome has to take place (Holroyd & Coles, 2002). Prerequisite for this is an internal template of the rules to which the current behaviour can be compared. As no internal template is formed, a comparison between real and expected outcome cannot be made and hence learning, reflected in adaptive behaviour, is compromised. The reduced rERN amplitude thus reflects higher uncertainty due to diminished learning at an electrophysiological level (Pailing & Segalowitz, 2004). It has been demonstrated that the performance of individuals with PP in certain learning paradigms is modulated by reward but not by punishment (Blair et al. 2006). Additionally, it has been reported that low socialized individuals (a trait closely related to PP) show diminished rERNs under conditions of punishment but not reward (Dikman & Allen, 2000). With regard to the current task, individuals with PP might have learned based on reward cues, but not on punishment cues, which leads to diminished learning performance as only some of the trials (the rewarded but not the punished) are used to adapt behaviour. An earlier investigation of the rERN in individuals diagnosed with PP outside a learning context (Munro et al. 2007) reported no indications for diminished amplitudes. Although Brazil et al. (2009) replicated this finding at an electrophysiological level, their behavioural data demonstrated problems in error signalling in individuals with PP. This suggests that rERN amplitudes are only decreased in PP when related to explicit behavioural adaptations or learning processes but not in the context of simple error detection in a neutral task. Integration It is noteworthy that the currently found learning deficits in individuals with PP would not have been predicted by the IES (Blair, 2005 ; Blair et al. 2005) hypothesis of PP. The IES interpretation proposes that an underlying amygdala deficit (Kiehl et al. 2001 ; Blair, 2003 ; Pridmore et al. 2005) leads to impairments in 1566 A. K. L. von Borries et al. stimulus–reinforcement associations but not in stimulus–response associations in individuals with PP. However, although the amygdala plays a central role in the first process, other brain structures are involved in the second. Functional magnetic resonance imaging (fMRI) and ERP studies using similar paradigms to the current one have demonstrated an important role for the pMFC (including the ACC and pre-SMA; Holroyd et al. 2004 ; Mars et al. 2005) and the basal ganglia (Holroyd & Coles, 2002 ; Ullsperger & von Cramon, 2006) in learning from errors. Currently, the IES interpretation of PP does not include these processes and brain areas and hence does not allow for any specific predictions to be made. Therefore, we argue that, for a better understanding of the learning deficits in PP, neurocognitive models should also focus on the areas involved in the processing of internal and external error messages and the subsequent adaptation of behaviour. To summarize, our results indicate that learning from negative feedback is compromised in PP. These results are supported by both behavioural and electrophysiological data. Deviancies in error processing may play a crucial role in the learning deficiencies associated with PP. The IES interpretation of PP predicts deficits in certain forms of learning, but does not relate these deficits to the processing of errors. Furthermore, although the model includes aspects of stimulus–response learning and stimulus– reinforcement learning, aspects of internal and external error processing relevant to trial-and-error learning are not included. This differentiation between learning processes also fits with a more recent model of decision making proposed by Rushworth et al. (2007), in which the orbitofrontal cortex, the ACC and the amygdala are part of a neural network involved in learning, action monitoring and social behaviour. Our data suggest that extending the IES interpretation to include error monitoring and areas involved in error monitoring, in addition to more diverse forms of learning, may lead to a broader understanding of the relationship between learning and PP. Acknowledgements E.D.B. was supported by a VENI grant from the Netherlands Organization for Scientific Research (NWO) (451-07-022). Declaration of Interest None. References Blair KS, Morton J, Leonard A, Blair RJR (2006). Impaired decision-making on the basis of both reward and punishment information in individuals with psychopathy. Personality and Individual Differences 41, 155–165. Blair RJR (2003). Neurobiological basis of psychopathy. British Journal of Psychiatry 182, 5–7. Blair RJR (2005). Applying a cognitive neuroscience perspective to the disorder of psychopathy. Development and Psychopathology 17, 865–891. Blair RJR, Mitchell DGH, Blair KS (2005). The Psychopath : Emotion and the Brain. Blackwell Publishing : Oxford, UK. Blair RJR, Mitchell DGH, Leonard A, Budhani S, Peschardt KS, Newman C (2004). Passive avoidance learning in individuals with psychopathy : modulation by reward but not by punishment. Personality and Individual Differences 37, 1179–1192. Brazil IA, de Bruijn ERA, Bulten BH, von Borries AKL, van Lankveld JJM, Buitelaar JK, Verkes RJ (2009). Early and late components of error-monitoring in violent offenders with psychopathy. Biological Psychiatry 65, 137–143. Budhani S, Richell RA, Blair RJR (2006). Impaired reversal but intact acquisition : probabilistic response reversal deficits in adult individuals with psychopathy. Journal of Abnormal Psychology 115, 552–558. Carter CS, Braver TS, Barch DM, Botvinick MM, Noll D, Cohen JD (1998). Anterior cingulate cortex, error detection, and the online monitoring of performance. Science 28, 747. Clarke HF, Robbins TW, Roberts AC (2008). Lesions of the medial striatum in monkeys produce perseverative impairments during reversal learning similar to those produced by lesions in the orbitofrontal cortex. Journal of Neuroscience 28, 10972–10982. Cleckley H (1976). The Mask of Sanity, 5th edn. Mosby : St Louis. Cools R, Clark L, Owen AM, Robbins TW (2002). Defining the neural mechanisms of probabilistic reversal learning using event-related functional magnetic resonance imaging. Journal of Neuroscience 22, 4563–4567. de Bruijn ERA, Hulstijn W, Verkes RJ, Ruigt GS, Sabbe BG (2004). Drug-induced stimulation and suppression of action monitoring in healthy volunteers. Psychopharmacology 177, 151–160. de Bruijn ERA, Schubotz RU, Ullsperger M (2007). An event-related potential study on the observation of erroneous everyday actions. Cognitive, Affective, and Behavioral Neuroscience 4, 278–285. Dehaene S, Posner MI, Tucker DM (1994). Localization of a neural system for error detection and compensation. Psychological Science 5, 303–305. Dikman ZA, Allen JJB (2000). Error monitoring during reward and avoidance learning in highand low-socialized individuals. Psychophysiology 27, 43–54. D'Silva K, Duggan C, McCarthy L (2004). Does treatment really make psychopaths worse? A review of evidence. Journal of Personality Disorders 18, 163–177. Error-related learning deficits in psychopaths 1567 Flor H, Birbaumer N, Hermann C, Ziegler S, Patrick CJ (2002). Aversive Pavlovian conditioning in psychopaths : peripheral and central correlates. Psychophysiology 39, 505–518. Gehring WJ, Fencsik DE (2001). Function of the medial frontal cortex in the processing of conflict and errors. Journal of Neuroscience 21, 9430–9437. Gratton G, Coles MGH, Donchin E (1983). A new method for off-line removal of ocular artifacts. Electroencephalography and Clinical Neurophysiology 55, 468–484. Groenestijn MAC, Akkerhuis GW, Kupka RW, Schneider N, Nolen WA (1999). Structured Clinical Interview for DSM-IV Axis I Disorders (SCID-I) [in Dutch]. Swets Test Publishers : Lisse, The Netherlands. Hall JR, Bernat EM, Patrick CJ (2007). Externalizing psychopathology and the error-related negativity. Psychological Science 18, 326–333. Hare R (1991). The Hare Psychopathy Checklist – Revised. Multi-Health Systems : Toronto. Hildebrand M (2005). Summary of research report : Escapes during leave, escapes and recidivism during TBS treatment in 2000–2005. Expertisecentrum Forensische Psychiatrie. Holroyd CB, Coles MGH (2002). The neural basis of human error processing : reinforcement learning, dopamine, and the error-related negativity. Psychological Review 109, 679–709. Holroyd CB, Dien J, Coles MGH (1998). Error-related scalp potentials elicited by hand and foot movements : evidence for an output-independent error-processing system in humans. Neuroscience Letters 242, 65–68. Holroyd CB, Nieuwehuis S, Yeung N, Nystrom L, Mars RB, Coles MGH, Cohen JD (2004). Dorsal anterior cingulated cortex shows fMRI response to internal and external error signals. Nature Neuroscience 7, 497–498. Kiehl KA, Smith AM, Hare RD, Forster BB, Brink J, Liddle PF (2001). Limbic abnormalities in affective processing by criminal psychopaths as revealed by functional magnetic resonance imaging. Biological Psychiatry 50, 677–684. Kosson DS, Newman JP (1989). Socialization and attentional deficits under focusing and divided attention conditions. Journal of Personality and Social Psychology 57, 175–184. Lykken DT (1957). A study of anxiety in the sociopathic personality. Journal of Abnormal and Social Psychology 55, 6–10. Mars RB, Coles MGH, Grol MJ, Holroyd CB, Nieuwenhuis S, Hulstijn W, Toni I (2005). Neural dynamics of error processing in medial frontal cortex. NeuroImage 29, 1007–1013. Mars RB, de Bruijn ERA, Hulstijn W, Milter WHR, Coles MGH (2004). What if I told you : 'You were wrong '? Brain potentials and behavioral adjustments elicited by feedback in a time-estimation task. In Errors, Conflict and the Brain. Current Opinions on Performance Monitoring (ed. M. Ullsperger and M. Falkenstein), pp. 129–134. Sächsisches Digitaldruck Zentrum GmbH: Dresden, Germany. Miltner WHR, Braun CH, Coles MGH (1997). Event-related brain potentials following incorrect feedback in a time-estimation task : evidence for a 'generic ' neural system for error detection. Journal of Cognitive Neuroscience 9, 788–798. Mitchell DGV, Fine C, Richell RA, Newman C, Lumsden J, Blair KS, Blair RJR (2006). Instrumental learning and relearning in individuals with psychopathy and in patients with lesions involving the amygdala or orbitofrontal cortex. Neuropsychology 20, 280–289. Munro GE, Dywan J, Harris GT, McKee S, Unsal A, Segalowitz SJ (2007). ERN varies with degree of psychopathy in an emotion discrimination task. Biological Psychiatry 76, 31–42. Newman JP (1987). Reaction to punishment in extraverts and psychopaths : implications for the impulsive behavior of disinhibited individuals. Journal of Research in Personality 21, 464–480. Newman JP, Kosson, DS (1986). Passive avoidance learning in psychopathic and nonpsychopathic offenders. Journal of Abnormal Psychology 95, 252–256. Nieuwenhuis S, NielenMM,Mol N, Hajcak G, Veltman DJ (2005). Performance monitoring in obsessive-compulsive disorder. Psychiatry Research 134, 111–122. Nieuwenhuis S, Ridderinkhof KR, Talsma D, Coles MGH, Holroyd CB, Kok A, van der Molen MW (2002). A computational account of altered error processing in older age : dopamine and the error-related negativity. Cognitive, Affective, and Behavioral Neuroscience 2, 19–36. Pailing PE, Segalowitz SJ (2004). The effects of uncertainty in error monitoring on associated ERPs. Brain and Cognition 56, 215–233. Pridmore S, Chambers A, McArthur M (2005). Neuroimaging in psychopathy. Australian and New Zealand Journal of Psychiatry 39, 856–865. Rushworth MFS, Behrens TEJ, Rudebeck PH, Walton ME (2007). Contrasting roles for cingulate and orbitofrontal cortex in decisions and social behavior. Trends in Cognitive Sciences 11, 168–176. Ullsperger M, Falkenstein M (2004). Errors, Conflict and the Brain. Current Opinions on Performance Monitoring. MPI Special Issue. Sächsisches Digitaldruck Zentrum GmbH: Dresden, Germany. Ullsperger M, von Cramon DY (2006). The role of intact frontostriatal circuits in error processing. Journal of Cognitive Neuroscience 18, 651–664. van Vliet IM, Leroy H, van Megen HJGM (2000). M.I.N.I. International Neuropsychiatric Interview, Dutch Version 5.0.0 [in Dutch]. LUMC: Leiden. 1568 A. K. L. von Borries et al.