Word associations contribute to machine learning in automatic scoring of degree of emotional tones in dream reports

doi:10.1016/j.concog.2011.08.003

Consciousness and Cognition

Volume 20, Issue 4, December 2011, Pages 1570-1576

https://doi.org/10.1016/j.concog.2011.08.003 Get rights and content

Abstract

Scientific study of dreams requires the most objective methods to reliably analyze dream content. In this context, artificial intelligence should prove useful for an automatic and non subjective scoring technique. Past research has utilized word search and emotional affiliation methods, to model and automatically match human judges’ scoring of dream report’s negative emotional tone. The current study added word associations to improve the model’s accuracy. Word associations were established using words’ frequency of co-occurrence with their defining words as found in a dictionary and an encyclopedia. It was hypothesized that this addition would facilitate the machine learning model and improve its predictability beyond those of previous models. With a sample of 458 dreams, this model demonstrated an improvement in accuracy from 59% to 63% (kappa = .485) on the negative emotional tone scale, and for the first time reached an accuracy of 77% (kappa = .520) on the positive scale.

Highlights

► Automatic analysis using machine learning can mimic human judgment of emotional tone in dreams. ► Both positive and negative tones can be predicted accurately. ► Word associations improve the accuracy of automatic dream scoring. ► Simple logistic regression is a sufficient model for machine learning.

Introduction

Scientific research on dreaming and dream content has strived to establish objective methods of collecting dreams and analyzing them. In order to limit the distortion and the loss of recall of dreaming activity, subjects are asked to immediately report their dreams upon awakening in laboratory or at home. The reports are then coded using a number of scales that have been developed (Winget & Kramer, 1979). The most comprehensive system was developed by Hall and Van de Castle (1966) which has eight dimensions ranging from characters to emotions. Domhoff (1996) has improved this system notably with DreamSAT which utilizes spreadsheets as means for data entry and statistical analyses (Domhoff, 2003a).

Such a scoring system relies on human judges’ reliability and inter-judge agreements as a requirement for scientific studies (for example, Barcaro, Cavallero, & Navona, 2005). As such, these studies may benefit from an automatic scoring technique. Our current research program attempts to develop an automatic scoring system that ultimately can match human judges’ and the dreamer’s own scoring. As a first step, we have focused on dream emotions. While it is well established that dream production is primarily a cognitive process (Cavallero and Foulkes, 1993, Foulkes, 1985), there is increased interest in the emotional component of dreams. Hartmann (2007) has gone so far as to propose that dream construction starts from central emotions. There are also some physiological studies that attempt to implicate emotions in REM sleep (Benca et al., 2000, Dang-Vu et al., 2007, Mancia, 2005). These observations are consistent with the typical prevalence of negative emotions in REM dreams (Roussy, Raymond, & De Koninck, 2000).

Emotions in dreams can be evaluated by human judges who read the transcript of dreams or by the dreamer. Studies have shown discrepancies between human judges and dreamer in estimating levels of emotions (Fosse, Stickgold, & Hobson, 2001). For example, dreamers attribute more emotions to their dreams than human judges. The Hall and Van de Castle system has five subscales: Happiness, Apprehension, Anger, Sadness and Confusion, which have been derived after extensive research (reviewed by Domhoff (2003b)). Given the complexity of these dimensions, however, we have chosen to start with two four-level scales to score the global positive and negative emotional tone of dreams in an attempt to model the human judge’s scoring, using machine learning. These scores are global measures of the degree of an emotion within a dream which may rely on the dynamics of emotions within the narrative.

In a first study, experienced dream researchers scored the emotional tone of 100 dreams. With disagreements rarely more than one level apart, it was found that the degree of agreement for negative sentiment of dream reports was 81% (MSE = .19) and for positive sentiment it was 58% (MSE = .54) (Nadeau, Sabourin, De Koninck, Matwin, & Turney, 2006). Therefore, further work concentrated on the negative scale. An application called Linguistic Inquiry and Word Count (LIWC) was first used to identify words pertaining to expression of positive and negative emotions in each dream report. The frequency of such words was used to generate additional attributes using mathematical functions; log ratio, square and square root. These attributes were used to train a linear regression model to predict negative affect score of a human judge on a subset of all dream reports, and tested against the remaining dream reports. The model proved to be efficient with an accuracy of 48% (MSE = .608) (Nadeau et al., 2006). The use of individual words as attributes in modeling a dream report was in line with other dream content analysis research (Bulkeley & Domhoff, 2010). In a second study, Razavi et al. (2008) recognized that a more accurate model could be implemented by including patterns of emotional fluctuation; deemed a good source of attributes as per Hartmann’s (2007) theory of emotionally driven dream. As such, attributes of emotional fluctuations, such as frequency of rises/falls of emotions, length of rise/falls, and maximum and minimum, were quantified and used in the model building. These attributes were recognized as Emotion Progression attributes. Finally in the same study, it was suspected that words may be linked to other sources (Barcaro et al., 2005) and to other words through proximal co-occurrence. Hence, the number of words separating any two words was used as a measure of their semantic closeness. In this manner, a new set of attributes was generated and used in this study. In conjunction with all these extracted attributes, the dreamer’s own report of Joy, Fear and Anxiety, and Hall and Van de Castle’s dimensions of Anger, Apprehension, Happiness, Sadness and Confusion were included. Where Hall and Van de Castle recognized the presence or absence of these dimensions, here the dreamers were asked to report the emotional intensities on a 4 level scale. In this experiment, the model produced an accuracy of 59% for negative sentiment. Matwin, Razavi, De Koninck, and Amini (2010) took one step further by incorporating co-occurrence vectors and advance machine learning techniques to achieve an agreement of 64% on the negative scale.

In the present study, we improve upon Razavi et al.’s (2008) model by replacing their word co-occurrence vectors (established using the dream reports under analysis) with word associations (established from external source). This is based on the notion that associative memory is used by dreamers when they report, and by human judges when they read and score for emotional tone. In this process it is assumed that there is communication of meaning through written words. This shared meaning exhibits itself in part in the network of word associations. Silberman, Bentin, and Miikkulainen (2007) suggest that these associations form over our life time, as semantic and episodic factors coincide. These associations can be observed to be syntagmatically structured in childhood and paradigmatically structured in adulthood (Gomes, 1995). In analyzing dream content of adults, the network of word associations can be built using words expressed in dictionary and encyclopedia definitions, as a whole, in order to better approximate paradigmatic word association. Alternatively, sequential proximity of words in definitions would better approximate syntagmatical structure which would benefit analysis of dream content for children. The frequency of word-coincidences can be a good basis for word association, although high frequencies do not necessarily mean high word association as suggested in the Search of Associative Memory (SAM) Model of semantic memory (Nelson and McEvoy, 2000, Raaijmakers and Schiffrin, 1981). As such, in this study, we normalized these frequencies starting with the highly frequent words first.

Incorporation of word-association inherently demands a different data structure than those of the past. Recently, word search (Bulkeley & Domhoff, 2010), word-strings (Domhoff & Schneider, 2008) and bag of words (Nadeau et al., 2006) provided means to identify and count specific occurrences. With the assumption that there is a hierarchical structure in meaning (Murphy & Lassaline, 1997), these frequencies were then used for grouping words to form discrete categories (Bulkeley, 2009) or fuzzy categorical membership (Frantova & Bergler, 2009) which were used to measure specific aspects: personality, emotions, etc. (Hall & Van de Castle, 1966). These aspects are quantified by means of search, which have the strength of objectivity, but suffer from narrow scope of observation due to discretization. It is proposed that word-association can open up this scope for a greater perspective on dream content. One dream report, as represented by a network of associations, can be compared to another’s for similarities and differences using machine learning algorithms. In the former paradigm, one is required to search for specific words in order to quantify and compare frequencies. That approach is limited by its narrow scope; by focusing on a few variables, one has to ignore other potential variables. In the proposed paradigm, differences in the network of associations can be identified using machine learning, relinquishing the need to focus on specific items.

Therefore, it was hypothesized that, with the greater scope of and access to implicit information provided by word associations, the accuracy in automatic scoring of the emotional tone of dreams as assessed by human judges will improve. More specifically, it was predicted that the inclusion of associated words would increase the percentage of agreement between machine and human judges.

Section snippets

Participants

Dream reports were submitted by participants to the Normative Study of Dreams of Canadians, conducted at the University of Ottawa. Upon signing a consent form, participants were instructed to keep a journal of their daily events, and their dreams. The task terminated when 10 consecutive daily event questionnaires were completed or 2–4 dreams had been reported. A subset of 458 English dreams met the 50–800 word count limit, and were used for this study. These dreams were reported by 172 females

Results

For the face validity test, the associations of a sample of words to positive and negative Affect Vector were calculated using the described dot product method. It is clear from the charted results (Fig. 3), that the words we suspected to be affiliated to positive affect and negative affect, do indeed meet expectations. A thorough validation is warranted for future research.

Machine performance results showed that the machine–human agreement score was 62.5% (kappa = .485, MSE = .388) on the negative

Discussion

This paper demonstrated that positive and negative affect could be represented using a set of words as extracted from LIWC. Furthermore, with artificial intelligence as means for learning and handling large volumes of data, it is observed that word associations enhance automatic scoring of dream emotional tone. The contribution rests in enriching the dataset and in describing dream reports beyond a set of discrete words. As a result it has allowed for a more accurate emotional progression whose

References (27)

R.M. Benca et al.
Effects of amygdala lesions on sleep in rhesus monkeys
Brain Research
(2000)
K. Bulkeley
Seeking patterns in dream content: A systematic approach to word searches
Consciousness and Cognition
(2009)
G.W. Domhoff et al.
Studying dream content using the archive and search engine on DreamBank.net
Consciousness and Cognition
(2008)
U. Barcaro et al.
A method for recognizing and describing the links among dream sources
Dreaming
(2005)
K. Bulkeley et al.
Detecting meaning in dream reports: An extension of a word search approach
Dreaming
(2010)
C. Cavallero et al.
Dreams as cognitions
(1993)
T.T. Dang-Vu et al.
Neuroimaging of REM sleep dreaming
G.W. Domhoff
Finding meaning in dreams: A quantitative approach
(1996)
Domhoff, G. W. (2003a). A new resource for content analysis (pp. 95–105). Washington, DC, US: American Psychological...
G.W. Domhoff
The scientific study of dreams
(2003)

M.J. Fosse et al.

The mind in REM sleep: Reports of emotional experience

Sleep

(2001)

D. Foulkes

Dreaming: A cognitive-psychological analysis

(1985)

Frantova, E., & Bergler, S., (2009). Automatic emotion annotation of dream diaries. In Analyzing social media to...

Cited by (12)

Independent discussion sections for improving inferential reproducibility in published research
2019, British Journal of Anaesthesia
Citation Excerpt :
For example, natural language processing (NLP), a form of computational linguistics, is one strategy to measure the tone of a Discussion with relative objectivity. NLP and machine learning approaches to text analysis have been used (among numerous uses in clinical medicine and research) to identify emotional tone in dream reports,41 evaluate psychological states,42 classify emotional expression in suicide notes,43 and identify positive and negative emotional polarity.44 Qualitative analysis of text using NLP algorithms could allow comparisons of Discussions by study authors vs independent authors.
There is a reproducibility crisis in science. There are many potential contributors to replication failure in research across the translational continuum. In this perspective piece, we focus on the narrow topic of inferential reproducibility. Although replication of methods and results is necessary to demonstrate reproducibility, it is not sufficient. Also fundamental is consistent interpretation in the Discussion section. Current deficiencies in the Discussion sections of manuscripts might limit the inferential reproducibility of scientific research. Lack of contextualisation using systematic reviews, overinterpretation and misinterpretation of results, and insufficient acknowledgement of limitations are common problems in Discussion sections; these deficiencies can harm the translational process. Proposed solutions include eliminating or not reading Discussions, writing accompanying editorials, and post-publication review and comments; however, none of these solutions works very well. A second Discussion written by an independent author with appropriate expertise in research methodology is a new testable solution that could help probe inferential reproducibility, and address some deficiencies in primary Discussion sections.
Automatic gender detection of dream reports: A promising approach
2016, Consciousness and Cognition
Citation Excerpt :
Later on, an Automatic Analysis (AA) technique was developed and used to combine the processes of Linguistic Inquiry and Word Count (LIWC), progressions of LIWC categories and emotions, and the transformations of the data resulting from these processes. Instead of searching through text for specific target words, Amini, Sabourin, and De Koninck (2011a,b) have incorporated word associations. For example, “sadness” may also be associated with words like “crying”, “tears”, or “hurt”, and such inclusions have been shown to improve the automatic scoring this time for both the negative and positive emotional tones of dreams.
A computer program was developed in an attempt to differentiate the dreams of males from females. Hypothesized gender predictors were based on previous literature concerning both dream content and written language features. Dream reports from home-collected dream diaries of 100 male (144 dreams) and 100 female (144 dreams) adolescent Anglophones were matched for equal length. They were first scored with the Hall and Van de Castle (HVDC) scales and quantified using DreamSAT. Two male and two female undergraduate students were asked to read all dreams and predict the dreamer’s gender. They averaged a pairwise percent correct gender prediction of 75.8% (κ = 0.516), while the Automatic Analysis showed that the computer program’s accuracy was 74.5% (κ = 0.492), both of which were higher than chance of 50% (κ = 0.00). The prediction levels were maintained when dreams containing obvious gender identifiers were eliminated and integration of HVDC scales did not improve prediction.
A differentiating empirical linguistic analysis of dreamer activity in reports of EEG-controlled REM-dreams and hypnagogic hallucinations
2013, Consciousness and Cognition
Citation Excerpt :
We can investigate in how far characteristics of dream phenomenology associated with different brain states – such as REM sleep and hypnagogic hallucinations – can be distinguished with the help of subjects’ verbal dream reports. With regard to a variety of dream research questions, empirical linguistic analysis may close the gap between existing approaches to the analysis of dream phenomenology: It combines the advantages of the high objectivity of standardized or even automatic counting systems (e.g. Amini, Sabourin, & De Koninck, 2011; Bulkeley, 2009; Domhoff & Schneider, 2008) with the flexibility of human rating and scoring (e.g. Fosse, Stickgold, & Hobson, 2001) – a flexibility that is necessary when we are working with language in the context of subjective verbal reports. The empirical linguistic analysis of dream reports wants to advance the tools of formal analysis by establishing clear rules for the linguistic testing of research questions which until now appeared to be approachable only with content analytical methods (e.g. Hall & Van de Castle, 1966).
We present Activity Analysis as a new method for the quantification of subjective reports of altered states of consciousness with regard to the indicated level of simulated motor activity. Empirical linguistic activity analysis was conducted with dream reports conceived immediately after EEG-controlled periods of hypnagogic hallucinations and REM-sleep in the sleep laboratory. Reports of REM-dreams exhibited a significantly higher level of simulated physical dreamer activity, while hypnagogic hallucinations appear to be experienced mostly from the point of passive observer. This study lays the groundwork for clinical research on the level of simulated activity in pathologically altered states of subjective experience, for example in the REM-dreams of clinically depressed patients, or in intrusions and dreams of patients diagnosed with PTSD.
Automatic Annotation of Dream Report’s Emotional Content with Large Language Models
2024, CLPsych 2024 - 9th Workshop on Computational Linguistics and Clinical Psychology, Proceedings of the Workshop
Automated Analysis of Dream Sentiment—The Royal Road to Dream Dynamics?
2022, Dreaming
Relevance in Web search: between content, authority and popularity
2022, Quality and Quantity

View all citing articles on Scopus

¹: Fax: +1 613 562 5147.

View full text

Word associations contribute to machine learning in automatic scoring of degree of emotional tones in dream reports

Abstract

Highlights

Introduction

Section snippets

Participants

Results

Discussion

Brain Research

Consciousness and Cognition

Consciousness and Cognition

A method for recognizing and describing the links among dream sources

Dreaming

Detecting meaning in dream reports: An extension of a word search approach

Dreaming

Dreams as cognitions

Neuroimaging of REM sleep dreaming

Finding meaning in dreams: A quantitative approach

The scientific study of dreams

The mind in REM sleep: Reports of emotional experience

Sleep

Dreaming: A cognitive-psychological analysis