Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records

Don't Let Notes Be Misunderstood:
A Negation Detection Method for
Assessing Risk of Suicide in Mental
Health Records
George Gkotsis, Sumithra Velupillai, Anika Oellrich
Harry Dean, Maria Liakata and Rina Dutta
Biomedical Research Centre Nucleus – King’s College London

e-HOST-IT
Electronic health records to predict
HOspitalised Suicide attempts:
Targeting Information Technology solutions
Aim
To determine whether structured and free-text data in Electronic
Health Records (EHRs) can be used to quantify changes in
symptoms, behaviour patterns and health service-utilisation and
predict serious suicide attempts

• Health records contain many structured fields, such as:
• Personal information/contact
• Diagnosis
• Prescription
• Interventions
• Scans & Measurements
Most of the structured fields
are left blank
The mysterious case of health records (1/2)
Max Weber
Theory of
Formal Rationality

The mysterious case of health records (2/2)
• Free text contains a lot of information
• Traditional information access
technology returns many false
positives
Example
1. Patient is suicidal
2. Patient is not suicidal
• Meaning can be expressed in multiple
ways
Example
1. He has suicidal thoughts
2. He wants to end his life
3. She wants to kill herself

CRIS database
• 226,000 patients
• 18.6 million documents (Event)
• Suicide-related data
• 783,000 documents contain the word suicid*
• 111,000 patients
Anonymous Reviewer:
“Overall,I think the paper is well thought out and written, and I am
envious of their access to such a large patient dataset”

Negation detection – definition (1/2)
“The determination of whether a finding or
disease mentioned within narrative medical
reports is present or absent”*
Negex, Chapman et al.
Journal of Biomedical Informatics, 2001

Negation detection – definition (2/2)
Negation
Detection
Sentence
Target keyword
Positive/Negative

Towards negation detection resolution
• Fundamental NLP task
Reduced to identifying the scope of negation
Examples:
No issues other than her indicating that she might commit
He continues to deny any suicidal thoughts and is
happy to come to the XXX for medical review
tomorrow
+
-

State-of-the-art: Negex (1/2)
• Lexical-based approach
• Collection of negation cues/expressions
• Pseudo-negation expressions
• Termination cues for scope
• Search scope of 6 words surrounding the target keyword
• pyConTextNLP

State-of-the-art: DEEPEN (2/2)
• Wrapper over NegEx
• Applied over the (predicted to be) negated sentences
• Uses a dependency parse tree

“Negation’s Not Solved”*
• Optimizable, but not generalizable
• Annotation guidelines are different
• Spans considered can be nouns or whole phrases
• Amount of overlap allowed (or not)
*Wu et al.
PLOS One, 2014

Workflow
Annotated
Dataset
Preprocessing
CoreNLP
Sentence
Parse tree
Target Node
Negation Detection
1. Pruning
2. Identification of
dominating
subordinate
clause
3. Identification of
negation
governing the
target-node
4. Negation
resolution
Positive/
Negative
Target
Keyword
‘suicid*’
MHRs
‘suicid*’
Annotation
Dataset and annotation
Proposed Methodology

2941 3125
Positive Negative
Dataset and annotation
• Generation
• Random sampling from SLAM Events of 6k sentences containing the word
“suicid*”
• Annotation
• One expert annotated the complete corpus
• Another expert repeated the annotations for 25% of the sentences
𝛋=0.93 (IAA=97.9%)
Limitations
• Linguistic focus
• Patient-agnostic

1. Pruning
Labels
• Subordinate conjunctions
• ,
• S
• SBAR
• SINV

2. Identification of dominating subordinate clause
SBAR

3. Identification of governing nodes S

Comparison
1. Proposed Model (uses 15 negation words)
2. NegEx (uses 272 rules)
3. pyConTextNLP-N (uses [2])
4. pyConTextNLP-O (uses [1])
no, without, nil,not, n't, never, none, neith, nor, non
deny, reject, refuse, subside, retract

Results (1/3)
Positive Negative
Positive 2782 331
Negative 159 2794
Total 2941 3125
Prediction
Class

Results (2/3)
Precision Recall FM Accuracy
NegEx 93.4 92.1 92.8 93
pyContextNLP-N 94.1 92.9 93.5 93.7
pyContextNLP-O 80.7 86 83.2 83.2
Proposed 89.4 94.6 91.9 91.9

Distribution of sentence length
Proper punctuation and sentence chunking are crucial!

Discussion
• Corpus of 6k sentences from Mental Health Records
• Annotation of high quality
• Evaluation - focus on positive cases
• Parse trees
+ Require minimum number of negation keywords
+ Further potential
• Statement extraction (subject-predicate-object)
• Temporal characteristics
• Degree of suicidality
- Expensive
- Error prone for long sentences

Future work (1/2)
Expand on expressions of suicidality
oStudy how negation detection can be used to strengthen predictive
power of mental health records
oOngoing cohort study (pupils with ASD)
oLarge scale study based on hospitalisation events

Future work (2/2)
• Evaluate our tool against other datasets/domains
• Consider syntactic dependency parser (instead of constituency-based)
• spaCy
• SyntaxNet

https://github.com/gkotsis/negation-detection
Source code

Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (20)

Similar a Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records

Similar a Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records (20)

Último

Último (20)

Don’t Let Notes Be Misunderstood: A Negation Detection Method for Assessing Risk of Suicide in Mental Health Records

Notas del editor