SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
Extracting information
  from clinical notes

  H. Yang, I. Spasic, F. Sarafraz,
  John A. Keane, Goran Nenadic


     School of Computer Science
      University of Manchester
Motivation & aim
 Electronic clinical notes
    electronic medical/health records
    hospital discharge summaries
 Extract information on
    individual patients and their diseases
    clinical practice
      treatments, drugs used, etc.
 Aim: support data analytics
       e.g. monitoring quality
 Huge interest locally and internationally
Clinical notes
 Highly condensed text
    sometimes without proper sentences
    hospital discharge summaries are more structured
    list of medications, symptoms, etc.


 Terminological variability
    orthographic, acronyms, local conventions


 Various sections
    previous history, social/family background
NLP challenges in clinical data
 A series of international challenges in information
  extraction from clinical narratives
      organisers: Informatics for Integrating Biology & the
       Bedside (i2b2)

 3 shared tasks so far
   −   De-identification of medical records and identification of
       smokers from their clinical records (2007)
       Identification of obesity & related diseases in patients from
       hospital discharge documents (2008)
       Extraction of medications and related information from
       patients’ discharge documents (2009)

 2010 challenge
      concept, assertions, relations
i2b2 2008
 Extract status of diseases in patients
       obesity, diabetes mellitus, hypercholesterolemia,
        hypertriglyceridemia, hypertension, heart failure (16 in total)
       status: yes, no, unmentioned, questionable
       on textual and “intuitive” level

 28 teams worldwide
       UoM ranked 1st in textual and 7th in intuitive

 Our methodology
       Term-based exact and approximate matching
       Context-based pattern- and rule-based matching
       Machine learning approach


Yang, H., Spasic, I., Keane, J., Nenadic, G.: A Text Mining Approach to the Prediction of a
Disease Status from Clinical Discharge Summaries, JAMIA 16(4):596-600
Methodology
                    Linguistic      section splitting, sentence splitting,
                 pre-processing     chunking, POS tagging, parsing




                   Information        textual evidence extraction,
                    extraction        section filtering, morphological
  Medical
                 (rules, machine      clues (e.g. drug/disease name
 resources
                     learning)        affixes)

•Disease names
•Drug names
•Body parts                        Template filling, filtering negative
•Symptoms                          results, relations and heuristics:
•Abbreviations    Constructing             Organ : Symptom,
•Synonyms           results                Symptom : Disease,
                                           Disease : Drug,
                                           Drug : Mode of application
Rule-based IE
 Disease status patterns
 - context-based patterns
   [N] negative for CHF
   [Q] question of asthma
   [U] no known diagnosis of CAD
   [U] we should consider further asthma studies as an
   outpatient

 - semantics-based patterns
   [N] normal coronaries, a thin black man

 Clinical resources used in sentence extraction
    clinical inference rules e.g., weight>90kg,
     LDL>160mg/dl, HDL<35mg/dl
    medications e.g., ‘anti-depressant’
Textual Annotation Results

 Performance on Disease Status (Ranked 1st)
Micro-average: Accuracy (0.9723)
Macro-average: P (0.8482), R (0.7737), F-score (0.8052)



      #Eval   #Corr   #Gold   Precision   Recall   F-score

  Y   2267    2132    2192    0.9404      0.9726   0.9562

  N   56      40      65      0.7142      0.6153   0.6611

  Q   12      9       17      0.7500      0.5294   0.6206

  U   5709    5640    5770    0.9879      0.9774   0.9826
Intuitive Annotation Results

 Performance on Disease Status (Ranked 7th)
Micro-average: Accuracy (0.9572)
Macro-average: P (0.6383), R (0.6294), F-score (0.6336)




      #Eval   #Corr   #Gold    Precision   Recall   F-Score

  Y   2160    2068    2285     0.9574      0.9050   0.9304

  N   5236    5014    5100     0.9576      0.9831   0.9702

  Q   3       0       14       0           0        0
i2b2 2009
 Extract mentions of medication and related
  information
   drugs the patient takes
   dose, mode of application, frequency, duration, etc.
    (for each mention)
 19 teams worldwide
   UoM ranked 3rd
 Our approach was based on combining
   extensive dictionaries
   morphological and derivational patterns
Evaluation (F-measure)


              Medication                              83.59%
              Dosage                                  82.67%
              Frequency                               83.49%
              Mode                                    85.33%
              Duration                                51.00%
              Reason                                  38.81%

              All fields                              78.47%




Spasić I, Sarafraz F, Keane JA, Nenadic G: “Medication Information Extraction
  with Linguistic Pattern Matching and Semantic Rules”, JAMIA (to appear)
Summary
 NLP and text mining techniques are useful for extraction
  of clinical data
  - disease status extraction: 95-97% accuracy
  - medication information extraction: 80% F-measure

 Construction of reliable and sufficient resources
  - clinical terms and abbreviations (e.g., disease synonyms,
   symptoms, drugs)
  - context patterns related to diseases, medication, etc.

 Domain knowledge required
      construction of domain- and task-specific resources
      complex clinical facts and conditions for inference
        more comprehensive knowledge representation needed

Más contenido relacionado

La actualidad más candente

Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...ScHARR HEDS
 
Process Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentationProcess Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentationSanjana Nair
 
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...Rodrigo Vargas Zapana
 
Otol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-MerrittOtol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-MerrittMichael (Mick) Merritt
 
Prescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage SystemsPrescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage SystemsSatish Veerla
 
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanomaAccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanomaIMSHealthRWES
 
The Envisia Genomic Classifier
The Envisia Genomic ClassifierThe Envisia Genomic Classifier
The Envisia Genomic ClassifierPhil J. Morrison
 
Consenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATSConsenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATSFlávia Salame
 
Chapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responsesChapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responsesNilesh Kucha
 
Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...home
 
Nursesí practices and perception of delirium in the intensive care units of ...
Nursesí  practices and perception of delirium in the intensive care units of ...Nursesí  practices and perception of delirium in the intensive care units of ...
Nursesí practices and perception of delirium in the intensive care units of ...Alexander Decker
 

La actualidad más candente (20)

Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
Diagnostic accuracy of echocardiography for co-existing pathologies in atrial...
 
Process Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentationProcess Oriented Multidisciplinary Approach (POMA) -journal presentation
Process Oriented Multidisciplinary Approach (POMA) -journal presentation
 
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
Benefits os Statins in Elderly Subjects Without Established Cardiovascular Di...
 
Otol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-MerrittOtol HNS Better to be Young-2000-Lacy-Merritt
Otol HNS Better to be Young-2000-Lacy-Merritt
 
London 21.11.2008
London 21.11.2008London 21.11.2008
London 21.11.2008
 
UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...
UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...
UOG Journal Club: Intrafetal laser treatment for twin reversed arterial perfu...
 
Journal of Immune Research
Journal of Immune Research Journal of Immune Research
Journal of Immune Research
 
Prescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage SystemsPrescription Event Monitoring & Record Linkage Systems
Prescription Event Monitoring & Record Linkage Systems
 
Annotation Editorial
Annotation EditorialAnnotation Editorial
Annotation Editorial
 
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanomaAccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
AccessPoint Excerpt - The potential for RWE to improve care inmalignant melanoma
 
The Envisia Genomic Classifier
The Envisia Genomic ClassifierThe Envisia Genomic Classifier
The Envisia Genomic Classifier
 
Informed consent
Informed consentInformed consent
Informed consent
 
Bio 152 Paper
Bio 152 PaperBio 152 Paper
Bio 152 Paper
 
Aaa
AaaAaa
Aaa
 
Nódulos pulmonares
Nódulos pulmonares Nódulos pulmonares
Nódulos pulmonares
 
Consenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATSConsenso de Fibrose Pulmonar Idiopática da ATS
Consenso de Fibrose Pulmonar Idiopática da ATS
 
Chapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responsesChapter 25 assessment of clincal responses
Chapter 25 assessment of clincal responses
 
Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...Overall patient satisfaction was significantly higher in homeopathic than in ...
Overall patient satisfaction was significantly higher in homeopathic than in ...
 
159th publication jamdsr- 3rd name
159th publication  jamdsr- 3rd name159th publication  jamdsr- 3rd name
159th publication jamdsr- 3rd name
 
Nursesí practices and perception of delirium in the intensive care units of ...
Nursesí  practices and perception of delirium in the intensive care units of ...Nursesí  practices and perception of delirium in the intensive care units of ...
Nursesí practices and perception of delirium in the intensive care units of ...
 

Destacado

Destacado (19)

the_life_cycle_of_a_wireframe
the_life_cycle_of_a_wireframethe_life_cycle_of_a_wireframe
the_life_cycle_of_a_wireframe
 
BioNLP09 Winners
BioNLP09 WinnersBioNLP09 Winners
BioNLP09 Winners
 
Eoy
EoyEoy
Eoy
 
Rosario Hearst
Rosario HearstRosario Hearst
Rosario Hearst
 
Language
LanguageLanguage
Language
 
Crf
CrfCrf
Crf
 
Edu2
Edu2Edu2
Edu2
 
Susan Gray
Susan GraySusan Gray
Susan Gray
 
Workshop negations
Workshop negationsWorkshop negations
Workshop negations
 
Bionlp09
Bionlp09Bionlp09
Bionlp09
 
I2b209
I2b209I2b209
I2b209
 
Edu
EduEdu
Edu
 
Artspoken.com
Artspoken.comArtspoken.com
Artspoken.com
 
Six Month
Six MonthSix Month
Six Month
 
Tinsleys 7 Accomplishments
Tinsleys 7 AccomplishmentsTinsleys 7 Accomplishments
Tinsleys 7 Accomplishments
 
Nacsa úJ 4.1 Jav.
Nacsa úJ 4.1 Jav.Nacsa úJ 4.1 Jav.
Nacsa úJ 4.1 Jav.
 
Defense
DefenseDefense
Defense
 
Olivia Contradictions
Olivia ContradictionsOlivia Contradictions
Olivia Contradictions
 
Ambiguity
AmbiguityAmbiguity
Ambiguity
 

Similar a Health care special interest-i2b2

Using real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsUsing real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsKarin Verspoor
 
nuevos criterios de sepsis
nuevos criterios de sepsisnuevos criterios de sepsis
nuevos criterios de sepsisVeronica Dubay
 
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...Crimsonpublishers-Rehabilitation
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learningEman Abdelrazik
 
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERTSess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERTguestfbf1e1
 
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...lawrenceanchah
 
iOMICS Clinical & Omnia
iOMICS Clinical & OmniaiOMICS Clinical & Omnia
iOMICS Clinical & OmniaInterpretOmics
 
Metanalisis tratamientos ttm
Metanalisis tratamientos ttmMetanalisis tratamientos ttm
Metanalisis tratamientos ttmReynold Muñoz
 
Analysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure ControlAnalysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure ControlHealth Informatics New Zealand
 
EmergencyMedicine Research
EmergencyMedicine ResearchEmergencyMedicine Research
EmergencyMedicine Researchzybernav
 
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...Nelson Hendler
 
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdfEffective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdfPubrica
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxssuser6b571f
 
Central mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 casesCentral mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 casesMNTan1
 
Nejm early goal shock septico 2019
Nejm early goal shock septico 2019Nejm early goal shock septico 2019
Nejm early goal shock septico 2019Lucia Tacanga
 
TADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for AnaesthetistsTADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for AnaesthetistsHealth Informatics New Zealand
 

Similar a Health care special interest-i2b2 (20)

Using real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questionsUsing real-world evidence to investigate clinical research questions
Using real-world evidence to investigate clinical research questions
 
nuevos criterios de sepsis
nuevos criterios de sepsisnuevos criterios de sepsis
nuevos criterios de sepsis
 
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
How to Improve the Accuracy of the Initial Evaluation, Using a System Develop...
 
Electronic health records and machine learning
Electronic health records and machine learningElectronic health records and machine learning
Electronic health records and machine learning
 
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERTSess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
Sess_39_NAMCS&NHAMCS_hands-on_SCHAPPERT
 
Cavernoma JC
Cavernoma JCCavernoma JC
Cavernoma JC
 
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
CARDIAC REHABILITATION IN SARAWAK GENERAL HOSPITAL IN MALAYSIA Research Area:...
 
iOMICS Clinical & Omnia
iOMICS Clinical & OmniaiOMICS Clinical & Omnia
iOMICS Clinical & Omnia
 
Metanalisis tratamientos ttm
Metanalisis tratamientos ttmMetanalisis tratamientos ttm
Metanalisis tratamientos ttm
 
Analysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure ControlAnalysis of Medication Possession Ratio for Improved Blood Pressure Control
Analysis of Medication Possession Ratio for Improved Blood Pressure Control
 
EmergencyMedicine Research
EmergencyMedicine ResearchEmergencyMedicine Research
EmergencyMedicine Research
 
Emergency Medicine Research
Emergency Medicine ResearchEmergency Medicine Research
Emergency Medicine Research
 
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
Comparisonof Clinical Diagnoses versus Computerized Test Diagnoses Using the ...
 
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdfEffective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
 
ai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptxai-in-healthcare-202011-201117103639.pptx
ai-in-healthcare-202011-201117103639.pptx
 
Central mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 casesCentral mucoepidermoid carcinoma an up to-date analysis of 147 cases
Central mucoepidermoid carcinoma an up to-date analysis of 147 cases
 
EMRs: Meaningful Use and Research
EMRs: Meaningful Use and ResearchEMRs: Meaningful Use and Research
EMRs: Meaningful Use and Research
 
Nejm early goal shock septico 2019
Nejm early goal shock septico 2019Nejm early goal shock septico 2019
Nejm early goal shock septico 2019
 
CIBM
CIBMCIBM
CIBM
 
TADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for AnaesthetistsTADAA - Enabling Continuous Improvement for Anaesthetists
TADAA - Enabling Continuous Improvement for Anaesthetists
 

Último

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 

Health care special interest-i2b2

  • 1. Extracting information from clinical notes H. Yang, I. Spasic, F. Sarafraz, John A. Keane, Goran Nenadic School of Computer Science University of Manchester
  • 2. Motivation & aim  Electronic clinical notes  electronic medical/health records  hospital discharge summaries  Extract information on  individual patients and their diseases  clinical practice  treatments, drugs used, etc.  Aim: support data analytics  e.g. monitoring quality  Huge interest locally and internationally
  • 3. Clinical notes  Highly condensed text  sometimes without proper sentences  hospital discharge summaries are more structured  list of medications, symptoms, etc.  Terminological variability  orthographic, acronyms, local conventions  Various sections  previous history, social/family background
  • 4.
  • 5. NLP challenges in clinical data  A series of international challenges in information extraction from clinical narratives  organisers: Informatics for Integrating Biology & the Bedside (i2b2)  3 shared tasks so far − De-identification of medical records and identification of smokers from their clinical records (2007) Identification of obesity & related diseases in patients from hospital discharge documents (2008) Extraction of medications and related information from patients’ discharge documents (2009)  2010 challenge  concept, assertions, relations
  • 6. i2b2 2008  Extract status of diseases in patients  obesity, diabetes mellitus, hypercholesterolemia, hypertriglyceridemia, hypertension, heart failure (16 in total)  status: yes, no, unmentioned, questionable  on textual and “intuitive” level  28 teams worldwide  UoM ranked 1st in textual and 7th in intuitive  Our methodology  Term-based exact and approximate matching  Context-based pattern- and rule-based matching  Machine learning approach Yang, H., Spasic, I., Keane, J., Nenadic, G.: A Text Mining Approach to the Prediction of a Disease Status from Clinical Discharge Summaries, JAMIA 16(4):596-600
  • 7. Methodology Linguistic section splitting, sentence splitting, pre-processing chunking, POS tagging, parsing Information textual evidence extraction, extraction section filtering, morphological Medical (rules, machine clues (e.g. drug/disease name resources learning) affixes) •Disease names •Drug names •Body parts Template filling, filtering negative •Symptoms results, relations and heuristics: •Abbreviations Constructing Organ : Symptom, •Synonyms results Symptom : Disease, Disease : Drug, Drug : Mode of application
  • 8. Rule-based IE  Disease status patterns - context-based patterns [N] negative for CHF [Q] question of asthma [U] no known diagnosis of CAD [U] we should consider further asthma studies as an outpatient - semantics-based patterns [N] normal coronaries, a thin black man  Clinical resources used in sentence extraction  clinical inference rules e.g., weight>90kg, LDL>160mg/dl, HDL<35mg/dl  medications e.g., ‘anti-depressant’
  • 9. Textual Annotation Results  Performance on Disease Status (Ranked 1st) Micro-average: Accuracy (0.9723) Macro-average: P (0.8482), R (0.7737), F-score (0.8052) #Eval #Corr #Gold Precision Recall F-score Y 2267 2132 2192 0.9404 0.9726 0.9562 N 56 40 65 0.7142 0.6153 0.6611 Q 12 9 17 0.7500 0.5294 0.6206 U 5709 5640 5770 0.9879 0.9774 0.9826
  • 10. Intuitive Annotation Results  Performance on Disease Status (Ranked 7th) Micro-average: Accuracy (0.9572) Macro-average: P (0.6383), R (0.6294), F-score (0.6336) #Eval #Corr #Gold Precision Recall F-Score Y 2160 2068 2285 0.9574 0.9050 0.9304 N 5236 5014 5100 0.9576 0.9831 0.9702 Q 3 0 14 0 0 0
  • 11. i2b2 2009  Extract mentions of medication and related information  drugs the patient takes  dose, mode of application, frequency, duration, etc. (for each mention)  19 teams worldwide  UoM ranked 3rd  Our approach was based on combining  extensive dictionaries  morphological and derivational patterns
  • 12. Evaluation (F-measure) Medication 83.59% Dosage 82.67% Frequency 83.49% Mode 85.33% Duration 51.00% Reason 38.81% All fields 78.47% Spasić I, Sarafraz F, Keane JA, Nenadic G: “Medication Information Extraction with Linguistic Pattern Matching and Semantic Rules”, JAMIA (to appear)
  • 13. Summary  NLP and text mining techniques are useful for extraction of clinical data - disease status extraction: 95-97% accuracy - medication information extraction: 80% F-measure  Construction of reliable and sufficient resources - clinical terms and abbreviations (e.g., disease synonyms, symptoms, drugs) - context patterns related to diseases, medication, etc.  Domain knowledge required  construction of domain- and task-specific resources  complex clinical facts and conditions for inference  more comprehensive knowledge representation needed