SlideShare una empresa de Scribd logo
1 de 9
Descargar para leer sin conexión
Introduction to Database Research Projects
@ Center for Women’s Health Research(CWHR)
                 OB/GYN
    Ritu Khare, Roderick Price, Kalatu Davies, Michele Follen
                                               March 14 2012



1
Background
       Information Science, Systems, and Technology

       Information Science Techniques
           Database Modeling and Design
           Information Extraction and Retrieval        Healthcare

           Data Integration, Mining, and Warehousing




    2
Projects
       Dissertation Project

       CWHR Project 1
           Clinical Form Encoding


       CWHR Project 2
           EMR Error Detection


       CWHR Project 3
           Query and Data Extraction Tool


    3
Dissertation Project
    Making Databases Easier to Use

   Enable users to SEARCH and QUERY databases
   Enable users to DESIGN databases
       GoogleForms, FormsAssembly, SurveyMonkey, REDCap




                                            Some
                                         Algorithms




            A user-designed form
                                                                 A New Database
                                                      (a collection of interconnected tables)
    4
Dissertation Project
Accommodate Changing User Needs




                         FormMapper

                         - Form understanding
                         - Equivalent elements discovery
                         - Birthing new database elements
Dissertation Project
The FormMapper Tool
                                                           Solution:
                                                               Novel Techniques
                                      Evolved                  Re-use Existing
Form             FormMapper           Database
                                      Database
                                                           Experiments in Healthcare
                                                             52 clinical forms
       Objective:
                                                             6 databases: 35-500 tables
         To evolve a high-quality database.
                                                                   84.5% match with gold
                                                                    standard
                                                                   74% high-quality



          Lesson Learnt
         Quality of evolved databases could be improved …
                   … if the terms used on forms are standardized (clinically encoded)

    6
Collaborators
CWHR Project 1:                                                           Dr Sandra Hartmann
                                                                            Dr Aasta Mehta
Clinical Form Encoding                                                 Dr Yuan An, Dr Xiaohua Hu,
                                                                             Dr Il-Yeol Song



                                                ?
    MRN         Med Rec #     Medical Record
                              Number
    Blood       Diastolic
    Pressure    Systolic      BP
                              Physical Status
 Constitutional Vital Signs




                                                    Form      SNOMED CT Concept
                                                    Term
                                                              11615400: Patient (person)
                                                    Patient
                                                              398225001: Medical record
                                                    MRN       number (observable entity)




7
Collaborators
CWHR Project 2:                                                            Dr Edgar Chou
                                                                           Dr Paul Nyirjesy
EMR Error Detection                                                          Dr Yuan An
                                                                            Dr Xiaohua Hu

       EMR data-entry errors cost            Allscripts Error Handling
           time, efforts, money                  Not enough
                                                  Clinical guidelines are not
           misdiagnosis, patient health
                                                   appropriately integrated
            (not so benign)                       Surface – level checks



                           Alert!!!                      Allscripts
                                                            DB
                      Alert!!!


                                                                      DUCOM
                                                                       Clinical
                                                                      Warehouse



    8
CWHR Project 3:
  Query and Data Extraction Tool
  Allscripts Database (Feb ’12)                      Experts:
         420k+ patients                                 Dr Xiaohua Tony Hu (Data Mining)
         140 -160 providers                             Dr Yuan An (Data Integration)
         5TB of Data                                    Dr Kalatu Davies (Biostatistics)
                                                         Dr Il-Yeol Song (Data Warehousing)
Drowning in data
and information
                                Summarize, Discover
                               patterns and knowledge

                                                                      Starving for
           DUCOM               Natural Language Query
            Clinical            using NLP techniques
           Warehouse
                               Customized reports for
                               different providers, and
                                   residents research


      9

Más contenido relacionado

Destacado

8 things you should not do when selecting a prem
8 things you should not do when selecting a prem8 things you should not do when selecting a prem
8 things you should not do when selecting a premKeith Meadows
 
The diabetes health profile ebook
The diabetes health profile ebookThe diabetes health profile ebook
The diabetes health profile ebookKeith Meadows
 
White paper 5 things you need to know about patient reported outcome (pro) ...
White paper   5 things you need to know about patient reported outcome (pro) ...White paper   5 things you need to know about patient reported outcome (pro) ...
White paper 5 things you need to know about patient reported outcome (pro) ...Keith Meadows
 
Thepatientoutcomesblog survey results 2012
Thepatientoutcomesblog survey results 2012Thepatientoutcomesblog survey results 2012
Thepatientoutcomesblog survey results 2012Keith Meadows
 
The Diabetes Health Profile - Development and applications
The Diabetes Health Profile - Development and applicationsThe Diabetes Health Profile - Development and applications
The Diabetes Health Profile - Development and applicationsKeith Meadows
 

Destacado (6)

8 things you should not do when selecting a prem
8 things you should not do when selecting a prem8 things you should not do when selecting a prem
8 things you should not do when selecting a prem
 
Rassa dikit juga enak
Rassa dikit juga enakRassa dikit juga enak
Rassa dikit juga enak
 
The diabetes health profile ebook
The diabetes health profile ebookThe diabetes health profile ebook
The diabetes health profile ebook
 
White paper 5 things you need to know about patient reported outcome (pro) ...
White paper   5 things you need to know about patient reported outcome (pro) ...White paper   5 things you need to know about patient reported outcome (pro) ...
White paper 5 things you need to know about patient reported outcome (pro) ...
 
Thepatientoutcomesblog survey results 2012
Thepatientoutcomesblog survey results 2012Thepatientoutcomesblog survey results 2012
Thepatientoutcomesblog survey results 2012
 
The Diabetes Health Profile - Development and applications
The Diabetes Health Profile - Development and applicationsThe Diabetes Health Profile - Development and applications
The Diabetes Health Profile - Development and applications
 

Similar a Introduction to Database Research Projects @ CWHR

eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...
eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...
eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...Plan de Calidad para el SNS
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Sage Base
 
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...Databricks
 
Applying NLP to Personalized Healthcare - 2021
Applying NLP to Personalized Healthcare - 2021Applying NLP to Personalized Healthcare - 2021
Applying NLP to Personalized Healthcare - 2021David Talby
 
The Translational Medicine
The Translational MedicineThe Translational Medicine
The Translational MedicineJoanne Luciano
 
OpenEHR modeling case studies in China
OpenEHR modeling case studies in ChinaOpenEHR modeling case studies in China
OpenEHR modeling case studies in Chinaxudong_lu
 
Clinician Decision Support Dashboard
Clinician Decision Support DashboardClinician Decision Support Dashboard
Clinician Decision Support DashboardIccha Sethi
 
Friend p4c 2012-11-29
Friend p4c 2012-11-29Friend p4c 2012-11-29
Friend p4c 2012-11-29Sage Base
 
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)Joel Saltz
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Remedy Informatics
 
Friend Gastein 2012-10-04
Friend Gastein 2012-10-04Friend Gastein 2012-10-04
Friend Gastein 2012-10-04Sage Base
 
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...Health Informatics New Zealand
 
Real-time Analysis of Next Generation Sequencing Data
Real-time Analysis of Next Generation Sequencing DataReal-time Analysis of Next Generation Sequencing Data
Real-time Analysis of Next Generation Sequencing DataMatthieu Schapranow
 
Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28
Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28
Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28Sage Base
 

Similar a Introduction to Database Research Projects @ CWHR (20)

Mining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposingMining public domain data as a basis for drug repurposing
Mining public domain data as a basis for drug repurposing
 
Secondary Use of Healthcare Data for Translational Research
Secondary Use of Healthcare Data for Translational ResearchSecondary Use of Healthcare Data for Translational Research
Secondary Use of Healthcare Data for Translational Research
 
eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...
eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...
eHealth Governance in a Local Organisation. The Experience from Pompidou Hosp...
 
Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24Stephen Friend Dana Farber Cancer Institute 2011-10-24
Stephen Friend Dana Farber Cancer Institute 2011-10-24
 
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...State of the Art Natural Language Processing at Scale with Alexander Thomas a...
State of the Art Natural Language Processing at Scale with Alexander Thomas a...
 
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 PresentationSLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
 
Applying NLP to Personalized Healthcare - 2021
Applying NLP to Personalized Healthcare - 2021Applying NLP to Personalized Healthcare - 2021
Applying NLP to Personalized Healthcare - 2021
 
The Translational Medicine
The Translational MedicineThe Translational Medicine
The Translational Medicine
 
OpenEHR modeling case studies in China
OpenEHR modeling case studies in ChinaOpenEHR modeling case studies in China
OpenEHR modeling case studies in China
 
Clinician Decision Support Dashboard
Clinician Decision Support DashboardClinician Decision Support Dashboard
Clinician Decision Support Dashboard
 
Can scan final 2012 berkeley
Can scan final 2012 berkeleyCan scan final 2012 berkeley
Can scan final 2012 berkeley
 
Friend p4c 2012-11-29
Friend p4c 2012-11-29Friend p4c 2012-11-29
Friend p4c 2012-11-29
 
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
 
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
Ontology-Driven Clinical Intelligence: A Path from the Biobank to Cross-Disea...
 
Friend Gastein 2012-10-04
Friend Gastein 2012-10-04Friend Gastein 2012-10-04
Friend Gastein 2012-10-04
 
MedGIFT projects in medical imaging
MedGIFT projects in medical imagingMedGIFT projects in medical imaging
MedGIFT projects in medical imaging
 
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
Informatics in Clinical Practice: Designing and Implementing an Electronic Re...
 
Medical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructuresMedical image analysis and big data evaluation infrastructures
Medical image analysis and big data evaluation infrastructures
 
Real-time Analysis of Next Generation Sequencing Data
Real-time Analysis of Next Generation Sequencing DataReal-time Analysis of Next Generation Sequencing Data
Real-time Analysis of Next Generation Sequencing Data
 
Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28
Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28
Stephen Friend CRUK-MD Anderson Cancer Workshop 2012-02-28
 

Último

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Último (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Introduction to Database Research Projects @ CWHR

  • 1. Introduction to Database Research Projects @ Center for Women’s Health Research(CWHR) OB/GYN Ritu Khare, Roderick Price, Kalatu Davies, Michele Follen March 14 2012 1
  • 2. Background  Information Science, Systems, and Technology  Information Science Techniques  Database Modeling and Design  Information Extraction and Retrieval Healthcare  Data Integration, Mining, and Warehousing 2
  • 3. Projects  Dissertation Project  CWHR Project 1  Clinical Form Encoding  CWHR Project 2  EMR Error Detection  CWHR Project 3  Query and Data Extraction Tool 3
  • 4. Dissertation Project Making Databases Easier to Use  Enable users to SEARCH and QUERY databases  Enable users to DESIGN databases  GoogleForms, FormsAssembly, SurveyMonkey, REDCap Some Algorithms A user-designed form A New Database (a collection of interconnected tables) 4
  • 5. Dissertation Project Accommodate Changing User Needs FormMapper - Form understanding - Equivalent elements discovery - Birthing new database elements
  • 6. Dissertation Project The FormMapper Tool  Solution:  Novel Techniques Evolved  Re-use Existing Form FormMapper Database Database  Experiments in Healthcare  52 clinical forms  Objective:  6 databases: 35-500 tables  To evolve a high-quality database.  84.5% match with gold standard  74% high-quality Lesson Learnt Quality of evolved databases could be improved … … if the terms used on forms are standardized (clinically encoded) 6
  • 7. Collaborators CWHR Project 1: Dr Sandra Hartmann Dr Aasta Mehta Clinical Form Encoding Dr Yuan An, Dr Xiaohua Hu, Dr Il-Yeol Song ? MRN Med Rec # Medical Record Number Blood Diastolic Pressure Systolic BP Physical Status Constitutional Vital Signs Form SNOMED CT Concept Term 11615400: Patient (person) Patient 398225001: Medical record MRN number (observable entity) 7
  • 8. Collaborators CWHR Project 2: Dr Edgar Chou Dr Paul Nyirjesy EMR Error Detection Dr Yuan An Dr Xiaohua Hu  EMR data-entry errors cost  Allscripts Error Handling  time, efforts, money  Not enough  Clinical guidelines are not  misdiagnosis, patient health appropriately integrated (not so benign)  Surface – level checks Alert!!! Allscripts DB Alert!!! DUCOM Clinical Warehouse 8
  • 9. CWHR Project 3: Query and Data Extraction Tool Allscripts Database (Feb ’12) Experts:  420k+ patients  Dr Xiaohua Tony Hu (Data Mining)  140 -160 providers  Dr Yuan An (Data Integration)  5TB of Data  Dr Kalatu Davies (Biostatistics)  Dr Il-Yeol Song (Data Warehousing) Drowning in data and information Summarize, Discover patterns and knowledge Starving for DUCOM Natural Language Query Clinical using NLP techniques Warehouse Customized reports for different providers, and residents research 9