SlideShare una empresa de Scribd logo
1 de 23
Visual Tools for Queries and
Display of Quantitative Information
  in a Cancer Research Database

     JESSE STEWART and JERZY W. JAROMCZYK
         Department of Computer Science
        University of Kentucky, Lexington KY
The Kentucky Cancer Registry

• The Markey Cancer has the singular mission to eliminate the
  morbidity and mortality of cancer
• Since its founding, the Markey Cancer Center and the UK
  Chandler hospital have served 2000-2200 new patients a
  year and is one of the few institutions nationwide that
  address both clinical care as well as cancer research.
• The KCR’s case count exceeded 30,000 annually as of 2009
• The KCR houses a wealth of historical data for hundreds of
  cancer variants, associated treatments, and their relative
  success across the state of Kentucky.
Data Collection


Patient    Abstracting   Internet   Registry DB
Events     CPDMS.NET     HTTPS        MySQL
Accelerating Cancer Research



                        Discover
Develop   Visualize    Important
Queries   Data Sets   Correlations
Registry Databases and Research
                 Valuable Information
                 •Survival Trends
                 •Incidence Rates
                 •Behavioral and
                 Geographical Correlation

    Challenges in Research
    •Coded Data
    •SQL
    •Complex DB Schemas
    •Access Control
    •Visualization
Software Solutions

• Define Queries (Data Sets)
  –   Intuitive: no programming required
  –   Flexible: allow any data set to be explored
  –   Accessible: Visual cross-browser application
  –   Re-use: Save, modify and combine Data Sets
• Data Analysis and Visualization:
  – Context-specific diagrams
  – Compare data sets singularly or side-by-side
  – Customizable appearance
The Query Builder
• Presents a high-level abstraction of the
  Registry Database
• Patient, Case, Therapy data variables are
  easily recognizable and categorized
• Separates the user from the actual database
  structure and coded information
  – Example: Treatment is encoded as:
     • No Treatment=0, Treatment=1, Surveillance=2
The Query Builder

• Translates a question about cancer data into
  SQL (Structured Query Language) which can be
  understood by the computer system
• Parses and stores the query for modification and
  reuse later
Example Query
• Patients diagnosed between Jan 1, 2005 and Dec
  31, 2008
• Patients diagnosed in Kentucky
• Patients treated with immunotherapy

• SQL may be complex

case_data.diagdate >= 20050101 and case_data.diagdate <= 20081231 and
   case_tx.txtype = ‘I’ and case_data.diagstate = ‘KY’ from case_data, case_tx
   where case_tx.hospkey = case_data.hospkey and case_tx.patkey =
   case_data.patkey and case_data.incomplete = 0;
Query Builder in Action
Syntax Tree
Query Management
Visualization Tools
– Scaled Venn Diagrams
   • User can quickly ascertain relative size of data sets and
     their relationship to one another
– Bar and Histogram Charts
   • Flexible view of variable distribution for different sets
– Survival Trends
   • View and compare survival rates over time
– Statistics
   • Common descriptive statistics
   • Comparison with Chi-square, Log rank, T-, Z-tests
Visualization: Venn Diagrams
Visualization: Venn Diagrams
Visualization: Histogram
Visualization: Survival Trends
Cross-Tab Analysis
Chi-square Analysis
Censored Life Table
Success
• The Visual Query Builder and Data Analysis tools have
  become an integral part of CPDMS.NET – the online
  abstracting system developed at the KCR.
• Over 5000 study groups have been created by users of
  the system.
• Features have been added and improved resulting
  from feedback given by researchers and registrars
  (cancer data professionals).
• Future developments may include:
   – Wider array of statistical tests
   – Functions to analyze more than two data sets at once
Acknowledgements

    KCR Informatics and Registry Management
      Eric Durbin, MS - Director of Informatics
 Frances Ross, CTR - Director of Registry Operations
Isaac Hands - Lead Programmer and Systems Analyst
Software

Más contenido relacionado

Destacado

Comview11 Visual Learning Tools in Business Management
Comview11 Visual Learning Tools in Business ManagementComview11 Visual Learning Tools in Business Management
Comview11 Visual Learning Tools in Business ManagementAmanda Ritter
 
Visual thinking for business analysis
Visual thinking for business analysisVisual thinking for business analysis
Visual thinking for business analysisDanny D. Kosasih
 
Power Of Visual Thinking
Power Of Visual ThinkingPower Of Visual Thinking
Power Of Visual Thinkingsmehro
 
An Introduction to Benefits Realization Management
An Introduction to Benefits Realization ManagementAn Introduction to Benefits Realization Management
An Introduction to Benefits Realization ManagementCraig Letavec
 
Visual Thinking for Brainstorming, Planning, Learning, Collaborating, Harvesting
Visual Thinking for Brainstorming, Planning, Learning, Collaborating, HarvestingVisual Thinking for Brainstorming, Planning, Learning, Collaborating, Harvesting
Visual Thinking for Brainstorming, Planning, Learning, Collaborating, HarvestingGiulia Forsythe
 
The Value of Visual Thinking in Social Business
The Value of Visual Thinking in Social BusinessThe Value of Visual Thinking in Social Business
The Value of Visual Thinking in Social BusinessDavid Armano
 

Destacado (10)

Comview11 Visual Learning Tools in Business Management
Comview11 Visual Learning Tools in Business ManagementComview11 Visual Learning Tools in Business Management
Comview11 Visual Learning Tools in Business Management
 
Why Visual Business Analysis is More Effective?
Why Visual Business Analysis is More Effective?Why Visual Business Analysis is More Effective?
Why Visual Business Analysis is More Effective?
 
Defense of my BSc-Thesis
Defense of my BSc-ThesisDefense of my BSc-Thesis
Defense of my BSc-Thesis
 
Visual thinking for business analysis
Visual thinking for business analysisVisual thinking for business analysis
Visual thinking for business analysis
 
Visual Thinking
Visual ThinkingVisual Thinking
Visual Thinking
 
Power Of Visual Thinking
Power Of Visual ThinkingPower Of Visual Thinking
Power Of Visual Thinking
 
An Introduction to Benefits Realization Management
An Introduction to Benefits Realization ManagementAn Introduction to Benefits Realization Management
An Introduction to Benefits Realization Management
 
Visual Thinking for Brainstorming, Planning, Learning, Collaborating, Harvesting
Visual Thinking for Brainstorming, Planning, Learning, Collaborating, HarvestingVisual Thinking for Brainstorming, Planning, Learning, Collaborating, Harvesting
Visual Thinking for Brainstorming, Planning, Learning, Collaborating, Harvesting
 
Thinking Visually
Thinking VisuallyThinking Visually
Thinking Visually
 
The Value of Visual Thinking in Social Business
The Value of Visual Thinking in Social BusinessThe Value of Visual Thinking in Social Business
The Value of Visual Thinking in Social Business
 

Similar a Visual tools for databade queries and analysis

Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeDataWorks Summit
 
Population Health Management
Population Health ManagementPopulation Health Management
Population Health ManagementShaheen Gauher
 
Quartesian capabilities-2013
Quartesian capabilities-2013Quartesian capabilities-2013
Quartesian capabilities-2013Benjamin Jackson
 
Data base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somyaData base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somyaMukesh Jaiswal
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsKen Karapetyan
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
Brisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCapBrisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCapARDC
 
2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)Michael Atkins
 
Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...
Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...
Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...Lokukaluge Prasad Perera
 
Rich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMSRich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMSbdemchak
 
Iscram 2008 presentation
Iscram 2008 presentationIscram 2008 presentation
Iscram 2008 presentationbdemchak
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataJoel Saltz
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemWarren Kibbe
 
FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_PresentationYatpang Cheung
 
Call for Papers *** International Journal of Database Management Systems (IJDMS)
Call for Papers *** International Journal of Database Management Systems (IJDMS)Call for Papers *** International Journal of Database Management Systems (IJDMS)
Call for Papers *** International Journal of Database Management Systems (IJDMS)ijdms
 
City of hope research informatics common data elements
City of hope research informatics common data elementsCity of hope research informatics common data elements
City of hope research informatics common data elementsAbdul-Malik Shakir
 

Similar a Visual tools for databade queries and analysis (20)

Big Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short TimeBig Data at Geisinger Health System: Big Wins in a Short Time
Big Data at Geisinger Health System: Big Wins in a Short Time
 
Population Health Management
Population Health ManagementPopulation Health Management
Population Health Management
 
Cri big data
Cri big dataCri big data
Cri big data
 
Quartesian capabilities-2013
Quartesian capabilities-2013Quartesian capabilities-2013
Quartesian capabilities-2013
 
Data base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somyaData base and data entry presentation by mj n somya
Data base and data entry presentation by mj n somya
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
Brisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCapBrisbane Health-y Data: RedCap
Brisbane Health-y Data: RedCap
 
2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)2015 GU-ICBI Poster (third printing)
2015 GU-ICBI Poster (third printing)
 
Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...
Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...
Industrial IoT to Predictive Analytics: A Reverse Engineering Approach from S...
 
Rich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMSRich Feeds for RESCUE and PALMS
Rich Feeds for RESCUE and PALMS
 
Iscram 2008 presentation
Iscram 2008 presentationIscram 2008 presentation
Iscram 2008 presentation
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor Data
 
Data Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health SystemData Harmonization for a Molecularly Driven Health System
Data Harmonization for a Molecularly Driven Health System
 
FedCentric_Presentation
FedCentric_PresentationFedCentric_Presentation
FedCentric_Presentation
 
Call for Papers *** International Journal of Database Management Systems (IJDMS)
Call for Papers *** International Journal of Database Management Systems (IJDMS)Call for Papers *** International Journal of Database Management Systems (IJDMS)
Call for Papers *** International Journal of Database Management Systems (IJDMS)
 
Rdm slides march 2014
Rdm slides march 2014Rdm slides march 2014
Rdm slides march 2014
 
Ncicbiit
NcicbiitNcicbiit
Ncicbiit
 
10th Annual Utah's Health Services Research Conference - Data Quality in Mult...
10th Annual Utah's Health Services Research Conference - Data Quality in Mult...10th Annual Utah's Health Services Research Conference - Data Quality in Mult...
10th Annual Utah's Health Services Research Conference - Data Quality in Mult...
 
City of hope research informatics common data elements
City of hope research informatics common data elementsCity of hope research informatics common data elements
City of hope research informatics common data elements
 

Último

Morse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptxMorse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptxjmorse8
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Denish Jangid
 
Behavioral-sciences-dr-mowadat rana (1).pdf
Behavioral-sciences-dr-mowadat rana (1).pdfBehavioral-sciences-dr-mowadat rana (1).pdf
Behavioral-sciences-dr-mowadat rana (1).pdfaedhbteg
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...Nguyen Thanh Tu Collection
 
The Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. HenryThe Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. HenryEugene Lysak
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽中 央社
 
Open Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPointOpen Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPointELaRue0
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...Nguyen Thanh Tu Collection
 
Application of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matricesApplication of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matricesRased Khan
 
Neurulation and the formation of the neural tube
Neurulation and the formation of the neural tubeNeurulation and the formation of the neural tube
Neurulation and the formation of the neural tubeSaadHumayun7
 
IATP How-to Foreign Travel May 2024.pdff
IATP How-to Foreign Travel May 2024.pdffIATP How-to Foreign Travel May 2024.pdff
IATP How-to Foreign Travel May 2024.pdff17thcssbs2
 
How to the fix Attribute Error in odoo 17
How to the fix Attribute Error in odoo 17How to the fix Attribute Error in odoo 17
How to the fix Attribute Error in odoo 17Celine George
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45MysoreMuleSoftMeetup
 
Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024CapitolTechU
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxCapitolTechU
 
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17Celine George
 
philosophy and it's principles based on the life
philosophy and it's principles based on the lifephilosophy and it's principles based on the life
philosophy and it's principles based on the lifeNitinDeodare
 
The Ultimate Guide to Social Media Marketing in 2024.pdf
The Ultimate Guide to Social Media Marketing in 2024.pdfThe Ultimate Guide to Social Media Marketing in 2024.pdf
The Ultimate Guide to Social Media Marketing in 2024.pdfdm4ashexcelr
 

Último (20)

Morse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptxMorse OER Some Benefits and Challenges.pptx
Morse OER Some Benefits and Challenges.pptx
 
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
Basic Civil Engineering notes on Transportation Engineering, Modes of Transpo...
 
Behavioral-sciences-dr-mowadat rana (1).pdf
Behavioral-sciences-dr-mowadat rana (1).pdfBehavioral-sciences-dr-mowadat rana (1).pdf
Behavioral-sciences-dr-mowadat rana (1).pdf
 
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
TỔNG HỢP HƠN 100 ĐỀ THI THỬ TỐT NGHIỆP THPT VẬT LÝ 2024 - TỪ CÁC TRƯỜNG, TRƯ...
 
The Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. HenryThe Last Leaf, a short story by O. Henry
The Last Leaf, a short story by O. Henry
 
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽會考英聽
 
Open Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPointOpen Educational Resources Primer PowerPoint
Open Educational Resources Primer PowerPoint
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
 
Application of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matricesApplication of Matrices in real life. Presentation on application of matrices
Application of Matrices in real life. Presentation on application of matrices
 
Neurulation and the formation of the neural tube
Neurulation and the formation of the neural tubeNeurulation and the formation of the neural tube
Neurulation and the formation of the neural tube
 
IATP How-to Foreign Travel May 2024.pdff
IATP How-to Foreign Travel May 2024.pdffIATP How-to Foreign Travel May 2024.pdff
IATP How-to Foreign Travel May 2024.pdff
 
How to the fix Attribute Error in odoo 17
How to the fix Attribute Error in odoo 17How to the fix Attribute Error in odoo 17
How to the fix Attribute Error in odoo 17
 
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
 
Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024Capitol Tech Univ Doctoral Presentation -May 2024
Capitol Tech Univ Doctoral Presentation -May 2024
 
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptxslides CapTechTalks Webinar May 2024 Alexander Perry.pptx
slides CapTechTalks Webinar May 2024 Alexander Perry.pptx
 
Word Stress rules esl .pptx
Word Stress rules esl               .pptxWord Stress rules esl               .pptx
Word Stress rules esl .pptx
 
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 2 STEPS Using Odoo 17
 
Post Exam Fun(da) Intra UEM General Quiz - Finals.pdf
Post Exam Fun(da) Intra UEM General Quiz - Finals.pdfPost Exam Fun(da) Intra UEM General Quiz - Finals.pdf
Post Exam Fun(da) Intra UEM General Quiz - Finals.pdf
 
philosophy and it's principles based on the life
philosophy and it's principles based on the lifephilosophy and it's principles based on the life
philosophy and it's principles based on the life
 
The Ultimate Guide to Social Media Marketing in 2024.pdf
The Ultimate Guide to Social Media Marketing in 2024.pdfThe Ultimate Guide to Social Media Marketing in 2024.pdf
The Ultimate Guide to Social Media Marketing in 2024.pdf
 

Visual tools for databade queries and analysis

  • 1. Visual Tools for Queries and Display of Quantitative Information in a Cancer Research Database JESSE STEWART and JERZY W. JAROMCZYK Department of Computer Science University of Kentucky, Lexington KY
  • 2. The Kentucky Cancer Registry • The Markey Cancer has the singular mission to eliminate the morbidity and mortality of cancer • Since its founding, the Markey Cancer Center and the UK Chandler hospital have served 2000-2200 new patients a year and is one of the few institutions nationwide that address both clinical care as well as cancer research. • The KCR’s case count exceeded 30,000 annually as of 2009 • The KCR houses a wealth of historical data for hundreds of cancer variants, associated treatments, and their relative success across the state of Kentucky.
  • 3. Data Collection Patient Abstracting Internet Registry DB Events CPDMS.NET HTTPS MySQL
  • 4. Accelerating Cancer Research Discover Develop Visualize Important Queries Data Sets Correlations
  • 5. Registry Databases and Research Valuable Information •Survival Trends •Incidence Rates •Behavioral and Geographical Correlation Challenges in Research •Coded Data •SQL •Complex DB Schemas •Access Control •Visualization
  • 6. Software Solutions • Define Queries (Data Sets) – Intuitive: no programming required – Flexible: allow any data set to be explored – Accessible: Visual cross-browser application – Re-use: Save, modify and combine Data Sets • Data Analysis and Visualization: – Context-specific diagrams – Compare data sets singularly or side-by-side – Customizable appearance
  • 7. The Query Builder • Presents a high-level abstraction of the Registry Database • Patient, Case, Therapy data variables are easily recognizable and categorized • Separates the user from the actual database structure and coded information – Example: Treatment is encoded as: • No Treatment=0, Treatment=1, Surveillance=2
  • 8. The Query Builder • Translates a question about cancer data into SQL (Structured Query Language) which can be understood by the computer system • Parses and stores the query for modification and reuse later
  • 9. Example Query • Patients diagnosed between Jan 1, 2005 and Dec 31, 2008 • Patients diagnosed in Kentucky • Patients treated with immunotherapy • SQL may be complex case_data.diagdate >= 20050101 and case_data.diagdate <= 20081231 and case_tx.txtype = ‘I’ and case_data.diagstate = ‘KY’ from case_data, case_tx where case_tx.hospkey = case_data.hospkey and case_tx.patkey = case_data.patkey and case_data.incomplete = 0;
  • 13. Visualization Tools – Scaled Venn Diagrams • User can quickly ascertain relative size of data sets and their relationship to one another – Bar and Histogram Charts • Flexible view of variable distribution for different sets – Survival Trends • View and compare survival rates over time – Statistics • Common descriptive statistics • Comparison with Chi-square, Log rank, T-, Z-tests
  • 21. Success • The Visual Query Builder and Data Analysis tools have become an integral part of CPDMS.NET – the online abstracting system developed at the KCR. • Over 5000 study groups have been created by users of the system. • Features have been added and improved resulting from feedback given by researchers and registrars (cancer data professionals). • Future developments may include: – Wider array of statistical tests – Functions to analyze more than two data sets at once
  • 22. Acknowledgements KCR Informatics and Registry Management Eric Durbin, MS - Director of Informatics Frances Ross, CTR - Director of Registry Operations Isaac Hands - Lead Programmer and Systems Analyst

Notas del editor

  1. Patient Data is collected by Medical facilities across the state of KY.Abstractors read paper/electronic records and code the data as a cancer abstract according to standards.Abstracting is performed using the KCR’s custom CPDMS.NET reporting system.The abstract is transmitted across the internet and stored in the registry database.
  2. Take KCR’s data into something a computer can process and analyze quicklyCreate the tools for analysisDevelop useful ways to present the results of analysisPresent the information in a user friendly manner
  3. Many valuable statistics and trends are hidden in the registry database.Retrieving this information is an arduous task, especially for those without knowledge of SQL
  4. When this information can be analyzed and visualized, life-saving discoveries may be uncovered by research experts. Advancing the understanding of cancer and toward the development of new models and modes of intervention in malignant processes.Take this old mine of information and simplify it visually and numerically;It is hoped that this may help advance the understanding of cancer, and in turn help science fight one of its biggest battles: to better treat and prevent disease.
  5. The Query Builder tool aims to solve the aforementioned problems by providing a visual interface forconstructing database queries without the need to understand the underlying structure of the database orwrite formal SQL expressions.1) Provide access to important registry database objects including Patient, Case, and Therapy information.2) Provide a list of important attributes/fields associated with each object.3) Allow search criteria be entered with minimal effort, and no knowledge of SQL language.4) Show descriptive database field values where appropriate - in addition to or in lieu of coded values.a. Display an appropriate input field for different data types like dates, numbers, and lists.5) Allow the user to construct arbitrarily complex searches by adding as many criteria as needed tothe query.6) Support a set of Boolean operators: AND, OR, XOR, NOT - so search criteria can be joined invarious ways.7) Allow searches to be saved for later use.
  6. Direct interaction with the database system involves the use of a structured query language (SQL)used by most relational database systems. This includes operations like reading, adding, removing, andmodifying data stored by the system. Although this language is readable by humans, special understandingof the syntax and structure of an SQL statement is required for a user to “talk” to the database systemAnd find what he or she is looking for. This can at the very least be cumbersome or nearly impossiblefor those without much experience with programming languages or similar, especially when one tries todescribe a very specific data set.There are several factors that contribute to the disparity between a database language like SQL, and anatural language such as English, each reason of course being related to the way a computer stores andprocesses information in a digital form.Encoding of Each Attribute: helps reduce the database storage space required andincrease performance. Unfortunately the trade-off of is that any SQL statement describing such a recordmust use the coded version of the attribute data rather than a natural textual description. For example,a person’s assigned treatment could be encoded as No Treatment=0, Treatment=1, Surveillance=2. Normalize the Data: avoid duplicatin information and wasting storage space, records are often split up into multiple tablesand associated with one another.
  7. Each condition of the query can be entered with several mouse clicksThe conditions may be joined with Boolean operators AND, OR, etcEncoded values are shown with descriptive translationsThe Query Builder shows a data-type sensitive input for each variableSeparates researchers from data encoding
  8. Syntax Tree is generated from the query and stored in serialized form for later use.Once the user is satisfied with the query, it can be given a title and saved for analysis!
  9. Queries are saved indefinitely for later for each user account.Metadata showing the last modified and edited times are displayedStudy groups can be copied, deleted, edited or created from this interface
  10. Compare the survival distributions of two samples. Nonparametric test – used with data that is censoredUsed frequently in clinical trials applications