SlideShare una empresa de Scribd logo
1 de 6
Information Management Systems
in Behavior Science
Theme: Statistic, Epidemiology Modeling and Diabetes
Lab #2
Etienne Z. Gnimpieba
BRIN WS 2013
Mount Marty College – June 24th 2013
Etienne.gnimpieba@usd.edu
Context
0. Specification & Aims
Lab #2
Statement of problem / Case study: Interdisciplinary and transversal research development invites each scientific field to initiate an evolution toward integration to /for other field. The
bioinformatics and computational biology used in Behavior science remain difficult to describe. We propose here to use Information management tools (data collected, mining, load, statistic
analysis) and systems Biology modeling (epidemiology modeling) as key point for that translational interaction. Epidemiology is the study of the distribution and determinants of health-related
states or events (including disease), and the application of this study to the control of diseases and other health problems. Various methods can be used to carry out epidemiological investigations:
surveillance and descriptive studies can be used to study distribution; analytical studies are used to study determinants.
Bioinformatics and Information Management in Behavior Science
Aim: The aim of this lab is to create a broader
understanding of behavioral science data analysis
and mining using statistic tools. As a part of life
science area, we propose here to use
Bioinformatics and Systems Biology tools in
Epidemiology modeling to predict disease spread.
Acquired skills
Online Server Tools:
- Survey design (from hypothesis to questionnaire)
- Google Apps (design forms, updating questions)
- Data analysis, data learning, data mining
- Using NetLogo (modeling approach)
2
Resolution Process
T1. Creating a Google Survey
Objective: Learn how to make a Google survey
T2. Descriptive and Inference Statistics in Excel
Objective: Extract Load and Treat (ELT) data set for excel statistics used.
T1.1. Setting up the Survey
T1.2 Creating the Questions
T1.3. Collected Data and Visualize the Summary
T2.1. Import/edit/export from different formats (text, tab, xml, …)
T2.2. Descriptive analysis in excel (max, min, typos, count, Stde, average, sum) and
data visualization in excel (histogram, scatter plot, …)
T.2.3. Case Study: Social Reajustment Rating Scale (SRRS) question on diabete
T4. Epidemiology Modeling
Objective: How to use modeling approach to analyze the epidemiology problems
T4.1. Using NetLogo
T4.2. Running epiDEM simulation
T3. Inference statistics in Tanagra (data mining)
Objective: learn, extract knowledge from data using data mining tools (associations rules,
clustering, neuronal network) in Tanagra.
T3.1. Data Mining using Associaton Rule on Dataset
T3.2. K-Means Clustering Method for Data Learning
Information Management in Behavior Science
T1. Creating a Google Survey
Objective : Learn how to make a Google Survey
T1.1. Setting up the Survey
On the Google website: http://www.google.com
o Click on “Drive” tab
o Login or create an account
o Click on the red “Create” button and select the “Form”
o In “Title” you name your survey. You can also select your desired theme. For our example we will name our
survey “Diabetes”
o Click “Ok” to begin creating your survey
o A screen will open and you can complete an number of functions such as
o Providing a description
o Titling your questions
o Providing help text if needed
o Choosing you question type
o Adding more questions
o When you are finished with your survey questions click “Done”
o You can now choose the options to
o Show link to submit another response
o Publish and show the link to the results of the form to all the responders
o Allow responders to edit responses after submitting
o You can now share the link of your survey through Google +, Facebook, and twitter, or sending forms via e-mail.
o When finished selecting recipients click “Done”
Etienne Z. Gnimpieba
BRIN WS 2013
Mount Marty College – June 24th 2013
T1.2. Creating the Questions
T1.3. Collected Data and Visualize the Summary
Statistics in Behavior Science
Information Management in Behavior Science
T2. Descriptive and Inference Statistics in Excel
Objective: Extract Load and Treat (ELT) data set for excel statistics used
T2.1. Import/Edit/Export from different Formats (text, tab, xml…)
Etienne Z. Gnimpieba
BRIN WS 2013
Mount Marty College – June 24th 2013
o Open Excel and go to the “Data” tab. Click on “From Text” and select the file “data_lab8” from the student folder.
o Click “Next” Under “Delimiters” select only the “comma” Click “Finish” and “OK”
o Click on the column letter you want to sort by and click on “Sort and Filter” on the top left of the “Data” tab.
o Select the “Sort A though Z” Make sure “Expand with selection is selected and click “Sort”
T2.2. Descriptive Analysis in Excel and Data Visualisation in Excel
o Click on the cell under the title “Average” Under the “Formulas” tab click on “More Functions”, “Statistical” and “Average”
o To find the average of the cars that traveled enter “D2:D21” in “Number 1” and click “OK”
o To find the standard deviation of the number accidents select the cell under the title “Stdev” and click on “More Functions”, “Statistical”,
and “STDEV.S”. In “Number 1” put “E2:E21”.
o Highlight the data under “Cars Travel” (D2:D21) Go to the “Insert” tab and click on “Line” and select the first dot line option given
o You can edit the chart multiple ways and change the looks of in by going under the tabs that show up when the chart is selected
o Under the “Layout” tab you can add titles for the X and Y axis's.
o Right click the chart and select “Change Chart Type…” if you want a different type of graph
T.2.3. Case Study: Social Reajustment Rating Scale (SRRS) question on Diabetes
o Open “Case Study-Diabetes” from your student folder
o Select cell C2 and type in the equation “=$B2*$B2” and press enter.
o Hover your mouse over the lower right corner and drag the black box down to C7 to apply this equation to the rest of the cells.
o Select cell E2 and type in the equation “=$D2*$D2” and again drag down until cell E7
o Select cell F2 and type in the equation “=$B2*$D2” and again drag down until cell F7
o Select cell B8 and click on the button “Math and Trig” and select “SUM” Type in “Number 1” “B2:B7” and then press “OK”
o Drag the black box across to F8. Now your table should be complete
o Click on the “Insert” tab and pull down the “Scatter” button and select the first chart
Statistics in Behavior Science
Statistics in Behavior Science Information Management in Behavior Science
T3. Inference statistics in Tanagra
Objective: learn, extract knowledge from data using data mining tools (associations rules,
clustering, neuronal network) in Tanagra.
Data Mining
Etienne Z. Gnimpieba
BRIN WS 2013
Mount Marty College – June 24th 2013
T3.1. Data Mining using Association Rule on Dataset
o Open Tanagra. Pull down “File” and select “Open…” Pull down “Files of Type” and select “Binary data mining diagram
(*.bdm) Then open the file “T41Bin_transactions.bdm” in your student folder
o Click on the “Define Attributes Status” icon [ ] Select all “Attributes” by clicking on [ ] and move over into “Input”
by clicking on the arrow. Click “OK”
o Right click “Define Status 1” on the left and select “Execute”
o Click on “Association” on the bottom. Pull “A priori” on the top of “Define Status 1” on top left. Then double click “A priori 1”
to see the results
T3.2. K-Means Clustering Method for Data Learning
o Open Tanagra, pull down “File” and click on this icon [ ] Then click on this icon [ ]Change file type to “Excel File..”
and open the file “cars” from your student folder. Click “Open”
o Click on this icon [ ] Move “MPG”, “Weight”, and “Drive Ratio” into the “Input” and click “OK”
o Click on “Clustering” at the bottom, and pull “HAC” on top of “Define status1” Then right click “HAC 1” and change “Best
Clusters” to “Detect”
o You can open the “Dendrogram” tab on the top of the view window. Move back to the “Report” view
o Add another “Define Status” under “HAC 1” and select all of the “Attributes” into “Input” except “Car” ad
“Cluster_HAC_1”
o Open “Target” tab and put “Cluster_HAC_1” in
o Click on “Statistics” and put “Group characterization” under “Define status 2”
o Click on “Factorial Analysis” and put “Principal Component Analysis 1” under “HAC 1” Add a “Define status” under
“Principal Component Analysis 1” as well. Change the “Parameters” for the new “Define status 3” by putting “MPG”,
“Wieght”, “Drive_Ration”, “Horsepower”, “Displacement”, and “cylinders” in “Input” and “PCA_1_Axis _1” and
“PCA_1_Axis_2” in “Target.
o Click on “Data Visualization” and put “Correlation scatterplot” under “Define status 3” In any top right pull down in the
vieww window select “Cluster_HAC_1”. In the other two pull downs put “PCA_1_Axis_1” and “PCA_1_Axis_2”
Information Management in Behavior Science
T4. Epidemiology Modeling
Objective : How to use modeling approach to analyze the epidemiology problems
T4.1 Using NetLogo
o Click “Run epiDEM Travel and Control in you browser” to launch it
o You can change the parameters of the system as you please and simulate the experiment
o Click “Setup” and the system will automatically build a population
o Execute by clicking “Go”
o You will be able to see the results of the population changes in the left graphs
o Right Click and select “Copy Image” in order to copy the image of your results
Etienne Z. Gnimpieba
BRIN WS 2013
Mount Marty College – June 24th 2013
T4.2. Running epiDEM Simulation
On the NetLogo website: http://ccl.northwestern.edu/netlogo/ (Make sure you are using
Chrome browser)
o Click on “Library” and scroll to the bottom of the page and select “epiDEM Travel and Control”
Statistics in Behavior Science

Más contenido relacionado

La actualidad más candente

NPA Data Science: Outcomes and PCs
NPA Data Science: Outcomes and PCsNPA Data Science: Outcomes and PCs
NPA Data Science: Outcomes and PCsKate Farrell
 
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINEA NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINEaciijournal
 
Effect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysisEffect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysiseSAT Publishing House
 
Advanced Filter Concepts in MS-Excel
Advanced Filter Concepts in MS-ExcelAdvanced Filter Concepts in MS-Excel
Advanced Filter Concepts in MS-ExcelP. SUNDARI ARUN
 
IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...
IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...
IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...IRJET Journal
 
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...IRJET Journal
 
A02610104
A02610104A02610104
A02610104theijes
 
Application of Exponential Gamma Distribution in Modeling Queuing Data
Application of Exponential Gamma Distribution in Modeling Queuing DataApplication of Exponential Gamma Distribution in Modeling Queuing Data
Application of Exponential Gamma Distribution in Modeling Queuing Dataijtsrd
 
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...ijcsa
 
G. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statisticsG. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statisticsIstituto nazionale di statistica
 
Session ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrSession ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrUSD Bioinformatics
 
EE660 Project_sl_final
EE660 Project_sl_finalEE660 Project_sl_final
EE660 Project_sl_finalShanglin Yang
 
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining AnalysisRisk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysisertekg
 
Decision tree induction
Decision tree inductionDecision tree induction
Decision tree inductionthamizh arasi
 

La actualidad más candente (19)

NPA Data Science: Outcomes and PCs
NPA Data Science: Outcomes and PCsNPA Data Science: Outcomes and PCs
NPA Data Science: Outcomes and PCs
 
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINEA NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
 
Biodata analysis
Biodata analysisBiodata analysis
Biodata analysis
 
Effect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysisEffect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysis
 
Advanced Filter Concepts in MS-Excel
Advanced Filter Concepts in MS-ExcelAdvanced Filter Concepts in MS-Excel
Advanced Filter Concepts in MS-Excel
 
IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...
IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...
IRJET- Analysis of Health-Care Capabilities of Different Countries in Order t...
 
XL-MINER: Data Utilities
XL-MINER: Data UtilitiesXL-MINER: Data Utilities
XL-MINER: Data Utilities
 
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...IRJET-  	  Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
IRJET- Breast Cancer Relapse Prognosis by Classic and Modern Structures o...
 
A02610104
A02610104A02610104
A02610104
 
Application of Exponential Gamma Distribution in Modeling Queuing Data
Application of Exponential Gamma Distribution in Modeling Queuing DataApplication of Exponential Gamma Distribution in Modeling Queuing Data
Application of Exponential Gamma Distribution in Modeling Queuing Data
 
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...DATA MINING METHODOLOGIES TO  STUDY STUDENT'S ACADEMIC  PERFORMANCE USING THE...
DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...
 
G. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statisticsG. Barcaroli, The use of machine learning in official statistics
G. Barcaroli, The use of machine learning in official statistics
 
R12 i setup doc
R12 i setup docR12 i setup doc
R12 i setup doc
 
Session ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corrSession ii g1 lab genomics and gene expression mmc-corr
Session ii g1 lab genomics and gene expression mmc-corr
 
Q45019899
Q45019899Q45019899
Q45019899
 
EE660 Project_sl_final
EE660 Project_sl_finalEE660 Project_sl_final
EE660 Project_sl_final
 
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining AnalysisRisk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
 
Dma unit 1
Dma unit   1Dma unit   1
Dma unit 1
 
Decision tree induction
Decision tree inductionDecision tree induction
Decision tree induction
 

Destacado

Clinical Application of RNA Sequencing - Bladder Cancer
Clinical Application of RNA Sequencing - Bladder CancerClinical Application of RNA Sequencing - Bladder Cancer
Clinical Application of RNA Sequencing - Bladder CancerUSD Bioinformatics
 
Session ii g3 overview behavior science mmc
Session ii g3 overview behavior science mmcSession ii g3 overview behavior science mmc
Session ii g3 overview behavior science mmcUSD Bioinformatics
 
Session ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mccSession ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mccUSD Bioinformatics
 
Session ii g3 overview epidemiology modeling mmc
Session ii g3 overview epidemiology modeling mmcSession ii g3 overview epidemiology modeling mmc
Session ii g3 overview epidemiology modeling mmcUSD Bioinformatics
 
Session ii g2 overview protein modeling mmc
Session ii g2 overview protein modeling mmcSession ii g2 overview protein modeling mmc
Session ii g2 overview protein modeling mmcUSD Bioinformatics
 
Next Generation Sequencing - the basics
Next Generation Sequencing - the basicsNext Generation Sequencing - the basics
Next Generation Sequencing - the basicsUSD Bioinformatics
 
Improved Reagents & Methods for Target Enrichment in Next Generation Sequencing
Improved Reagents & Methods for Target Enrichment in Next Generation SequencingImproved Reagents & Methods for Target Enrichment in Next Generation Sequencing
Improved Reagents & Methods for Target Enrichment in Next Generation SequencingIntegrated DNA Technologies
 
Small Molecule Real Time Sequencing
Small Molecule Real Time SequencingSmall Molecule Real Time Sequencing
Small Molecule Real Time SequencingUSD Bioinformatics
 

Destacado (19)

Clinical Application of RNA Sequencing - Bladder Cancer
Clinical Application of RNA Sequencing - Bladder CancerClinical Application of RNA Sequencing - Bladder Cancer
Clinical Application of RNA Sequencing - Bladder Cancer
 
Session ii g3 overview behavior science mmc
Session ii g3 overview behavior science mmcSession ii g3 overview behavior science mmc
Session ii g3 overview behavior science mmc
 
Session ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mccSession ii g2 overview metabolic network modeling mcc
Session ii g2 overview metabolic network modeling mcc
 
Session ii g3 overview epidemiology modeling mmc
Session ii g3 overview epidemiology modeling mmcSession ii g3 overview epidemiology modeling mmc
Session ii g3 overview epidemiology modeling mmc
 
Session ii g2 overview protein modeling mmc
Session ii g2 overview protein modeling mmcSession ii g2 overview protein modeling mmc
Session ii g2 overview protein modeling mmc
 
Clinical Application 2.0
Clinical Application 2.0Clinical Application 2.0
Clinical Application 2.0
 
Visualization Tools
Visualization ToolsVisualization Tools
Visualization Tools
 
Clinical Application 1.0
Clinical Application 1.0Clinical Application 1.0
Clinical Application 1.0
 
Bridge Amplification Part 2
Bridge Amplification Part 2Bridge Amplification Part 2
Bridge Amplification Part 2
 
True Single Molecule Sequencing
True Single Molecule SequencingTrue Single Molecule Sequencing
True Single Molecule Sequencing
 
Next Generation Sequencing - the basics
Next Generation Sequencing - the basicsNext Generation Sequencing - the basics
Next Generation Sequencing - the basics
 
Ion Torrent Sequencing
Ion Torrent SequencingIon Torrent Sequencing
Ion Torrent Sequencing
 
Improved Reagents & Methods for Target Enrichment in Next Generation Sequencing
Improved Reagents & Methods for Target Enrichment in Next Generation SequencingImproved Reagents & Methods for Target Enrichment in Next Generation Sequencing
Improved Reagents & Methods for Target Enrichment in Next Generation Sequencing
 
Bridge Amplification Part 1
Bridge Amplification Part 1Bridge Amplification Part 1
Bridge Amplification Part 1
 
Sanger Dideoxy Method
Sanger Dideoxy MethodSanger Dideoxy Method
Sanger Dideoxy Method
 
Small Molecule Real Time Sequencing
Small Molecule Real Time SequencingSmall Molecule Real Time Sequencing
Small Molecule Real Time Sequencing
 
Pyrosequencing 454
Pyrosequencing 454Pyrosequencing 454
Pyrosequencing 454
 
Illumina Sequencing
Illumina SequencingIllumina Sequencing
Illumina Sequencing
 
Basic Steps of the NGS Method
Basic Steps of the NGS MethodBasic Steps of the NGS Method
Basic Steps of the NGS Method
 

Similar a Session ii g3 lab behavior science mmc

Tutorial for Starting to Analyse Data with SPSS
Tutorial for Starting to Analyse Data with SPSSTutorial for Starting to Analyse Data with SPSS
Tutorial for Starting to Analyse Data with SPSSMathew Toll
 
UNIT1-2.pptx
UNIT1-2.pptxUNIT1-2.pptx
UNIT1-2.pptxcsecem
 
Week 2 Project - STAT 3001Student Name Type your name here.docx
Week 2 Project - STAT 3001Student Name Type your name here.docxWeek 2 Project - STAT 3001Student Name Type your name here.docx
Week 2 Project - STAT 3001Student Name Type your name here.docxcockekeshia
 
SPSS :Introduction for beginners
SPSS :Introduction for beginners SPSS :Introduction for beginners
SPSS :Introduction for beginners PRAKASAM C P
 
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...IRJET Journal
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CAREijistjournal
 
Creating visual representations of time period data
Creating visual representations of time period dataCreating visual representations of time period data
Creating visual representations of time period dataQHRClinicalOps
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CAREijistjournal
 
monitoring_loads_duration_loadcurve_tutorial.ppt
monitoring_loads_duration_loadcurve_tutorial.pptmonitoring_loads_duration_loadcurve_tutorial.ppt
monitoring_loads_duration_loadcurve_tutorial.pptssuserf71928
 
MAT 240 Creating Histograms in Excel This tutorial wil
MAT 240 Creating Histograms in Excel   This tutorial wilMAT 240 Creating Histograms in Excel   This tutorial wil
MAT 240 Creating Histograms in Excel This tutorial wilAbramMartino96
 
MAT 240 Creating Histograms in Excel This tutorial wil
MAT 240 Creating Histograms in Excel   This tutorial wilMAT 240 Creating Histograms in Excel   This tutorial wil
MAT 240 Creating Histograms in Excel This tutorial wilAbramMartino96
 
Machine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptxMachine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptxVenkateswaraBabuRavi
 
Lecture 1 practical_guidelines_assignment
Lecture 1 practical_guidelines_assignmentLecture 1 practical_guidelines_assignment
Lecture 1 practical_guidelines_assignmentDaria Bogdanova
 

Similar a Session ii g3 lab behavior science mmc (20)

Tutorial for Starting to Analyse Data with SPSS
Tutorial for Starting to Analyse Data with SPSSTutorial for Starting to Analyse Data with SPSS
Tutorial for Starting to Analyse Data with SPSS
 
UNIT1-2.pptx
UNIT1-2.pptxUNIT1-2.pptx
UNIT1-2.pptx
 
Week 2 Project - STAT 3001Student Name Type your name here.docx
Week 2 Project - STAT 3001Student Name Type your name here.docxWeek 2 Project - STAT 3001Student Name Type your name here.docx
Week 2 Project - STAT 3001Student Name Type your name here.docx
 
Ijcatr04041015
Ijcatr04041015Ijcatr04041015
Ijcatr04041015
 
Tinkerplots
TinkerplotsTinkerplots
Tinkerplots
 
SPSS :Introduction for beginners
SPSS :Introduction for beginners SPSS :Introduction for beginners
SPSS :Introduction for beginners
 
Chapter 3
Chapter 3Chapter 3
Chapter 3
 
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...
IRJET- A Survey on Prediction of Heart Disease Presence using Data Mining and...
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
 
Creating visual representations of time period data
Creating visual representations of time period dataCreating visual representations of time period data
Creating visual representations of time period data
 
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARECLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
CLUSTERING DICHOTOMOUS DATA FOR HEALTH CARE
 
monitoring_loads_duration_loadcurve_tutorial.ppt
monitoring_loads_duration_loadcurve_tutorial.pptmonitoring_loads_duration_loadcurve_tutorial.ppt
monitoring_loads_duration_loadcurve_tutorial.ppt
 
EDM405 4.pptx
EDM405 4.pptxEDM405 4.pptx
EDM405 4.pptx
 
MAT 240 Creating Histograms in Excel This tutorial wil
MAT 240 Creating Histograms in Excel   This tutorial wilMAT 240 Creating Histograms in Excel   This tutorial wil
MAT 240 Creating Histograms in Excel This tutorial wil
 
MAT 240 Creating Histograms in Excel This tutorial wil
MAT 240 Creating Histograms in Excel   This tutorial wilMAT 240 Creating Histograms in Excel   This tutorial wil
MAT 240 Creating Histograms in Excel This tutorial wil
 
SPSS FINAL.pdf
SPSS FINAL.pdfSPSS FINAL.pdf
SPSS FINAL.pdf
 
Statistics
StatisticsStatistics
Statistics
 
Machine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptxMachine learning ppt unit one syllabuspptx
Machine learning ppt unit one syllabuspptx
 
3320 lab1
3320 lab13320 lab1
3320 lab1
 
Lecture 1 practical_guidelines_assignment
Lecture 1 practical_guidelines_assignmentLecture 1 practical_guidelines_assignment
Lecture 1 practical_guidelines_assignment
 

Más de USD Bioinformatics

Más de USD Bioinformatics (11)

Session ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcSession ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmc
 
Session ii g2 lab modeling mmc
Session ii g2 lab modeling mmcSession ii g2 lab modeling mmc
Session ii g2 lab modeling mmc
 
Session ii g1 overview genomics and gene expression mmc-good
Session ii g1 overview genomics and gene expression mmc-goodSession ii g1 overview genomics and gene expression mmc-good
Session ii g1 overview genomics and gene expression mmc-good
 
Session i overview bioinfo dm and app mmc
Session i overview bioinfo dm and app mmcSession i overview bioinfo dm and app mmc
Session i overview bioinfo dm and app mmc
 
Session i lab bioinfo dm and app mmc
Session i lab bioinfo dm and app mmcSession i lab bioinfo dm and app mmc
Session i lab bioinfo dm and app mmc
 
Swiss model evaluation
Swiss model evaluationSwiss model evaluation
Swiss model evaluation
 
Amino acid sequence
Amino acid sequenceAmino acid sequence
Amino acid sequence
 
Brin bws13 quiz mmc
Brin bws13 quiz mmcBrin bws13 quiz mmc
Brin bws13 quiz mmc
 
Brin annimation
Brin annimationBrin annimation
Brin annimation
 
Huber brin pb1_f2_poster_2012
Huber brin pb1_f2_poster_2012Huber brin pb1_f2_poster_2012
Huber brin pb1_f2_poster_2012
 
Lab Gene Expression Data Analysis
Lab Gene Expression Data AnalysisLab Gene Expression Data Analysis
Lab Gene Expression Data Analysis
 

Último

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Último (20)

Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

Session ii g3 lab behavior science mmc

  • 1. Information Management Systems in Behavior Science Theme: Statistic, Epidemiology Modeling and Diabetes Lab #2 Etienne Z. Gnimpieba BRIN WS 2013 Mount Marty College – June 24th 2013 Etienne.gnimpieba@usd.edu
  • 2. Context 0. Specification & Aims Lab #2 Statement of problem / Case study: Interdisciplinary and transversal research development invites each scientific field to initiate an evolution toward integration to /for other field. The bioinformatics and computational biology used in Behavior science remain difficult to describe. We propose here to use Information management tools (data collected, mining, load, statistic analysis) and systems Biology modeling (epidemiology modeling) as key point for that translational interaction. Epidemiology is the study of the distribution and determinants of health-related states or events (including disease), and the application of this study to the control of diseases and other health problems. Various methods can be used to carry out epidemiological investigations: surveillance and descriptive studies can be used to study distribution; analytical studies are used to study determinants. Bioinformatics and Information Management in Behavior Science Aim: The aim of this lab is to create a broader understanding of behavioral science data analysis and mining using statistic tools. As a part of life science area, we propose here to use Bioinformatics and Systems Biology tools in Epidemiology modeling to predict disease spread. Acquired skills Online Server Tools: - Survey design (from hypothesis to questionnaire) - Google Apps (design forms, updating questions) - Data analysis, data learning, data mining - Using NetLogo (modeling approach) 2 Resolution Process T1. Creating a Google Survey Objective: Learn how to make a Google survey T2. Descriptive and Inference Statistics in Excel Objective: Extract Load and Treat (ELT) data set for excel statistics used. T1.1. Setting up the Survey T1.2 Creating the Questions T1.3. Collected Data and Visualize the Summary T2.1. Import/edit/export from different formats (text, tab, xml, …) T2.2. Descriptive analysis in excel (max, min, typos, count, Stde, average, sum) and data visualization in excel (histogram, scatter plot, …) T.2.3. Case Study: Social Reajustment Rating Scale (SRRS) question on diabete T4. Epidemiology Modeling Objective: How to use modeling approach to analyze the epidemiology problems T4.1. Using NetLogo T4.2. Running epiDEM simulation T3. Inference statistics in Tanagra (data mining) Objective: learn, extract knowledge from data using data mining tools (associations rules, clustering, neuronal network) in Tanagra. T3.1. Data Mining using Associaton Rule on Dataset T3.2. K-Means Clustering Method for Data Learning
  • 3. Information Management in Behavior Science T1. Creating a Google Survey Objective : Learn how to make a Google Survey T1.1. Setting up the Survey On the Google website: http://www.google.com o Click on “Drive” tab o Login or create an account o Click on the red “Create” button and select the “Form” o In “Title” you name your survey. You can also select your desired theme. For our example we will name our survey “Diabetes” o Click “Ok” to begin creating your survey o A screen will open and you can complete an number of functions such as o Providing a description o Titling your questions o Providing help text if needed o Choosing you question type o Adding more questions o When you are finished with your survey questions click “Done” o You can now choose the options to o Show link to submit another response o Publish and show the link to the results of the form to all the responders o Allow responders to edit responses after submitting o You can now share the link of your survey through Google +, Facebook, and twitter, or sending forms via e-mail. o When finished selecting recipients click “Done” Etienne Z. Gnimpieba BRIN WS 2013 Mount Marty College – June 24th 2013 T1.2. Creating the Questions T1.3. Collected Data and Visualize the Summary Statistics in Behavior Science
  • 4. Information Management in Behavior Science T2. Descriptive and Inference Statistics in Excel Objective: Extract Load and Treat (ELT) data set for excel statistics used T2.1. Import/Edit/Export from different Formats (text, tab, xml…) Etienne Z. Gnimpieba BRIN WS 2013 Mount Marty College – June 24th 2013 o Open Excel and go to the “Data” tab. Click on “From Text” and select the file “data_lab8” from the student folder. o Click “Next” Under “Delimiters” select only the “comma” Click “Finish” and “OK” o Click on the column letter you want to sort by and click on “Sort and Filter” on the top left of the “Data” tab. o Select the “Sort A though Z” Make sure “Expand with selection is selected and click “Sort” T2.2. Descriptive Analysis in Excel and Data Visualisation in Excel o Click on the cell under the title “Average” Under the “Formulas” tab click on “More Functions”, “Statistical” and “Average” o To find the average of the cars that traveled enter “D2:D21” in “Number 1” and click “OK” o To find the standard deviation of the number accidents select the cell under the title “Stdev” and click on “More Functions”, “Statistical”, and “STDEV.S”. In “Number 1” put “E2:E21”. o Highlight the data under “Cars Travel” (D2:D21) Go to the “Insert” tab and click on “Line” and select the first dot line option given o You can edit the chart multiple ways and change the looks of in by going under the tabs that show up when the chart is selected o Under the “Layout” tab you can add titles for the X and Y axis's. o Right click the chart and select “Change Chart Type…” if you want a different type of graph T.2.3. Case Study: Social Reajustment Rating Scale (SRRS) question on Diabetes o Open “Case Study-Diabetes” from your student folder o Select cell C2 and type in the equation “=$B2*$B2” and press enter. o Hover your mouse over the lower right corner and drag the black box down to C7 to apply this equation to the rest of the cells. o Select cell E2 and type in the equation “=$D2*$D2” and again drag down until cell E7 o Select cell F2 and type in the equation “=$B2*$D2” and again drag down until cell F7 o Select cell B8 and click on the button “Math and Trig” and select “SUM” Type in “Number 1” “B2:B7” and then press “OK” o Drag the black box across to F8. Now your table should be complete o Click on the “Insert” tab and pull down the “Scatter” button and select the first chart Statistics in Behavior Science
  • 5. Statistics in Behavior Science Information Management in Behavior Science T3. Inference statistics in Tanagra Objective: learn, extract knowledge from data using data mining tools (associations rules, clustering, neuronal network) in Tanagra. Data Mining Etienne Z. Gnimpieba BRIN WS 2013 Mount Marty College – June 24th 2013 T3.1. Data Mining using Association Rule on Dataset o Open Tanagra. Pull down “File” and select “Open…” Pull down “Files of Type” and select “Binary data mining diagram (*.bdm) Then open the file “T41Bin_transactions.bdm” in your student folder o Click on the “Define Attributes Status” icon [ ] Select all “Attributes” by clicking on [ ] and move over into “Input” by clicking on the arrow. Click “OK” o Right click “Define Status 1” on the left and select “Execute” o Click on “Association” on the bottom. Pull “A priori” on the top of “Define Status 1” on top left. Then double click “A priori 1” to see the results T3.2. K-Means Clustering Method for Data Learning o Open Tanagra, pull down “File” and click on this icon [ ] Then click on this icon [ ]Change file type to “Excel File..” and open the file “cars” from your student folder. Click “Open” o Click on this icon [ ] Move “MPG”, “Weight”, and “Drive Ratio” into the “Input” and click “OK” o Click on “Clustering” at the bottom, and pull “HAC” on top of “Define status1” Then right click “HAC 1” and change “Best Clusters” to “Detect” o You can open the “Dendrogram” tab on the top of the view window. Move back to the “Report” view o Add another “Define Status” under “HAC 1” and select all of the “Attributes” into “Input” except “Car” ad “Cluster_HAC_1” o Open “Target” tab and put “Cluster_HAC_1” in o Click on “Statistics” and put “Group characterization” under “Define status 2” o Click on “Factorial Analysis” and put “Principal Component Analysis 1” under “HAC 1” Add a “Define status” under “Principal Component Analysis 1” as well. Change the “Parameters” for the new “Define status 3” by putting “MPG”, “Wieght”, “Drive_Ration”, “Horsepower”, “Displacement”, and “cylinders” in “Input” and “PCA_1_Axis _1” and “PCA_1_Axis_2” in “Target. o Click on “Data Visualization” and put “Correlation scatterplot” under “Define status 3” In any top right pull down in the vieww window select “Cluster_HAC_1”. In the other two pull downs put “PCA_1_Axis_1” and “PCA_1_Axis_2”
  • 6. Information Management in Behavior Science T4. Epidemiology Modeling Objective : How to use modeling approach to analyze the epidemiology problems T4.1 Using NetLogo o Click “Run epiDEM Travel and Control in you browser” to launch it o You can change the parameters of the system as you please and simulate the experiment o Click “Setup” and the system will automatically build a population o Execute by clicking “Go” o You will be able to see the results of the population changes in the left graphs o Right Click and select “Copy Image” in order to copy the image of your results Etienne Z. Gnimpieba BRIN WS 2013 Mount Marty College – June 24th 2013 T4.2. Running epiDEM Simulation On the NetLogo website: http://ccl.northwestern.edu/netlogo/ (Make sure you are using Chrome browser) o Click on “Library” and scroll to the bottom of the page and select “epiDEM Travel and Control” Statistics in Behavior Science

Notas del editor

  1. Th is is the lab template: The context is a biological context based on a real biological problem. And a given hypothesisI don’t use computer science, strong word.When you read this template, you have a different view than an informatician.You want to understand the process to build the used tools.The architecture of the systemThe algorithm implementationThe quality of the resulting dataAnd so on