SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Data Science (Informatics)
    Programs at GMU

                   Kirk Borne
                  George Mason University
  School of Physics, Astronomy, & Computational Sciences
                 http://spacs.gmu.edu/
Outline
• Graduate Program at GMU
• Undergraduate Program at GMU
• Challenge Areas and Reflections




                                    2
CSI Graduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CSI = Computational Science & Informatics
   – CSI graduate program has existed at GMU since 1991
   – Over 200 PhD’s graduated in past 20 years
   – Approximately 90 students currently enrolled
   – About 10% of students end with M.S. (Masters in Computational
     Science)
   – We have a Graduate Certificate in Computational Techniques and
     Applications (non-degree professional certification program)
   – We are developing online Distance Education courses




                                                                      3
CSI Graduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CSI = Computational Science & Informatics
   – Students can choose a concentration from several choices:
       •   Computational Astrophysics
       •   Space Science
       •   Computational Physics
       •   Computational Fluid Dynamics
       •   Computational Statistics
       •   Computational Learning
       •   Computational Mathematics
       •   Computational Materials Science (Physical Chemistry)
   – A student may “create” their own concentration, such as one of these
     previously approved concentrations:
       • Computational Finance
       • Remote Sensing
       • Computational Economics
                                                                            4
CSI Graduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CSI = Computational Science & Informatics
   – Note that there is no specific X-Informatics concentration in CSI.
   – However, GMU students can enroll in other departments for specific
     X-Informatics disciplines:
       • Geoinformatics (including Geospatial Intelligence)
       • Bioinformatics
       • Health Informatics
   – We have discussed adding an AstroInformatics concentration in CSI,
     but it is not really necessary since interested students are already
     doing it – i.e., “creating” their own concentration and working with
     the resident AstroInformatics person (K.B.)




                                                                            5
CSI Graduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CSI = Computational Science & Informatics
  Students must complete 4 core courses from this set of 5:
   – CSI 700 Numerical Methods
   – CSI 701 Foundations of Computational Science
   – CSI 702 High Performance Computing
   – CSI 703 Scientific and Statistical Visualization
   – CSI 710 Scientific Databases
  There are also many electives (at least 5 additional CSI
  courses are required, plus concentration science electives).
  For example:
   – Data Mining, Knowledge Mining, Computational Learning,
     Statistical Learning, Computational Statistics, Statistical Graphics,
     Data Exploration, etc.                                                  6
CSI Graduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CSI = Computational Science & Informatics
  Students must complete 4 core courses from this set of 5:
   – CSI 700 Numerical Methods
   – CSI 701 Foundations of Computational Science
   – CSI 702 High Performance Computing
   – CSI 703 Scientific and Statistical Visualization Informatics
   – CSI 710 Scientific Databases                       Courses !
  There are also many electives (at least 5 additional CSI
  courses are required, plus concentration science electives).
  For example:
   – Data Mining, Knowledge Mining, Computational Learning,
     Statistical Learning, Computational Statistics, Statistical Graphics,
     Data Exploration, etc.                                                  7
Example Course Syllabus:
          CSI 710 – Scientific Databases
• CSI 710 Scientific Databases (taught by K.B.) –
  lectures include:
      •   Relational Databases: Modeling, Schemas, Normalization, SQL
      •   Scientific Databases, Big Data in Science, The 4th Paradigm
      •   E-Science, Ontologies, Semantic E-Science, X-Informatics
      •   Distributed Data, Federated Data, Virtual Observatories
      •   Citizen Science with Big Data
      •   Scientific Data Mining I
      •   Scientific Data Mining II
      •   Astroinformatics and Astro databases
      •   Bioinformatics and Bio databases
      •   Geoinformatics and Geo databases
      •   Health Informatics
      •   Online Science (Jim Gray’s KDD-2003 lecture)
      •   Intelligent Archives of the Future
                                                                        8
Outline
• Graduate Program at GMU
• Undergraduate Program at GMU
• Challenge Areas and Reflections




                                    9
CDS Undergraduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CDS = Computational and Data Sciences
   –   Undergraduate B.S. degree program at GMU since 2008
   –   Graduating ~2 students per year
   –   10 students currently enrolled
   –   … this is considered “under-performing”
   –   … we need strategies for increasing enrollment.




                                                             10
CDS Undergraduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CDS = Computational and Data Sciences
   –   Undergraduate B.S. degree program at GMU since 2008
   –   Graduating ~2 students per year
   –   10 students currently enrolled
   –   … this is considered “under-performing”
   –   Strategies for increasing enrollment:
       • Target marketing – send email to all “local” high school students interested in science
         *and* interested in GMU College of Science programs already
       • Advertising materials posted on campus and emailed to existing GMU undergraduates
       • Modify the curriculum to satisfy student interests (e.g., “Computing for Scientists”**)
       • Future idea: find a sponsor for a CDS scholarship program
       • REU = Research Experiences for Undergraduates: students want this experience!
       • … or … wait and see what happens, because …
       • Enrollment is already increasing this year due to a new course** that is very exciting for
         the students and is attracting ~100 students each semester, some of whom have already
         converted to CDS and several others are thinking about it.
                                                                                                 11
CDS Undergraduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CDS = Computational and Data Sciences
   – Students can choose the general CDS degree, or else choose one of
     these concentrations:
       • Physics
       • Chemistry
       • Biology
   – A student may “create” their own concentration. For example:
       • Environmental Science
   – There is no AstroInformatics option, but one course is being
     modified to include this material, plus students can take Independent
     Study and Independent Research courses that cover this material.




                                                                             12
CDS Undergraduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CDS = Computational and Data Sciences
   – The DATA SCIENCE component of the curriculum was developed
     with the support of a grant from the NSF (National Science
     Foundation):
      • CUPIDS = Curriculum for an Undergraduate Program In Data
         Sciences
   – Primary Goal: to increase student’s understanding of the role that
     data plays across the sciences as well as to increase the student’s
     ability to use the technologies associated with data acquisition,
     mining, analysis, and visualization.
   – Objectives – students are trained:
   –   … to access large distributed data repositories
   –   … to conduct meaningful inquiries into the data
   –   … to mine, visualize, and analyze the data
   –   … to make objective data-driven inferences, discoveries, and decisions
                                                                                13
CDS Undergraduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CDS = Computational and Data Sciences
    Students must complete 6 core courses from this set of 8 courses:
•   CDS 101 – Introduction to Computational and Data Sciences
•   CDS 130 – Computing for Scientists
•   CDS 251 – Introduction to Scientific Programming
•   CDS 301 – Scientific Information and Data Visualization
•   CDS 302 – Scientific Data and Databases
•   CDS 401 – Scientific Data Mining
•   CDS 410 – Modeling and Simulations I
•   CDS 411 – Modeling and Simulations II

    Additional required courses include Math, Statistics, Computer Science,
    Physics I and II, plus courses in student’s chosen science concentration

                                                                               14
CDS Undergraduate Program at GMU
• http://spacs.gmu.edu/content/academic-programs
• CDS = Computational and Data Sciences
  We are also “infiltrating” the entire undergraduate program at GMU
  through 3 of our courses that satisfy university General Education
  graduation requirements for all students at the university:
• CDS 101 – Introduction to Computational and Data Sciences
   – Satisfies GMU’s Natural Science requirement
• CDS 130 – Computing for Scientists
   – Satisfies GMU’s I.T. requirement
• CDS 151 – Data Ethics
   – Satisfies GMU’s Ethics requirement




                                                                       15
Example of Learning Objectives:
                CDS 401 – Scientific Data Mining
• Be able to explain the role of data mining within scientific knowledge discovery.
• Be able to describe the most well known data mining algorithms and correctly
  use data mining terminology.
• Be able to express the application of statistics, similarity measures, and indexing
  to data mining tasks.
• Identify appropriate techniques for classification and clustering applications.
• Determine approaches used for mining large scientific databases (e.g., genomics,
  virtual observatories).
• Recognize techniques used for spatial and temporal data mining applications.
• Express the steps in a data mining project (e.g., cleaning, transforming, indexing,
  mining, analysis).
• Analyze classic data mining examples and use cases, and assess the applicatio of
  different data mining techniques.
• Effectively prepare data for mining.
• Effectively use software packages for data exploration, visualization, and mining.

                                                                                   16
Outline
• Graduate Program at GMU
• Undergraduate Program at GMU
• Challenge Areas and Reflections




                                    17
Challenge Areas
        (primarily for the undergraduate program,
      since the graduate program is doing very well)
• Attracting students: nobody says “I want to be a computational and data
  scientist when I grow up.”
• Visibility: most other science departments are not aware of the
  importance of our courses for their majors. (Note: Biology and
  Neuroscience now require our Science Computing course.)
• Retention: after Year 1 courses, there are no other CDS courses until
  junior year (Year 3)
• Lab course: in order to attract sufficient enrollment, the CDS 101
  General Education Natural Science course should be accompanied with a
  lab section.
• Scientific computing course: this was identified 2 years ago as a
  necessary course to attract students to CDS … we now have this course,
  and it is very popular (nearly 100 students each semester, and growing …
  up to 400+)
                                                                         18
Reflections
• Students with a broad interest in computers and sciences will benefit from
  these types of programs: Computational and Data Sciences.
   – Actual quote from high school senior visiting the university:
     “I plan to major in biology, but I wish I could do something with
       computing also.”
• Students graduating with a traditional discipline-based bachelors degree
  in biology, chemistry, or physics generally do not have the required
  computational background necessary to participate as productive
  members of modern interdisciplinary scientific research teams, which are
  becoming increasingly computational- and data-intensive.
• The motivating theme and goal of these program should be to train the
  next-generation scientists in the tools and techniques of cyber-enabled
  science (e-Science) to prepare them to confront the emerging petascale
  challenges of data-intensive science.
• It is also good for society in general that all members of the 21st century
  workforce are trained in computational and data science skills – i.e.,
  computational literacy and data literacy are critical for all.              19

Más contenido relacionado

Similar a Kirkborne panel presentation-rdap2012

Classroom Pilot-Testing for GETSI NON-Authors Presentation
Classroom Pilot-Testing for GETSI NON-Authors Presentation Classroom Pilot-Testing for GETSI NON-Authors Presentation
Classroom Pilot-Testing for GETSI NON-Authors Presentation SERC at Carleton College
 
Graduate Engineering Programs at USC (India_Fall 2016)
Graduate Engineering Programs at USC (India_Fall 2016)Graduate Engineering Programs at USC (India_Fall 2016)
Graduate Engineering Programs at USC (India_Fall 2016)Laura Amalia Hartman
 
Introducing FIU CAESCIR
Introducing FIU CAESCIRIntroducing FIU CAESCIR
Introducing FIU CAESCIRJason Liu
 
03 slide num & mlk - persiapan akreditasi internasional abet untuk teknik i...
03 slide   num & mlk - persiapan akreditasi internasional abet untuk teknik i...03 slide   num & mlk - persiapan akreditasi internasional abet untuk teknik i...
03 slide num & mlk - persiapan akreditasi internasional abet untuk teknik i...Ainul Yaqin
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013SALCTG
 
AppCEDA Webinar UW-Madison 12-10-15
AppCEDA Webinar UW-Madison 12-10-15AppCEDA Webinar UW-Madison 12-10-15
AppCEDA Webinar UW-Madison 12-10-15Matt Griswold
 
Digital Badging and Microcredentialing
Digital Badging and MicrocredentialingDigital Badging and Microcredentialing
Digital Badging and MicrocredentialingPlamen Miltenoff
 
Introduction to NSF-sponsored Big Data Education Project
Introduction to NSF-sponsored Big Data Education ProjectIntroduction to NSF-sponsored Big Data Education Project
Introduction to NSF-sponsored Big Data Education ProjectBig Data Education
 
School of Geosciences MSc recruitment event
School of Geosciences MSc recruitment eventSchool of Geosciences MSc recruitment event
School of Geosciences MSc recruitment eventEmily350847
 
Data Science and Online Education
Data Science and Online EducationData Science and Online Education
Data Science and Online EducationGeoffrey Fox
 
Digital badges and Microcredentialing
Digital badges and MicrocredentialingDigital badges and Microcredentialing
Digital badges and MicrocredentialingPlamen Miltenoff
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and TrainingNUI Galway
 
uw-platteville_chepd_Mei-4-2
uw-platteville_chepd_Mei-4-2uw-platteville_chepd_Mei-4-2
uw-platteville_chepd_Mei-4-2Mei Reeder
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Geoffrey Fox
 
Auburn CSSE graduate student orientation
Auburn CSSE graduate student orientationAuburn CSSE graduate student orientation
Auburn CSSE graduate student orientationXiao Qin
 
TOP UNIVERSITIES IN US FOR MS IN DATA SCIENCE
TOP UNIVERSITIES IN US FOR MS IN DATA SCIENCETOP UNIVERSITIES IN US FOR MS IN DATA SCIENCE
TOP UNIVERSITIES IN US FOR MS IN DATA SCIENCESKILL-LYNC SUPPORT
 
Esriuk_track8_newcastle_embedding_gis
Esriuk_track8_newcastle_embedding_gisEsriuk_track8_newcastle_embedding_gis
Esriuk_track8_newcastle_embedding_gisEsri UK
 

Similar a Kirkborne panel presentation-rdap2012 (20)

Classroom Pilot-Testing for GETSI NON-Authors Presentation
Classroom Pilot-Testing for GETSI NON-Authors Presentation Classroom Pilot-Testing for GETSI NON-Authors Presentation
Classroom Pilot-Testing for GETSI NON-Authors Presentation
 
Graduate Engineering Programs at USC (India_Fall 2016)
Graduate Engineering Programs at USC (India_Fall 2016)Graduate Engineering Programs at USC (India_Fall 2016)
Graduate Engineering Programs at USC (India_Fall 2016)
 
Introducing FIU CAESCIR
Introducing FIU CAESCIRIntroducing FIU CAESCIR
Introducing FIU CAESCIR
 
03 slide num & mlk - persiapan akreditasi internasional abet untuk teknik i...
03 slide   num & mlk - persiapan akreditasi internasional abet untuk teknik i...03 slide   num & mlk - persiapan akreditasi internasional abet untuk teknik i...
03 slide num & mlk - persiapan akreditasi internasional abet untuk teknik i...
 
CSTA.ppt
CSTA.pptCSTA.ppt
CSTA.ppt
 
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
Birgit Plietzsch “RDM within research computing support” SALCTG June 2013
 
AppCEDA Webinar UW-Madison 12-10-15
AppCEDA Webinar UW-Madison 12-10-15AppCEDA Webinar UW-Madison 12-10-15
AppCEDA Webinar UW-Madison 12-10-15
 
Digital Badging and Microcredentialing
Digital Badging and MicrocredentialingDigital Badging and Microcredentialing
Digital Badging and Microcredentialing
 
Introduction to NSF-sponsored Big Data Education Project
Introduction to NSF-sponsored Big Data Education ProjectIntroduction to NSF-sponsored Big Data Education Project
Introduction to NSF-sponsored Big Data Education Project
 
School of Geosciences MSc recruitment event
School of Geosciences MSc recruitment eventSchool of Geosciences MSc recruitment event
School of Geosciences MSc recruitment event
 
Data Science and Online Education
Data Science and Online EducationData Science and Online Education
Data Science and Online Education
 
Digital badges and Microcredentialing
Digital badges and MicrocredentialingDigital badges and Microcredentialing
Digital badges and Microcredentialing
 
RDM @ UoE
RDM @ UoERDM @ UoE
RDM @ UoE
 
2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training2011.10.10 Multi-Disciplinary Research Themes and Training
2011.10.10 Multi-Disciplinary Research Themes and Training
 
uw-platteville_chepd_Mei-4-2
uw-platteville_chepd_Mei-4-2uw-platteville_chepd_Mei-4-2
uw-platteville_chepd_Mei-4-2
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...
 
Auburn CSSE graduate student orientation
Auburn CSSE graduate student orientationAuburn CSSE graduate student orientation
Auburn CSSE graduate student orientation
 
TOP UNIVERSITIES IN US FOR MS IN DATA SCIENCE
TOP UNIVERSITIES IN US FOR MS IN DATA SCIENCETOP UNIVERSITIES IN US FOR MS IN DATA SCIENCE
TOP UNIVERSITIES IN US FOR MS IN DATA SCIENCE
 
Online ms open house revised march 26 215
Online ms open house    revised march 26 215Online ms open house    revised march 26 215
Online ms open house revised march 26 215
 
Esriuk_track8_newcastle_embedding_gis
Esriuk_track8_newcastle_embedding_gisEsriuk_track8_newcastle_embedding_gis
Esriuk_track8_newcastle_embedding_gis
 

Más de ASIS&T

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)ASIS&T
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...ASIS&T
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesASIS&T
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...ASIS&T
 
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...ASIS&T
 
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...ASIS&T
 
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)ASIS&T
 
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...ASIS&T
 
RDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in PracticeRDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in PracticeASIS&T
 
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...ASIS&T
 
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...ASIS&T
 
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...ASIS&T
 
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?ASIS&T
 
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...ASIS&T
 
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge BrokerRDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge BrokerASIS&T
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...ASIS&T
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...ASIS&T
 
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research DataRDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research DataASIS&T
 
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide CollaborationRDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide CollaborationASIS&T
 
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...ASIS&T
 

Más de ASIS&T (20)

RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
RDAP 16: Sustaining Research Data Services (Panel 2: Sustainability)
 
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...RDAP 16: Sustainability of data infrastructure: The history of science scienc...
RDAP 16: Sustainability of data infrastructure: The history of science scienc...
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
 
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
 
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
RDAP 16: If I could turn back time: Looking back on 2+ years of DMP consultin...
 
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
RDAP 16: Data Management Plan Perspectives (Panel 5, DMPs and Public Access)
 
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
RDAP 16 Poster: Challenges and Opportunities in an Institutional Repository S...
 
RDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in PracticeRDAP 16 Poster: Interpreting Local Data Policies in Practice
RDAP 16 Poster: Interpreting Local Data Policies in Practice
 
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
RDAP 16 Poster: Measuring adoption of Electronic Lab Notebooks and their impa...
 
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
RDAP 16 Poster: Responding to Data Management and Sharing Requirements in the...
 
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
RDAP 16 Lightning: Spreading the love: Bringing data management training to s...
 
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?RDAP 16 Lightning: RDM Discussion Group: How'd that go?
RDAP 16 Lightning: RDM Discussion Group: How'd that go?
 
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
RDAP 16 Lightning: Data Practices and Perspectives of Atmospheric and Enginee...
 
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge BrokerRDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
RDAP 16 Lightning: Working Across Cultures: Data Librarian as Knowledge Broker
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
 
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
RDAP 16 Lightning: Quantifying Needs for a University Research Repository Sys...
 
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research DataRDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
RDAP 16 Lightning: Personas as a Policy Development Tool for Research Data
 
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide CollaborationRDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
RDAP 16 Lightning: Growing Data in Utah: A Model for Statewide Collaboration
 
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
RDAP 16: Building Without a Plan: How do you assess structural strength? (Pan...
 

Último

P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfYu Kanazawa / Osaka University
 
The basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxThe basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxheathfieldcps1
 
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptxPractical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptxKatherine Villaluna
 
How to Use api.constrains ( ) in Odoo 17
How to Use api.constrains ( ) in Odoo 17How to Use api.constrains ( ) in Odoo 17
How to Use api.constrains ( ) in Odoo 17Celine George
 
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptxClinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptxraviapr7
 
CAULIFLOWER BREEDING 1 Parmar pptx
CAULIFLOWER BREEDING 1 Parmar pptxCAULIFLOWER BREEDING 1 Parmar pptx
CAULIFLOWER BREEDING 1 Parmar pptxSaurabhParmar42
 
Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...raviapr7
 
Philosophy of Education and Educational Philosophy
Philosophy of Education  and Educational PhilosophyPhilosophy of Education  and Educational Philosophy
Philosophy of Education and Educational PhilosophyShuvankar Madhu
 
How to Make a Field read-only in Odoo 17
How to Make a Field read-only in Odoo 17How to Make a Field read-only in Odoo 17
How to Make a Field read-only in Odoo 17Celine George
 
AUDIENCE THEORY -- FANDOM -- JENKINS.pptx
AUDIENCE THEORY -- FANDOM -- JENKINS.pptxAUDIENCE THEORY -- FANDOM -- JENKINS.pptx
AUDIENCE THEORY -- FANDOM -- JENKINS.pptxiammrhaywood
 
Maximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdf
Maximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdfMaximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdf
Maximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdfTechSoup
 
HED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfHED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfMohonDas
 
How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17Celine George
 
Quality Assurance_GOOD LABORATORY PRACTICE
Quality Assurance_GOOD LABORATORY PRACTICEQuality Assurance_GOOD LABORATORY PRACTICE
Quality Assurance_GOOD LABORATORY PRACTICESayali Powar
 
3.21.24 The Origins of Black Power.pptx
3.21.24  The Origins of Black Power.pptx3.21.24  The Origins of Black Power.pptx
3.21.24 The Origins of Black Power.pptxmary850239
 
Practical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptxPractical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptxKatherine Villaluna
 
The Singapore Teaching Practice document
The Singapore Teaching Practice documentThe Singapore Teaching Practice document
The Singapore Teaching Practice documentXsasf Sfdfasd
 
Benefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive EducationBenefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive EducationMJDuyan
 
M-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptxM-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptxDr. Santhosh Kumar. N
 
Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.raviapr7
 

Último (20)

P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdfP4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
P4C x ELT = P4ELT: Its Theoretical Background (Kanazawa, 2024 March).pdf
 
The basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptxThe basics of sentences session 10pptx.pptx
The basics of sentences session 10pptx.pptx
 
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptxPractical Research 1: Lesson 8 Writing the Thesis Statement.pptx
Practical Research 1: Lesson 8 Writing the Thesis Statement.pptx
 
How to Use api.constrains ( ) in Odoo 17
How to Use api.constrains ( ) in Odoo 17How to Use api.constrains ( ) in Odoo 17
How to Use api.constrains ( ) in Odoo 17
 
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptxClinical Pharmacy  Introduction to Clinical Pharmacy, Concept of clinical pptx
Clinical Pharmacy Introduction to Clinical Pharmacy, Concept of clinical pptx
 
CAULIFLOWER BREEDING 1 Parmar pptx
CAULIFLOWER BREEDING 1 Parmar pptxCAULIFLOWER BREEDING 1 Parmar pptx
CAULIFLOWER BREEDING 1 Parmar pptx
 
Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...Patient Counselling. Definition of patient counseling; steps involved in pati...
Patient Counselling. Definition of patient counseling; steps involved in pati...
 
Philosophy of Education and Educational Philosophy
Philosophy of Education  and Educational PhilosophyPhilosophy of Education  and Educational Philosophy
Philosophy of Education and Educational Philosophy
 
How to Make a Field read-only in Odoo 17
How to Make a Field read-only in Odoo 17How to Make a Field read-only in Odoo 17
How to Make a Field read-only in Odoo 17
 
AUDIENCE THEORY -- FANDOM -- JENKINS.pptx
AUDIENCE THEORY -- FANDOM -- JENKINS.pptxAUDIENCE THEORY -- FANDOM -- JENKINS.pptx
AUDIENCE THEORY -- FANDOM -- JENKINS.pptx
 
Maximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdf
Maximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdfMaximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdf
Maximizing Impact_ Nonprofit Website Planning, Budgeting, and Design.pdf
 
HED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdfHED Office Sohayok Exam Question Solution 2023.pdf
HED Office Sohayok Exam Question Solution 2023.pdf
 
How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17How to Add Existing Field in One2Many Tree View in Odoo 17
How to Add Existing Field in One2Many Tree View in Odoo 17
 
Quality Assurance_GOOD LABORATORY PRACTICE
Quality Assurance_GOOD LABORATORY PRACTICEQuality Assurance_GOOD LABORATORY PRACTICE
Quality Assurance_GOOD LABORATORY PRACTICE
 
3.21.24 The Origins of Black Power.pptx
3.21.24  The Origins of Black Power.pptx3.21.24  The Origins of Black Power.pptx
3.21.24 The Origins of Black Power.pptx
 
Practical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptxPractical Research 1 Lesson 9 Scope and delimitation.pptx
Practical Research 1 Lesson 9 Scope and delimitation.pptx
 
The Singapore Teaching Practice document
The Singapore Teaching Practice documentThe Singapore Teaching Practice document
The Singapore Teaching Practice document
 
Benefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive EducationBenefits & Challenges of Inclusive Education
Benefits & Challenges of Inclusive Education
 
M-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptxM-2- General Reactions of amino acids.pptx
M-2- General Reactions of amino acids.pptx
 
Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.Drug Information Services- DIC and Sources.
Drug Information Services- DIC and Sources.
 

Kirkborne panel presentation-rdap2012

  • 1. Data Science (Informatics) Programs at GMU Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/
  • 2. Outline • Graduate Program at GMU • Undergraduate Program at GMU • Challenge Areas and Reflections 2
  • 3. CSI Graduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CSI = Computational Science & Informatics – CSI graduate program has existed at GMU since 1991 – Over 200 PhD’s graduated in past 20 years – Approximately 90 students currently enrolled – About 10% of students end with M.S. (Masters in Computational Science) – We have a Graduate Certificate in Computational Techniques and Applications (non-degree professional certification program) – We are developing online Distance Education courses 3
  • 4. CSI Graduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CSI = Computational Science & Informatics – Students can choose a concentration from several choices: • Computational Astrophysics • Space Science • Computational Physics • Computational Fluid Dynamics • Computational Statistics • Computational Learning • Computational Mathematics • Computational Materials Science (Physical Chemistry) – A student may “create” their own concentration, such as one of these previously approved concentrations: • Computational Finance • Remote Sensing • Computational Economics 4
  • 5. CSI Graduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CSI = Computational Science & Informatics – Note that there is no specific X-Informatics concentration in CSI. – However, GMU students can enroll in other departments for specific X-Informatics disciplines: • Geoinformatics (including Geospatial Intelligence) • Bioinformatics • Health Informatics – We have discussed adding an AstroInformatics concentration in CSI, but it is not really necessary since interested students are already doing it – i.e., “creating” their own concentration and working with the resident AstroInformatics person (K.B.) 5
  • 6. CSI Graduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CSI = Computational Science & Informatics Students must complete 4 core courses from this set of 5: – CSI 700 Numerical Methods – CSI 701 Foundations of Computational Science – CSI 702 High Performance Computing – CSI 703 Scientific and Statistical Visualization – CSI 710 Scientific Databases There are also many electives (at least 5 additional CSI courses are required, plus concentration science electives). For example: – Data Mining, Knowledge Mining, Computational Learning, Statistical Learning, Computational Statistics, Statistical Graphics, Data Exploration, etc. 6
  • 7. CSI Graduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CSI = Computational Science & Informatics Students must complete 4 core courses from this set of 5: – CSI 700 Numerical Methods – CSI 701 Foundations of Computational Science – CSI 702 High Performance Computing – CSI 703 Scientific and Statistical Visualization Informatics – CSI 710 Scientific Databases Courses ! There are also many electives (at least 5 additional CSI courses are required, plus concentration science electives). For example: – Data Mining, Knowledge Mining, Computational Learning, Statistical Learning, Computational Statistics, Statistical Graphics, Data Exploration, etc. 7
  • 8. Example Course Syllabus: CSI 710 – Scientific Databases • CSI 710 Scientific Databases (taught by K.B.) – lectures include: • Relational Databases: Modeling, Schemas, Normalization, SQL • Scientific Databases, Big Data in Science, The 4th Paradigm • E-Science, Ontologies, Semantic E-Science, X-Informatics • Distributed Data, Federated Data, Virtual Observatories • Citizen Science with Big Data • Scientific Data Mining I • Scientific Data Mining II • Astroinformatics and Astro databases • Bioinformatics and Bio databases • Geoinformatics and Geo databases • Health Informatics • Online Science (Jim Gray’s KDD-2003 lecture) • Intelligent Archives of the Future 8
  • 9. Outline • Graduate Program at GMU • Undergraduate Program at GMU • Challenge Areas and Reflections 9
  • 10. CDS Undergraduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CDS = Computational and Data Sciences – Undergraduate B.S. degree program at GMU since 2008 – Graduating ~2 students per year – 10 students currently enrolled – … this is considered “under-performing” – … we need strategies for increasing enrollment. 10
  • 11. CDS Undergraduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CDS = Computational and Data Sciences – Undergraduate B.S. degree program at GMU since 2008 – Graduating ~2 students per year – 10 students currently enrolled – … this is considered “under-performing” – Strategies for increasing enrollment: • Target marketing – send email to all “local” high school students interested in science *and* interested in GMU College of Science programs already • Advertising materials posted on campus and emailed to existing GMU undergraduates • Modify the curriculum to satisfy student interests (e.g., “Computing for Scientists”**) • Future idea: find a sponsor for a CDS scholarship program • REU = Research Experiences for Undergraduates: students want this experience! • … or … wait and see what happens, because … • Enrollment is already increasing this year due to a new course** that is very exciting for the students and is attracting ~100 students each semester, some of whom have already converted to CDS and several others are thinking about it. 11
  • 12. CDS Undergraduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CDS = Computational and Data Sciences – Students can choose the general CDS degree, or else choose one of these concentrations: • Physics • Chemistry • Biology – A student may “create” their own concentration. For example: • Environmental Science – There is no AstroInformatics option, but one course is being modified to include this material, plus students can take Independent Study and Independent Research courses that cover this material. 12
  • 13. CDS Undergraduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CDS = Computational and Data Sciences – The DATA SCIENCE component of the curriculum was developed with the support of a grant from the NSF (National Science Foundation): • CUPIDS = Curriculum for an Undergraduate Program In Data Sciences – Primary Goal: to increase student’s understanding of the role that data plays across the sciences as well as to increase the student’s ability to use the technologies associated with data acquisition, mining, analysis, and visualization. – Objectives – students are trained: – … to access large distributed data repositories – … to conduct meaningful inquiries into the data – … to mine, visualize, and analyze the data – … to make objective data-driven inferences, discoveries, and decisions 13
  • 14. CDS Undergraduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CDS = Computational and Data Sciences Students must complete 6 core courses from this set of 8 courses: • CDS 101 – Introduction to Computational and Data Sciences • CDS 130 – Computing for Scientists • CDS 251 – Introduction to Scientific Programming • CDS 301 – Scientific Information and Data Visualization • CDS 302 – Scientific Data and Databases • CDS 401 – Scientific Data Mining • CDS 410 – Modeling and Simulations I • CDS 411 – Modeling and Simulations II Additional required courses include Math, Statistics, Computer Science, Physics I and II, plus courses in student’s chosen science concentration 14
  • 15. CDS Undergraduate Program at GMU • http://spacs.gmu.edu/content/academic-programs • CDS = Computational and Data Sciences We are also “infiltrating” the entire undergraduate program at GMU through 3 of our courses that satisfy university General Education graduation requirements for all students at the university: • CDS 101 – Introduction to Computational and Data Sciences – Satisfies GMU’s Natural Science requirement • CDS 130 – Computing for Scientists – Satisfies GMU’s I.T. requirement • CDS 151 – Data Ethics – Satisfies GMU’s Ethics requirement 15
  • 16. Example of Learning Objectives: CDS 401 – Scientific Data Mining • Be able to explain the role of data mining within scientific knowledge discovery. • Be able to describe the most well known data mining algorithms and correctly use data mining terminology. • Be able to express the application of statistics, similarity measures, and indexing to data mining tasks. • Identify appropriate techniques for classification and clustering applications. • Determine approaches used for mining large scientific databases (e.g., genomics, virtual observatories). • Recognize techniques used for spatial and temporal data mining applications. • Express the steps in a data mining project (e.g., cleaning, transforming, indexing, mining, analysis). • Analyze classic data mining examples and use cases, and assess the applicatio of different data mining techniques. • Effectively prepare data for mining. • Effectively use software packages for data exploration, visualization, and mining. 16
  • 17. Outline • Graduate Program at GMU • Undergraduate Program at GMU • Challenge Areas and Reflections 17
  • 18. Challenge Areas (primarily for the undergraduate program, since the graduate program is doing very well) • Attracting students: nobody says “I want to be a computational and data scientist when I grow up.” • Visibility: most other science departments are not aware of the importance of our courses for their majors. (Note: Biology and Neuroscience now require our Science Computing course.) • Retention: after Year 1 courses, there are no other CDS courses until junior year (Year 3) • Lab course: in order to attract sufficient enrollment, the CDS 101 General Education Natural Science course should be accompanied with a lab section. • Scientific computing course: this was identified 2 years ago as a necessary course to attract students to CDS … we now have this course, and it is very popular (nearly 100 students each semester, and growing … up to 400+) 18
  • 19. Reflections • Students with a broad interest in computers and sciences will benefit from these types of programs: Computational and Data Sciences. – Actual quote from high school senior visiting the university: “I plan to major in biology, but I wish I could do something with computing also.” • Students graduating with a traditional discipline-based bachelors degree in biology, chemistry, or physics generally do not have the required computational background necessary to participate as productive members of modern interdisciplinary scientific research teams, which are becoming increasingly computational- and data-intensive. • The motivating theme and goal of these program should be to train the next-generation scientists in the tools and techniques of cyber-enabled science (e-Science) to prepare them to confront the emerging petascale challenges of data-intensive science. • It is also good for society in general that all members of the 21st century workforce are trained in computational and data science skills – i.e., computational literacy and data literacy are critical for all. 19