SlideShare una empresa de Scribd logo
1 de 12
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015
Data sources
SQL_anonymized_forum.sql
SQL_anonymized_general.sql
SQL_hash_mapping.sql
standard extract.csvSQL_unanonymizable.sql.gz
clickstream_export.gz
Personal data
Demographic data
…
Data sources
Weekly data packages
auth_user-{site}-analytics.sql
auth_userprofile-{site}-analytics.sql
certificates_generatedcertificate-{site}-analytics.sql
Daily data packages
course_structure-{site}-analytics.json
courseware_studentmodule-{site}-analytics.sql
email_opt_in-{site}-analytics.csv
student_courseenrollment-{site}-analytics.sql
user_api_usercoursetag-{site}-analytics.sql
user_id_map-{site}-analytics.sql
{org}-{course}-{date}-{site}.mongo
wiki_article-{site}-analytics.sql
wiki_articlerevision-{site}-analytics.sql
{org}-{site}-events-{date}.log.gz.gpg
Challenges
• Analytics across several courses
• Analytics across different platforms
• Analytics across different institutions
• Sharing data
Solution?
• Collaborative data science platform
– Standardize data storage
– Generalizable across courses and data providers
(currently OpenEdX, edX and Coursera)
– “Data being shared without data being
exchanged”
– Sharing and reproducing the results
MOOC data science commons
MOOCdb
Observing mode
- Observed Events table
- Resources table
- Resources Types table
- URLs table
- Resource URLs table
Submitting mode
- Problem Types table
- Problems table
- Submissions table
- Assessments table
Collaborating mode
- Collaborations table
- Collaboration Types table
Feedback mode
- Feedbacks table
- Questions table
- Answers table
- Surveys table
User information
- User PII table
- Global User table
- Course User table
http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
Collaborative platform and
applications
edX Coursera MOOCdb doc Github repo
Feature factory LabelMe Digital learner quantified Problem analytics
My MOOCViz Social network analysis Forum analysis Dropout prediction
Current state
• Established network of institutions
– MIT, Stanford, University of Michigan, University
of Edinburgh, University of Queensland, University
of Texas (Austin)
• Release of open source software
• Development and release of the first data
analytics framework
Next steps
Digital Learner Quantified
Discussion forum analysis
LabelMe
Problem analytics
Dropout prediction
Social network analysis
Collaboration
• If you are interested in…
– Development
– Feature modeling
– Translating your data
– Testing
kalyan@csail.mit.edu
s.Joksimovic@ed.ac.uk
Q&A
MOOCdb:
Developing Data Standards for MOOCs
Srećko Joksimović
s.joksimovic@ed.ac.uk
@s_joksimovic
Kalyan Veeramachaneni
kalyan@csail.mit.edu
Dragan Gašević
dragan.gasevic@ed.ac.uk
FutureLearn Academic Network Conference
15 June 2015

Más contenido relacionado

Similar a MOOCdb: Developing Data Standards for MOOCs

Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
Charles Severance
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
Vamsi Manoj
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
Chengjen Lee
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
Vikram Parmar
 

Similar a MOOCdb: Developing Data Standards for MOOCs (20)

Step by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSEStep by Step to learn Azure and get MCSE
Step by Step to learn Azure and get MCSE
 
SCORE REPORT
SCORE REPORTSCORE REPORT
SCORE REPORT
 
Silverlight & WCF RIA
Silverlight & WCF RIASilverlight & WCF RIA
Silverlight & WCF RIA
 
Jisc learning analytics service core slides
Jisc learning analytics service core slidesJisc learning analytics service core slides
Jisc learning analytics service core slides
 
Data Science in the Elastic Stack
Data Science in the Elastic StackData Science in the Elastic Stack
Data Science in the Elastic Stack
 
Exploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics serviceExploring learning analytics: Jisc learning analytics service
Exploring learning analytics: Jisc learning analytics service
 
Building PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTIBuilding PHP Applications for IMS Basic LTI
Building PHP Applications for IMS Basic LTI
 
CS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdfCS8592_Notes_008_edubuzz360.pdf
CS8592_Notes_008_edubuzz360.pdf
 
Library Management Python, MySQL
Library Management Python, MySQLLibrary Management Python, MySQL
Library Management Python, MySQL
 
Resume_Susmita
Resume_SusmitaResume_Susmita
Resume_Susmita
 
Telecom datascience master_public
Telecom datascience master_publicTelecom datascience master_public
Telecom datascience master_public
 
Models in Minutes using AutoML
Models in Minutes using AutoMLModels in Minutes using AutoML
Models in Minutes using AutoML
 
Online examination documentation
Online examination documentationOnline examination documentation
Online examination documentation
 
Start Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptxStart Your Career in Data Science Using Data Camp.pptx
Start Your Career in Data Science Using Data Camp.pptx
 
20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation20180701 - 1st Meeting - Data Science Orientation
20180701 - 1st Meeting - Data Science Orientation
 
Manoj(Java Developer)_Resume
Manoj(Java Developer)_ResumeManoj(Java Developer)_Resume
Manoj(Java Developer)_Resume
 
Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017Jisc learning analytics MASHEIN Jan 2017
Jisc learning analytics MASHEIN Jan 2017
 
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
 
Web crawler with seo analysis
Web crawler with seo analysis Web crawler with seo analysis
Web crawler with seo analysis
 
Resume
ResumeResume
Resume
 

Último

Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptx
GOWTHAMIM22
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...
Sérgio Sacani
 
Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...
Sérgio Sacani
 
Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a Technosignature
Sérgio Sacani
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
University of Hertfordshire
 

Último (20)

TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdf
 
Land use land cover change analysis and detection of its drivers using geospa...
Land use land cover change analysis and detection of its drivers using geospa...Land use land cover change analysis and detection of its drivers using geospa...
Land use land cover change analysis and detection of its drivers using geospa...
 
NuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent UniversityNuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent University
 
Isolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptxIsolation of AMF by wet sieving and decantation method pptx
Isolation of AMF by wet sieving and decantation method pptx
 
Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...Jet reorientation in central galaxies of clusters and groups: insights from V...
Jet reorientation in central galaxies of clusters and groups: insights from V...
 
Hemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. MuralinathHemoglobin metabolism: C Kalyan & E. Muralinath
Hemoglobin metabolism: C Kalyan & E. Muralinath
 
Virulence Analysis of Citrus canker caused by Xanthomonas axonopodis pv. citr...
Virulence Analysis of Citrus canker caused by Xanthomonas axonopodis pv. citr...Virulence Analysis of Citrus canker caused by Xanthomonas axonopodis pv. citr...
Virulence Analysis of Citrus canker caused by Xanthomonas axonopodis pv. citr...
 
Biochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptx
Biochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptxBiochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptx
Biochemistry and Biomolecules - Science - 9th Grade by Slidesgo.pptx
 
Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...Climate extremes likely to drive land mammal extinction during next supercont...
Climate extremes likely to drive land mammal extinction during next supercont...
 
Detectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a TechnosignatureDetectability of Solar Panels as a Technosignature
Detectability of Solar Panels as a Technosignature
 
Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!Quantifying Artificial Intelligence and What Comes Next!
Quantifying Artificial Intelligence and What Comes Next!
 
Triploidy ...............................pptx
Triploidy ...............................pptxTriploidy ...............................pptx
Triploidy ...............................pptx
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on Earth
 
Lubrication System in forced feed system
Lubrication System in forced feed systemLubrication System in forced feed system
Lubrication System in forced feed system
 
Plasmapheresis - Dr. E. Muralinath - Kalyan . C.pptx
Plasmapheresis - Dr. E. Muralinath - Kalyan . C.pptxPlasmapheresis - Dr. E. Muralinath - Kalyan . C.pptx
Plasmapheresis - Dr. E. Muralinath - Kalyan . C.pptx
 
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCEPLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
PLANT DISEASE MANAGEMENT PRINCIPLES AND ITS IMPORTANCE
 
WASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 Rp
WASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 RpWASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 Rp
WASP-69b’s Escaping Envelope Is Confined to a Tail Extending at Least 7 Rp
 
VILLAGE ATTACHMENT For rural agriculture PPT.pptx
VILLAGE ATTACHMENT For rural agriculture  PPT.pptxVILLAGE ATTACHMENT For rural agriculture  PPT.pptx
VILLAGE ATTACHMENT For rural agriculture PPT.pptx
 
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
Alternative method of dissolution in-vitro in-vivo correlation and dissolutio...
 
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...Manganese‐RichSandstonesasanIndicatorofAncientOxic  LakeWaterConditionsinGale...
Manganese‐RichSandstonesasanIndicatorofAncientOxic LakeWaterConditionsinGale...
 

MOOCdb: Developing Data Standards for MOOCs

  • 1. MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015
  • 3. Data sources Weekly data packages auth_user-{site}-analytics.sql auth_userprofile-{site}-analytics.sql certificates_generatedcertificate-{site}-analytics.sql Daily data packages course_structure-{site}-analytics.json courseware_studentmodule-{site}-analytics.sql email_opt_in-{site}-analytics.csv student_courseenrollment-{site}-analytics.sql user_api_usercoursetag-{site}-analytics.sql user_id_map-{site}-analytics.sql {org}-{course}-{date}-{site}.mongo wiki_article-{site}-analytics.sql wiki_articlerevision-{site}-analytics.sql {org}-{site}-events-{date}.log.gz.gpg
  • 4. Challenges • Analytics across several courses • Analytics across different platforms • Analytics across different institutions • Sharing data
  • 5. Solution? • Collaborative data science platform – Standardize data storage – Generalizable across courses and data providers (currently OpenEdX, edX and Coursera) – “Data being shared without data being exchanged” – Sharing and reproducing the results
  • 7. MOOCdb Observing mode - Observed Events table - Resources table - Resources Types table - URLs table - Resource URLs table Submitting mode - Problem Types table - Problems table - Submissions table - Assessments table Collaborating mode - Collaborations table - Collaboration Types table Feedback mode - Feedbacks table - Questions table - Answers table - Surveys table User information - User PII table - Global User table - Course User table http://moocdb.csail.mit.edu/wiki/index.php?title=MOOCdb
  • 8. Collaborative platform and applications edX Coursera MOOCdb doc Github repo Feature factory LabelMe Digital learner quantified Problem analytics My MOOCViz Social network analysis Forum analysis Dropout prediction
  • 9. Current state • Established network of institutions – MIT, Stanford, University of Michigan, University of Edinburgh, University of Queensland, University of Texas (Austin) • Release of open source software • Development and release of the first data analytics framework
  • 10. Next steps Digital Learner Quantified Discussion forum analysis LabelMe Problem analytics Dropout prediction Social network analysis
  • 11. Collaboration • If you are interested in… – Development – Feature modeling – Translating your data – Testing kalyan@csail.mit.edu s.Joksimovic@ed.ac.uk
  • 12. Q&A MOOCdb: Developing Data Standards for MOOCs Srećko Joksimović s.joksimovic@ed.ac.uk @s_joksimovic Kalyan Veeramachaneni kalyan@csail.mit.edu Dragan Gašević dragan.gasevic@ed.ac.uk FutureLearn Academic Network Conference 15 June 2015

Notas del editor

  1. MOOCdb which is our solution to centralizing and generalizing MOOC data organization and providing general purpose analytics for MOOC education research.
  2. “How does amount of time spent on the videos during a certain week correlate to performance on the homework?”
  3. CAN WE HAVE STANDARDIZED DATA STORAGE? Sharing and reproducing the results: When they publish research, analysts can share the scripts by depositing them into a public archive where they are retrievable and cross-referenced to their donor and publication.
  4. The MOOCdb project aims to brings together educational researchers, computer science researchers, machine learning researchers, technologists, database and big data experts to advance MOOC data science. The project founded at MIT includes a platform agnostic functional data model for data exhaust from MOOCs, a collaborative-open source-open access data visualization framework, a crowd sourced knowledge discovery framework and a privacy preserving software framework. The team is currently working to release a number of these tools and frameworks as open source. WHAT MOOCdb PROVIDES? Concise data storage: MOOCdb's proposed schema is \loss-less" with respect to research relevant information, i.e. no information is lost in translating raw data to it. Access Control Levels for Anonymized Data: The data schema offers an organized means of structuring anonymized user identities safeguard them further.
  5. Sharing of data extraction scripts: Scripts for data extraction and descriptive statistics extraction can be open source and shared by everyone because they reference data organized according to the schema. Crowd source potential: Machine learning frequently involves humans identifying explanatory variables that could drive a response. Enabling the crowd to help propose variables could greatly scale the com- munity's progress in mining MOOC data.