Implementing SpokenMedia for the Indian Institute for Human Settlements

•Descargar como PPT, PDF•

1 recomendación•1,430 vistas

The SpokenMedia project implemented a proof-of-concept demonstration of its automatic lecture transcription and video player technologies for the Indian Institute for Human Settlements. This presentation describes the development of the technologies, production of the automatic lecture transcripts, deployment of the transcripts via a video player, initial reactions and future directions. Presented remotely by Brandon Muramatsu at the Technology for Education Conference, Mumbai, India, July 1, 2010

Educación Tecnología

Implementing SpokenMedia for the Indian Institute for Human Settlements Brandon Muramatsu Andrew McKinney Peter Wilkins July 2010 Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ ) Citation: Muramatsu, B., McKinney, A., Wilkins, P. (2010). Implementing SpokenMedia for the Indian Institute for Human Settlements. Presented at Technology for Education 2010: Mumbai, India, July 1, 2010.

Case Study of Using SpokenMedia for IIHS ,[object Object],[object Object],[object Object],[object Object],[object Object]

MIT Office for Educational Innovation and Technology ,[object Object],[object Object],[object Object],[object Object],Experiment Incubate Transition Service

SpokenMedia Project Experiment Incubate Transition Service ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The Indian Institute for Human Settlements (IIHS) will… “create India’s first independent National Innovation University focused on the challenges and opportunities of its urbanisation.” – Indian Institute for Human Settlements: Curriculum Framework Version 3.0 January 2010

“ The IIHS Website is our commitment to a different way of looking at things.” – Aromar Revi 5 January 2010

“ The Institution will fail or scale based on language.” – Aromar Revi 5 January 2010

What did we do? Auto Transcribe Edit Translate Play

How did we do it? Auto Transcribe Edit Translate Play

How do we do it? Spoken Lecture Research ,[object Object],[object Object],[object Object],[object Object],[object Object],James Glass [email_address] Supported with iCampus MIT/Microsoft Alliance funding

SpokenMedia Process We used a portion of the SpokenMedia process for the demo

Edit & Translate: Accuracy Automatic Transcription Hand Transcription Time Adjusted Translated Hindi I I I मेरे खयाल से think think think once one one नयोजन की एक मुख्य चुनौती है and central so challenge central the of challenger planning challenge of planning is planning nice legitimacy is legitimacy of legitimacy of of government government सरकार की एक ऐसी मुख्य संस्थान के रूप में वैधता government as as

Automatic Speech Recognition Accuracy ,[object Object],[object Object],[object Object],[object Object],[object Object]

The Player ,[object Object],[object Object],[object Object],[object Object],Search within Transcript Highlight text as spoken Switch transcript to different language(s)

SpokenMedia (circa January) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Challenges ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Where are we heading? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Check it out for yourself ,[object Object],[object Object],[object Object]

Thank You! Brandon Muramatsu, [email_address] Andrew McKinney, [email_address] Peter Wilkins, [email_address] ° Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ )

Más contenido relacionado

Destacado

IIHS Open Framework-SpokenMediaBrandon Muramatsu

Settlement Characteristicswhiskeyhj

Unit 1 planning c oncepts pptEzhil Tamizh

Rural housing in indiakumaresan2704

Evolution of settlementsyusra_gul

Human Settlementspeyne

Rural development pptManish Kumar Sinha

Destacado (7)

IIHS Open Framework-SpokenMedia

Settlement Characteristics

Unit 1 planning c oncepts ppt

Rural housing in india

Evolution of settlements

Human Settlements

Rural development ppt

Similar a Implementing SpokenMedia for the Indian Institute for Human Settlements

How to Get Online Course Translations of the Highest QualityCommLab India – Rapid eLearning Solutions

visH (fin).pptxtefflontrolegdy

AI secrets - become a pro at using AI with these slidesstaskeviciustomas121

Rapid Innovative Design Notesspotlearning

TechnologyGillian Lord

Video Captioning for Accessibility: University of Florida and Regis Universit...3Play Media

Master eLearning Translation in 7 Simple Stepssaikumarmba2023

IRJET - Analysis on Code-Mixed Data for Movie ReviewsIRJET Journal

UPDATED! Microsoft Education - ATIAMicrosoft Accessibility

Proposal presentation.pptxNhlakanipho Majola

Microsoft Education - ATIAMicrosoft Accessibility

Intelligent ChatBotantimo musone

Voice Tech TO #1Voice Tech Global

An Application for Performing Real Time Speech Translation in Mobile EnvironmentAssociation of Scientists, Developers and Faculties

Tips for Creating an Accessible & Engaging Virtual Classroom3Play Media

Intro to watson bluemix servicesVikas Manoria

Video Accessibility Toolkit for Success in a Virtual Environment3Play Media

VloggingAiden Yeh

Ebook media-and-entertainmentBnsplBraahmam

Subtitles You tube Videos.pdfmikemichael30

Similar a Implementing SpokenMedia for the Indian Institute for Human Settlements (20)

How to Get Online Course Translations of the Highest Quality

visH (fin).pptx

AI secrets - become a pro at using AI with these slides

Rapid Innovative Design Notes

Technology

Video Captioning for Accessibility: University of Florida and Regis Universit...

Master eLearning Translation in 7 Simple Steps

IRJET - Analysis on Code-Mixed Data for Movie Reviews

UPDATED! Microsoft Education - ATIA

Proposal presentation.pptx

Microsoft Education - ATIA

Intelligent ChatBot

Voice Tech TO #1

An Application for Performing Real Time Speech Translation in Mobile Environment

Tips for Creating an Accessible & Engaging Virtual Classroom

Intro to watson bluemix services

Video Accessibility Toolkit for Success in a Virtual Environment

Vlogging

Ebook media-and-entertainment

Subtitles You tube Videos.pdf

Más de Brandon Muramatsu

Digital Credentials Enabling Mobility and Verification of Educational Achieve...Brandon Muramatsu

Sustainability of OER Initiatives: An Interactive DiscussionBrandon Muramatsu

Bridging the Gap: Mixing approaches, content and tools to help college studentsBrandon Muramatsu

Federations & Backstage: Thoughts for a Geoscience Education InfrastructureBrandon Muramatsu

The Connected Learning Initiative Quality at Scale in IndiaBrandon Muramatsu

Strategic Education Initiatives, MIT Open LearningBrandon Muramatsu

Open Embedded Assessments:Play, Author; Anywhere, AnytimeBrandon Muramatsu

DXtera Enabling Digital ExchangeBrandon Muramatsu

Evaluating and Selecting Digital Learning ResourcesBrandon Muramatsu

Online Recitation SessionsBrandon Muramatsu

Connected Learning Initiative: Learning at ScaleBrandon Muramatsu

CLIx-Connected Learning IntiativeBrandon Muramatsu

The Best of Both Worlds: Transforming OpenCourseWare in an age of InteractivityBrandon Muramatsu

Innovative Educational Technologyand Educational Infrastructureat MITBrandon Muramatsu

Strategic Education InitiativesBrandon Muramatsu

Workshop: Emerging Possibilities and Takeaways for KFUPMBrandon Muramatsu

Workshop: Lessons from Online and edX / MITx CoursesBrandon Muramatsu

Workshop: Design Considerations for Online / Digital CoursesBrandon Muramatsu

Workshop: Educational Technology Opportunities for KFUPMBrandon Muramatsu

Más de Brandon Muramatsu (20)

Digital Credentials Enabling Mobility and Verification of Educational Achieve...

Sustainability of OER Initiatives: An Interactive Discussion

Bridging the Gap: Mixing approaches, content and tools to help college students

Federations & Backstage: Thoughts for a Geoscience Education Infrastructure

The Connected Learning Initiative Quality at Scale in India

Strategic Education Initiatives, MIT Open Learning

Open Embedded Assessments:Play, Author; Anywhere, Anytime

DXtera Enabling Digital Exchange

Evaluating and Selecting Digital Learning Resources

Online Recitation Sessions

Connected Learning Initiative: Learning at Scale

CLIx-Connected Learning Intiative

The Best of Both Worlds: Transforming OpenCourseWare in an age of Interactivity

Innovative Educational Technologyand Educational Infrastructureat MIT

Strategic Education Initiatives

Workshop: Emerging Possibilities and Takeaways for KFUPM

Workshop: Lessons from Online and edX / MITx Courses

Workshop: Design Considerations for Online / Digital Courses

Workshop: Educational Technology Opportunities for KFUPM

Último

Spellings Wk 3 English CAPS CARES Please PractiseAnaAcapella

Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith

The basics of sentences session 3pptx.pptxheathfieldcps1

SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur

Holdier Curriculum Vitae (April 2024).pdfagholdier

Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University of Engineering & Technology, Jamshoro

ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43

Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417

How to Give a Domain for a Field in Odoo 17Celine George

Spatium Project Simulation student briefAssociation for Project Management

Graduate Outcomes Presentation Slides - Englishneillewis46

Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic

FSB Advising Checklist - Orientation 2024Elizabeth Walsh

Application orientated numerical on hev.pptRamjanShidvankar

Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam

Towards a code of practice for AI in AT.pptxJisc

How to Create and Manage Wizard in Odoo 17Celine George

Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh

How to Manage Global Discount in Odoo 17 POSCeline George

Salient Features of India constitution especially power and functionsKarakKing

Implementing SpokenMedia for the Indian Institute for Human Settlements

1. Implementing SpokenMedia for the Indian Institute for Human Settlements Brandon Muramatsu Andrew McKinney Peter Wilkins July 2010 Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ ) Citation: Muramatsu, B., McKinney, A., Wilkins, P. (2010). Implementing SpokenMedia for the Indian Institute for Human Settlements. Presented at Technology for Education 2010: Mumbai, India, July 1, 2010.

5. The Indian Institute for Human Settlements (IIHS) will… “create India’s first independent National Innovation University focused on the challenges and opportunities of its urbanisation.” – Indian Institute for Human Settlements: Curriculum Framework Version 3.0 January 2010

6. “ The IIHS Website is our commitment to a different way of looking at things.” – Aromar Revi 5 January 2010

7. “ The Institution will fail or scale based on language.” – Aromar Revi 5 January 2010

8. What did we do? Auto Transcribe Edit Translate Play

9. The Demo

10. How did we do it? Auto Transcribe Edit Translate Play

11.

12. SpokenMedia Process We used a portion of the SpokenMedia process for the demo

13. How did we do it? Auto Transcribe Edit Translate Play

14. Edit & Translate: Accuracy Automatic Transcription Hand Transcription Time Adjusted Translated Hindi I I I मेरे खयाल से think think think once one one नयोजन की एक मुख्य चुनौती है and central so challenge central the of challenger planning challenge of planning is planning nice legitimacy is legitimacy of legitimacy of of government government सरकार की एक ऐसी मुख्य संस्थान के रूप में वैधता government as as

15.

16. How did we do it? Auto Transcribe Edit Translate Play

17.

18.

19.

20. SpokenMedia (July 2010)

21.

22.

23. Thank You! Brandon Muramatsu, [email_address] Andrew McKinney, [email_address] Peter Wilkins, [email_address] ° Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License ( creativecommons.org/licenses/by-nc-sa/3.0/us/ )

Notas del editor

Citation: Muramatsu, B., McKinney, A., Wilkins, P. (2010). Implementing SpokenMedia for the Indian Institute for Human Settlements. Presented at Technology for Education 2010: Mumbai, India, July 1, 2010. Unless otherwise specified, this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License
Spoken Lecture Project Supported by iCampus Includes the browser (which was just demo’d) the processor (back end lecture transcription) and a hand workflow to do the processing Approximately 400 hours of video indexed SpokenMedia • Technology transfer—get code running outside of original environment • 4 components: automatic lecture transcription, search, player, transcript editor • Suuported by iCampus SpokenMedia as a Service? • Still investigating
Four step process (simple) First we used the SpokenMedia to do automatic transcription. Next we did hand edit and translation steps. Then we created a player for the presentation of the video and transcripts…
Demo In this demo, we see a video interview with Prof. Bish Sanyal from MIT. We’ll see three things in this demo: • As Prof. Sanyal speaks, we will see the text in the transcript highlighted and the highlighting will follow along (“bouncing ball”) • Switching the transcript from English to a hand translation into Hindi that is synchronized with the audio, as the switch occurs the playback is seamless • Searching within the transcript, after entering the search term and selecting the result, the video and transcript seek to that point in the video and playback continues
First we used the SpokenMedia to do automatic transcription.
Lecture Transcription Jim Glass and his group have years of research experience for spoken languages Lectures are a different type of spoken language Much of the speech recognition research has focused on real time transcription of news broadcasts, or interactive voice response systems (telephone) Broadcast news has something like 300 unique words in an hour long broadcast Broadcast news is well structured, prepared copy (in studio via teleprompters), clear transitions between speakers, etc. Lectures are conversational and spontaneous Can use highly specialized vocabularies, engineering, physical sciences, mathematics
We only used part of the process due to time constraints. Audio separation, speech processing, time-coded transcript, and then presentation through a SpokenMedia player.
Next we did hand edit and translation steps.
For this demo, we did computer-based automatic transcription, sent a file to IIHS for “editing” that consisted of performing a hand transcription (due to the format we sent, and the low accuracy of the automatic transcription in this case), a time alignment (though I actually feel that it’s “off” or “slow”) and then a hand translation by IIHS. Automatic transcription is in the ~50-55% accuracy range (by way of comparison YouTube Auto Caption for this same video is ~68% accuracy).
Recognizer Accuracy Base accuracy is approximately 50% (generic domain and speaker models) Increase accuracy with speaker model up to 80-85%, and specific domain model This approach is good for courses with multiple lectures by the same speaker Domain models get more useful as more relevant text documents are indexed (keyword/noun phrase extraction) Initial results indicate that doing one 99% accurate (by hand/manual) transcript can help immensely for additional lectures by the same speaker Better use of limited resources Search accuracy is closer to 90%, searches tend to be for unique words which the processor is better at recognizing
Then we created a player for the presentation of the video and transcripts…
SpokenMedia Player 1.0 • Video-linked transcript • Highlighted text follows along as the speaker speaks • Switch transcript to a different transcript track seamlessly during playback • Search within a transcript
What we have today It’s not perfect, but a pretty good start Prototype has a number of useful features that demonstrate search interfaces and interaction interfaces
The bright yellow indicates features working in the last two months… • Text processing to create a custom domain model • Creation of custom acoustic models in unsupervised mode • Updated speech recognition software The gray indicates features we’ve had working since December 2009 and that were used for IIHS The light yellow indicates features on which we’ve just started working. • Accuracy measurement • SpokenMedia Search (search across multiple videos) • Transcript Editor
Where are we heading? Transition from research project to service Explore new interactions—what we’re calling Rich Media Notebooks
Citation: Muramatsu, B., McKinney, A., Wilkins, P. (2010). Implementing SpokenMedia for the Indian Institute for Human Settlements. Presented at Technology for Education 2010: Mumbai, India, July 1, 2010. Unless otherwise specified, this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License

Implementing SpokenMedia for the Indian Institute for Human Settlements

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (7)

Similar a Implementing SpokenMedia for the Indian Institute for Human Settlements

Similar a Implementing SpokenMedia for the Indian Institute for Human Settlements (20)

Más de Brandon Muramatsu

Más de Brandon Muramatsu (20)

Último

Último (20)

Implementing SpokenMedia for the Indian Institute for Human Settlements

Notas del editor