Opening Up IIHS Video with SpokenMedia

20
Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License Opening Up IIHS Video with SpokenMedia Brandon Muramatsu Andrew McKinney Peter Wilkins May 2010 1 Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/ ) Citation: Muramatsu, B., McKinney, A., Wilkins, P. (2010). Opening Up IIHS Video with SpokenMedia. Presented at OpenCourseWare Consortium Global 2010: Hanoi, Vietnam, May 7, 2010.

description

The Indian Institute for Human Settlements (IIHS, www.iihs.co.in) and the SpokenMedia (spoken- media.mit.edu) team from the MIT Office of Educational Innovation and Technology (OEIT) have been discussing how SpokenMedia technologies might be used by IIHS to provide cost effective ways of making video/audio course materials accessible to the diversity of students expected by IIHS. This presentation provides a case study of the proof-of-concept demonstration SpokenMedia developed for IIHS. Presented by Brandon Muramatsu at OCWC Global 2010, Hanoi, Vietnam, May 5, 2010.

Transcript of Opening Up IIHS Video with SpokenMedia

Page 1: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Opening Up IIHS Video with SpokenMedia

Brandon Muramatsu

Andrew McKinney

Peter Wilkins

May 2010

1Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Citation: Muramatsu, B., McKinney, A., Wilkins, P. (2010). Opening Up IIHS Video with SpokenMedia.Presented at OpenCourseWare Consortium Global 2010: Hanoi, Vietnam, May 7, 2010.

Page 2: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Case Study of Using SpokenMedia for IIHS

Demonstrate transcripts and translations of IIHS videos

Describe the process and our experiences Transcribe -> Edit -> Translate -> Present

2

Page 3: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

The Indian Institute for Human Settlements (IIHS) will… “create India’s first independent National Innovation University focused on the challenges and opportunities of its urbanisation.”

3

– Indian Institute for Human Settlements: Curriculum Framework Version 3.0

January 2010

Page 4: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

“The IIHS Website is our commitment to a different way of looking at things.”

4

– Aromar Revi5 January 2010

Page 5: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

“The Institution will fail or scale based on language.”

5

– Aromar Revi5 January 2010

Page 6: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

What did we do?

6

AutoTranscrib

e

AutoTranscrib

eEditEdit TranslateTranslate PresentPresent

Page 7: Opening Up IIHS Video with SpokenMedia

The Demo

7

Page 8: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

How did we do it?

8

AutoTranscrib

e

AutoTranscrib

eEditEdit TranslateTranslate PresentPresent

Page 9: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

How do we do it?Lecture Transcription

• Spoken Lecture: research project• Speech recognition & automated transcription

of lectures• Why lectures?

– Conversational, spontaneous, starts/stops

– Different from broadcast news, other types of speech recognition

– Specialized vocabularies

9

James [email protected]

Page 10: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Spoken Lecture Project

• Processor, browser, workflow• Prototyped with lecture & seminar video

– MIT OCW (~300 hours, lectures)

– MIT World (~80 hours, seminar speakers)

Supported with iCampus MIT/Microsoft Alliance funding

10

James [email protected]

Page 11: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

SpokenMedia Process

11

We used a portion of the SpokenMedia process for the demo

Page 12: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

How did we do it?

12

AutoTranscrib

e

AutoTranscrib

eEditEdit TranslateTranslate PresentPresent

Page 13: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Edit & Translate: AccuracyAutomatic

TranscriptionHand

TranscriptionTime

AdjustedTranslated

Hindi

I I I मे�रे� खया�ल से�

think think think

once one one नयाजन की एकी मे�ख्या चु�न�ती� है�

and central

so challenge central

the of

challenger planning challenge of

planning is planning

nice legitimacy is

legitimacy of legitimacy of

of government government सेरेकी�रे की एकी ऐसे� मे�ख्या से�स्था�न की� रूप मे� वै�धती�

government as as13

Page 14: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Automatic Speech Recognition Accuracy

Accuracy Domain Model and

Speaker Model

Internal validity measure

Seed with transcript

Ongoing research by Jim Glass and his team @ MIT

14

Page 15: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

How did we do it?

15

AutoTranscrib

e

AutoTranscrib

eEditEdit TranslateTranslate PresentPresent

Page 16: Opening Up IIHS Video with SpokenMedia

The Player

Simple Player

Hopes for more features Bookmarks Create snippets

16

Page 17: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

SpokenMedia today…

Features Video linked transcripts

Automated Lecture Transcript creation Simple transcript editor (April 2010)

SpokenMedia Player “Bouncing Ball” (underline text) follow along Search within a video Multiple transcript language support

Challenges Accuracy (partial toolset)

17

SpokenMedia Player couldbe used for MIT OCW Videos

Page 18: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Where are we heading?

Improved accuracy

Search across multiple video transcripts

New players with bookmarking, annotation, “paper-based video”

Automate and improve processing > Starting a lecture transcription service

18

Page 19: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Check it out for yourself

IIHS Demo: http://spokenmedia.mit.edu/demo/iihs/

SpokenMedia Website:

http://spokenmedia.mit.edu/

Upload Videos for Automated Lecture Transcription

http://sm.mit.edu/upload

19

Page 20: Opening Up IIHS Video with SpokenMedia

Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)

Thank You!

Brandon Muramatsu, [email protected]

Andrew McKinney, [email protected]

Peter Wilkins, [email protected]

20Unless otherwise specified this work is licensed under a Creative Commons Attribution-Noncommercial-Share Alike 3.0 United States License (creativecommons.org/licenses/by-nc-sa/3.0/us/)