AUTOMATED EXTRACTION OF LECTURE OUTLINES FROM LECTURE VIDEOS - A Hybrid Solution for Lecture Video Indexing

Haojin Yang, Franka Gruenewald, Christoph Meinel

2012

Abstract

Multimedia-based tele-teaching and lecture video portals have become more and more popular in the last few years. The amount of multimedia data available on the WWW (World Wide Web) is rapidly growing. Thus, finding lecture video data on the web or within a lecture video portal has become a significant and challenging task. In this paper, we present an approach for lecture video indexing based on automated video segmentation and extracted lecture outlines. First, we developed a novel video segmenter intended to extract the unique slide frames from the lecture video. Then we adopted video OCR (Optical Character Recognition) technology to recognize texts in video. Finally, we developed a novel method for extracting of lecture outlines from OCRtranscripts. Both video segments and extracted lecture outlines are further utilized for the video indexing. The accuracy of the proposed approach is proven by evaluation.

References

  1. Addie, C. (2002). Learning Disabilities: There is a Cure-A Guide for Parents, Educators and Physicians. Achieve Pubns.
  2. Addie, C. (2002). Learning Disabilities: There is a Cure-A Guide for Parents, Educators and Physicians. Achieve Pubns.
  3. B. Epshtein, E. Ofek, Y. W. (2010). Detecting text in natural scene with stroke width transform. In Proc. of Computer Vision and Pattern Recognition, pages 2963- 2970.
  4. B. Epshtein, E. Ofek, Y. W. (2010). Detecting text in natural scene with stroke width transform. In Proc. of Computer Vision and Pattern Recognition, pages 2963- 2970.
  5. E.Leeuwis, M.Federico, and Cettolo, M. (2003). Language modeling and transcription of the ted corpus lectures. In Proc. of the IEEE ICASSP.
  6. E.Leeuwis, M.Federico, and Cettolo, M. (2003). Language modeling and transcription of the ted corpus lectures. In Proc. of the IEEE ICASSP.
  7. F. Moritz, M. Siebert, C. M. (2011). Community tagging in tele-teaching environments. In Proc. of 2nd International Conference on e-Education, e-Business, eManagement and E-Learning.
  8. F. Moritz, M. Siebert, C. M. (2011). Community tagging in tele-teaching environments. In Proc. of 2nd International Conference on e-Education, e-Business, eManagement and E-Learning.
  9. F. Wang, C-W. Ngo, T.-C. P. (2008). Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis. Journal of Pattern Recognition, 41(10):3257-3269.
  10. F. Wang, C-W. Ngo, T.-C. P. (2008). Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis. Journal of Pattern Recognition, 41(10):3257-3269.
  11. flickr (2011). http://www.flickr.com.
  12. flickr (2011). http://www.flickr.com.
  13. H-J. Yang, C. Oehlke, C. M. (2011). A solution for german speech recognition for analysis and processing of lecture videos. In Proc. of 10th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2011), pages 201-206, Sanya, Heinan Island, China. IEEE/ACIS.
  14. H-J. Yang, C. Oehlke, C. M. (2011). A solution for german speech recognition for analysis and processing of lecture videos. In Proc. of 10th IEEE/ACIS International Conference on Computer and Information Science (ICIS 2011), pages 201-206, Sanya, Heinan Island, China. IEEE/ACIS.
  15. J. Hunter, S. L. (2001). Building and indexing a distributed multimedia presentation archive using smil. In Proc. of ECDL 7801 Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries, pages 415-428, London, UK.
  16. J. Hunter, S. L. (2001). Building and indexing a distributed multimedia presentation archive using smil. In Proc. of ECDL 7801 Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries, pages 415-428, London, UK.
  17. J. Waitelonis, H. S. (2010). Exploratory video search with yovisto. In Proc. of 4th IEEE International Conference on Semantic Computing (ICSC 2010), Pittsburg, USA.
  18. J. Waitelonis, H. S. (2010). Exploratory video search with yovisto. In Proc. of 4th IEEE International Conference on Semantic Computing (ICSC 2010), Pittsburg, USA.
  19. N. Dala, B. T. (2005). Histograms of oriented gradients for human detection. In Proc. IEEE Conf. Computer Vision and Pattern Recognition, volume 1, pages 886- 893.
  20. N. Dala, B. T. (2005). Histograms of oriented gradients for human detection. In Proc. IEEE Conf. Computer Vision and Pattern Recognition, volume 1, pages 886- 893.
  21. Otsu (1979). A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man and Cybernetics, SCM-9(1):62-66.
  22. Otsu (1979). A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man and Cybernetics, SCM-9(1):62-66.
  23. R. C. Gonzalez, R. E. W. (2002). Digital Image Processing. Englewood Cliffs.
  24. R. C. Gonzalez, R. E. W. (2002). Digital Image Processing. Englewood Cliffs.
  25. S. Trahasch, S. Linckels, W. H. (2009). Vorlesungsaufzeichnungen - Anwendungen, Erfahrungen und Forschungsperspektiven. Beobachtungen vom GI-Workshop eLectures 2009“. i-com, 8(3 Social Semantic Web):62-64.
  26. S. Trahasch, S. Linckels, W. H. (2009). Vorlesungsaufzeichnungen - Anwendungen, Erfahrungen und Forschungsperspektiven. Beobachtungen vom GI-Workshop eLectures 2009“. i-com, 8(3 Social Semantic Web):62-64.
  27. Siebert, M. and Meinel, C. (2010). Realization of an expandable search function for an e-learningweb portal. In Workshop on e-Activity at the Ninth IEEE/ACIS International Conference on Computer and Information Science Article, page 6, Yamagata/Japan.
  28. Siebert, M. and Meinel, C. (2010). Realization of an expandable search function for an e-learningweb portal. In Workshop on e-Activity at the Ninth IEEE/ACIS International Conference on Computer and Information Science Article, page 6, Yamagata/Japan.
  29. Sobel, I. (1990). An isotropic 3 3 image gradient operator. Machine Version for Three-Dimensional Scenes, (376-379).
  30. Sobel, I. (1990). An isotropic 3 3 image gradient operator. Machine Version for Three-Dimensional Scenes, (376-379).
  31. Yang, H., Siebert, M., Lühne, P., Sack, H., and Meinel, C. (2011). Lecture video indexing and analysis using video ocr technology. In Proc. of 7th International Conference on Signal Image Technology and Internet Based Systems (SITIS 2011), Dijon, France.
  32. Yang, H., Siebert, M., Lühne, P., Sack, H., and Meinel, C. (2011). Lecture video indexing and analysis using video ocr technology. In Proc. of 7th International Conference on Signal Image Technology and Internet Based Systems (SITIS 2011), Dijon, France.
  33. Zupancic, B. (2006). Vorlesungsaufzeichnungen und digitale Annotationen Einsatz und Nutzen in der Lehre. Dissertation, Albert-Ludwigs-Universität Freiburg.
  34. Zupancic, B. (2006). Vorlesungsaufzeichnungen und digitale Annotationen Einsatz und Nutzen in der Lehre. Dissertation, Albert-Ludwigs-Universität Freiburg.
Download


Paper Citation


in Harvard Style

Yang H., Gruenewald F. and Meinel C. (2012). AUTOMATED EXTRACTION OF LECTURE OUTLINES FROM LECTURE VIDEOS - A Hybrid Solution for Lecture Video Indexing . In Proceedings of the 4th International Conference on Computer Supported Education - Volume 1: CSEDU, ISBN 978-989-8565-06-8, pages 13-22. DOI: 10.5220/0003903700130022


in Harvard Style

Yang H., Gruenewald F. and Meinel C. (2012). AUTOMATED EXTRACTION OF LECTURE OUTLINES FROM LECTURE VIDEOS - A Hybrid Solution for Lecture Video Indexing . In Proceedings of the 4th International Conference on Computer Supported Education - Volume 1: CSEDU, ISBN 978-989-8565-06-8, pages 13-22. DOI: 10.5220/0003903700130022


in Bibtex Style

@conference{csedu12,
author={Haojin Yang and Franka Gruenewald and Christoph Meinel},
title={AUTOMATED EXTRACTION OF LECTURE OUTLINES FROM LECTURE VIDEOS - A Hybrid Solution for Lecture Video Indexing},
booktitle={Proceedings of the 4th International Conference on Computer Supported Education - Volume 1: CSEDU,},
year={2012},
pages={13-22},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003903700130022},
isbn={978-989-8565-06-8},
}


in Bibtex Style

@conference{csedu12,
author={Haojin Yang and Franka Gruenewald and Christoph Meinel},
title={AUTOMATED EXTRACTION OF LECTURE OUTLINES FROM LECTURE VIDEOS - A Hybrid Solution for Lecture Video Indexing},
booktitle={Proceedings of the 4th International Conference on Computer Supported Education - Volume 1: CSEDU,},
year={2012},
pages={13-22},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003903700130022},
isbn={978-989-8565-06-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Conference on Computer Supported Education - Volume 1: CSEDU,
TI - AUTOMATED EXTRACTION OF LECTURE OUTLINES FROM LECTURE VIDEOS - A Hybrid Solution for Lecture Video Indexing
SN - 978-989-8565-06-8
AU - Yang H.
AU - Gruenewald F.
AU - Meinel C.
PY - 2012
SP - 13
EP - 22
DO - 10.5220/0003903700130022


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Conference on Computer Supported Education - Volume 1: CSEDU,
TI - AUTOMATED EXTRACTION OF LECTURE OUTLINES FROM LECTURE VIDEOS - A Hybrid Solution for Lecture Video Indexing
SN - 978-989-8565-06-8
AU - Yang H.
AU - Gruenewald F.
AU - Meinel C.
PY - 2012
SP - 13
EP - 22
DO - 10.5220/0003903700130022