Expressive Talking Head for Interactive Conversational Systems

Paula Dornhofer Paro Costa, José Mario De Martino

2014

Abstract

The synthesis of expressive speech videorealistic facial animation remains a challenging problem in computer graphics. The objective of this work is to propose a new synthesis methodology for an expressive talking head based on the manipulation of photographs, also referred as 2D facial animation. Our focus is directed to applications where the talking head act as an embodied conversational agent, with the ultimate goal of creating animated faces capable of inspiring user thrust and empathy.

References

  1. Anderson, R., Stenger, B., Wan, V., and Cipolla, R. (2013). Expressive Visual Text-to-Speech Using Active Appearance Models. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3382- 3389.
  2. Beskow, J. and Nordenberg, M. (2005). Data-driven synthesis of expressive visual speech using an MPEG-4 talking head. In INTERSPEECH, pages 793-796. ISCA.
  3. Brand, M. (1999). Voice puppetry. In SIGGRAPH 7899: Proceedings of the 26th annual conference on Computer graphics and interactive techniques, pages 21-28, New York, NY, USA. ACM Press/Addison-Wesley Publishing Co.
  4. Bregler, C., Covell, M., and Slaney, M. (1997). Video Rewrite: driving visual speech with audio. In SIGGRAPH, pages 353-360.
  5. Cao, Y., Tien, W. C., Faloutsos, P., and Pighin, F. H. (2005). Expressive speech-driven facial animation. ACM Transactions on Graphics, 24(4):1283-1302.
  6. Chuang, E. and Bregler, C. (2005). Mood swings: expressive speech animation. ACM Transactions on Graphics, 24(2):331-347.
  7. Cosatto, E. and Graf, H. P. (2000). Photo-Realistic TalkingHeads from Image Samples. IEEE Transactions on Multimedia, 2(3):152-163.
  8. De Martino, J. M., Magalha˜es, L. P., and Violaro, F. (2006). Facial animation based on context-dependent visemes. Computer & Graphics, 30:971-980.
  9. Deng, Z., Neumann, U., Lewis, J., Kim, T.-Y., Bulut, M., and Narayanan, S. (2006). Expressive Facial Animation Synthesis by Learning Speech Coarticulation and Expression Spaces. IEEE Transactions on Visualization and Computer Graphics, 12(6):1523-1534.
  10. Ekman, P. (1972). Universals and Cultural Differences in Facial Expressions of Emotion. In Proceedings of the Nebraska Symposium on Motivation, number 19, pages 207-282. Lincoln University of Nebraska Press.
  11. Ezzat, T., Geiger, G., and Poggio, T. (2002). Trainable videorealistic speech animation. In SIGGRAPH, pages 388-398.
  12. Ezzat, T. and Poggio, T. (1998). MikeTalk: A Talking Facial Display Based on Morphing Visemes. In CA, pages 96-102.
  13. Jia, J., Zhang, S., Meng, F., Wang, Y., and Cai, L. (2011). Emotional Audio-Visual Speech Synthesis Based on PAD. IEEE Transactions on Audio, Speech & Language Processing, 19(3):570-582.
  14. Ortony, A., Clore, G., and Collins, A. (1988). Cognitive Structure of Emotions. Cambridge University Press.
  15. Parke, F. I. (1972). Computer generated animation of faces. In ACM'72: Proceedings of the ACM annual conference, pages 451-457, New York, NY, USA. ACM Press.
  16. Pasquariello, S. and Pelachaud, C. (2002). Greta: A simple facial animation engine. Soft Computing and Industry, pages 511-525.
Download


Paper Citation


in Harvard Style

Dornhofer Paro Costa P. and De Martino J. (2014). Expressive Talking Head for Interactive Conversational Systems . In Doctoral Consortium - DCVISIGRAPP, (VISIGRAPP 2014) ISBN Not Available, pages 20-24


in Bibtex Style

@conference{dcvisigrapp14,
author={Paula Dornhofer Paro Costa and José Mario De Martino},
title={Expressive Talking Head for Interactive Conversational Systems},
booktitle={Doctoral Consortium - DCVISIGRAPP, (VISIGRAPP 2014)},
year={2014},
pages={20-24},
publisher={SciTePress},
organization={INSTICC},
doi={},
isbn={Not Available},
}


in EndNote Style

TY - CONF
JO - Doctoral Consortium - DCVISIGRAPP, (VISIGRAPP 2014)
TI - Expressive Talking Head for Interactive Conversational Systems
SN - Not Available
AU - Dornhofer Paro Costa P.
AU - De Martino J.
PY - 2014
SP - 20
EP - 24
DO -