Visual Analytics for Narrative Text - Visualizing Characters and their Relationships as Extracted from Novels

Markus John, Steffen Lohmann, Steffen Koch, Michael Wörner, Thomas Ertl

Abstract

The study of novels and the analysis of their plot, characters and other entities are time-consuming and complex tasks in literary science. The digitization of literature and the proliferation of electronic books provide new opportunities to support these tasks with visual abstractions. Methods from the fields of computational linguistics can be used to automatically extract entities and their relations from digitized novels, which can then be visualized to ease exploration and analysis tasks. This paper presents a web-based approach that combines automatic analysis methods with effective visualization techniques. Different views on the extracted entities are provided and relations between them across the plot are indicated. Two usage scenarios show successful applications of the approach and demonstrate its benefits and limitations.

References

  1. Abdul-Rahman, A., Lein, J., Coles, K., Maguire, E., Meyer, M., Wynne, M., Johnson, C. R., Trefethen, A., and Chen, M. (2013). Rule-based visual mappings - with a case study on poetry visualization. Computer Graphics Forum, 32(3pt4):381-390.
  2. Bederson, B. (2000). Fisheye menus. In Proceedings of the 13th Annual ACM Symposium on User interface Software and Technology, pages 217-225. ACM.
  3. Clement, T., Plaisant, C., and Vuillemot, R. (2009). The story of one: Humanity scholarship with visualization and text analysis. In Proceedings of the Digital Humanities Conference, DH 7809.
  4. Cockburn, A., Karlson, A., and Bederson, B. (2009). A review of overview+detail, zooming, and focus+context interfaces. ACM Computing Surveys, 41(1):1-31.
  5. Don, A., Zheleva, E., Gregory, M., Tarkan, S., Auvil, L., Clement, T., Shneiderman, B., and Plaisant, C. (2007). Discovering interesting usage patterns in text collections: Integrating text mining with visualization. In Proceedings of the 16th ACM Conference on Conference on Information and Knowledge Management, CIKM 7807, pages 213-222. ACM.
  6. Dou, W., Wang, X., Skau, D., Ribarsky, W., and Zhou, M. (2012). Leadline: Interactive visual analysis of text data through event identification and exploration. In Proceedings of the IEEE Conference on Visual Analytics Science and Technology, VAST 7812, pages 93- 102.
  7. Eick, S., Steffen, J., and Sumner, E.E., J. (1992). Seesoft - a tool for visualizing line oriented software statistics. IEEE Transactions on Software Engineering, 18(11):957-968.
  8. Gildea, D. and Jurafsky, D. (2002). Automatic labeling of semantic roles. Computational linguistics, 28(3):245- 288.
  9. Hearst, M. A. (1995). Tilebars: Visualization of term distribution information in full text information access. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI 7895, pages 59-66. ACM/Addison-Wesley.
  10. Heimerl, F., Lohmann, S., Lange, S., and Ertl, T. (2014). Word cloud explorer: Text analytics based on word clouds. In 47th Hawaii International Conference on System Sciences, HICCS 7814, pages 1833-1842. IEEE.
  11. Jänicke, S., Franzini, G., Cheema, M. F., and Scheuermann, G. (2015). On Close and Distant Reading in Digital Humanities: A Survey and Future Challenges. In Eurographics Conference on Visualization - STARs, EuroVis 7815. Eurographics Association.
  12. Jänicke, S., Geßner, A., B üchler, M., and Scheuermann, G. (2014). Visualizations for text re-use. In Proceedings of the 5th International Conference on Information Visualization Theory and Applications, IVAPP 7814, pages 59-70. INSTICC.
  13. Jänicke, S., Heine, C., Stockmann, R., and Scheuermann, G. (2012). Comparative visualization of geospatialtemporal data. In Proceedings of the 3rd International Conference on Information Visualization Theory and Applications, IVAPP 7814, pages 613-625. INSTICC.
  14. Keim, D. and Oelke, D. (2007). Literature fingerprinting: A new method for visual literary analysis. In Proceedings of the IEEE Symposium on Visual Analytics Science and Technology, VAST 7807, pages 115-122.
  15. Koch, S., Bosch, H., Giereth, M., and Ertl, T. (2011). Iterative integration of visual insights during scalable patent search and analysis. IEEE Transactions on Visualization and Computer Graphics, 17(5):557-569.
  16. Koch, S., John, M., Worner, M., Muller, A., and Ertl, T. (2014). Varifocalreader - in-depth visual analysis of large text documents. IEEE Transactions on Visualization and Computer Graphics, 20(12):1723-1732.
  17. Liu, S., Wu, Y., Wei, E., Liu, M., and Liu, Y. (2013). Storyflow: Tracking the evolution of stories. IEEE Transactions on Visualization and Computer Graphics, 19(12):2436-2445.
  18. McNaught, C. and Lam, P. (2010). Using wordle as a supplementary research tool. The qualitative report, 15(3):630-643.
  19. Moretti, F. (2005). Graphs, Maps, Trees: Abstract Models for a Literary History. Verso.
  20. Munroe, R. (2009). http://xkcd.com/657/.
  21. Oelke, D., Hao, M., Rohrdantz, C., Keim, D., Dayal, U., Haug, L., and Janetzko, H. (2009). Visual opinion analysis of customer feedback data. In Proceedings of the IEEE Symposium on Visual Analytics Science and Technology, VAST 7809, pages 187-194.
  22. Oelke, D., Kokkinakis, D., and Keim, D. A. (2013). Fingerprint matrices: Uncovering the dynamics of social networks in prose literature. Computer Graphics Forum, 32(3pt4):371-380.
  23. Phelan, J. (1989). Reading people, reading plots: Character, progression, and the interpretation of narrative. University of Chicago Press.
  24. Plaisant, C., Rose, J., Yu, B., Auvil, L., Kirschenbaum, M. G., Smith, M. N., Clement, T., and Lord, G. (2006). Exploring erotics in emily dickinson's correspondence with text mining and visual interfaces. In Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital libraries, pages 141-150. ACM.
  25. Raghunathan, K., Lee, H., Rangarajan, S., Chambers, N., Surdeanu, M., Jurafsky, D., and Manning, C. (2010). A multi-pass sieve for coreference resolution. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, EMNLP 7810, pages 492-501. ACL.
  26. Ruiz-Casado, M., Alfonseca, E., and Castells, P. (2007). Automatising the learning of lexical patterns: An application to the enrichment of wordnet by extracting semantic relationships from wikipedia. Data & Knowledge Engineering, 61(3):484-499.
  27. Stasko, J., Görg, C., and Liu, Z. (2008). Jigsaw: Supporting investigative analysis through interactive visualization. Information Visualization, 7(2):118-132.
  28. Stoyanov, V., Cardie, C., Gilbert, N., Riloff, E., Buttler, D., and Hysom, D. (2010). Coreference resolution with reconcile. In Proceedings of the ACL 2010 Conference Short Papers, pages 156-161. ACL.
  29. Tanahashi, Y. and Ma, K.-L. (2012). Design considerations for optimizing storyline visualizations. IEEE Transactions on Visualization and Computer Graphics, 18(12):2679-2688.
  30. Van Ham, F., Wattenberg, M., and Viegas, F. (2009). Mapping text with phrase nets. IEEE Transactions on Visualization and Computer Graphics, 15(6):1169-1176.
  31. Viégas, F. B. and Wattenberg, M. (2008). Tag clouds and the case for vernacular visualization. interactions, 15(4):49-52.
  32. Vuillemot, R., Clement, T., Plaisant, C., and Kumar, A. (2009). What's being said near “martha”? exploring name entities in literary text collections. In Proceedings of the IEEE Symposium on Visual Analytics Science and Technology, 2009, VAST 7809, pages 107- 114.
  33. Wattenberg, M. and Viegas, F. (2008). The word tree, an interactive visual concordance. IEEE Transactions on Visualization and Computer Graphics, 14(6):1221- 1228.
Download


Paper Citation


in Harvard Style

John M., Lohmann S., Koch S., Wörner M. and Ertl T. (2016). Visual Analytics for Narrative Text - Visualizing Characters and their Relationships as Extracted from Novels . In Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: IVAPP, (VISIGRAPP 2016) ISBN 978-989-758-175-5, pages 27-38. DOI: 10.5220/0005669800270038


in Bibtex Style

@conference{ivapp16,
author={Markus John and Steffen Lohmann and Steffen Koch and Michael Wörner and Thomas Ertl},
title={Visual Analytics for Narrative Text - Visualizing Characters and their Relationships as Extracted from Novels},
booktitle={Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: IVAPP, (VISIGRAPP 2016)},
year={2016},
pages={27-38},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005669800270038},
isbn={978-989-758-175-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: IVAPP, (VISIGRAPP 2016)
TI - Visual Analytics for Narrative Text - Visualizing Characters and their Relationships as Extracted from Novels
SN - 978-989-758-175-5
AU - John M.
AU - Lohmann S.
AU - Koch S.
AU - Wörner M.
AU - Ertl T.
PY - 2016
SP - 27
EP - 38
DO - 10.5220/0005669800270038