VISUAL ANALYSIS OF DOCUMENT TRIAGE DATA

Zhao Geng, Robert S. Laramee, Fernando Loizides, George Buchanan

Abstract

As part of the information seeking process, a large amount of effort is invested in order to study and understand how information seekers search through documents such that they can assess their relevance. This search and assessment of document relevance, known as document triage, is an important information seeking process, but is not yet well understood. Human-computer interaction (HCI) and digital library scientists have undertaken a series of user studies involving information seeking, collected a large amount of data describing information seekers’ behavior during document search. Next to this, we have witnessed a rapid increase in the number of off-the-shelf visualization tools which can benefit document triage study. Here we set out to utilize existing information visualization techniques and tools in order to gain a better understanding of the large amount of user-study data collected by HCI and digital library researchers. We describe the range of available tools and visualizations we use in order to increase our knowledge of document triage. Treemap, parallel coordinates, stack graph, matrix chart, as well as other visualization methods, prove to be insightful in exploring, analyzing and presenting user behavior during document triage. Our findings and visualizations are evaluated by HCI and digital library researchers studying this problem.

References

  1. Advanced Visual Systems Inc. (2009). OpenViz. 300 Fifth Avenue,Waltham, MA 02451. http://www.avs.com.
  2. Bae, S., Hsieh, H., Kim, D., Marshall, C., Meintanis, K., Moore, M., Zacchi, A., and Shipman, F. (2008). Supporting Document Triage via Annotation-based Visualizations. In Proceedings of the American Society for Information Science and Technology, volume 45, pages 1-16.
  3. Bateman, S., Gutwin, C., and Nacenta, M. (2008). Seeing Things in the Clouds: The Effect of Visual Features on Tag Cloud Selections. In Proceedings of the nineteenth ACM conference on Hypertext and hypermedia, pages 193-202, New York, NY, USA. ACM.
  4. Baudel, T. (2004). Browsing Through an Information Visualization Design Space. In Proceedings of ACM CHI Conference on Human Factors in Computing Systems, volume 2 of Demonstrations, pages 765-766.
  5. Blanch, R. and Lecolinet, E. (2007). Browsing Zoomable Treemaps: Structure-Aware Multi-Scale Navigation Techniques. IEEE Transactions on Visualization and Computer Graphics, 13(6):1248-1253.
  6. Bruls, M., Huizing, K., and van Wijk, J. J. (2000). Squarified Treemaps. In Proceedings of Joint Eurographics/IEEE TVCG symposium Visualization, pages 33- 42.
  7. Buchanan, G. and Loizides, F. (2007). Investigating Document Triage On Paper And Electronic Media. In Proceedings of the European Conference on Research and advanced Technology for Digital Libraries, 4675:416-427.
  8. Cleveland, W. S. and McGill, R. (1985). Graphical Perception and Graphical Methods for Analyzing Scientific Data. Science, 229(4716):828-833.
  9. Cool, C., Belkin, N. J., Frieder, O., and Kantor, P. (1993). Characteristics of Texts Affecting Relevance Judgments. In In 14th National Online Meeting, pages 77-84.
  10. Gottron, T. (2009). Document Word Clouds: Visualising Web Documents as Tag Clouds to Aid Users in Relevance Decisions. In Research and Advanced Technology for Digital Libraries, 13th European Conference, Proceedings, volume 5714 of Lecture Notes in Computer Science, pages 94-105. Springer.
  11. Heer, J., Mackinlay, J. D., Stolte, C., and Agrawala, M. (2008). Graphical Histories for Visualization: Supporting Analysis, Communication, and Evaluation. IEEE Transactions on Visualization and Computer Graphics, 14(6):1189-1196.
  12. Inselberg, A. and Dimsdale, B. (1990). Parallel Coordinates: A Tool for Visualizing Multi-dimensional Geometry. In Proceedings of IEEE Visualization, pages 361-378.
  13. Johnson, B. and Shneiderman, B. (1991). Tree Maps: A Space-Filling Approach to the Visualization of Hierarchical Information Structures. In Proceedings of IEEE Visualization, pages 284-291.
  14. Jonker, D., Wright, W., Schroh, D., Proulx, P., and Cort, B. (2005). Information Triage With Trist. In Proceedings of Intelligence Analysis, pages 1-6.
  15. Keim., D. A. (2002). Information Visualization and Visual Data Mining. IEEE Transactions on Visualization and Computer Graphics, 8:1-8.
  16. Kobsa, A. (2001). An Empirical Comparison of Three Commercial Information Visualization Systems. In Proceedings of IEEE Symposium on Information Visualization, San Diego, CA, pages 123-130.
  17. Kobsa, A. (2004). User Experiments with Tree Visualization Systems. In Proceedings of IEEE Symposium on Information Visualization, pages 9-16. IEEE Computer Society.
  18. Kosara, R., Hauser, H., and Gresh, D. (2003). An Interaction View on Information Visualization. In Proceedings of Eurographics, pages 123-137.
  19. Lee, D. L., Chuang, H., and Seamons, K. E. (1997). Document Ranking and the Vector-Space Model. IEEE Software, 14(2):67-75.
  20. Loizides, F. and Buchanan, G. (2009). An Empirical Study of User Navigation during Document Triage. In Proceedings of Research and Advanced Technology for Digital Libraries, 13th European Conference, volume 5714 of Lecture Notes in Computer Science, pages 138-149. Springer.
  21. Mark Taylor (2005). TOPCAT - Tool for OPerations on Catalogues And Tables Version 3.4-3. Starlink development.
  22. Marsh, S. (1992). The Interactive Matrix Chart. ACM SIGCHI Bulletin, 24(4):32-38.
  23. Martin, T. and Simon, U. (2008). Interactive Graphics for Data Analysis: Principles and Examples (Computer Science and Data Analysis). Chapman & Hall/CRC.
  24. Microsoft (2007). Microsoft Office Excel 2007 product guide. Microsoft office.
  25. Shneiderman, B. (1992). Tree Visualization With Treemaps: a 2-d Space-filling Approach. ACM Transactions on Graphics, 11(1):92-99.
  26. Shneiderman, B. (1996). The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations. In Proceedings of IEEE Symposium on Visual Languages, pages 336-343.
  27. Shneiderman, B. (2003). Why Not Make Interfaces Better than 3D Reality? IEEE Computer Graphics and Applications, 23(6):12-15.
  28. Teyseyre, A. R. and Campo, M. R. (2009). An Overview of 3D Software Visualization. IEEE Transactions on Visualization and Computer Graphics, 15(1):87-105.
  29. Theus, M. (2002). Interactive Data Visualization Using Mondrian. Journal of Statistical Software, 7(11):1-9.
  30. Tu, Y. and Shen, H.-W. (2007). Visualizing Changes of Hierarchical Data using Treemaps. IEEE Transactions on Visualization and Computer Graphics, 13(6):1286-1293.
  31. Viegas, F. B., Wattenberg, M., and Feinberg, J. (2009). Participatory Visualization with Wordle. IEEE Transactions on Visualization and Computer Graphics, 15(6):1137-1144.
  32. Viegas, F. B., Wattenberg, M., van Ham, F., Kriss, J., and Mckeon, M. (2007). ManyEyes: A Site for Visualization at Internet Scale. IEEE Transactions on Visualization and Computer Graphics, 13(6):1121-1128.
  33. Ward, M. O. (1994). XmdvTool: Integrating Multiple Methods for Visualizing Multivariate Data. In Proceedings of IEEE on Visualization, pages 326-336. IEEE Computer Society Press.
  34. Ziemkiewicz, C. and Kosara, R. (2008). The Shaping of Information by Visual Metaphors. IEEE Transactions on Visualization and Computer Graphics, 14(6):1269-1276.
Download


Paper Citation


in Harvard Style

Geng Z., S. Laramee R., Loizides F. and Buchanan G. (2011). VISUAL ANALYSIS OF DOCUMENT TRIAGE DATA . In Proceedings of the International Conference on Imaging Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2011) ISBN 978-989-8425-46-1, pages 151-163. DOI: 10.5220/0003320401510163


in Bibtex Style

@conference{ivapp11,
author={Zhao Geng and Robert S. Laramee and Fernando Loizides and George Buchanan},
title={VISUAL ANALYSIS OF DOCUMENT TRIAGE DATA},
booktitle={Proceedings of the International Conference on Imaging Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2011)},
year={2011},
pages={151-163},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003320401510163},
isbn={978-989-8425-46-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Imaging Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: IVAPP, (VISIGRAPP 2011)
TI - VISUAL ANALYSIS OF DOCUMENT TRIAGE DATA
SN - 978-989-8425-46-1
AU - Geng Z.
AU - S. Laramee R.
AU - Loizides F.
AU - Buchanan G.
PY - 2011
SP - 151
EP - 163
DO - 10.5220/0003320401510163