This book contains the proceedings of the International Conference on Knowledge Discovery and Information Retrieval – KDIR, which was sponsored by the Institute for Systems and Technologies of Information, Control and Communication (INSTICC) and held in Barcelona, Spain. KDIR was organized in cooperation with the Association for the Advancement of Artificial Intelligence (AAAI). The main objective of the International Conference on Knowledge Discovery and Information Retrieval was to provide a point of contact for scientists, engineers and practitioners interested in the scientific and technical advancement of knowledge discovery and information retrieval both theoretically and in a broad range of application fields. Knowledge Discovery is an interdisciplinary area focusing upon methodologies for identifying valid, novel, potentially useful and meaningful patterns from data, often based on underlying large data sets. A major aspect of Knowledge Discovery is data mining, i.e.
applying data analysis and discovery algorithms that produce a particular enumeration of patterns (or models) over the data. Knowledge Discovery also includes the evaluation of patterns and identification of which add to knowledge. This has proven to be a promising approach for enhancing the intelligence of software systems and services. The ongoing rapid growth of online data due to the Internet and the widespread use of large databases have created an important need for knowledge discovery methodologies. The challenge of extracting knowledge from data draws upon research in a large number of disciplines including statistics, databases, pattern recognition, machine learning, data visualization, optimization, and high-performance computing, to deliver advanced business intelligence and web discovery solutions. Information retrieval (IR) is concerned with gathering relevant information from unstructured and semantically fuzzy data in texts and other media, searching for information within documents and for metadata about documents, as well as searching relational databases and the Web. Automation of information retrieval enables the reduction of what has been called "information overload". As always KDIR is one of the three integrated conferences that are co-located and constitute the International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K). The other two are KEOD - International Conference on Knowledge Engineering and Ontology Development and KMIS - International Conference on Knowledge Management and Information Sharing. The joint conference, IC3K received 347 paper submissions from 59 countries, which demonstrates the success and global dimension of this conference. From these, 44 papers were published as full papers, 88 were accepted for short presentation and another 60 for poster presentation. These numbers, leading to a “full-paper” acceptance ratio of about 13% and an oral paper acceptance ratio close to 38%, show the intention of preserving a high quality forum for the next editions of this conference. The high quality of the papers received imposed difficult choices in the review process. To evaluate each submission, a double blind paper review was performed by the Program Committee, whose members are highly qualified independent researchers in KDIR topic areas. Moreover, the conferences also featured a number of keynote lectures delivered by internationally well known experts, namely Daniel O'Leary (University of Southern California, United States), Sophia Ananiadou (University of Manchester, United Kingdom), Alfred Inselberg (Tel Aviv University, Israel), Peter F. Patel-Schneider (United States) and Florian Michahelles (ETH Zürich, Switzerland), thus contributing to increase the overall quality of the conferences and to provide a deeper understanding of the conferences interest fields. Workshops provide interactive forae that allow for a more in-depth discussion of particular areas within the scope of the conference. We would like to thank the workshop chairs for their collaboration in providing this added-value satellite event of IC3K 2012 namely: 3rd International Workshop on Software Knowledge – SKY (chaired by Iaakov Exman, Juan Llorens and Anabel Fraga). The meeting was also complemented with two special sessions, the Special Session on Text Mining – SSTM (chaired by Ana Fred) and the Special Session on Information Filtering and Retrieval: Novel Distributed Systems and Applications – DART (chaired by Cristian Lai, Giovanni Semeraro and Alessandro Giuliani). To recognize the best submissions and the best student contributions, awards based on the best combined marks of paper reviewing, as assessed by the Program Committee, and the quality of the presentation, as assessed by session chairs at the conference venue, were conferred at the closing session of the conference. We trust that you will find these proceedings interesting and to be a helpful reference in the future. All presented papers will be submitted for indexation by Thomson Reuters Conference Proceedings Citation Index (ISI), INSPEC, DBLP and EI (Elsevier Index), as well as being made available at the SciTePress Digital Library. Additionally, a short list of presented papers will be selected to be expanded into a forthcoming book of IC3K Selected Papers to be published by Springer Verlag. Building an interesting and successful program for the conference required the dedicated effort of many people. We would like to express our thanks, first of all, to all authors including those whose papers were not included in the program. We would also like to express our gratitude to all members of the Program Committee and auxiliary reviewers, who helped us with their expertise and valuable time. Furthermore, we thank the invited speakers for their invaluable contribution and for taking the time to synthesize and prepare their talks. Moreover, we thank the workshop and special session chairs whose contribution to the diversity of the program was decisive. Finally, we gratefully acknowledge the professional support of the INSTICC team for all organizational processes.
Vol. 1 - 978-989-8565-29-7