loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Manuel Álvarez ; Fidel Cacheda ; Rafael López-García and Víctor M. Prieto

Affiliation: University of A Coruña, Spain

Keyword(s): Information retrieval, Hidden Web, Spanish Web.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Business Analytics ; Data Analytics ; Data Engineering ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Symbolic Systems

Abstract: This article submits a study about the web sites of the “.es” domains which focuses on the level of use of the technologies that hinder the traversal of the Web to the crawling systems. The study is centred on HTML scripts and forms, since they are two well-known entry points to the “Hidden Web”. For the case of scripts, it pays special attention to redirection and dynamic construction of URLs. The article concludes that a crawler should process those technologies in order to obtain most of the documents of the Web.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 44.204.204.14

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Álvarez, M.; Cacheda, F.; López-García, R. and M. Prieto, V. (2011). THE SPANISH WEB IN NUMBERS - Main Features of the Spanish Hidden Web. In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR; ISBN 978-989-8425-79-9; ISSN 2184-3228, SciTePress, pages 363-366. DOI: 10.5220/0003626603710374

@conference{kdir11,
author={Manuel Álvarez. and Fidel Cacheda. and Rafael López{-}García. and Víctor {M. Prieto}.},
title={THE SPANISH WEB IN NUMBERS - Main Features of the Spanish Hidden Web},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR},
year={2011},
pages={363-366},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003626603710374},
isbn={978-989-8425-79-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval (IC3K 2011) - KDIR
TI - THE SPANISH WEB IN NUMBERS - Main Features of the Spanish Hidden Web
SN - 978-989-8425-79-9
IS - 2184-3228
AU - Álvarez, M.
AU - Cacheda, F.
AU - López-García, R.
AU - M. Prieto, V.
PY - 2011
SP - 363
EP - 366
DO - 10.5220/0003626603710374
PB - SciTePress