CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Exploiting web search engines to search structured databases Export

In WWW '09: Proceedings of the 18th international conference on World wide web (2009), pp. 501-510.

Citation Format

[Posts]

View FullText article


X Reviews [Write a review of this article]

X Notes for this article

ChaTo has 0 private notes and 1 public note for this article.

[Talk] Structured information is interesting for users, as many queries are about entities; however sometimes the info in the database is not enough for a free text query.

Complement database search using web info from web pages.

Identify and aggregate entities that are in close proximity of ocurrences of a query on web pages. These pages are the same ones returned by the search engine for the query, so there is no extra call to the web search backend.

Scoring: aggregates using factors such as proximity or document importance.

ChaTo (public note) - 2009-04-24 16:30:44

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

Web search engines often federate many user queries to relevant structured databases. For example, a product related query might be federated to a product database containing their descriptions and specifications. The relevant structured data items are then returned to the user along with web search results. However, each structured database is searched in isolation. Hence, the search often produces empty or incomplete results as the database may not contain the required information to answer the query. In this paper, we propose a novel integrated search architecture. We establish and exploit the relationships between web search results and the items in structured databases to identify the relevant structured data items for a much wider range of queries.Our architecture leverages existing search engine components to implement this functionality at very low overhead. We demonstrate the quality and efficiency of our techniques through an extensive experimental study.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.