![]() |
CiteULike | ![]() |
tulaydemir's CiteULike | ![]() |
![]() |
|
![]() |
Register | ![]() |
Log in | ![]() |
SQL Queries Over Unstructured Text DatabasesData Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on In Data Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on (2007), pp. 1255-1257.
|
Reviews
[Write a review of this article]
Notes for this article
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting History
AbstractText documents often embed data that is structured in nature. By processing a text database with information extraction systems, we can define a variety of structured "relations" over which we can then issue SQL queries. Processing SQL queries in this text-based scenario presents multiple challenges. One key challenge is efficiency: information extraction is a time-consuming process, so query processing strategies should pick efficient extraction systems whenever possible, and also minimize the number of documents that they process. Another key challenge is result quality: extraction systems might output erroneous information or miss information that they should capture; also, efficiency-related query processing decisions (e.g., to avoid processing large numbers of useless documents) may compromise result completeness. To address these challenges, we characterize SQL query processing strategies in terms of their efficiency and result quality, and discuss the (user-specific) tradeoff between these two properties.
BibTeX record
RIS record