![]() |
CiteULike | ![]() |
fmccown's CiteULike | ![]() |
![]() |
|
![]() |
Register | ![]() |
Log in | ![]() |
The Invisible Web: Uncovering Information Sources Search Engines Can't Seeby: Gary Price, Chris Sherman
|
|
Reviews
[Write a review of this article]
Notes for this articlep. 61 lists types of invisible web content: 1) disconnect pages 2) Page consisting primarily of images, audio, or video 3) Pages consisting primarily of PDF, PS, Flash, exe, compressed files (tar, zip) 4) Content in relational databases 5) Real-time content 6) Dynamically generated content
Of course, this book is somewhat outdated. Google has crawled PDF and PS for some time and now crawls Flash.
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting History
AbstractEnormous expanses of the Internet are unreachable with standard Web search engines. This book provides the key to finding these hidden resources by identifying how to uncover and use invisible Web resources. Mapping the invisible Web, when and how to use it, assessing the validity of the information, and the future of Web searching are topics covered in detail. Only 16 percent of Net-based information can be located using a general search engine. The other 84 percent is what is referred to as the invisible Web-made up of information stored in databases. Unlike pages on the visible Web, information in databases is generally inaccessible to the software spiders and crawlers that compile search engine indexes. As Web technology improves, more and more information is being stored in databases that feed into dynamically generated Web pages. The tips provided in this resource will ensure that those databases are exposed and Net-based research will be conducted in the most thorough and effective manner.
BibTeX record
RIS record