|
Home
News
Citegeist
|
Browse Groups
Search Groups
Journals
|
FAQs
Howto
Discussion
|
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
reCAPTCHA: Human-Based Character Recognition via Web Security Measures |
Reviews
[Write a review of this article]
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting HistoryNEW
AbstractCAPTCHAs (Completely Automated Public Turing test to tell Computers and Humans Apart) are widespread security measures in the World Wide Web that prevent automated programs from abusing online services. They do so by asking humans to perform a task that computers cannot yet perform, such as deciphering distorted characters. Our research explored whether such human effort can be channeled into a useful purpose: helping to digitize old printed material by asking users to decipher scanned words from books that computerized optical character recognition (OCR) failed to recognize. We showed that this method can transcribe text with word accuracy over 99%, matching the guarantee of professional human transcribers. Our apparatus is deployed in over 40,000 Web sites and has transcribed over 440 million words. 10.1126/science.1160379
BibTeX record
RIS record