![]() |
CiteULike | ![]() |
yoavg's CiteULike | ![]() |
![]() |
|
![]() |
Register | ![]() |
Log in | ![]() |
Impact of imperfect OCR on part-of-speech taggingby: Xiaofan Lin
|
Reviews
[Write a review of this article]
Notes for this articleperformance drops linearly with character error-rate
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting History
AbstractPart-of-speech (POS) tagging is the foundation of natural language processing (NLP) systems, and thus has been an active area of research for many years. However, one question remains unanswered: How will a POS tagger behave when the input text is not error-free? This issue can be of great importance when the text comes from imperfect sources like Optical Character Recognition (OCR). This paper analyzes the performance of both individual POS taggers and combination systems on imperfect text. ...
BibTeX record
RIS record