CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Representing text chunks

(1999)

X Abstract

Dividing sentences in chunks of words is a useful preprocessing step for parsing, information extraction and information retrieval. (Ramshaw and Marcus, 1995) have introduced a "convenient" data representation for chunking by converting it to a tagging task. In this paper we will examine seven different data representations for the problem of recognizing noun phrase chunks. We will show that the the data representation choice has a minor influence on chunking performance. However, equipped with ...

View the full article here:

CiteSeerX Beta

This article has been bookmarked 3 times, initially on 2006-06-25.

2007-02-25 User yoavg , 1 note

compares various tag representation for chunking (and conclude they are all the same??)

2007-02-25 23:40:52
Group NLP
2006-06-25 User scis0000001
Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.