CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Text pattern visualization for analysis of biology full text and captions Export

Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE (2003), pp. 648-651.

Citation Format

[Posts]

View FullText article


biomedical-nlp's tags for this article

fulltext-use

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

Large textbanks comprised of thousands of full-text biology papers are rapidly becoming available. We describe an approach to characterize all major language patterns in biology text in terms of frameworks. Frameworks are "containers" made up of common phrases surrounding specific informational items such as gene and protein names. A framework viewer has been developed that shows similar text frameworks aligned on the screen much as biosequence visualization tools do. Using the viewer, it is evident that frameworks have the power to find the types of structures needed to develop useful information retrieval systems. As a simple example, one framework was able to concisely select 45,000 nouns from a corpus of 5 million words without error. This work points the way to highly automated systems that will be able to extract and index information in biology textbanks. Work in progress includes extensions to characterize recursive structures in text, subsystems to retrieve figures in papers, and the discovery of semantic relations to aid concept-based retrieval.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.