CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Name This! Automating Metadata Extraction through a Named Entity Recognition Tool Export

DLF Spring 2009 Forum (5 May 2009)

Citation Format

[Posts]

View FullText article


AlisonBabeu's tags for this article

authority_control automatic_metadata_generation named_entities--historical named-entity-recognition

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

The Extracting Metadata for Preservation (EMP) Project, funded by the National Digital Information Infrastructure and Preservation (NDIIPP) Program, addresses the ongoing challenge of identifying proper names to improve authority control in metadata creation and extraction, as well as accuracy in end-user information access via web-based search and retrieval. As a collaboration among the University of Illinois at Urbana-Champaign, OCLC, and the University of Maryland, EMP researchers bring multidisciplinary perspectives from the library, computer science, and linguistics communities to the problem of high-quality identification and disambiguation of names. This presentation reports on three activities. First, we describe an open-source name extractor tool developed by computational linguists at Illinois, configured with a plug-in interface that lowers barriers of access to state-of-the-art research tools. Second, we demonstrate the use of this tool by integrating it into two applications developed at the collaborating institutions: summary views of FRBR-ized MARC records hosted at OCLC and metadata generated by CLiMB (Computational Linguistics for Metadata Building) at Maryland. Finally, we describe the results of evaluation that compares the output of EMP with previously available solutions. This research will be of interest to those who develop search interfaces, metadata creation tools, institutional repositories, and applications requiring names management.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.