CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Multiclass composite N-gram language model based on connection direction Export

Systems and Computers in Japan, Vol. 34, No. 7. (2003), pp. 108-114.

Citation Format

[Posts]

View FullText article


zzb3886's tags for this article

class lm

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

The authors propose a method to generate a compact, highly reliable language model for speech recognition based on the efficient classification of words. In this method, the connectedness with the words immediately before and after the word is taken to represent separate attributes, and individual classification is performed for each word. The resulting composite word class is created separately based on the distribution of words connected before or after. As a result, classification of classes is efficient and reliable. In a multiclass composite N-gram, which uses the proposed method for the variable-order N-gram to bring in chain words, the entry size is reduced to one-tenth, and the word recognition rate is higher than that of a conventional composite N-gram for particles or variable-length word arrays. © 2003 Wiley Periodicals, Inc. Syst Comp Jpn, 34(7): 108-114, 2003; Published online in Wiley InterScience (). DOI 10.1002/scj.1210


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.