CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Predicting Oral Reading Miscues Export

In ICSLP (2002)

Citation Format

[Posts]

View FullText article


mote's tags for this article

asr call child cmu good language levenshtein_distance listen pedagogy sla

X Reviews [Write a review of this article]

X Notes for this article

mote has 0 private notes and 2 public notes for this article.

System to generate nbest list for likely child reading mispronunciations.

Compare two different methods ("rote" and "extrapolative").

Training data : Colorado DB (Olson et al) of 112k transcribed child miscues (vocabulary 881 distinct words)

Rote method selects miscues that have actually occured in the past (avg. 34.2 per word, pared down to 7.4 by looking at miscues that more than one student made).

Extrapolative uses machine learning to extract other miscue words that are similar in pronunciation to the target words. Used a number of features (primarily a modified edit distance on pronunciation (more on that in another note).

Results: as would be expected, rote works well in very common words (lots of data) and extrapolative works well for uncommon words.

One big shortcoming was that miscues in extrapolative method seemed to necessarily be actual words in the dictionary. And the miscues were limitted to be words that started with the same phone as target. That last one seems like a mistake, because as they compare utility of different features in training the extrapolative model, the "same-first-phoneme" feature, while useful, scored rather low.

mote (public note) - 2006-03-19 02:57:14

They use a neat modified levenshtein (edit) distance: - 0-2 point penalty for substituting similar phones - 5 point penalty for substituting non-similar phonemes - unspecified penalty for insertion/deletion. And all normalized by phones in target word.

mote (public note) - 2006-03-19 03:05:25

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.