CiteULike is a free online bibliography manager. Register and you can start organising your references online.

The string edit distance matching problem with moves Export

ACM Trans. Algorithms, Vol. 3, No. 1. (2007)

Citation Format

[Posts]

View FullText article


markusd's tags for this article

dynamic-programming edit-distance

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

The edit distance between two strings S and R is defined to be the minimum number of character inserts, deletes, and changes needed to convert R to S . Given a text string t of length n , and a pattern string p of length m , informally, the string edit distance matching problem is to compute the smallest edit distance between p and substrings of t .We relax the problem so that: (a) we allow an additional operation, namely, substring moves ; and (b) we allow approximation of this string edit distance. Our result is a near-linear time deterministic algorithm to produce a factor of O (log n log* n ) approximation to the string edit distance with moves. This is the first known significantly subquadratic algorithm for a string edit distance problem in which the distance involves nontrivial alignments. Our results are obtained by embedding strings into L 1 vector space using a simplified parsing technique, which we call edit-sensitive parsing (ESP).


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.