Re: [R] string edit distance

From: Tobias Verbeke <tobias.verbeke_at_gmail.com>
Date: Sat 07 Apr 2007 - 20:14:15 GMT

Thomas Hills wrote:
> I have a column of words, for example
>
> "DOG"
> "DOOG"
> "GOD"
> "GOOD"
> "DOOR"
> ...
>
> and I am interested in creating a matrix that contains the string
> edit distances between each pair of words. I am this close -> ' '
> <- to writing the algorithm myself (which will allow for different
> variations on the string edit rules, indels, plus or minus
> transpositions, and possibly some variations on that), but I figured
> I'd see if anyone on the list has any experience with this and might
> already have some shoulders for me to stand on.
>
See     

http://wiki.r-project.org/rwiki/doku.php?id=tips:data-strings:levenshtein for some R code which might be useful.

HTH,
Tobias

-- 

Tobias Verbeke - Consultant
Business & Decision Benelux
Rue de la révolution 8
1000 Brussels - BELGIUM

+32 499 36 33 15
tobias.verbeke@businessdecision.com

______________________________________________
R-help@stat.math.ethz.ch mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
Received on Sun Apr 08 06:19:17 2007

Archive maintained by Robert King, hosted by the discipline of statistics at the University of Newcastle, Australia.
Archive generated by hypermail 2.1.8, at Sat 07 Apr 2007 - 20:33:41 GMT.

Mailing list information is available at https://stat.ethz.ch/mailman/listinfo/r-help. Please read the posting guide before posting to the list.