| Registrati | Log in | FAQ | [?] |
A comparison of string distance metrics for name-matching tasks(2003)
|
Reviews
[Write a review of this article]
There are no reviews of this article
Notes for this article
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
AbstractUsing an open-source, Java toolkit of name-matching methods, we experimentally compare string distance metrics on the task of matching entity names. We investigate a number of different metrics proposed by different communities, including edit-distance metrics, fast heuristic string comparators , token-based distance metrics, and hybrid methods. Overall, the best-performing method is a hybrid scheme combining a TFIDF weighting scheme, which is widely used in information retrieval, with ...
BibTeX record
RIS record