Evaluating entity resolution results

Cited 0 time in webofscience Cited 53 time in scopus
  • Hit : 142
  • Download : 0
Entity Resolution (ER) is the process of identifying groups of records that refer to the same real-world entity. Various measures (e.g., pairwise F1, cluster F1) have been used for evaluating ER results. However, ER measures tend to be chosen in an ad-hoc fashion without careful thought as to what defines a good result for the specific application at hand. In this paper, our contributions are twofold. First, we conduct an analysis on existing ER measures, showing that they can often con ict with each other by ranking the results of ER algorithms differently. Second, we explore a new distance measure for ER (called “generalized merge distance” or GMD)
Publisher
The VLDB Endowment
Issue Date
2010-09
Language
English
Citation

36th International Conference on Very Large Data Bases, VLDB 2010, pp.208 - 219

ISSN
2150-8097
DOI
10.14778/1920841.1920871
URI
http://hdl.handle.net/10203/260221
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0