@mstrohm

SimFusion: measuring similarity using unified relationship matrix

, , , , , , и . SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, стр. 130--137. New York, NY, USA, ACM, (2005)
DOI: http://doi.acm.org/10.1145/1076034.1076059

Аннотация

In this paper we use a Unified Relationship Matrix (URM) to represent a set of heterogeneous data objects (e.g., web pages, queries) and their interrelationships (e.g., hyperlinks, user click-through sequences). We claim that iterative computations over the URM can help overcome the data sparseness problem and detect latent relationships among heterogeneous data objects, thus, can improve the quality of information applications that require com- bination of information from heterogeneous sources. To support our claim, we present a unified similarity-calculating algorithm, SimFusion. By iteratively computing over the URM, SimFusion can effectively integrate relationships from heterogeneous sources when measuring the similarity of two data objects. Experiments based on a web search engine query log and a web page collection demonstrate that SimFusion can improve similarity measurement of web objects over both traditional content based algorithms and the cutting edge SimRank algorithm.

Описание

on URM (Unified Relationship Matrix)

Линки и ресурсы

тэги

сообщество

  • @chato
  • @francesco.k
  • @dblp
  • @mstrohm
@mstrohm- тэги данного пользователя выделены