Document Similarity Measure Based on Topic Model

Medientyp: E-Artikel
Titel: Document Similarity Measure Based on Topic Model
Beteiligte: He, Ming; Wang, Zhen Zhen; Du, Yong Ping
Erschienen: Trans Tech Publications, Ltd., 2014
Erschienen in: Applied Mechanics and Materials
Sprache: Nicht zu entscheiden
DOI: 10.4028/www.scientific.net/amm.513-517.1280
ISSN: 1662-7482
Schlagwörter: General Engineering
Entstehung:
Anmerkungen:
Beschreibung: <jats:p>Document similarity computation is an exciting research topic in information retrieval (IR) and it is a key issue for automatic document categorization, clustering analysis, fuzzy query and question answering. Topic model is an emerging field in natural language processing (NLP), IR and machine learning (ML). In this paper, we apply a latent Dirichlet allocation (LDA) topic model-based method to compute similarity between documents. By mapping a document with term space representation into a topic space, a distribution over topics derived for computing document similarity. An empirical study using real data set demonstrates the efficiency of our method.</jats:p>

Nur in Feld suchen: