copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Investigating text power in predicting semantic similarity

Z. Yousefi, H. Sotudeh, M. Mirzabeigi, S. Fakhrahmad, A. Nikseresht, and M. Mohammadi. International Journal of Information Science and Management (IJISM), 17 (1): 17 (January 2019)

Abstract

This article presents an empirical evaluation to investigate the distributional semantic power of abstract, body and full-text, as different text levels, in predicting the semantic similarity using a collection of open access articles from PubMed. The semantic similarity is measured based on two criteria namely, linear MeSH terms intersection and hierarchical MeSH terms distance. As such, a random sample of 200 queries and 20000 documents are selected from a test collection built on CITREC open source code. Sim Pack Java Library is used to calculate the textual and semantic similarities. The nDCG value corresponding to two of the semantic similarity criteria is calculated at three precision points. Finally, the nDCG values are compared by using the Friedman test to determine the power of each text level in predicting the semantic similarity. The results showed the effectiveness of the text in representing the semantic similarity in such a way that texts with maximum textual similarity are also shown to be 77\% and 67\% semantically similar in terms of linear and hierarchical criteria, respectively. Furthermore, the text length is found to be more effective in representing the hierarchical semantic compared to the linear one. Based on the findings, it is concluded that when the subjects are homogenous in the tree of knowledge, abstracts provide effective semantic capabilities, while in heterogeneous milieus, full-texts processing or knowledge bases is needed to acquire IR effectiveness.

Links and resources

BibTeX key: yousefi_investigating_2019
entry type: article
year: 2019
month: jan
journal: International Journal of Information Science and Management (IJISM)
number: 1
pages: 17
volume: 17
copyright: Copyright (c) 2019 International Journal of Information Science and Management (IJISM)
language: en
file: Full Text PDF:/Users/le/Zotero/storage/TWMD42NV/Yousefi et al. - 2019 - Investigating text power in predicting semantic si.pdf:application/pdf;Snapshot:/Users/le/Zotero/storage/W5NQ6PG4/1297.html:text/html
issn: 2008-8310
urldate: 2019-02-07
url: https://ijism.ricest.ac.ir/index.php/ijism/article/view/1297

Cite this publication

@article{yousefi_investigating_2019, abstract = {This article presents an empirical evaluation to investigate the distributional semantic power of abstract, body and full-text, as different text levels, in predicting the semantic similarity using a collection of open access articles from PubMed. The semantic similarity is measured based on two criteria namely, linear MeSH terms intersection and hierarchical MeSH terms distance. As such, a random sample of 200 queries and 20000 documents are selected from a test collection built on CITREC open source code. Sim Pack Java Library is used to calculate the textual and semantic similarities. The nDCG value corresponding to two of the semantic similarity criteria is calculated at three precision points. Finally, the nDCG values are compared by using the Friedman test to determine the power of each text level in predicting the semantic similarity. The results showed the effectiveness of the text in representing the semantic similarity in such a way that texts with maximum textual similarity are also shown to be 77\% and 67\% semantically similar in terms of linear and hierarchical criteria, respectively. Furthermore, the text length is found to be more effective in representing the hierarchical semantic compared to the linear one. Based on the findings, it is concluded that when the subjects are homogenous in the tree of knowledge, abstracts provide effective semantic capabilities, while in heterogeneous milieus, full-texts processing or knowledge bases is needed to acquire IR effectiveness.}, added-at = {2019-02-22T00:55:04.000+0100}, author = {Yousefi, Zahra and Sotudeh, Hajar and Mirzabeigi, Mahdieh and Fakhrahmad, Seyed Mostafa and Nikseresht, Alireza and Mohammadi, Mehdi}, biburl = {https://www.bibsonomy.org/bibtex/2ff15bc993ab0fdae2bac4089b7c07bc5/lepsky}, copyright = {Copyright (c) 2019 International Journal of Information Science and Management (IJISM)}, file = {Full Text PDF:/Users/le/Zotero/storage/TWMD42NV/Yousefi et al. - 2019 - Investigating text power in predicting semantic si.pdf:application/pdf;Snapshot:/Users/le/Zotero/storage/W5NQ6PG4/1297.html:text/html}, interhash = {9047f9bbc126ca0878984494b3c22da3}, intrahash = {ff15bc993ab0fdae2bac4089b7c07bc5}, issn = {2008-8310}, journal = {International Journal of Information Science and Management (IJISM)}, keywords = {semantik}, language = {en}, month = jan, number = 1, pages = 17, timestamp = {2019-02-22T00:58:33.000+0100}, title = {Investigating text power in predicting semantic similarity}, url = {https://ijism.ricest.ac.ir/index.php/ijism/article/view/1297}, urldate = {2019-02-07}, volume = 17, year = 2019 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Investigating text power in predicting semantic similarity

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Investigating text power in predicting semantic similarity

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Investigating text power in predicting semantic similarity

Comments and Reviews
(0)