copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Word Embedding for Nepali

B. Pande, and A. Shakya. Proceedings of 12th IOE Graduate Conference, 12, page 621 -- 627. Institute of Engineering, Tribhuvan University, Nepal, (October 2022)

Abstract

Word embedding are a vital part for most modern Natural Language Processing (NLP) task. It is however difficult to identify if the given word embedding model works well or not. Especially with larger models, the time taken to train such models are very large. When we add to this the time required to train the eventual model for NLP task, a very large chunk of time can be spent on just training the different word embedding models to identify which word embedding models works well. Due to this, selecting between different models of word embedding can be very difficult. For this, intrinsic evaluation is used to evaluate the performance of word embedding systems instead of directly using the model for eventual NLP task. But for Nepali, it is difficult due to the lack of resources in Nepali Language. We show that using intrinsic evaluation based from similar language like Hindi with small modifications, we can gain insight about the effectiveness of word embedding. It can be justified based on the result for extrinsic evaluation where in the results are in agreement with the results from intrinsic evaluation. Using this, we find out that among the 3 models considered, the fasttext model performs the best when considering out of vocabulary words.

Links and resources

BibTeX key: IOEGC-12-081-12117
entry type: inproceedings
booktitle: Proceedings of 12th IOE Graduate Conference
year: 2022
month: October
organization: Institute of Engineering, Tribhuvan University, Nepal
pages: 621 -- 627
volume: 12
Document: http://conference.ioe.edu.np/publications/ioegc12/IOEGC-12-081-12117.pdf

Cite this publication

@inproceedings{IOEGC-12-081-12117, abstract = {Word embedding are a vital part for most modern Natural Language Processing (NLP) task. It is however difficult to identify if the given word embedding model works well or not. Especially with larger models, the time taken to train such models are very large. When we add to this the time required to train the eventual model for NLP task, a very large chunk of time can be spent on just training the different word embedding models to identify which word embedding models works well. Due to this, selecting between different models of word embedding can be very difficult. For this, intrinsic evaluation is used to evaluate the performance of word embedding systems instead of directly using the model for eventual NLP task. But for Nepali, it is difficult due to the lack of resources in Nepali Language. We show that using intrinsic evaluation based from similar language like Hindi with small modifications, we can gain insight about the effectiveness of word embedding. It can be justified based on the result for extrinsic evaluation where in the results are in agreement with the results from intrinsic evaluation. Using this, we find out that among the 3 models considered, the fasttext model performs the best when considering out of vocabulary words.}, added-at = {2023-05-07T17:36:54.000+0200}, author = {Pande, Bishal Debb and Shakya, Aman}, biburl = {https://www.bibsonomy.org/bibtex/28ded3181ec2a5078ff38a2fc7bbea86a/amanshakya}, booktitle = {Proceedings of 12th IOE Graduate Conference}, interhash = {fa811d851716e525cae541db98d2bd43}, intrahash = {8ded3181ec2a5078ff38a2fc7bbea86a}, keywords = {myown}, month = {October}, organization = {Institute of Engineering, Tribhuvan University, Nepal}, pages = {621 -- 627}, timestamp = {2023-05-07T17:36:54.000+0200}, title = {Efficient Word Embedding for Nepali}, url = {http://conference.ioe.edu.np/publications/ioegc12/IOEGC-12-081-12117.pdf}, volume = 12, year = 2022 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Word Embedding for Nepali

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Efficient Word Embedding for Nepali

Abstract

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Word Embedding for Nepali

Comments and Reviews
(0)