copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying and Reducing Gender Bias in Word-Level Language Models

S. Bordia, and S. Bowman. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, page 7--15. Minneapolis, Minnesota, Association for Computational Linguistics, (June 2019)
DOI: 10.18653/v1/N19-3002

Abstract

Many text corpora exhibit socially problematic biases, which can be propagated or amplified in the models trained on such data. For example, doctor cooccurs more frequently with male pronouns than female pronouns. In this study we (i) propose a metric to measure gender bias; (ii) measure bias in a text corpus and the text generated from a recurrent neural network language model trained on the text corpus; (iii) propose a regularization loss term for the language model that minimizes the projection of encoder-trained embeddings onto an embedding subspace that encodes gender; (iv) finally, evaluate efficacy of our proposed method on reducing gender bias. We find this regularization method to be effective in reducing gender bias up to an optimal weight assigned to the loss term, beyond which the model becomes unstable as the perplexity increases. We replicate this study on three training corpora---Penn Treebank, WikiText-2, and CNN/Daily Mail---resulting in similar conclusions.

Description

Identifying and Reducing Gender Bias in Word-Level Language Models - ACL Anthology

Links and resources

BibTeX key: bordia-bowman-2019-identifying
entry type: inproceedings
address: Minneapolis, Minnesota
booktitle: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
year: 2019
month: jun
pages: 7--15
publisher: Association for Computational Linguistics
DOI: 10.18653/v1/N19-3002
url: https://www.aclweb.org/anthology/N19-3002

@schwemmlein's tags highlighted

Cite this publication

@inproceedings{bordia-bowman-2019-identifying, abstract = {Many text corpora exhibit socially problematic biases, which can be propagated or amplified in the models trained on such data. For example, doctor cooccurs more frequently with male pronouns than female pronouns. In this study we (i) propose a metric to measure gender bias; (ii) measure bias in a text corpus and the text generated from a recurrent neural network language model trained on the text corpus; (iii) propose a regularization loss term for the language model that minimizes the projection of encoder-trained embeddings onto an embedding subspace that encodes gender; (iv) finally, evaluate efficacy of our proposed method on reducing gender bias. We find this regularization method to be effective in reducing gender bias up to an optimal weight assigned to the loss term, beyond which the model becomes unstable as the perplexity increases. We replicate this study on three training corpora{---}Penn Treebank, WikiText-2, and CNN/Daily Mail{---}resulting in similar conclusions.}, added-at = {2021-01-25T14:11:49.000+0100}, address = {Minneapolis, Minnesota}, author = {Bordia, Shikha and Bowman, Samuel R.}, biburl = {https://www.bibsonomy.org/bibtex/2359958f27783947bc6a554ce5b351aab/schwemmlein}, booktitle = {Proceedings of the 2019 Conference of the North {A}merican Chapter of the Association for Computational Linguistics: Student Research Workshop}, description = {Identifying and Reducing Gender Bias in Word-Level Language Models - ACL Anthology}, doi = {10.18653/v1/N19-3002}, interhash = {1925874bacd03ff45ddc9ab1a75cb202}, intrahash = {359958f27783947bc6a554ce5b351aab}, keywords = {bias gender language lm models nlp word}, month = jun, pages = {7--15}, publisher = {Association for Computational Linguistics}, timestamp = {2021-01-25T14:11:49.000+0100}, title = {Identifying and Reducing Gender Bias in Word-Level Language Models}, url = {https://www.aclweb.org/anthology/N19-3002}, year = 2019 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying and Reducing Gender Bias in Word-Level Language Models

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Identifying and Reducing Gender Bias in Word-Level Language Models

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Identifying and Reducing Gender Bias in Word-Level Language Models

Comments and Reviews
(0)