You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings

28 May

Evaluating bias, fairness, and social impact in monolingual language models is a difficult task. This challenge is further compounded when language modeling occurs in a multilingual context. Considering the implication of evaluation biases for large multilingual language models, we situate the discussion of bias evaluation within a wider context of social scientific research with computational work.We highlight three dimensions of developing multilingual bias evaluation frameworks: (1) increasing transparency through documentation, (2) expanding targets of bias beyond gender, and (3) addressing cultural differences that exist between languages.We further discuss the power dynamics and consequences of training large language models and recommend that researchers remain cognizant of the ramifications of developing such technologies.

MultilingualNLPEthics and Policy

Stella Biderman

You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings

The BigScience ROOTS Corpus: A 1.6 TB Composite Multilingual Dataset

VQGAN-CLIP: Open domain image generation and editing