XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

Ruder, Sebastian; Constant, Noah; Botha, Jan; Siddhant, Aditya; Firat, Orhan; Fu, Jinlan; Liu, Pengfei; Hu, Junjie; Garrette, Dan; Neubig, Graham; Johnson, Melvin

Computer Science > Computation and Language

arXiv:2104.07412 (cs)

[Submitted on 15 Apr 2021 (v1), last revised 7 Oct 2021 (this version, v2)]

Title:XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

Authors:Sebastian Ruder, Noah Constant, Jan Botha, Aditya Siddhant, Orhan Firat, Jinlan Fu, Pengfei Liu, Junjie Hu, Dan Garrette, Graham Neubig, Melvin Johnson

View PDF

Abstract:Machine learning has brought striking advances in multilingual natural language processing capabilities over the past year. For example, the latest techniques have improved the state-of-the-art performance on the XTREME multilingual benchmark by more than 13 points. While a sizeable gap to human-level performance remains, improvements have been easier to achieve in some tasks than in others. This paper analyzes the current state of cross-lingual transfer learning and summarizes some lessons learned. In order to catalyze meaningful progress, we extend XTREME to XTREME-R, which consists of an improved set of ten natural language understanding tasks, including challenging language-agnostic retrieval tasks, and covers 50 typologically diverse languages. In addition, we provide a massively multilingual diagnostic suite (MultiCheckList) and fine-grained multi-dataset evaluation capabilities through an interactive public leaderboard to gain a better understanding of such models. The leaderboard and code for XTREME-R will be made available at this https URL and this https URL respectively.

Comments:	EMNLP 2021 camera-ready
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2104.07412 [cs.CL]
	(or arXiv:2104.07412v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2104.07412

Submission history

From: Sebastian Ruder [view email]
[v1] Thu, 15 Apr 2021 12:26:12 UTC (5,184 KB)
[v2] Thu, 7 Oct 2021 09:51:04 UTC (5,195 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-04

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sebastian Ruder
Noah Constant
Aditya Siddhant
Orhan Firat
Jinlan Fu

…

export BibTeX citation

Computer Science > Computation and Language

Title:XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators