Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Open Language Data Initiative

community
https://oldi.org/
openlanguagedata
Activity Feed

AI & ML interests

Multilingual NLP, underserved languages

Recent Activity

cointegrated  updated a collection 20 days ago
OLDI and friends
cointegrated  updated a dataset about 1 month ago
openlanguagedata/flores_plus
cointegrated  new activity about 1 month ago
openlanguagedata/flores_plus:Add Khakas data (kjh_Cyrl)
View all activity

Laurie Burchell's profile pictureJean's profile pictureSkyler Wang's profile pictureDavid Dale's profile pictureIsaac Caswell's profile picture

openlanguagedata 's collections 1

OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated Mar 10 • 893k • 16.5k • 123
  • openlanguagedata/oldi_seed

    Viewer • Updated Mar 8 • 564k • 785 • 11
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 2.81k • 94
  • google/wmt24pp

    Viewer • Updated Jan 22 • 54.9k • 5.96k • 87
OLDI and friends
This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
  • openlanguagedata/flores_plus

    Viewer • Updated Mar 10 • 893k • 16.5k • 123
  • openlanguagedata/oldi_seed

    Viewer • Updated Mar 8 • 564k • 785 • 11
  • google/smol

    Viewer • Updated Oct 31, 2025 • 798k • 2.81k • 94
  • google/wmt24pp

    Viewer • Updated Jan 22 • 54.9k • 5.96k • 87
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs