BigScience Workshop

non-profit

https://bigscience.huggingface.co

bigscience-workshop

AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

israel authored a paper 5 days ago

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

israel authored a paper 5 days ago

AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text

israel authored a paper 5 days ago

Afri-MCQA: Multimodal Cultural Question Answering for African Languages

View all activity

authored 2 papers 3 days ago

Scaling Low-Resource MT via Synthetic Data Generation with LLMs

Paper • 2505.14423 • Published May 20, 2025 • 2

Open Machine Translation for Esperanto

Paper • 2603.29345 • Published 13 days ago

in bigscience/bloom about 1 month ago

[SPAM] Deleted

#289 opened about 1 month ago by

pretokenizer Regex issues?

#278 opened almost 2 years ago by

authored 2 papers about 1 month ago

Beyond Transcripts: A Renewed Perspective on Audio Chaptering

Paper • 2602.08979 • Published Feb 9

Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024

Paper • 2406.16777 • Published Jun 24, 2024 • 1

in bigscience/bloom about 1 month ago

Test PR

#286 opened about 1 month ago by

Test discussion

#287 opened about 1 month ago by

Test discussion

#288 opened about 1 month ago by

authored a paper about 2 months ago

A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)

Paper • 2602.14696 • Published Feb 16

submitted a paper to Daily Papers about 2 months ago

A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)

Paper • 2602.14696 • Published Feb 16

submitted a paper to Daily Papers 2 months ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 27

pjox

authored a paper 2 months ago

SciLaD: A Large-Scale, Transparent, Reproducible Dataset for Natural Scientific Language Processing

Paper • 2512.11192 • Published Dec 12, 2025

authored a paper 3 months ago

Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

Paper • 2601.04890 • Published Jan 8 • 44

in bigscience/bloomz-560m 4 months ago

Fails to load with transformers v4.57+

#14 opened 4 months ago by

authored a paper 4 months ago

Revisiting Generalization Across Difficulty Levels: It's Not So Easy

Paper • 2511.21692 • Published Nov 26, 2025 • 15

posted an update 4 months ago

Post

458

PatchDNA, a DNA foundation model based on Meta's BLT tokenization strategy https://www.biorxiv.org/content/10.1101/2025.11.28.691095v1

in bigscience/petals-api 5 months ago

Bloom

#2 opened 5 months ago by

Zaid

authored a paper 5 months ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 21

authored a paper 6 months ago

Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Paper • 2510.05064 • Published Oct 6, 2025 • 1