Tom Goldstein's Lab at University of Maryland, College Park

university

http://www.cs.umd.edu/~tomg/

tomgoldsteincs

Activity Feed

AI & ML interests

AI security & privacy, algorithmic bias, foundations of ML

Recent Activity

ahans1 updated a collection 22 days ago

Goldfish Loss: Mitigating Memorization in LLMs

smcleish updated a collection 23 days ago

CLRS-Text

smcleish updated a collection 23 days ago

CLRS-Text

View all activity

Papers

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Gemstones: A Model Suite for Multi-Faceted Scaling Laws

View all Papers

tomg-group-umd 's collections 15

Sphere Encoder

Image Generation with a Sphere Encoder

Image Generation with a Sphere Encoder

Paper • 2602.15030 • Published Feb 16 • 16
kaiyuyue/sphere-encoder-fid-artifacts

Viewer • Updated Feb 27 • 50k • 36

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19
smcleish/Recurrent-Llama-3.2-train-recurrence-32

Text Generation • 1B • Updated Nov 11, 2025 • 611 • 1
smcleish/Recurrent-Llama-3.2-train-recurrence-16

Text Generation • 1B • Updated Nov 11, 2025 • 26
smcleish/Recurrent-Llama-3.2-train-recurrence-8

Text Generation • 1B • Updated Nov 11, 2025 • 410

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

nsjain/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22, 2025 • 3
nsjain/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22, 2025 • 1 • 1
nsjain/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22, 2025 • 12 • 1
nsjain/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22, 2025 • 9

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

juzhengz/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14, 2025 • 7 • 1
juzhengz/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14, 2025 • 2
juzhengz/LoRI-S_safety_llama3_rank_64

Text Generation • Updated Aug 13, 2025 • 12
juzhengz/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14, 2025 • 3 • 1

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • Updated Jul 29, 2025 • 2.83k • 291
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 154
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17, 2025 • 7
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17, 2025 • 2

GenQA

genqa/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 690 • 54
genqa/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 97
genqa/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 39 • 3
genqa/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024 • 2

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28, 2025 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5, 2025 • 2

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
ahans1/wikipedia-en-2k-samples

Viewer • Updated Aug 29, 2024 • 4k • 31
ahans1/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 12
goldfish-loss/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 2

MTP-LM

Models to accompany "Multi-Token Prediction via Self-Distillation" (arxiv:2602.06019)

Multi-Token Prediction via Self-Distillation

Paper • 2602.06019 • Published Feb 5
jwkirchenbauer/L3-1-8B-Magpie-MTP

8B • Updated Feb 10 • 5
jwkirchenbauer/Qwen3-4B-Inst-2507-MTP

4B • Updated Feb 10 • 11 • 1
jwkirchenbauer/metamathqa-grouped-split

Viewer • Updated Feb 9 • 395k • 23

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated Sep 3, 2025 • 161 • 15
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated Sep 3, 2025 • 45 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • Updated Sep 3, 2025 • 18.9k • 3
montehoover/DynaBench

Viewer • Updated 21 days ago • 140k • 336 • 5

FictionalQA

A collection of synthetic datasets for studying memorization and knowledge acquisition.

jwkirchenbauer/fictionalqa

Viewer • Updated Mar 2 • 39.2k • 317 • 2
jwkirchenbauer/fictionalqa_training_splits

Viewer • Updated Mar 2 • 219k • 228
jwkirchenbauer/fictionalqa_reformatted_triviaqa

Viewer • Updated Mar 2 • 16.4k • 22

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

Gemstone-Models/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9, 2025 • 37
Gemstone-Models/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6, 2025 • 39
Gemstone-Models/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6, 2025 • 42
Gemstone-Models/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7, 2025 • 43

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 16
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 19 • 5
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 24 • 5

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
smcleish/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 246 • 2
smcleish/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 76
smcleish/clrs_gemma_2b_100k_finetune_with_traces

Text Generation • 3B • Updated Dec 20, 2024 • 2

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated Dec 13, 2025 • 15.6M • 489 • 164
pixelprose/pixelprose-shards

Viewer • Updated Dec 14, 2025 • 7.66M • 577 • 1
pixelprose/pixelprose-jsons

Preview • Updated Jul 3, 2025 • 8

Sphere Encoder

Image Generation with a Sphere Encoder

Image Generation with a Sphere Encoder

Paper • 2602.15030 • Published Feb 16 • 16
kaiyuyue/sphere-encoder-fid-artifacts

Viewer • Updated Feb 27 • 50k • 36

MTP-LM

Models to accompany "Multi-Token Prediction via Self-Distillation" (arxiv:2602.06019)

Multi-Token Prediction via Self-Distillation

Paper • 2602.06019 • Published Feb 5
jwkirchenbauer/L3-1-8B-Magpie-MTP

8B • Updated Feb 10 • 5
jwkirchenbauer/Qwen3-4B-Inst-2507-MTP

4B • Updated Feb 10 • 11 • 1
jwkirchenbauer/metamathqa-grouped-split

Viewer • Updated Feb 9 • 395k • 23

Retrofitting Recurrence

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19
smcleish/Recurrent-Llama-3.2-train-recurrence-32

Text Generation • 1B • Updated Nov 11, 2025 • 611 • 1
smcleish/Recurrent-Llama-3.2-train-recurrence-16

Text Generation • 1B • Updated Nov 11, 2025 • 26
smcleish/Recurrent-Llama-3.2-train-recurrence-8

Text Generation • 1B • Updated Nov 11, 2025 • 410

DynaGuard

https://arxiv.org/abs/2509.02563

tomg-group-umd/DynaGuard-8B

Text Generation • 8B • Updated Sep 3, 2025 • 161 • 15
tomg-group-umd/DynaGuard-4B

Text Generation • 4B • Updated Sep 3, 2025 • 45 • 2
tomg-group-umd/DynaGuard-1.7B

Text Generation • Updated Sep 3, 2025 • 18.9k • 3
montehoover/DynaBench

Viewer • Updated 21 days ago • 140k • 336 • 5

Refusal Token Models

This collection contains models described in the refusal token paper published in COLM 2025.

nsjain/zephyr-llama3-8b-sft-refusal-n-contrast

8B • Updated Jul 22, 2025 • 3
nsjain/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens

8B • Updated Jul 22, 2025 • 1 • 1
nsjain/zephyr-llama3-8b-sft-refusal-n-contrast-single-token

8B • Updated Jul 22, 2025 • 12 • 1
nsjain/zephyr-llama3-8b-sft-no-refusal-messages

8B • Updated Jul 22, 2025 • 9

FictionalQA

A collection of synthetic datasets for studying memorization and knowledge acquisition.

jwkirchenbauer/fictionalqa

Viewer • Updated Mar 2 • 39.2k • 317 • 2
jwkirchenbauer/fictionalqa_training_splits

Viewer • Updated Mar 2 • 219k • 228
jwkirchenbauer/fictionalqa_reformatted_triviaqa

Viewer • Updated Mar 2 • 16.4k • 22

LoRI Adapters

LoRI adapters for natural language understanding, code generation, mathematical reasoning, and safety alignment, based on LLaMA-3-8B and Mistral-7B.

juzhengz/LoRI-S_safety_mistral7b_rank_64

Text Generation • Updated Apr 14, 2025 • 7 • 1
juzhengz/LoRI-S_safety_mistral7b_rank_32

Text Generation • Updated Apr 14, 2025 • 2
juzhengz/LoRI-S_safety_llama3_rank_64

Text Generation • Updated Aug 13, 2025 • 12
juzhengz/LoRI-S_safety_llama3_rank_32

Text Generation • Updated Apr 14, 2025 • 3 • 1

Gemstone Models

Our 22 open source Gemstone models for scaling laws range from 50M to 2B parameters, spanning 11 widths from 256 to 3072 and 18 depths from 3 to 80.

Gemstone-Models/Gemstone-768x45

Text Generation • 0.5B • Updated Feb 9, 2025 • 37
Gemstone-Models/Gemstone-1280x15

Text Generation • 0.5B • Updated Feb 6, 2025 • 39
Gemstone-Models/Gemstone-512x13

Text Generation • 0.1B • Updated Feb 6, 2025 • 42
Gemstone-Models/Gemstone-1536x50

Text Generation • 2B • Updated Feb 7, 2025 • 43

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space.

tomg-group-umd/huginn-0125

Text Generation • Updated Jul 29, 2025 • 2.83k • 291
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 154
tomg-group-umd/huginn_swa_100_10_avg_0.9_merge

Text Generation • 4B • Updated Jul 17, 2025 • 7
tomg-group-umd/step-00010752-recurrence_full_512_0

Text Generation • 4B • Updated Jul 17, 2025 • 2

Style Descriptors

How to extract style from images? Model, dataset, and the paper

Measuring Style Similarity in Diffusion Models

Paper • 2404.01292 • Published Apr 1, 2024 • 16
tomg-group-umd/CSD-ViT-L

Image Feature Extraction • Updated Sep 4, 2024 • 19 • 5
tomg-group-umd/ContraStyles

Viewer • Updated Jul 31, 2024 • 498k • 24 • 5

GenQA

genqa/GenQA

Viewer • Updated Jun 21, 2024 • 11.1M • 690 • 54
genqa/GenQA_raw

Viewer • Updated Jun 13, 2024 • 11.1M • 97
genqa/GenQA_rebalanced

Viewer • Updated Jun 13, 2024 • 6.47M • 39 • 3
genqa/GenQA-Subset-llama-3

Text Generation • 8B • Updated Jun 17, 2024 • 2

CLRS-Text

Hugging Face collection for all things CLRS-Text

The CLRS-Text Algorithmic Reasoning Language Benchmark

Paper • 2406.04229 • Published Jun 6, 2024 • 4
smcleish/CLRS-Text-train

Viewer • Updated Jul 14, 2024 • 2.15M • 246 • 2
smcleish/CLRS-Text-test

Viewer • Updated Jul 10, 2024 • 503k • 76
smcleish/clrs_gemma_2b_100k_finetune_with_traces

Text Generation • 3B • Updated Dec 20, 2024 • 2

Zero-Shot Grafting

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28, 2025 • 7
tomg-group-umd/zero-model-checkpoints

Image-Text-to-Text • Updated Aug 5, 2025 • 2

PixelProse

From Pixels to Prose: A Large Dataset of Dense Image Captions

Paper • 2406.10328 • Published Jun 14, 2024 • 18
tomg-group-umd/pixelprose

Viewer • Updated Dec 13, 2025 • 15.6M • 489 • 164
pixelprose/pixelprose-shards

Viewer • Updated Dec 14, 2025 • 7.66M • 577 • 1
pixelprose/pixelprose-jsons

Preview • Updated Jul 3, 2025 • 8

Goldfish Loss: Mitigating Memorization in LLMs

This collection contains artifacts from our paper titled: "Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs."

Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs

Paper • 2406.10209 • Published Jun 14, 2024 • 8
ahans1/wikipedia-en-2k-samples

Viewer • Updated Aug 29, 2024 • 4k • 31
ahans1/3-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 12
goldfish-loss/4-goldfish-loss-llama-1B

Text Generation • 1B • Updated Aug 19, 2024 • 2

AI & ML interests

Recent Activity

Papers

Team members 31

tomg-group-umd 's collections 15