3 13 11

Shawn Nie

shawn2333

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Meta-Harness: End-to-End Optimization of Model Harnesses

liked a Space 3 months ago

lzumot/lean-prover-validator

liked a model 5 months ago

allenai/Olmo-3-7B-Instruct

View all activity

Organizations

None yet

upvoted a paper 10 days ago

Meta-Harness: End-to-End Optimization of Model Harnesses

Paper • 2603.28052 • Published 16 days ago • 18

liked a Space 3 months ago

Lean Prover Validator

📈

generate a proof then test it with lean4

liked a model 5 months ago

allenai/Olmo-3-7B-Instruct

Text Generation • 528k • Updated Jan 5 • 496k • • 125

upvoted a collection 5 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 168

liked a model 6 months ago

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9, 2025 • 1.69k • 365

upvoted 2 papers 6 months ago

Flipping the Dialogue: Training and Evaluating User Language Models

Paper • 2510.06552 • Published Oct 8, 2025 • 1

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

commented a paper 7 months ago

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR

Paper • 2509.02522 • Published Sep 2, 2025 • 25 •

liked a model 7 months ago

Qwen/Qwen3-Next-80B-A3B-Instruct

Text Generation • 81B • Updated Sep 17, 2025 • 372k • • 959

liked a dataset 7 months ago

facebook/recycling_the_web

Viewer • Updated Aug 28, 2025 • 60.3M • 752 • 66

upvoted a paper 7 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199

liked a dataset 7 months ago

OpenAssistant/oasst2

Viewer • Updated Jan 11, 2024 • 135k • 10.6k • 288

upvoted 2 papers 7 months ago

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 61

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR

Paper • 2509.02522 • Published Sep 2, 2025 • 25

New activity in nvidia/Nemotron-Post-Training-Dataset-v1 8 months ago

Why is the user content always empty?

👍 5

#10 opened 8 months ago by

Hanfeng

liked a model 8 months ago

Skywork/Skywork-Reward-V2-Llama-3.1-8B

Text Classification • 8B • Updated Jul 6, 2025 • 48.3k • 42

liked a dataset 9 months ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25, 2025 • 25.7M • 6.48k • 179

liked a model 9 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5

Text Generation • 50B • Updated Oct 15, 2025 • 203k • 230

New activity in nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 9 months ago

Missing `modeling_decilm.py` when loading the model

#1 opened 9 months ago by