ldwang

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

liked a model about 8 hours ago

MiniMaxAI/MiniMax-M2.7

upvoted a paper 1 day ago

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes

liked a dataset 3 days ago

nvidia/Nemotron-SFT-OpenCode-v1

View all activity

Organizations

Collections 8

View 8 collections

Papers 8

models 4

datasets 3

ldwang/lighteval-ceval-exam

Updated Nov 14, 2024 • 25

ldwang/OpenHermes-2.5-zh

Preview • Updated Sep 2, 2024 • 31 • 1

ldwang/lighteval-cmmlu

Updated Aug 13, 2024 • 20

ldwang

AI & ML interests

Recent Activity

Organizations

Collections 8

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

OpenPipe/art-e-008

corbt/enron_emails_sample_questions

smolagents LLM leaderboard

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

OpenPipe/art-e-008

corbt/enron_emails_sample_questions

smolagents LLM leaderboard

Papers 8

models 4

ldwang/DeepScaleR-1.5B-Preview-Reproduce

ldwang/fasttext-oh-zh

ldwang/mamba-1.4b-aquila-400b-sft

ldwang/mamba-1.4b-aquila-400b

datasets 3

ldwang/lighteval-ceval-exam

ldwang/OpenHermes-2.5-zh

ldwang/lighteval-cmmlu

ldwang

AI & ML interests

Recent Activity

Organizations

Collections 8

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

smolagents LLM leaderboard

Scaling test-time compute

FineWeb: decanting the web for the finest text data at scale

The Ultra-Scale Playbook

FineVision: Open Data is All You Need

smolagents LLM leaderboard

Papers 8

models 4 Sort: Recently updated

datasets 3 Sort: Recently updated

models 4

datasets 3