4 10

Chenyang Zhang

zcyeee

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

liked a Space about 2 months ago

HuggingFaceTB/smol-training-playbook

liked a model 4 months ago

yiyanghkust/finbert-tone-chinese

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

Paper • 2602.01058 • Published Feb 1 • 43

liked a Space about 2 months ago

The Smol Training Playbook

📚

3.1k

The secrets to building world-class LLMs

liked a model 4 months ago

yiyanghkust/finbert-tone-chinese

Text Classification • 0.1B • Updated Feb 6, 2024 • 27.3k • • 53

upvoted an article 5 months ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

•

269

liked 2 datasets 5 months ago

PeterJinGo/nq_hotpotqa_train

Viewer • Updated Mar 13, 2025 • 221k • 1.29k • 14

hotpotqa/hotpot_qa

Viewer • Updated Aug 11, 2025 • 203k • 78.4k • 282

liked a dataset 6 months ago

agentrl/ReCall-data

Viewer • Updated Apr 25, 2025 • 30.1k • 51 • 6

liked a model 6 months ago

Qwen/Qwen2.5-3B-Instruct

Text Generation • 3B • Updated Sep 25, 2024 • 9.11M • 439

liked 2 datasets 6 months ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22, 2025 • 90.1k • 7.14k • 1.08k

FreedomIntelligence/Huatuo26M-Lite

Viewer • Updated Nov 29, 2023 • 178k • 2.05k • 64

upvoted a paper 6 months ago

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

Paper • 2510.09116 • Published Oct 10, 2025 • 97

liked a model 9 months ago

ByteDance/Dolphin

Image-Text-to-Text • Updated Jul 16, 2025 • 450 • 515

upvoted a paper 10 months ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published Jun 16, 2025 • 94

liked a dataset about 1 year ago

zexuanqiu22/CLongEval

Preview • Updated Mar 6, 2024 • 29 • 8