4 285 49

Dazhi Jiang

thuzhizhi

jiangzizi

AI & ML interests

None yet

Recent Activity

liked a dataset 3 days ago

TAAC2026/data_sample_1000

liked a model about 2 months ago

Nanbeige/Nanbeige4.1-3B

liked a Space about 2 months ago

OpenHands/openhands-index

View all activity

Organizations

None yet

liked a dataset 3 days ago

TAAC2026/data_sample_1000

Viewer • Updated 7 days ago • 1k • 7.77k • 53

liked a model about 2 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated 23 days ago • 303k • • 1.09k

liked a Space about 2 months ago

OpenHands Index

🤖

A Holistic Benchmark for Software Engineering

liked 2 models 2 months ago

zai-org/GLM-5

Text Generation • 754B • Updated 12 days ago • 491k • • 2.07k

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated Feb 27 • 5.35M • • 2.74k

upvoted an article 3 months ago

Article

The Optimal Architecture for Small Language Models

Dec 26, 2025

•

120

liked a dataset 3 months ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated Apr 18, 2025 • 1.79M • 9.04k • 166

liked a model 3 months ago

zai-org/GLM-4.7

Text Generation • 358B • Updated Jan 29 • 124k • • 2.02k

liked a dataset 5 months ago

qi6776/Recflow

Updated Jul 11, 2025 • 195 • 1

upvoted a paper 5 months ago

Data-Efficient RLVR via Off-Policy Influence Guidance

Paper • 2510.26491 • Published Oct 30, 2025 • 11

liked a Space 6 months ago

The Smol Training Playbook

📚

3.11k

The secrets to building world-class LLMs

liked 2 models 6 months ago

inclusionAI/LLaDA-MoE-7B-A1B-Instruct

7B • Updated Oct 28, 2025 • 963 • 70

inclusionAI/LLaDA2.0-mini-preview

Text Generation • 16B • Updated Dec 19, 2025 • 301 • 90

upvoted a collection 6 months ago

LLaDA 2.0

Collection

7 items • Updated 23 days ago • 41

updated a Space 7 months ago

MorningMind NewsCards 🌱

🐳

Flip through news flashcards to stay informed

published a Space 7 months ago

MorningMind NewsCards 🌱

🐳

Flip through news flashcards to stay informed

liked a model 7 months ago

SJTU-DENG-Lab/D2F_LLaDA_Instruct_8B_Lora

Text Generation • Updated Aug 14, 2025 • 5

liked a Space 8 months ago

Qwen Image Edit

✒

823

Edit and enhance images based on descriptive instructions

New activity in GSAI-ML/LLaDA-1.5 8 months ago

期待demo

#1 opened 10 months ago by

zzzgry

liked a model 8 months ago

deepseek-ai/DeepSeek-V3.1

Text Generation • Updated Sep 5, 2025 • 150k • • 819

Dazhi Jiang

AI & ML interests

Recent Activity

Organizations

thuzhizhi's activity

OpenHands Index

The Optimal Architecture for Small Language Models

The Smol Training Playbook

MorningMind NewsCards 🌱

MorningMind NewsCards 🌱

Qwen Image Edit

期待demo