11 11

Shangziqi Zhao

zhaoshangziqi

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

liked a dataset 9 days ago

PatronusAI/TRAIL

upvoted a paper 19 days ago

Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published 4 days ago • 76

liked a dataset 9 days ago

PatronusAI/TRAIL

Viewer • Updated May 15, 2025 • 148 • 329 • 17

upvoted a paper 19 days ago

Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale

Paper • 2509.14008 • Published Sep 17, 2025 • 90

upvoted a paper about 1 month ago

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 58

liked a model about 2 months ago

Qwen/Qwen3-30B-A3B-Base

Text Generation • 31B • Updated Jul 26, 2025 • 48.5k • 70

upvoted a paper about 2 months ago

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

Paper • 2602.14492 • Published Feb 16 • 18

upvoted a paper 2 months ago

How Do Decoder-Only LLMs Perceive Users? Rethinking Attention Masking for User Representation Learning

Paper • 2602.10622 • Published Feb 11 • 28

upvoted 6 papers 7 months ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29, 2025 • 22

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9, 2025 • 31

liked 2 datasets 8 months ago

furonghuang-lab/PHTest

Viewer • Updated Sep 24, 2024 • 3.27k • 81 • 3

wentingzhao/one-million-instructions

Viewer • Updated Sep 16, 2023 • 2.33M • 59 • 7

liked a dataset 9 months ago

bench-llm/or-bench

Viewer • Updated Dec 19, 2024 • 82.3k • 4.67k • 20

liked a dataset 10 months ago

ChilleD/CommonsenseQA

Viewer • Updated Jun 4, 2024 • 12.1k • 51 • 1

liked 3 datasets 12 months ago

meng-lab/AdaDecode-Llama-3.1-8B-Instruct-GSM8K

Viewer • Updated Sep 25, 2024 • 8.79k • 10 • 1

openai/gsm8k

Benchmark • Updated 26 days ago • 17.6k • 788k • 1.26k

fql/qwq_long_cot_math_gsm_v1

Viewer • Updated Dec 29, 2024 • 10.3k • 4 • 1

Shangziqi Zhao

AI & ML interests

Recent Activity

Organizations

zhaoshangziqi's activity