Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

weiliu's picture

In a Training Loop 🔄

weiliu

thinkwee

Mi6paulino's profile picture

zen-E's profile picture

XingweiT's profile picture

·

https://thinkwee.top/about/

thinkwee2767
thinkwee
thinkwee

AI & ML interests

LLM reasoning, agents

Organizations

None yet

thinkwee 's collections 3

Deep Data Research Benchmark

Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

Paper • 2602.02039 • Published Feb 2 • 5
Running

3

DDR Bench

🚀

3

Deep Data Research Benchmark
thinkwee/DDRBench_10K

Viewer • Updated Feb 3 • 3.16M • 98
thinkwee/DDRBench_10K_trajectory

Viewer • Updated Feb 4 • 50.9k • 31

General Reasoning datasets for training the NOVER model

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21, 2025 • 4
thinkwee/NOVEReason_2k

Viewer • Updated Aug 6, 2025 • 24.3k • 49 • 1
thinkwee/NOVEReason_5k

Viewer • Updated Aug 6, 2025 • 36.3k • 57 • 1
thinkwee/NOVEReason_full

Viewer • Updated Aug 6, 2025 • 1.7M • 70 • 1

NOVER-series models for general reasoning

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21, 2025 • 4
thinkwee/NOVER1-Qwen2.5-7B

Question Answering • 8B • Updated Aug 20, 2025 • 6 • 2
thinkwee/NOVER1-Qwen3-4B

Question Answering • 4B • Updated Aug 20, 2025 • 8 • 2

Deep Data Research Benchmark

Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

Paper • 2602.02039 • Published Feb 2 • 5
Running

3

DDR Bench

🚀

3

Deep Data Research Benchmark
thinkwee/DDRBench_10K

Viewer • Updated Feb 3 • 3.16M • 98
thinkwee/DDRBench_10K_trajectory

Viewer • Updated Feb 4 • 50.9k • 31

NOVER-series models for general reasoning

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21, 2025 • 4
thinkwee/NOVER1-Qwen2.5-7B

Question Answering • 8B • Updated Aug 20, 2025 • 6 • 2
thinkwee/NOVER1-Qwen3-4B

Question Answering • 4B • Updated Aug 20, 2025 • 8 • 2

General Reasoning datasets for training the NOVER model

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21, 2025 • 4
thinkwee/NOVEReason_2k

Viewer • Updated Aug 6, 2025 • 24.3k • 49 • 1
thinkwee/NOVEReason_5k

Viewer • Updated Aug 6, 2025 • 36.3k • 57 • 1
thinkwee/NOVEReason_full

Viewer • Updated Aug 6, 2025 • 1.7M • 70 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs