Xiang Fu's picture

Xiang Fu

craigxiangfu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

YaRN: Efficient Context Window Extension of Large Language Models

liked a model 2 months ago

mistralai/Mistral-Large-Instruct-2411

liked a dataset 5 months ago

Anthropic/hh-rlhf

View all activity

Organizations

upvoted a paper about 2 months ago

YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 82

upvoted a paper 6 months ago

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10, 2025 • 32

upvoted 4 collections 9 months ago

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated Mar 3 • 154

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9, 2025 • 99

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 305

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 247

upvoted 4 papers 10 months ago

RExBench: Can coding agents autonomously implement AI research extensions?

Paper • 2506.22598 • Published Jun 27, 2025 • 11

In-Context Learning Strategies Emerge Rationally

Paper • 2506.17859 • Published Jun 21, 2025 • 10

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7, 2025 • 71

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

upvoted 4 papers 11 months ago

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Paper • 2409.04109 • Published Sep 6, 2024 • 48

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25, 2025 • 34

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 132

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published May 22, 2025 • 121

upvoted 2 papers 12 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21, 2025 • 88

Could Thinking Multilingually Empower LLM Reasoning?

Paper • 2504.11833 • Published Apr 16, 2025 • 29

upvoted 2 papers about 1 year ago

NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions

Paper • 2502.13124 • Published Feb 18, 2025 • 8

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26, 2025 • 82

upvoted a collection about 1 year ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 713

upvoted a paper about 1 year ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26, 2025 • 28