jiachenluo's picture

jiachenluo

jiachenluo699

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 23 days ago

A Survey of Reinforcement Learning for Large Reasoning Models

upvoted a paper 23 days ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

upvoted a paper 27 days ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

View all activity

Organizations

None yet

upvoted 2 papers 23 days ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238

upvoted 5 papers 27 days ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published Dec 29, 2025 • 99

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 157

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 159

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 265

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304

upvoted a paper about 1 month ago

Advancing Open-source World Models

Paper • 2601.20540 • Published Jan 28 • 135

liked a model about 1 month ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Text Generation • 124B • Updated 5 days ago • 518k • 330

upvoted a paper about 1 month ago

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published Feb 26 • 152

liked a model about 1 month ago

openai/whisper-large

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 84.8k • 542

liked 9 models about 2 months ago

meta-llama/Llama-3.1-405B

Text Generation • 406B • Updated Sep 25, 2024 • 193k • 968

Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • Updated Nov 20, 2024 • 7.39k • 167

FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 9.64k • 380

microsoft/Florence-2-large

Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 1.27M • 1.8k

meta-llama/Meta-Llama-3-70B

Text Generation • 71B • Updated Sep 27, 2024 • 131k • • 874

Qwen/Qwen1.5-MoE-A2.7B-Chat

Text Generation • Updated Apr 30, 2024 • 31k • 134

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22, 2025 • 8k • 220

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22, 2025 • 363k • 907

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 64.4k • 3.57k