Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Qwen Pilot's picture
1 4

Qwen Pilot

QwenPilot
philipp-zettl's profile picture Enigrand's profile picture upup-ashton-wang's profile picture
·
  • qwenpilot

AI & ML interests

None yet

Recent Activity

updated a model 4 days ago
QwenPilot/FIPO_32B
new activity 14 days ago
QwenPilot/FIPO_32B:Add library_name and pipeline_tag
upvoted a paper 15 days ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
View all activity

Organizations

None yet

upvoted 4 papers 15 days ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 120

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Paper • 2603.22117 • Published 23 days ago • 29

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Paper • 2603.22446 • Published 23 days ago • 10

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 26 days ago • 337
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs