Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2509.13755

Machine Unlearning

TOFU: A Task of Fictitious Unlearning for LLMs

Paper • 2401.06121 • Published Jan 11, 2024 • 20
The Frontier of Data Erasure: Machine Unlearning for Large Language Models

Paper • 2403.15779 • Published Mar 23, 2024 • 1
Machine Unlearning of Pre-trained Large Language Models

Paper • 2402.15159 • Published Feb 23, 2024
Rethinking Machine Unlearning for Large Language Models

Paper • 2402.08787 • Published Feb 13, 2024 • 3

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

Paper • 2509.13755 • Published Sep 17, 2025 • 19

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 196 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

Source Code Analysis

CVEfixes: Automated Collection of Vulnerabilities and Their Fixes from Open-Source Software

Paper • 2107.08760 • Published Jul 19, 2021
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future

Paper • 2408.02479 • Published Aug 5, 2024
PurpCode: Reasoning for Safer Code Generation

Paper • 2507.19060 • Published Jul 25, 2025 • 2
Vulnerability Detection Using Two-Stage Deep Learning Models

Paper • 2305.09673 • Published May 8, 2023

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30, 2025 • 72
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 23
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published Sep 3, 2025 • 24
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published Aug 31, 2025 • 5

Machine Unlearning

TOFU: A Task of Fictitious Unlearning for LLMs

Paper • 2401.06121 • Published Jan 11, 2024 • 20
The Frontier of Data Erasure: Machine Unlearning for Large Language Models

Paper • 2403.15779 • Published Mar 23, 2024 • 1
Machine Unlearning of Pre-trained Large Language Models

Paper • 2402.15159 • Published Feb 23, 2024
Rethinking Machine Unlearning for Large Language Models

Paper • 2402.08787 • Published Feb 13, 2024 • 3

Source Code Analysis

CVEfixes: Automated Collection of Vulnerabilities and Their Fixes from Open-Source Software

Paper • 2107.08760 • Published Jul 19, 2021
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future

Paper • 2408.02479 • Published Aug 5, 2024
PurpCode: Reasoning for Safer Code Generation

Paper • 2507.19060 • Published Jul 25, 2025 • 2
Vulnerability Detection Using Two-Stage Deep Learning Models

Paper • 2305.09673 • Published May 8, 2023

Scrub It Out! Erasing Sensitive Memorization in Code Language Models via Machine Unlearning

Paper • 2509.13755 • Published Sep 17, 2025 • 19

Open Data Synthesis For Deep Research

Paper • 2509.00375 • Published Aug 30, 2025 • 72
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 23
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published Sep 3, 2025 • 24
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published Aug 31, 2025 • 5

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 196 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs