Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2511.15848

Running

Agents

102

NSFW Uncensored - Text

⚡

102

NSFW Uncensored Novel Generator
NousResearch/nomos-1

Text Generation • Updated Jan 10 • 209 • 143
FILM6912/distill-whisper-small

Automatic Speech Recognition • 0.2B • Updated Jul 9, 2025 • 13 • 1
mradermacher/Psychological_Counseling_Model_for_Occupational_Anxiety_in_Obese_Patients-i1-GGUF

15B • Updated Jan 4 • 97 • 1

Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling.

stepfun-ai/Step-Audio-R1.1

Audio-Text-to-Text • 33B • Updated Feb 14 • 278 • 172
Running

Agents

41

Step Audio R1.1

⚡

41

Step-Audio-R1.1
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58
stepfun-ai/Step-Audio-R1

Audio-Text-to-Text • Updated Dec 2, 2025 • 85 • 143

TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models

Paper • 2511.02802 • Published Nov 4, 2025 • 16
Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning

Paper • 2511.02818 • Published Nov 4, 2025 • 15
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58

SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation

Paper • 2405.18503 • Published May 28, 2024 • 9
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation

Paper • 2405.20289 • Published May 30, 2024 • 11
LiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autoregressive Modeling of Audio Discrete Codes

Paper • 2406.02897 • Published Jun 5, 2024 • 16
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

Paper • 2406.03344 • Published Jun 5, 2024 • 22

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

Paper • 2410.17799 • Published Oct 23, 2024 • 12
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5, 2025 • 54

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 19
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58

A Definition of AGI

Paper • 2510.18212 • Published Oct 21, 2025 • 36
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published Nov 20, 2025 • 29
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published Nov 19, 2025 • 91

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 30
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 15
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Running

Agents

102

NSFW Uncensored - Text

⚡

102

NSFW Uncensored Novel Generator
NousResearch/nomos-1

Text Generation • Updated Jan 10 • 209 • 143
FILM6912/distill-whisper-small

Automatic Speech Recognition • 0.2B • Updated Jul 9, 2025 • 13 • 1
mradermacher/Psychological_Counseling_Model_for_Occupational_Anxiety_in_Obese_Patients-i1-GGUF

15B • Updated Jan 4 • 97 • 1

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

Paper • 2410.17799 • Published Oct 23, 2024 • 12
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5, 2025 • 54

Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling.

stepfun-ai/Step-Audio-R1.1

Audio-Text-to-Text • 33B • Updated Feb 14 • 278 • 172
Running

Agents

41

Step Audio R1.1

⚡

41

Step-Audio-R1.1
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58
stepfun-ai/Step-Audio-R1

Audio-Text-to-Text • Updated Dec 2, 2025 • 85 • 143

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 19
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58

TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models

Paper • 2511.02802 • Published Nov 4, 2025 • 16
Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning

Paper • 2511.02818 • Published Nov 4, 2025 • 15
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58

A Definition of AGI

Paper • 2510.18212 • Published Oct 21, 2025 • 36
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published Nov 20, 2025 • 29
Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 58
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published Nov 19, 2025 • 91

SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation

Paper • 2405.18503 • Published May 28, 2024 • 9
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation

Paper • 2405.20289 • Published May 30, 2024 • 11
LiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autoregressive Modeling of Audio Discrete Codes

Paper • 2406.02897 • Published Jun 5, 2024 • 16
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning

Paper • 2406.03344 • Published Jun 5, 2024 • 22

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 30
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 15
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs