Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2503.14456

Linear Attention

Higher-order Linear Attention

Paper • 2510.27258 • Published Oct 31, 2025 • 15
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference

Paper • 2503.13427 • Published Mar 17, 2025 • 3
MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19, 2025 • 36

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 233
Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14, 2025 • 148

RWKV-7 Goose related resources.

Goose-World/RWKV-World-v3

Viewer • Updated Apr 28, 2025 • 1.1M • 144 • 5
BlinkDL/rwkv-7-world

Text Generation • Updated Feb 12 • 108
BlinkDL/rwkv-7-pile

Updated Dec 19, 2024 • 16
Sleeping

Agents

2

RWKV 7

🌏

2

best foundation model for its size !

Research Papers/Reviews/Literature

Daily Research papers and review including older relevant content.

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30, 2025 • 61
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published Mar 19, 2025 • 46
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18, 2025 • 50

Interesting shit 1

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 9.7M • • 6.03k
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28, 2025 • 133
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7, 2025 • 46
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7, 2025 • 27

interesting papers

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 154
Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6, 2025 • 23
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47
LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published Feb 12, 2025 • 30

RWKV7 models 🪿

fla-hub/rwkv7-7.2B-g0a

Text Generation • 7B • Updated Aug 30, 2025 • 159 • 3
fla-hub/rwkv7-7.2B-g0

Text Generation • 7B • Updated Aug 6, 2025 • 62 • 3
fla-hub/rwkv7-2.9B-g1

Text Generation • 3B • Updated Aug 6, 2025 • 260 • 3
fla-hub/rwkv7-2.9B-world

Text Generation • 3B • Updated May 7, 2025 • 278 • 4

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

Linear Attention

Higher-order Linear Attention

Paper • 2510.27258 • Published Oct 31, 2025 • 15
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference

Paper • 2503.13427 • Published Mar 17, 2025 • 3
MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19, 2025 • 36

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 233
Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 172
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14, 2025 • 148

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28, 2025 • 133
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7, 2025 • 124
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7, 2025 • 46
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7, 2025 • 27

RWKV-7 Goose related resources.

Goose-World/RWKV-World-v3

Viewer • Updated Apr 28, 2025 • 1.1M • 144 • 5
BlinkDL/rwkv-7-world

Text Generation • Updated Feb 12 • 108
BlinkDL/rwkv-7-pile

Updated Dec 19, 2024 • 16
Sleeping

Agents

2

RWKV 7

🌏

2

best foundation model for its size !

interesting papers

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 154
Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6, 2025 • 23
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47
LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published Feb 12, 2025 • 30

Research Papers/Reviews/Literature

Daily Research papers and review including older relevant content.

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30, 2025 • 61
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published Mar 19, 2025 • 46
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18, 2025 • 50

RWKV7 models 🪿

fla-hub/rwkv7-7.2B-g0a

Text Generation • 7B • Updated Aug 30, 2025 • 159 • 3
fla-hub/rwkv7-7.2B-g0

Text Generation • 7B • Updated Aug 6, 2025 • 62 • 3
fla-hub/rwkv7-2.9B-g1

Text Generation • 3B • Updated Aug 6, 2025 • 260 • 3
fla-hub/rwkv7-2.9B-world

Text Generation • 3B • Updated May 7, 2025 • 278 • 4

Interesting shit 1

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 9.7M • • 6.03k
RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18, 2025 • 154

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 90
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs