Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2509.08721

Foundational & Applied AI Models

A curated set of influential AI models across research and production, including open and closed-source systems - Agentic AI & Gen AI

llm-semantic-router/halugate-sentinel

Text Classification • 0.1B • Updated Dec 4, 2025 • 1k • 10
Qwen/Qwen-Image-Layered

Image-Text-to-Image • Updated Dec 19, 2025 • 23.2k • 1.05k
Gemma 3

Collection

A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models • 42 items • Updated Aug 14, 2025 • 33
GPT 5 Codex

Collection

Distilled models and datasets for GPT 5 Codex • 7 items • Updated Dec 20, 2025 • 5

Sentinel Sentience Station

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated 15 days ago • 567k • 2.74k
TuringEnterprises/Open-RL

Viewer • Updated Mar 4 • 40 • 230 • 179
Running on Zero

MCP

2.26k

Wan2.2 14B Preview

🐌

2.26k

generate a video from an image with a text prompt
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665

How to crack the CTET 2026 exam in 7 days?

How to crack the CTET 2026 exam in 7 days?

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665

Open-Source Foundations for Modern AI Systems

open-source libraries that form the infrastructure layer of modern AI systems, spanning model dev, retrieval, orchestration, evaluation, and MLOPS.

A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

Paper • 2309.06497 • Published Sep 12, 2023 • 7
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 628
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 251
Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 951

collections TEST Org

collections TEST ORG

PrivateXXOrganization/jjjjjj

Updated Jan 2
Qwen/Qwen-Image-Layered

Image-Text-to-Image • Updated Dec 19, 2025 • 23.2k • 1.05k
zai-org/GLM-4.7

Text Generation • 358B • Updated Jan 29 • 117k • • 2.03k
bigai/TongSIM-Asset

Updated Dec 29, 2025 • 4.49k • 215

Foundational & Modern AI Research (Curated)

A curated selection of foundational and modern AI research papers that meaningfully influence how real-world AI systems are designed, evaluated, and g

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 121
Scaling Laws for Neural Language Models

Paper • 2001.08361 • Published Jan 23, 2020 • 10
Training Compute-Optimal Large Language Models

Paper • 2203.15556 • Published Mar 29, 2022 • 11
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT

Paper • 2210.04186 • Published Oct 9, 2022

Local Model In-Weight Memory Experiments

Resources pertaining to experimentation with file-less in-weight memory systems for locally run models begging at 1B parameters with hopes to scale.

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated 21 days ago • 2.33k • 9.79k • 557
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665
Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 951
zai-org/GLM-5

Text Generation • 754B • Updated 16 days ago • 478k • • 2.07k

Reinforcement Learning

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150
Transformers in Reinforcement Learning: A Survey

Paper • 2307.05979 • Published Jul 12, 2023 • 1
Comparing DPO with IPO and KTO

Collection

A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 8, 2025 • 32

papers-most-view-by-month

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 233
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 550
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

TradingAgents: Multi-Agents LLM Financial Trading Framework

Paper • 2412.20138 • Published Dec 28, 2024 • 48
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304
Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 157

Foundational & Applied AI Models

A curated set of influential AI models across research and production, including open and closed-source systems - Agentic AI & Gen AI

llm-semantic-router/halugate-sentinel

Text Classification • 0.1B • Updated Dec 4, 2025 • 1k • 10
Qwen/Qwen-Image-Layered

Image-Text-to-Image • Updated Dec 19, 2025 • 23.2k • 1.05k
Gemma 3

Collection

A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models • 42 items • Updated Aug 14, 2025 • 33
GPT 5 Codex

Collection

Distilled models and datasets for GPT 5 Codex • 7 items • Updated Dec 20, 2025 • 5

Foundational & Modern AI Research (Curated)

A curated selection of foundational and modern AI research papers that meaningfully influence how real-world AI systems are designed, evaluated, and g

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 121
Scaling Laws for Neural Language Models

Paper • 2001.08361 • Published Jan 23, 2020 • 10
Training Compute-Optimal Large Language Models

Paper • 2203.15556 • Published Mar 29, 2022 • 11
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT

Paper • 2210.04186 • Published Oct 9, 2022

Sentinel Sentience Station

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated 15 days ago • 567k • 2.74k
TuringEnterprises/Open-RL

Viewer • Updated Mar 4 • 40 • 230 • 179
Running on Zero

MCP

2.26k

Wan2.2 14B Preview

🐌

2.26k

generate a video from an image with a text prompt
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665

Local Model In-Weight Memory Experiments

Resources pertaining to experimentation with file-less in-weight memory systems for locally run models begging at 1B parameters with hopes to scale.

nohurry/Opus-4.6-Reasoning-3000x-filtered

Viewer • Updated 21 days ago • 2.33k • 9.79k • 557
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665
Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 951
zai-org/GLM-5

Text Generation • 754B • Updated 16 days ago • 478k • • 2.07k

How to crack the CTET 2026 exam in 7 days?

How to crack the CTET 2026 exam in 7 days?

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665

Reinforcement Learning

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published Jan 13 • 150
Transformers in Reinforcement Learning: A Survey

Paper • 2307.05979 • Published Jul 12, 2023 • 1
Comparing DPO with IPO and KTO

Collection

A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 8, 2025 • 32

Open-Source Foundations for Modern AI Systems

open-source libraries that form the infrastructure layer of modern AI systems, spanning model dev, retrieval, orchestration, evaluation, and MLOPS.

A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale

Paper • 2309.06497 • Published Sep 12, 2023 • 7
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 628
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 251
Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 951

papers-most-view-by-month

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 233
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 550
Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

collections TEST Org

collections TEST ORG

PrivateXXOrganization/jjjjjj

Updated Jan 2
Qwen/Qwen-Image-Layered

Image-Text-to-Image • Updated Dec 19, 2025 • 23.2k • 1.05k
zai-org/GLM-4.7

Text Generation • 358B • Updated Jan 29 • 117k • • 2.03k
bigai/TongSIM-Asset

Updated Dec 29, 2025 • 4.49k • 215

TradingAgents: Multi-Agents LLM Financial Trading Framework

Paper • 2412.20138 • Published Dec 28, 2024 • 48
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 665
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 304
Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 157

Previous
1
2
3
4
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs