Ashish Mishra's picture

Ashish Mishra

ashbuilds

·

ashbuilds

AI & ML interests

None yet

Recent Activity

liked a model about 20 hours ago

LiquidAI/LFM2.5-VL-450M

liked a model 5 days ago

google/gemma-4-31B-it

upvoted a paper 7 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

View all activity

Organizations

None yet

upvoted a paper 7 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 9 days ago • 106

upvoted a paper about 2 months ago

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Paper • 2602.12099 • Published Feb 12 • 61

upvoted a paper 4 months ago

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Paper • 2512.19678 • Published Dec 22, 2025 • 31

upvoted an article 4 months ago

Article

Codex is Open Sourcing AI models

Dec 11, 2025

•

80

upvoted a paper 4 months ago

What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards

Paper • 2512.00425 • Published Nov 29, 2025 • 53

upvoted an article 7 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23, 2025

•

138

upvoted a collection 7 months ago

Granite Docling Models

Models for parsing complex PDFs and structured documents, designed to complement Docling. • 4 items • Updated 13 days ago • 60

upvoted an article 7 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

Sep 11, 2025

•

186

upvoted an article 9 months ago

Article

Creating custom kernels for the AMD MI300

Jul 9, 2025

•

54

upvoted a collection 10 months ago

Holo1

Vision-Language Action Model for use in Surfer-H web navigation agent • 6 items • Updated Jun 10, 2025 • 49

upvoted an article 10 months ago

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

Jun 3, 2025

•

71

upvoted a collection 12 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 561

upvoted a paper 12 months ago

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published Apr 2, 2025 • 9

upvoted 4 papers about 1 year ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6, 2025 • 72

InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback

Paper • 2502.15027 • Published Feb 20, 2025 • 7

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9, 2025 • 55

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10, 2025 • 53

upvoted a collection over 1 year ago

DeepSeek-V3

4 items • Updated Nov 27, 2025 • 284

upvoted a paper over 1 year ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48