1 13 8

Paul O’Mahony

pauleta

AI & ML interests

None yet

Recent Activity

upvoted an article 19 days ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

upvoted an article 19 days ago

The ML Engineer's Guide to Protein AI

upvoted a paper 25 days ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

View all activity

Organizations

None yet

upvoted 2 articles 19 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

293

Article

The ML Engineer's Guide to Protein AI

Mar 3

•

upvoted 3 papers 25 days ago

upvoted a paper 2 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 202

updated a model 2 months ago

pauleta/qwen-aws_16bit

Text Generation • 4B • Updated Jan 29 • 1

published a model 2 months ago

pauleta/qwen-aws_16bit

Text Generation • 4B • Updated Jan 29 • 1

upvoted 3 papers 3 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 230

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published Dec 30, 2025 • 111

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 170

upvoted an article 3 months ago

Article

OVHcloud on Hugging Face Inference Providers 🔥

Nov 24, 2025

•

upvoted 2 papers 3 months ago

Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting

Paper • 2601.02151 • Published Jan 5 • 113

InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published Jan 6 • 104

liked a dataset 3 months ago

wikimedia/wikipedia

Viewer • Updated Jan 9, 2024 • 61.6M • 103k • 1.19k

updated a model 5 months ago

pauleta/qwen-instruct-4B-4bit

Text Generation • 4B • Updated Nov 22, 2025 • 15

published a model 5 months ago

pauleta/qwen-instruct-4B-4bit

Text Generation • 4B • Updated Nov 22, 2025 • 15

liked a dataset 6 months ago

interstellarninja/hermes_reasoning_tool_use

Viewer • Updated Dec 26, 2025 • 51k • 508 • 159

upvoted a paper 6 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 513

liked a model 6 months ago

Manojb/Qwen3-4B-toolcalling-gguf-codex

Text Generation • 4B • Updated Sep 21, 2025 • 6.06k • 50

Paul O’Mahony

AI & ML interests

Recent Activity

Organizations

pauleta's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

The ML Engineer's Guide to Protein AI

OVHcloud on Hugging Face Inference Providers 🔥