50 1

Fadhil Akbar Cariearsa

fadhilakbar

https://fadhil.dev

AI & ML interests

None yet

Recent Activity

upvoted a paper about 23 hours ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

upvoted a paper about 23 hours ago

WildDet3D: Scaling Promptable 3D Detection in the Wild

updated a dataset about 23 hours ago

fadhilakbar/daily_streak

View all activity

Organizations

None yet

upvoted 2 papers about 23 hours ago

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published 6 days ago • 85

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published 5 days ago • 221

updated a dataset about 23 hours ago

fadhilakbar/daily_streak

Viewer • Updated about 23 hours ago • 126 • 151 • 1

upvoted 2 papers 2 days ago

MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping

Paper • 2604.08364 • Published 5 days ago • 92

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published 5 days ago • 249

upvoted an article 3 days ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

5 days ago

•

upvoted 2 papers 3 days ago

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 6 days ago • 162

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 6 days ago • 306

upvoted 2 papers 4 days ago

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 5 days ago • 271

When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models

Paper • 2604.08546 • Published 5 days ago • 109

upvoted an article 5 days ago

Article

ALTK‑Evolve: On‑the‑Job Learning for AI Agents

6 days ago

•

upvoted 2 papers 5 days ago

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published 6 days ago • 67

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 7 days ago • 60

upvoted 2 papers 6 days ago

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published 7 days ago • 113

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 8 days ago • 231

upvoted 2 papers 7 days ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published 8 days ago • 116

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 8 days ago • 200

upvoted 2 papers 8 days ago

Self-Distilled RLVR

Paper • 2604.03128 • Published 11 days ago • 158

A Simple Baseline for Streaming Video Understanding

Paper • 2604.02317 • Published 12 days ago • 72

upvoted an article 9 days ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

14 days ago

•

Fadhil Akbar Cariearsa

AI & ML interests

Recent Activity

Organizations

fadhilakbar's activity

Multimodal Embedding & Reranker Models with Sentence Transformers

ALTK‑Evolve: On‑the‑Job Learning for AI Agents

TRL v1.0: Post-Training Library Built to Move with the Field