David K. Johansson's picture

9

David K. Johansson

dkjo8

·

https://www.dkjo8.xyz/

AI & ML interests

Energy based reasoning, gradient based optimization, world models

Recent Activity

reacted to BibbyResearch's post with 🔥 5 days ago

Bibby AI is now #3 rank and live on ProductHunt. It's time to support the AI co-author for research you wish existed. Producthunt - https://www.producthunt.com/products/bibby-ai Please upvote, comment, give critical feedback. The research community has shown immense trust.

published a Space 5 days ago

PolishedSnow/README

upvoted a paper 5 days ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

View all activity

Organizations

upvoted 9 papers 5 days ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

Paper • 2604.04323 • Published 8 days ago • 38

Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning

Paper • 2604.05404 • Published 7 days ago • 41

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Paper • 2604.03922 • Published 9 days ago • 52

GBQA: A Game Benchmark for Evaluating LLMs as Quality Assurance Engineers

Paper • 2604.02648 • Published 11 days ago • 44

ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement

Paper • 2604.01591 • Published 12 days ago • 38

Learning to Retrieve from Agent Trajectories

Paper • 2604.04949 • Published 15 days ago • 67

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published 7 days ago • 111

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 8 days ago • 230

Reasoning as Energy Minimization over Structured Latent Trajectories

Paper • 2603.28248 • Published 15 days ago • 1