3 27 6

Makise Kurisu

kurisu0306

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

upvoted a paper 13 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

upvoted a paper 29 days ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

View all activity

Organizations

None yet

upvoted a paper 9 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 10 days ago • 200

upvoted a paper 13 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 20 days ago • 352

upvoted a paper 29 days ago

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published Feb 13 • 35

liked a model 29 days ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated 22 days ago • 336k • • 1.04k

upvoted a paper about 1 month ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 53

upvoted a paper about 2 months ago

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 263

upvoted 6 papers 2 months ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published Feb 4 • 99

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Paper • 2602.02619 • Published Feb 2 • 53

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

upvoted 2 papers 3 months ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 180

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Paper • 2505.14464 • Published May 20, 2025 • 10

liked 2 models 3 months ago

Emperorizzis/ASTRA-14B-Thinking-v1

Text Generation • Updated Feb 2 • 11 • 9

Emperorizzis/ASTRA-32B-Thinking-v1

Text Generation • 33B • Updated Feb 2 • 22 • 7

upvoted 2 collections 3 months ago

ASTRA Dataset

Collection

2 items • Updated Jan 21 • 5

ASTRA Models

Collection

2 items • Updated Jan 22 • 2

upvoted 2 papers 3 months ago

Toward Efficient Agents: Memory, Tool learning, and Planning

Paper • 2601.14192 • Published Jan 20 • 57

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published Jan 15 • 63

Makise Kurisu

AI & ML interests

Recent Activity

Organizations

kurisu0306's activity