Akshat Dwivedi's picture

In a Training Loop 🔄

Akshat Dwivedi

Akshat-Dwivedi

·

Akshat-Dwivedi-52

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

Qwen/Qwen3.6-35B-A3B

liked a model 4 days ago

Rta-AILabs/Nandi-Mini-150M-Instruct

liked a model 5 days ago

opendatalab/MinerU2.5-Pro-2604-1.2B

View all activity

Organizations

upvoted a paper 10 days ago

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Paper • 2604.05091 • Published 13 days ago • 45

upvoted a collection 30 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 15 items • Updated 4 days ago • 269

upvoted 2 collections about 2 months ago

pplx-embed

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96

Embeddings datasets ⚡️

This collection gather datasets for embeddings pre-training and fine-tuning. • 19 items • Updated 12 days ago • 4

upvoted a collection 2 months ago

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues. • 4 items • Updated Mar 8, 2025 • 9

upvoted a collection 3 months ago

MMFineReason

High-quality STEM reasoning dataset for Multimodal LLM post-training. • 8 items • Updated 18 days ago • 22

upvoted a paper 3 months ago

Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 87

upvoted an article 4 months ago

Article

The Optimal Architecture for Small Language Models

Dec 26, 2025

•

120

upvoted a collection 4 months ago

SYNTHETIC-2

12 items • Updated Oct 7, 2025 • 20

upvoted a paper 4 months ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 79

upvoted a collection 4 months ago

OpenThinker-Agent

5 items • Updated Dec 6, 2025 • 8

upvoted a collection 5 months ago

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 165