Ujjwal Tyagi's picture

Building on HF

Ujjwal Tyagi

Ujjwal-Tyagi

·

AI & ML interests

Chief Scientist at Shirova AI, focused on advancing open-source AI, Experienced in LLM fine-tuning, model architecture, and research, with a strong interest in building scalable and efficient models

Recent Activity

liked a dataset about 17 hours ago

kaiyuyue/sphere-encoder-fid-artifacts

upvoted a collection about 17 hours ago

upvoted a paper about 18 hours ago

Seedance 2.0: Advancing Video Generation for World Complexity

View all activity

Organizations

upvoted a collection about 17 hours ago

image

456 items • Updated Mar 9 • 10

upvoted 8 papers about 18 hours ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published 4 days ago • 136

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published 10 days ago • 278

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published 11 days ago • 316

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 29 days ago • 338

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published 20 days ago • 340

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 23 days ago • 354

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 16 days ago • 361

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 17 days ago • 480

upvoted a paper about 20 hours ago

Nucleus-Image: Sparse MoE for Image Generation

Paper • 2604.12163 • Published 5 days ago • 9

upvoted a changelog 3 days ago

Hugging Face Changelog

Introducing Kernels

3 days ago

• 137

upvoted a paper 5 days ago

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Paper • 2311.03099 • Published Nov 6, 2023 • 32

upvoted an article 14 days ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

17 days ago

•

864

upvoted a paper 16 days ago

Refusal in Language Models Is Mediated by a Single Direction

Paper • 2406.11717 • Published Jun 17, 2024 • 9

upvoted 4 papers 19 days ago

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 210

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 308

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 424

upvoted 2 collections 19 days ago

Coding Datasets

These are the best coding corpuses to make the LLM more stronger to surpass proprietary ones, basically it can be used in both post and pre training. • 15 items • Updated 19 days ago • 1

Distillation Datasets

These are the datasets that can be used to finetune small LLMs to reach the level of the closed models and large open LLMs • 41 items • Updated 16 days ago • 2