Dmaraj1258 's Collections
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance
Fields using Geometry-Guided Text-to-Image Diffusion Model
Paper
• 2309.03550
• Published • 12
Memory Augmented Language Models through Mixture of Word Experts
Paper
• 2311.10768
• Published • 19
GAIA: a benchmark for General AI Assistants
Paper
• 2311.12983
• Published • 247
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via
Blender-Oriented GPT Planning
Paper
• 2311.12631
• Published • 14
Analyzing and Improving the Training Dynamics of Diffusion Models
Paper
• 2312.02696
• Published • 33
Magicoder: Source Code Is All You Need
Paper
• 2312.02120
• Published • 82
Code Llama: Open Foundation Models for Code
Paper
• 2308.12950
• Published • 29
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric
Algorithm-System Co-Design
Paper
• 2401.14112
• Published • 20
Taiyi-Diffusion-XL: Advancing Bilingual Text-to-Image Generation with
Large Vision-Language Model Support
Paper
• 2401.14688
• Published • 14
EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models
Paper
• 2401.11739
• Published • 17
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on
Generalizability, Trustworthiness and Causality through Four Modalities
Paper
• 2401.15071
• Published • 37
Transfer Learning for Text Diffusion Models
Paper
• 2401.17181
• Published • 17
Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with
Prototypical Embedding
Paper
• 2401.15708
• Published • 12
Transforming and Combining Rewards for Aligning Large Language Models
Paper
• 2402.00742
• Published • 12
Dolma: an Open Corpus of Three Trillion Tokens for Language Model
Pretraining Research
Paper
• 2402.00159
• Published • 65
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models
and Adapters with Decoupled Consistency Learning
Paper
• 2402.00769
• Published • 22
StepCoder: Improve Code Generation with Reinforcement Learning from
Compiler Feedback
Paper
• 2402.01391
• Published • 43
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
• 2402.03300
• Published • 145
Training-Free Consistent Text-to-Image Generation
Paper
• 2402.03286
• Published • 67
V-IRL: Grounding Virtual Intelligence in Real Life
Paper
• 2402.03310
• Published • 16
ChemLLM: A Chemical Large Language Model
Paper
• 2402.06852
• Published • 32
Mixtures of Experts Unlock Parameter Scaling for Deep RL
Paper
• 2402.08609
• Published • 36
Linear Transformers with Learnable Kernel Functions are Better
In-Context Models
Paper
• 2402.10644
• Published • 81
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper
• 2402.12226
• Published • 45
VideoElevator: Elevating Video Generation Quality with Versatile
Text-to-Image Diffusion Models
Paper
• 2403.05438
• Published • 20
Language Models as Compilers: Simulating Pseudocode Execution Improves
Algorithmic Reasoning in Language Models
Paper
• 2404.02575
• Published • 50
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
• 2407.09025
• Published • 140
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in
Virtual 3D Spaces
Paper
• 2501.12909
• Published • 74
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper
• 2502.14499
• Published • 195
DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Paper
• 2503.14476
• Published • 146
ELTEX: A Framework for Domain-Driven Synthetic Data Generation
Paper
• 2503.15055
• Published • 6
Neuro-Symbolic Query Compiler
Paper
• 2505.11932
• Published • 18
VideoREPA: Learning Physics for Video Generation through Relational
Alignment with Foundation Models
Paper
• 2505.23656
• Published • 25
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical
Reasoning
Paper
• 2506.09513
• Published • 103
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based
Reinforcement Learning
Paper
• 2507.05920
• Published • 12
π^3: Scalable Permutation-Equivariant Visual Geometry Learning
Paper
• 2507.13347
• Published • 67
GenCompositor: Generative Video Compositing with Diffusion Transformer
Paper
• 2509.02460
• Published • 26