andreydelpozo 's Collections explorations
updated
teknium/OpenHermes-2.5-Mistral-7B
Text Generation
• Updated • 141k
• 895
Text-to-Image
• Updated • 158k
• • 2.14k
Text Generation
• 9B • Updated • 44.5k
• 1.24k
dphn/dolphin-2.2.1-mistral-7b
Text Generation
• 7B • Updated • 1.03k
• 198
dphn/dolphin-2.5-mixtral-8x7b
Text Generation
• 47B • Updated • 2.63k
• 1.24k
dphn/dolphin-2.6-mistral-7b-dpo-laser
Text Generation
• 7B • Updated • 165
• 120
ise-uiuc/Magicoder-Evol-Instruct-110K
Viewer
• Updated • 111k • 5.51k
• 174
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe
Interpolation
Paper
• 2408.15239
• Published • 30
WebShaper: Agentically Data Synthesizing via Information-Seeking
Formalization
Paper
• 2507.15061
• Published • 60
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
Paper
• 2510.01284
• Published • 37
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot
Paper
• 2510.06751
• Published • 22
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement
Learning
Paper
• 2509.24372
• Published • 12
Paper
• 2508.10104
• Published • 305
MATRIX: Mask Track Alignment for Interaction-aware Video Generation
Paper
• 2510.07310
• Published • 36
Real-Time Object Detection Meets DINOv3
Paper
• 2509.20787
• Published • 11
A Survey of Reinforcement Learning for Large Reasoning Models
Paper
• 2509.08827
• Published • 193
A Survey of Context Engineering for Large Language Models
Paper
• 2507.13334
• Published • 263
Scaling RL to Long Videos
Paper
• 2507.07966
• Published • 162
T-LoRA: Single Image Diffusion Model Customization Without Overfitting
Paper
• 2507.05964
• Published • 121
SingLoRA: Low Rank Adaptation Using a Single Matrix
Paper
• 2507.05566
• Published • 116
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM
Fine-Tuning Data from Unstructured Documents
Paper
• 2507.04009
• Published • 54
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for
Long Video Generation
Paper
• 2506.19852
• Published • 42
KV Cache Steering for Inducing Reasoning in Small Language Models
Paper
• 2507.08799
• Published • 40
PartCrafter: Structured 3D Mesh Generation via Compositional Latent
Diffusion Transformers
Paper
• 2506.05573
• Published • 82
Qwen3 Embedding: Advancing Text Embedding and Reranking Through
Foundation Models
Paper
• 2506.05176
• Published • 81
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation
Paper
• 2506.09790
• Published • 53
SpatialLM: Training Large Language Models for Structured Indoor Modeling
Paper
• 2506.07491
• Published • 51
Paper
• 2505.09388
• Published • 339
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture,
Training and Dataset
Paper
• 2505.09568
• Published • 99
Flow-GRPO: Training Flow Matching Models via Online RL
Paper
• 2505.05470
• Published • 88
Distilling LLM Agent into Small Models with Retrieval and Code Tools
Paper
• 2505.17612
• Published • 81
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Paper
• 2505.04588
• Published • 65
dx8152/Qwen-Edit-2509-Multiple-angles
Image-to-Image
• Updated • 87.8k
• • 929
Qwen3-TTS Technical Report
Paper
• 2601.15621
• Published • 74