explorations - a andreydelpozo Collection

andreydelpozo 's Collections

explorations

updated Mar 1

random things

teknium/OpenHermes-2.5-Mistral-7B

Text Generation • Updated Feb 19, 2024 • 141k • 895
ByteDance/SDXL-Lightning

Text-to-Image • Updated Apr 3, 2024 • 158k • • 2.14k
google/gemma-7b-it

Text Generation • 9B • Updated Aug 14, 2024 • 44.5k • 1.24k
dphn/dolphin-2.2.1-mistral-7b

Text Generation • 7B • Updated May 20, 2024 • 1.03k • 198
dphn/dolphin-2.5-mixtral-8x7b

Text Generation • 47B • Updated May 21, 2024 • 2.63k • 1.24k
dphn/dolphin-2.6-mistral-7b-dpo-laser

Text Generation • 7B • Updated Mar 4, 2024 • 165 • 120
ise-uiuc/Magicoder-Evol-Instruct-110K

Viewer • Updated Dec 28, 2023 • 111k • 5.51k • 174
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27, 2024 • 30
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20, 2025 • 60
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation

Paper • 2510.01284 • Published Sep 30, 2025 • 37
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot

Paper • 2510.06751 • Published Oct 8, 2025 • 22
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Paper • 2509.24372 • Published Sep 29, 2025 • 12
DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 305
MATRIX: Mask Track Alignment for Interaction-aware Video Generation

Paper • 2510.07310 • Published Oct 8, 2025 • 36
Real-Time Object Detection Meets DINOv3

Paper • 2509.20787 • Published Sep 25, 2025 • 11
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 193
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 263
Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 162
T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8, 2025 • 121
SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8, 2025 • 116
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5, 2025 • 54
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24, 2025 • 42
KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11, 2025 • 40
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5, 2025 • 82
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5, 2025 • 81
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11, 2025 • 53
SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9, 2025 • 51
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 339
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99
Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8, 2025 • 88
Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23, 2025 • 81
ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7, 2025 • 65
dx8152/Qwen-Edit-2509-Multiple-angles

Image-to-Image • Updated Nov 28, 2025 • 87.8k • • 929
Qwen3-TTS Technical Report

Paper • 2601.15621 • Published Jan 22 • 74