Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

omar-ah
/

ViL-DLM-0.6B

Image-Text-to-Text

vision-language

masked-diffusion

Model card Files Files and versions

1.95 MB

Ctrl+K

Ctrl+K

1 contributor

History: 27 commits

omar-ah's picture

Remove final model artifact from repo

516d0c0 13 days ago

code
Add timestep-aware sparse KD weighting 13 days ago
external
Add Vision Transformer and utility functions for sequence processing 14 days ago
.gitattributes

1.52 kB
Remove final model artifact from repo 13 days ago
.gitignore

38 Bytes
Add Vision Transformer and utility functions for sequence processing 14 days ago
README.md

8.62 kB
Add timestep-aware sparse KD weighting 13 days ago
pyproject.toml

720 Bytes
Implement stage-aware real-run training pipeline 14 days ago
train_production.py

172 Bytes
Update model configuration and training scripts with new vision backbone support and dependencies 15 days ago