WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 7 items • Updated 5 days ago • 13
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 5 days ago • 40
Falcon Perception Collection Falcon-Perception and Falcon-OCR model: early-fusion, natively multimodal, dense Autoregressive Transformer models. • 5 items • Updated 7 days ago • 13
view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 20 days ago • 16
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11, 2025 • 116
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 11 days ago • 822
SigLino: Vision Foundation Models (SigLIP2 + DINOv3) Collection Vision encoders distilled from DINOv3 and SigLIP2 (MoE & Dense). CVPR 2026. • 6 items • Updated 2 days ago • 16
view article Article Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline about 1 month ago • 39
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 13 days ago • 47
view article Article How I contributed a new model to the Transformers library using Codex 13 days ago • 45
MolmoWeb Collection This is the collection of MolmoWeb artifacts, including model checkpoints and data. • 6 items • Updated 2 days ago • 24
view article Article Introducing Cohere-transcribe: state-of-the-art speech recognition 17 days ago • 36