Puristan_Spaces Runtime error 3 Indic ParlerTTS Urdu 🦀 3 IndicParler_TTS for Urdu_Punjabi & Sindhi
MultiModal Models Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 95
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 95
Pspets multimodal GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 22
GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 22
Document Understanding Models vidore/colpali Visual Document Retrieval • Updated Nov 24, 2025 • 4.15k • 476
Puristan_Spaces Runtime error 3 Indic ParlerTTS Urdu 🦀 3 IndicParler_TTS for Urdu_Punjabi & Sindhi
Pspets multimodal GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 22
GPT-4V(ision) is a Generalist Web Agent, if Grounded Paper • 2401.01614 • Published Jan 3, 2024 • 22
MultiModal Models Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 95
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper • 2311.06242 • Published Nov 10, 2023 • 95
Document Understanding Models vidore/colpali Visual Document Retrieval • Updated Nov 24, 2025 • 4.15k • 476