MilkyMikey1104 's Collections
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for
Large-Scale Speech Generation
Paper
• 2407.05361
• Published • 2
allenai/pixmo-ask-model-anything
Viewer
• Updated • 162k • 267
• 5
Viewer
• Updated • 272k • 145
• 9
Viewer
• Updated • 717k • 1.17k
• 39
Viewer
• Updated • 195k • 1.49k
• 30
google/paligemma2-3b-pt-896
Image-Text-to-Text
• 3B • Updated • 985
• 26
FunAudioLLM/SenseVoiceSmall
Updated • 9.82k
• 380
Text Generation
• 3B • Updated • 516
• 891
Viewer
• Updated • 84.1k • 33
• 3
laion/laion-audio-preview
Viewer
• Updated • 4.15M • 870
• 11
laion/relaion-high-resolution
Viewer
• Updated • 166M • 723
• 116
Text Generation
• 8B • Updated • 2.96k
• • 712
product-science/xlam-function-calling-60k-raw-augmented
Viewer
• Updated • 89.8k • 105
• 2
Trust but Verify: Programmatic VLM Evaluation in the Wild
Paper
• 2410.13121
• Published • 3
Salesforce/xlam-function-calling-60k
Viewer
• Updated • 60k • 10.2k
• 599