Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Ashiedu
/
Synesthesia
like
0
Image-Text-to-Text
Transformers
TF-Keras
ONNX
Safetensors
gemma3n
automatic-speech-recognition
automatic-speech-translation
audio-text-to-text
video-text-to-text
conversational
arxiv:
17 papers
License:
gemma
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Synesthesia
/
musiccoca
1.1 kB
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
Ashiedu
sync: push ML models as-is 2026-04-03
5ecf91a
verified
16 days ago
musiccoca_audio_encoder_audio_meta.json
Safe
318 Bytes
sync: push ML models as-is 2026-04-03
16 days ago
musiccoca_rvq_quantizer_rvq_meta.json
Safe
316 Bytes
sync: push ML models as-is 2026-04-03
16 days ago
musiccoca_text_encoder_text_meta.json
Safe
470 Bytes
sync: push ML models as-is 2026-04-03
16 days ago