Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Ashiedu
/
Synesthesia
like
0
Image-Text-to-Text
Transformers
TF-Keras
ONNX
Safetensors
gemma3n
automatic-speech-recognition
automatic-speech-translation
audio-text-to-text
video-text-to-text
conversational
arxiv:
17 papers
License:
gemma
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Synesthesia
/
processor_config.json
Ashiedu
(Trained with Unsloth)
4951729
verified
15 days ago
raw
Copy download link
history
blame
contribute
delete
Safe
98 Bytes
{
"audio_seq_length"
:
188
,
"image_seq_length"
:
256
,
"processor_class"
:
"Gemma3nProcessor"
}