The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning
Paper • 2411.11758 • Published
Configuration Parsing Warning:In adapter_config.json: "peft.task_type" must be a string
This model is a fine-tuned version of llava-hf/llava-1.5-13b-hf on an unknown dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
llava-hf/llava-1.5-13b-hf