๐ The Ghost 8B Beta model outperforms prominent models such as Llama 3 8B Instruct, GPT 3.5 Turbo in the lc_winrate score. In addition, it also outperforms Claude 3 Opus, Claude 3 Sonnet, GPT-4, and Mistral Large when comparing the winrate score of AlpacaEval 2.0.
Ghost 8B Beta is a large language model developed with goals that include excellent multilingual support, superior knowledge capabilities, and cost-effectiveness. The model comes in two context length versions, 8k and 128k, along with multilingual function tools support by default. The languages supported are ๐บ๐ธ English, ๐ซ๐ท French, ๐ฎ๐น Italian, ๐ช๐ธ Spanish, ๐ต๐น Portuguese, ๐ฉ๐ช German, ๐ป๐ณ Vietnamese, ๐ฐ๐ท Korean and ๐จ๐ณ Chinese.
New open Vision Language Model by @Google: PaliGemma ๐๐ค
๐ Comes in 3B, pretrained, mix and fine-tuned models in 224, 448 and 896 resolution ๐งฉ Combination of Gemma 2B LLM and SigLIP image encoder ๐ค Supported in transformers
PaliGemma can do.. ๐งฉ Image segmentation and detection! ๐คฏ ๐ Detailed document understanding and reasoning ๐ Visual question answering, captioning and any other VLM task!
๐๐ค๐ New Research Alert - CVPR 2024! ๐๐ค๐ ๐ Title: RoHM: Robust Human Motion Reconstruction via Diffusion ๐
๐ Description: RoHM is a diffusion-based approach for robust 3D human motion reconstruction from monocular RGB(-D) videos, effectively handling noise and occlusions to produce complete and coherent motions. This method outperforms current techniques in various tasks and is faster at test time.
๐ฅ Authors: Siwei Zhang et al.
๐ Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA ๐บ๐ธ