Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks β’ 18 items β’ Updated 7 days ago β’ 17
view post Post 5643 Thank you @clem (Co-Founder & CEO of Hugging Face) for sharing my dataset on X / Twitter! ronantakizawa/github-top-developers#github #dataset See translation 4 replies Β· π 11 11 β€οΈ 3 3 π 2 2 π 1 1 + Reply
allenai/llama-3.1-tulu-3-8b-preference-mixture Viewer β’ Updated Feb 4, 2025 β’ 273k β’ 2.04k β’ 26
Qwen/Qwen3-30B-A3B-Thinking-2507 Text Generation β’ 31B β’ Updated Aug 17, 2025 β’ 141k β’ β’ 375
view post Post 1660 Multiple NEW notebooks and scripts added to the Hugging Face Gemma recipes repo!Thanks to the community π«Ά, we're adding more and more recipes using Gemma πFine tuning for all modalities, function calling, RAG...Repo: https://github.com/huggingface/huggingface-gemma-recipesWe're also open to new ideas from the community π€! See translation 1 reply Β· π€ 4 4 π₯ 1 1 + Reply
view post Post 3530 ByteDance released Tar 1.5B and 7B: image-text in image-text out models, fully open-source π ByteDance-Seed/tar-6864cf0d9fe59a3b91cc4260They have an image tokenizer unified with text, and they de-tokenize using either of two models (LLM and diffusion)The model is actually a full LLM (Qwen2), the tokenizer converts image tokens π€― See translation π₯ 8 8 β€οΈ 1 1 + Reply
Josiefied and Abliterated Qwen2.5 Collection The best uncensored models β’ 20 items β’ Updated Jun 27, 2025 β’ 3