Community Blog & Articles

Community Articles
view all
guidenlpsynthetic-data

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

114
March 20, 2024
galorepeftllm

GaLore: Advancing Large Model Training on Consumer-grade Hardware

  • +5
32
March 20, 2024
partnershipshardwarenvidia

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

13
March 18, 2024
guidequantizationtransformers

Quanto: a PyTorch quantization backend for Optimum

45
March 18, 2024
nlpintelquantization

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

  • +2
14
March 15, 2024
nlpcvdata

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

13
March 15, 2024
leaderboardcollaborationresearch

Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?

  • +1
4
March 5, 2024
communitydatacollaboration

Data is better together: Enabling communities to collectively build better datasets together using Argilla and Hugging Face Spaces

8
March 4, 2024
habanapartnershipshardware

Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator

1
February 29, 2024
nlpcommunityresearch

StarCoder2 and The Stack v2

9
February 28, 2024
leaderboardarenacollaboration

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

  • +3
72
February 27, 2024
ethicsresearchnlp

AI Watermarking 101: Tools and Techniques

  • +5
27
February 26, 2024
nlpcommunityresearch

Fine-Tuning Gemma Models in Hugging Face

44
February 23, 2024
leaderboardguidecollaboration

Introducing the Red-Teaming Resistance Leaderboard

13
February 23, 2024