SLED Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
text generation microsoft/biogpt Text Generation • Updated Feb 3, 2023 • 163k • 301 microsoft/MediPhi Text Generation • 4B • Updated Dec 15, 2025 • 2.89k • 20 google/medgemma-4b-it Image-Text-to-Text • Updated Oct 28, 2025 • 202k • 941 mistralai/Mistral-7B-v0.3 7B • Updated Jul 24, 2025 • 313k • 572
SLED Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8
text generation microsoft/biogpt Text Generation • Updated Feb 3, 2023 • 163k • 301 microsoft/MediPhi Text Generation • 4B • Updated Dec 15, 2025 • 2.89k • 20 google/medgemma-4b-it Image-Text-to-Text • Updated Oct 28, 2025 • 202k • 941 mistralai/Mistral-7B-v0.3 7B • Updated Jul 24, 2025 • 313k • 572