jina-embeddings-v4 Collection Universal Embeddings for Multimodal Multilingual Retrieval • 11 items • Updated 22 days ago • 4
TADA Collection TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment | https://huggingface.co/papers/2602.23068 • 7 items • Updated 22 days ago • 70
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated Feb 26 • 96
jina-embeddings-v5-text Collection Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 29 items • Updated Feb 27 • 38
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch Paper • 2602.03183 • Published Feb 3 • 11
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 8 days ago • 23
Waypoint-1 Collection The first real time diffusion world model designed for consumer hardware • 3 items • Updated Jan 30 • 8
Baichuan-M3 Collection Modeling Clinical Inquiry for Reliable Medical Decision-Making • 6 items • Updated Mar 2 • 17
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated Mar 12 • 210
MedCalc-Bench Collection Evaluating Large Language Models for Medical Calculations • 4 items • Updated Mar 10 • 2
Teacher Logits Collection Logits captured from large models to act as the teacher for distillation • 3 items • Updated Dec 15, 2025 • 11
VulnLLM-R Collection Data and model for VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection • 9 items • Updated Dec 17, 2025 • 9
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated about 5 hours ago • 267