Curated models for AI infrastructure, LLM deployment, and edge computing. Optimized for NVIDIA DGX Spark and Docker Swarm clusters.
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.15M • • 2k -
sentence-transformers/all-MiniLM-L6-v2
Sentence Similarity • 22.7M • Updated • 197M • • 4.68k -
BAAI/bge-large-en-v1.5
Feature Extraction • 0.3B • Updated • 8.69M • • 648 -
meta-llama/Llama-3.3-70B-Instruct
Text Generation • 71B • Updated • 449k • • 2.7k