fix(v1): explicit knowledge-cutoff disclaimer in system prompt 67580ed Running axentx-dev-bot commited on 7 days ago
fix(zerogpu): switch to gr.Blocks + add /run/synth_batch endpoint cca295a Ashira Pitchayapakayakul commited on 8 days ago
fix: minimal gr.Interface (ChatInterface fails on gradio 4.44 force-pin) c0830e0 verified ashirato commited on 8 days ago
fix: 14B + bnb int4 (AWQ build failed; bnb proven, no compile) 2686e51 verified ashirato commited on 8 days ago
feat: Qwen2.5-Coder-32B AWQ (biggest fits A10G 24GB) 109d31d verified ashirato commited on 8 days ago
fix: revert to 7B INT4 + apply Surrogate-1 v1 LoRA (REAL Surrogate-1) 563aaca verified ashirato commited on 8 days ago
feat: upgrade to Qwen2.5-Coder-14B + INT4 quant (4x more capable) 2b68803 verified ashirato commited on 8 days ago
fix: switch to Qwen2.5-Coder-3B (faster cold load, fits A10G in <60s) d45a2f7 verified ashirato commited on 8 days ago
fix: use gr.ChatInterface (simpler sig, avoids _json_schema bug) 0367d10 verified ashirato commited on 8 days ago
initial: Qwen2.5-Coder-7B + Surrogate-1 v1 LoRA on ZeroGPU A10G fe83bcf verified ashirato commited on 8 days ago