Commit History

fix(v1): explicit knowledge-cutoff disclaimer in system prompt
67580ed
Running

axentx-dev-bot commited on

fix(zerogpu): switch to gr.Blocks + add /run/synth_batch endpoint
cca295a

Ashira Pitchayapakayakul commited on

fix: minimal gr.Interface (ChatInterface fails on gradio 4.44 force-pin)
c0830e0
verified

ashirato commited on

fix: 14B + bnb int4 (AWQ build failed; bnb proven, no compile)
2686e51
verified

ashirato commited on

feat: Qwen2.5-Coder-32B AWQ (biggest fits A10G 24GB)
109d31d
verified

ashirato commited on

fix: revert to 7B INT4 + apply Surrogate-1 v1 LoRA (REAL Surrogate-1)
563aaca
verified

ashirato commited on

feat: upgrade to Qwen2.5-Coder-14B + INT4 quant (4x more capable)
2b68803
verified

ashirato commited on

fix: switch to Qwen2.5-Coder-3B (faster cold load, fits A10G in <60s)
d45a2f7
verified

ashirato commited on

fix: lazy load (avoid Space init OOM with 7B+LoRA)
0535836
verified

ashirato commited on

fix: use gr.ChatInterface (simpler sig, avoids _json_schema bug)
0367d10
verified

ashirato commited on

initial: Qwen2.5-Coder-7B + Surrogate-1 v1 LoRA on ZeroGPU A10G
fe83bcf
verified

ashirato commited on