Commit History

fix(v1): explicit knowledge-cutoff disclaimer in system prompt
67580ed
Running

axentx-dev-bot commited on

fix(zerogpu): switch to gr.Blocks + add /run/synth_batch endpoint
cca295a

Ashira Pitchayapakayakul commited on

fix: minimal gr.Interface (ChatInterface fails on gradio 4.44 force-pin)
c0830e0
verified

ashirato commited on

bnb requirements
9c4bec1
verified

ashirato commited on

fix: 14B + bnb int4 (AWQ build failed; bnb proven, no compile)
2686e51
verified

ashirato commited on

autoawq instead of bitsandbytes
03211d1
verified

ashirato commited on

feat: Qwen2.5-Coder-32B AWQ (biggest fits A10G 24GB)
109d31d
verified

ashirato commited on

fix: revert to 7B INT4 + apply Surrogate-1 v1 LoRA (REAL Surrogate-1)
563aaca
verified

ashirato commited on

add bitsandbytes for INT4
b3d849d
verified

ashirato commited on

feat: upgrade to Qwen2.5-Coder-14B + INT4 quant (4x more capable)
2b68803
verified

ashirato commited on

fix: switch to Qwen2.5-Coder-3B (faster cold load, fits A10G in <60s)
d45a2f7
verified

ashirato commited on

fix: lazy load (avoid Space init OOM with 7B+LoRA)
0535836
verified

ashirato commited on

fix: use gr.ChatInterface (simpler sig, avoids _json_schema bug)
0367d10
verified

ashirato commited on

fix: pin huggingface_hub<0.26 (HfFolder), don't redeclare gradio (HF forces 4.44.0)
16a3ce3
verified

ashirato commited on

fix: pin gradio>=5.0 + huggingface_hub<0.30 (HfFolder removed in 0.30+)
dfc6374
verified

ashirato commited on

initial: Qwen2.5-Coder-7B + Surrogate-1 v1 LoRA on ZeroGPU A10G
fe83bcf
verified

ashirato commited on

initial commit
8bdeff6
verified

ashirato commited on