axentx-dev-bot commited on
Commit
67580ed
·
1 Parent(s): cca295a

fix(v1): explicit knowledge-cutoff disclaimer in system prompt

Browse files

User reported v1 confidently denied AWS ap-southeast-7 (Bangkok region,
launched Jan 2025 — AFTER Qwen2.5-Coder Sept-2024 cutoff). Old prompt
'Say IDK rather than confabulate' wasn't strong enough; the model
asserted wrong info anyway.

New prompt:
- explicit knowledge cutoff = Sept 2024
- explicit instruction: don't deny post-2024 things, trust user
- Thai-language IDK template

Real fix is v1.5 (training on Kaggle, Qwen3.5-9B + Phase 88 anti-
confabulation Be-Your-Own-Red-Teamer).

Files changed (1) hide show
  1. app.py +16 -2
app.py CHANGED
@@ -26,8 +26,22 @@ BASE_MODEL = os.environ.get("BASE_MODEL", "Qwen/Qwen2.5-Coder-7B-Instruct")
26
  LORA_REPO = os.environ.get("LORA_REPO", "axentx/surrogate-1-coder-7b-lora-v1")
27
  HF_TOKEN = os.environ.get("HF_TOKEN", "")
28
 
29
- SYSTEM = ("You are Surrogate-1, expert DevSecOps + SRE + coding agent. "
30
- "Cite real APIs only. Say IDK rather than confabulate.")
 
 
 
 
 
 
 
 
 
 
 
 
 
 
31
 
32
  DOMAIN_HINTS = {
33
  "code-python": "Python coding tasks, idiomatic, type-hinted",
 
26
  LORA_REPO = os.environ.get("LORA_REPO", "axentx/surrogate-1-coder-7b-lora-v1")
27
  HF_TOKEN = os.environ.get("HF_TOKEN", "")
28
 
29
+ SYSTEM = (
30
+ "You are Surrogate-1 v1 (Qwen2.5-Coder-7B + LoRA), expert DevSecOps + "
31
+ "SRE + coding agent.\n\n"
32
+ "CRITICAL — knowledge cutoff: your training data ends Sept 2024. "
33
+ "Anything launched AFTER Sept 2024 (cloud regions, framework versions, "
34
+ "API changes, model releases) is OUTSIDE your knowledge. When the user "
35
+ "asks about post-2024 facts, ALWAYS say:\n"
36
+ " 'ผมไม่แน่ใจครับ — knowledge cutoff Sept 2024. โปรดเช็ค official docs.'\n"
37
+ "Do NOT deny the existence of newer things — the user almost certainly "
38
+ "knows more than you about post-Sept-2024 changes. If user says X "
39
+ "exists and X sounds plausible (e.g. AWS region named ap-southeast-N, "
40
+ "a new model release, a new framework version), TRUST THE USER, don't "
41
+ "argue. Reply with what you know about adjacent context.\n\n"
42
+ "Cite real APIs only. Say IDK rather than confabulate. When in doubt, "
43
+ "ask the user to verify rather than asserting wrong info."
44
+ )
45
 
46
  DOMAIN_HINTS = {
47
  "code-python": "Python coding tasks, idiomatic, type-hinted",