Spaces:
Running on Zero
Running on Zero
axentx-dev-bot commited on
Commit ·
67580ed
1
Parent(s): cca295a
fix(v1): explicit knowledge-cutoff disclaimer in system prompt
Browse filesUser reported v1 confidently denied AWS ap-southeast-7 (Bangkok region,
launched Jan 2025 — AFTER Qwen2.5-Coder Sept-2024 cutoff). Old prompt
'Say IDK rather than confabulate' wasn't strong enough; the model
asserted wrong info anyway.
New prompt:
- explicit knowledge cutoff = Sept 2024
- explicit instruction: don't deny post-2024 things, trust user
- Thai-language IDK template
Real fix is v1.5 (training on Kaggle, Qwen3.5-9B + Phase 88 anti-
confabulation Be-Your-Own-Red-Teamer).
app.py
CHANGED
|
@@ -26,8 +26,22 @@ BASE_MODEL = os.environ.get("BASE_MODEL", "Qwen/Qwen2.5-Coder-7B-Instruct")
|
|
| 26 |
LORA_REPO = os.environ.get("LORA_REPO", "axentx/surrogate-1-coder-7b-lora-v1")
|
| 27 |
HF_TOKEN = os.environ.get("HF_TOKEN", "")
|
| 28 |
|
| 29 |
-
SYSTEM = (
|
| 30 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 31 |
|
| 32 |
DOMAIN_HINTS = {
|
| 33 |
"code-python": "Python coding tasks, idiomatic, type-hinted",
|
|
|
|
| 26 |
LORA_REPO = os.environ.get("LORA_REPO", "axentx/surrogate-1-coder-7b-lora-v1")
|
| 27 |
HF_TOKEN = os.environ.get("HF_TOKEN", "")
|
| 28 |
|
| 29 |
+
SYSTEM = (
|
| 30 |
+
"You are Surrogate-1 v1 (Qwen2.5-Coder-7B + LoRA), expert DevSecOps + "
|
| 31 |
+
"SRE + coding agent.\n\n"
|
| 32 |
+
"CRITICAL — knowledge cutoff: your training data ends Sept 2024. "
|
| 33 |
+
"Anything launched AFTER Sept 2024 (cloud regions, framework versions, "
|
| 34 |
+
"API changes, model releases) is OUTSIDE your knowledge. When the user "
|
| 35 |
+
"asks about post-2024 facts, ALWAYS say:\n"
|
| 36 |
+
" 'ผมไม่แน่ใจครับ — knowledge cutoff Sept 2024. โปรดเช็ค official docs.'\n"
|
| 37 |
+
"Do NOT deny the existence of newer things — the user almost certainly "
|
| 38 |
+
"knows more than you about post-Sept-2024 changes. If user says X "
|
| 39 |
+
"exists and X sounds plausible (e.g. AWS region named ap-southeast-N, "
|
| 40 |
+
"a new model release, a new framework version), TRUST THE USER, don't "
|
| 41 |
+
"argue. Reply with what you know about adjacent context.\n\n"
|
| 42 |
+
"Cite real APIs only. Say IDK rather than confabulate. When in doubt, "
|
| 43 |
+
"ask the user to verify rather than asserting wrong info."
|
| 44 |
+
)
|
| 45 |
|
| 46 |
DOMAIN_HINTS = {
|
| 47 |
"code-python": "Python coding tasks, idiomatic, type-hinted",
|