Spaces:

axentx
/

surrogate-1

Runtime error

Ashira Pitchayapakayakul commited on 15 days ago

Commit

ff4b1b7

1 Parent(s): 52df1e3

fix: 4 dead agents — heredoc-pipe bug, missing CLI, token pool

Three runtime fixes catching 'silent agent death' patterns from log audit:

1. surrogate-self-ingest.sh — heredoc/pipe redirection conflict
The original 'sed | python3 - "$INDEX" <<PYEOF' was a black hole:
bash binds heredocs AFTER pipe setup, so python3 read the script body
from stdin (since 'python3 -' takes script from stdin), and sed's
actual jsonl output had nowhere to go. Result was the dead-quiet
'inserted=0 skipped_parse=0 skipped_empty=0' loop we saw 84 times.
Fix: write inline python to a mktemp file, then pipe sed -> python3 file.

2. surrogate-research-loop.sh — $HOME/.local/bin/surrogate doesn't exist
The CLI binary was never installed on this Space, so every cycle was
failing silently in 0s ('research done in 0s' = no work). Replaced
with direct OpenAI-compatible calls falling through Cerebras → Groq
→ OpenRouter using the keys already configured as Space secrets.
Also fixed the optional notify-discord.sh call to skip cleanly if
that script isn't installed either.

3. GITHUB_TOKEN_POOL HF Space secret expanded
Was hitting 5000 search-req/hr ceiling repeatedly. Pushed the 4 usable
PATs (arkashira + midnightcrisis-1 + midnightcrisis-2 + ashiradevops-alt;
ashirap excluded per user mandate) for 20K req/hr ceiling.
Push via HF API in operator history, not committed to repo.

skill-synthesis daemon left as-is for now — its 'total skills=0' is
because the source dirs (/tmp/agentic-discovery, workspace/projects)
are still being populated by the agentic-crawler. Will pick up once
those have content.

Tag: i have control... arios gundam. จะส่องละนะ

Files changed (2) hide show

bin/surrogate-research-loop.sh +38 -4
bin/surrogate-self-ingest.sh +13 -3

bin/surrogate-research-loop.sh CHANGED Viewed

@@ -65,17 +65,51 @@ Then write a 1-line action TODO to ${RESEARCH_DIR}/queue.txt for each quick-win,
 Be selective — quality > quantity."
-# ── Run research via surrogate CLI ──────────────────────────────────────────
 START=$(date +%s)
-"$HOME/.local/bin/surrogate" -p --max-steps 8 "$PROMPT" 2>&1 | head -100 >> "$LOG"
 DUR=$(( $(date +%s) - START ))
 echo "[$(date +%H:%M:%S)] research done in ${DUR}s" | tee -a "$LOG"
 # ── Discord notify if new findings worth attention ─────────────────────────
 if [[ -f "$OUT" ]] && [[ -s "$OUT" ]]; then
     QUICK_WINS=$(grep -c "^apply " "$RESEARCH_DIR/queue.txt" 2>/dev/null || echo 0)
-    "$HOME/.local/bin/notify-discord.sh" 2>/dev/null info "🔬 Research cycle done" \
-        "Focus: $FOCUS · ${DUR}s · $(wc -l < "$OUT") lines · $QUICK_WINS quick-wins queued" || true
 fi
 echo "[$(date +%H:%M:%S)] cycle done" | tee -a "$LOG"

 Be selective — quality > quantity."
+# ── Run research via cloud LLM API directly ────────────────────────────────
+# Original $HOME/.local/bin/surrogate CLI was never installed on this Space,
+# so every cycle was failing silently in 0s. Replaced with direct calls to
+# whichever cloud LLM key is set (Cerebras → Groq → OpenRouter) with
+# automatic fallback if a backend is rate-limited or unavailable.
 START=$(date +%s)
+RESEARCH_RESPONSE=""
+for backend in cerebras groq openrouter; do
+    case "$backend" in
+        cerebras)   url="https://api.cerebras.ai/v1/chat/completions";    key="${CEREBRAS_API_KEY:-}";   model="qwen-3-coder-480b" ;;
+        groq)       url="https://api.groq.com/openai/v1/chat/completions"; key="${GROQ_API_KEY:-}";       model="qwen/qwen3-32b" ;;
+        openrouter) url="https://openrouter.ai/api/v1/chat/completions";   key="${OPENROUTER_API_KEY:-}"; model="qwen/qwen3-coder:free" ;;
+    esac
+    [[ -z "$key" ]] && { echo "  [$backend] no key — skip" >> "$LOG"; continue; }
+    RESEARCH_RESPONSE=$(curl -sS --max-time 90 "$url" \
+        -H "Authorization: Bearer $key" \
+        -H "Content-Type: application/json" \
+        -d "$(python3 -c "import json,sys; print(json.dumps({'model':sys.argv[1],'messages':[{'role':'user','content':sys.argv[2]}],'max_tokens':4000,'temperature':0.4}))" "$model" "$PROMPT")" 2>>"$LOG" \
+        | python3 -c "import json,sys; d=json.load(sys.stdin); print(d.get('choices',[{}])[0].get('message',{}).get('content',''))" 2>>"$LOG" || true)
+    if [[ -n "$RESEARCH_RESPONSE" ]]; then
+        echo "  [$backend] response: $(echo "$RESEARCH_RESPONSE" | wc -c) chars" >> "$LOG"
+        break
+    fi
+    echo "  [$backend] empty/error — try next" >> "$LOG"
+done
+if [[ -n "$RESEARCH_RESPONSE" ]]; then
+    {
+        echo "# Research cycle: $FOCUS ($CYCLE_TS)"
+        echo ""
+        echo "$RESEARCH_RESPONSE"
+    } > "$OUT"
+    # Extract any 'apply ...' lines into the queue
+    echo "$RESEARCH_RESPONSE" | grep -E "^apply " >> "$RESEARCH_DIR/queue.txt" 2>/dev/null || true
+fi
 DUR=$(( $(date +%s) - START ))
 echo "[$(date +%H:%M:%S)] research done in ${DUR}s" | tee -a "$LOG"
 # ── Discord notify if new findings worth attention ─────────────────────────
 if [[ -f "$OUT" ]] && [[ -s "$OUT" ]]; then
     QUICK_WINS=$(grep -c "^apply " "$RESEARCH_DIR/queue.txt" 2>/dev/null || echo 0)
+    [[ -x "$HOME/.local/bin/notify-discord.sh" ]] && \
+        "$HOME/.local/bin/notify-discord.sh" info "🔬 Research cycle done" \
+        "Focus: $FOCUS · ${DUR}s · $(wc -l < "$OUT") lines · $QUICK_WINS quick-wins queued" 2>/dev/null || true
 fi
 echo "[$(date +%H:%M:%S)] cycle done" | tee -a "$LOG"

bin/surrogate-self-ingest.sh CHANGED Viewed

@@ -42,7 +42,16 @@ TAKE=$NEW
 [[ $TAKE -gt $BATCH_SIZE ]] && TAKE=$BATCH_SIZE
 echo "[$(date +%H:%M:%S)]   processing $TAKE / $NEW (batch_size=$BATCH_SIZE)" | tee -a "$LOG"
-sed -n "$((PREV + 1)),$((PREV + TAKE))p" "$SRC" | python3 - "$INDEX" >> "$LOG" 2>&1 <<'PYEOF'
 import sys, json, sqlite3
 db = sys.argv[1]
 con = sqlite3.connect(db)
@@ -59,8 +68,6 @@ for line in sys.stdin:
     ts = d.get("ts", 0)
     prompt = (d.get("prompt") or "")[:4000]
     response = (d.get("response") or "")[:8000]
-    # Relaxed filter: index anything with both fields present (was 50-char min)
-    # Even short pairs are useful for tag-based retrieval
     if not prompt or not response:
         skipped_short += 1
         continue
@@ -76,6 +83,9 @@ con.commit()
 print(f"  inserted={n} skipped_parse={skipped_parse} skipped_empty={skipped_short}", flush=True)
 PYEOF
 # Advance offset by what we actually processed
 NEW_OFFSET=$(( PREV + TAKE ))
 echo "$NEW_OFFSET" > "$OFFSET_FILE"

 [[ $TAKE -gt $BATCH_SIZE ]] && TAKE=$BATCH_SIZE
 echo "[$(date +%H:%M:%S)]   processing $TAKE / $NEW (batch_size=$BATCH_SIZE)" | tee -a "$LOG"
+# Bug fix: previously `sed | python3 - "$INDEX" <<'PYEOF'` had a redirection
+# conflict — bash's heredoc binds to python3's stdin AFTER the pipe, so the
+# script body (PYEOF block) was being read as stdin (and consumed once for
+# 'python3 -'), leaving sed's actual jsonl output unreachable. Result was
+# `inserted=0 skipped_parse=0 skipped_empty=0` — a silent black hole.
+#
+# Fix: write the inline python to a temp file, then run with sed piped in.
+# Now stdin = the actual jsonl lines, exactly as intended.
+INGEST_PY=$(mktemp -t self-ingest-XXXXXX.py)
+cat > "$INGEST_PY" <<'PYEOF'
 import sys, json, sqlite3
 db = sys.argv[1]
 con = sqlite3.connect(db)
     ts = d.get("ts", 0)
     prompt = (d.get("prompt") or "")[:4000]
     response = (d.get("response") or "")[:8000]
     if not prompt or not response:
         skipped_short += 1
         continue
 print(f"  inserted={n} skipped_parse={skipped_parse} skipped_empty={skipped_short}", flush=True)
 PYEOF
+sed -n "$((PREV + 1)),$((PREV + TAKE))p" "$SRC" | python3 "$INGEST_PY" "$INDEX" >> "$LOG" 2>&1
+rm -f "$INGEST_PY"
 # Advance offset by what we actually processed
 NEW_OFFSET=$(( PREV + TAKE ))
 echo "$NEW_OFFSET" > "$OFFSET_FILE"