HuggingClaw

Running

somratpro Claude Opus 4.7 commited on 27 days ago

Commit

fe9380c

1 Parent(s): db85721

Harden config build, speed up startup, add healthcheck

start.sh:
- CLOUDFLARE_PROXY_DEBUG defaults to false (Gemini path verified working;
per-request "Redirecting" log is no longer needed by default).
- Replace shell-interpolated jq calls with --arg/--argjson for
GATEWAY_TOKEN, LLM_MODEL, OPENCLAW_PASSWORD, BROWSER_EXECUTABLE_PATH,
SPACE_HOST, TELEGRAM_USER_ID. Prevents JSON breakage and jq filter
injection if those values contain quotes/backslashes/dollar signs.
- Combine ~6 sequential jq invocations (token + model + logging,
plugin allow/deny + entry toggles, controlUi + password) into single
pipelines. Saves several hundred ms of subprocess overhead on cold start.
- Document why device-pair/phone-control/talk-voice are pre-allowed and
why lmstudio/xai PLUGINS (not the xai model provider) are denied.
- Replace `sleep 3 && kill -0 $!` readiness check with a TCP poll on
127.0.0.1:7860 (configurable via GATEWAY_READY_TIMEOUT, default 90s)
that also bails out if the pipeline died. Old check tested the tee PID
in the pipeline, not openclaw, so late crashes went undetected.
- Drop remaining emoji prefixes from error/warning prints.

cloudflare-proxy.js:
- Tighten the require() hook so undici patching only fires for the bare
"undici" id or paths whose final package segment is undici. The old
substring check `id.includes("/undici/")` would also catch unrelated
packages like "super-undici-x".

cloudflare-proxy-setup.py:
- Always write CF_PROXY_ENV_FILE when CLOUDFLARE_PROXY_URL is provided,
even with an empty CLOUDFLARE_PROXY_SECRET, so start.sh's
`. $CF_PROXY_ENV_FILE` reliably exports the URL. Print a warning when
the URL is set without a secret (silent 401 is the worst failure mode).
- chmod 0600 the env file explicitly (umask 0077 should already cover
it; this is belt-and-suspenders since the file holds the worker
shared secret).

workspace-sync.py:
- After restoring WhatsApp credentials, walk the directory and chmod
0700 on dirs / 0600 on files so session secrets aren't world-readable.

Dockerfile:
- Add HEALTHCHECK against http://localhost:7861/health (health-server
proxies to the gateway). 90s start-period covers cold-start plugin
install.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Files changed (5) hide show

Dockerfile +5 -0
cloudflare-proxy-setup.py +23 -7
cloudflare-proxy.js +7 -1
start.sh +111 -61
workspace-sync.py +10 -0

Dockerfile CHANGED Viewed

@@ -85,4 +85,9 @@ WORKDIR /home/node/app
 EXPOSE 7861
 CMD ["/home/node/app/start.sh"]

 EXPOSE 7861
+# health-server.js exposes /health on 7861 and proxies to the gateway on 7860.
+# 90s start period covers OpenClaw's plugin install + gateway boot on cold start.
+HEALTHCHECK --interval=30s --timeout=5s --start-period=90s \
+  CMD curl -fsS http://localhost:7861/health || exit 1
 CMD ["/home/node/app/start.sh"]

cloudflare-proxy-setup.py CHANGED Viewed

@@ -154,6 +154,12 @@ def write_env(proxy_url: str, proxy_secret: str) -> None:
         + "\n",
         encoding="utf-8",
     )
 def main() -> int:
@@ -162,10 +168,20 @@ def main() -> int:
     api_token = os.environ.get("CLOUDFLARE_WORKERS_TOKEN", "").strip()
     if existing_url:
-      if existing_secret:
         write_env(existing_url, existing_secret)
-      print(f"☁️ Using configured Cloudflare proxy: {existing_url}")
-      return 0
     if not api_token:
         return 0
@@ -214,22 +230,22 @@ def main() -> int:
         proxy_url = f"https://{worker_name}.{subdomain}.workers.dev"
         write_env(proxy_url, proxy_secret)
-        print(f"☁️ Cloudflare proxy ready: {proxy_url}")
         return 0
     except urllib.error.HTTPError as error:
         detail = error.read().decode("utf-8", errors="replace")
         if error.code == 403 and '"code":9109' in detail:
             print(
-                "☁️ Cloudflare proxy setup failed: invalid Workers token. "
                 "Use a Cloudflare API Token in CLOUDFLARE_WORKERS_TOKEN "
                 "(not a Global API Key, tunnel token, or worker secret). "
                 "For auto-setup, it should have account-level 'Workers Scripts: Edit'. "
                 "The setup can auto-discover your account; CLOUDFLARE_ACCOUNT_ID is not required."
             )
-        print(f"☁️ Cloudflare proxy setup failed: HTTP {error.code} {detail}")
         return 1
     except Exception as error:
-        print(f"☁️ Cloudflare proxy setup failed: {error}")
         return 1

         + "\n",
         encoding="utf-8",
     )
+    # Belt-and-suspenders: even with umask 0077 on the parent shell, force
+    # 0600 since the file holds the worker shared secret.
+    try:
+        ENV_FILE.chmod(0o600)
+    except OSError:
+        pass
 def main() -> int:
     api_token = os.environ.get("CLOUDFLARE_WORKERS_TOKEN", "").strip()
     if existing_url:
+        # Always write the env file so downstream `. $CF_PROXY_ENV_FILE` in
+        # start.sh has CLOUDFLARE_PROXY_URL set even when no secret was
+        # supplied. Empty secret means we send no x-proxy-key header — that
+        # only works if the deployed worker also has no secret baked in.
         write_env(existing_url, existing_secret)
+        if not existing_secret:
+            print(
+                "Warning: CLOUDFLARE_PROXY_URL is set but CLOUDFLARE_PROXY_SECRET "
+                "is empty. Requests will succeed only if the deployed worker "
+                "was built without PROXY_SHARED_SECRET; otherwise you'll see "
+                "401 Unauthorized."
+            )
+        print(f"Using configured Cloudflare proxy: {existing_url}")
+        return 0
     if not api_token:
         return 0
         proxy_url = f"https://{worker_name}.{subdomain}.workers.dev"
         write_env(proxy_url, proxy_secret)
+        print(f"Cloudflare proxy ready: {proxy_url}")
         return 0
     except urllib.error.HTTPError as error:
         detail = error.read().decode("utf-8", errors="replace")
         if error.code == 403 and '"code":9109' in detail:
             print(
+                "Cloudflare proxy setup failed: invalid Workers token. "
                 "Use a Cloudflare API Token in CLOUDFLARE_WORKERS_TOKEN "
                 "(not a Global API Key, tunnel token, or worker secret). "
                 "For auto-setup, it should have account-level 'Workers Scripts: Edit'. "
                 "The setup can auto-discover your account; CLOUDFLARE_ACCOUNT_ID is not required."
             )
+        print(f"Cloudflare proxy setup failed: HTTP {error.code} {detail}")
         return 1
     except Exception as error:
+        print(f"Cloudflare proxy setup failed: {error}")
         return 1

cloudflare-proxy.js CHANGED Viewed

@@ -337,11 +337,17 @@ if (PROXY_URL) {
       patchUndiciInstance(undici);
     } catch (e) {}
     const Module = require("module");
     const originalRequire = Module.prototype.require;
     Module.prototype.require = function (id) {
       const exports = originalRequire.apply(this, arguments);
-      if (id === "undici" || id.includes("/undici/")) {
         try { patchUndiciInstance(exports); } catch (e) {}
       }
       return exports;

       patchUndiciInstance(undici);
     } catch (e) {}
+    // Hook require() to patch any undici instance the moment it loads.
+    // Match either the bare "undici" id or paths whose final package
+    // segment IS undici (e.g. "/foo/node_modules/undici/index.js"). The
+    // earlier substring check `id.includes("/undici/")` would also match
+    // unrelated packages like "super-undici-x".
     const Module = require("module");
     const originalRequire = Module.prototype.require;
+    const UNDICI_PATH_RE = /(?:^|\/)node_modules\/undici(?:\/|$)/;
     Module.prototype.require = function (id) {
       const exports = originalRequire.apply(this, arguments);
+      if (id === "undici" || UNDICI_PATH_RE.test(id)) {
         try { patchUndiciInstance(exports); } catch (e) {}
       }
       return exports;

start.sh CHANGED Viewed

@@ -38,13 +38,13 @@ echo ""
 # ── Validate required secrets ──
 ERRORS=""
 if [ -z "$LLM_API_KEY" ]; then
-  ERRORS="${ERRORS}  ❌ LLM_API_KEY is not set\n"
 fi
 if [ -z "$LLM_MODEL" ]; then
-  ERRORS="${ERRORS}  ❌ LLM_MODEL is not set (e.g. google/gemini-2.5-flash, anthropic/claude-sonnet-4-5, openai/gpt-4)\n"
 fi
 if [ -z "$GATEWAY_TOKEN" ]; then
-  ERRORS="${ERRORS}  ❌ GATEWAY_TOKEN is not set (generate: openssl rand -hex 32)\n"
 fi
 if [ -n "$ERRORS" ]; then
   echo "Missing required secrets:"
@@ -73,7 +73,7 @@ fi
 # Auto-correct Gemini models to use google/ prefix if anthropic/ was mistakenly used
 if [[ "$LLM_MODEL" == "anthropic/gemini"* ]]; then
   LLM_MODEL=$(echo "$LLM_MODEL" | sed 's/^anthropic\//google\//')
-  echo "⚠️  Corrected model from anthropic/gemini* to google/gemini*"
 fi
 # Extract provider prefix from model name (e.g. "google/gemini-2.5-flash" → "google")
@@ -146,7 +146,9 @@ export CLOUDFLARE_WORKERS_TOKEN
 CF_PROXY_ENV_FILE="/tmp/huggingclaw-cloudflare-proxy.env"
 if [ -n "${CLOUDFLARE_WORKERS_TOKEN:-}" ] || [ -n "${CLOUDFLARE_PROXY_URL:-}" ]; then
   export CLOUDFLARE_PROXY_DOMAINS="${CLOUDFLARE_PROXY_DOMAINS:-api.telegram.org,web.whatsapp.com,googleapis.com}"
-  export CLOUDFLARE_PROXY_DEBUG="${CLOUDFLARE_PROXY_DEBUG:-true}"
   echo "Preparing Cloudflare outbound proxy..."
   python3 /home/node/app/cloudflare-proxy-setup.py || true
   if [ -f "$CF_PROXY_ENV_FILE" ]; then
@@ -183,12 +185,20 @@ CONFIG_JSON=$(cat <<'CONFIGEOF'
 CONFIGEOF
 )
-# Gateway token
-CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".gateway.auth.token = \"$GATEWAY_TOKEN\"")
-# Model configuration at top level
-CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".agents.defaults.model = \"$LLM_MODEL\"")
-CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".logging.level = \"$OPENCLAW_FILE_LOG_LEVEL\" | .logging.consoleLevel = \"$OPENCLAW_CONSOLE_LOG_LEVEL\" | .logging.consoleStyle = \"$OPENCLAW_CONSOLE_LOG_STYLE\"")
 # Optional: dynamic custom OpenAI-compatible provider registration
 CUSTOM_PROVIDER_NAME="${CUSTOM_PROVIDER_NAME:-}"
@@ -206,29 +216,29 @@ if [ -n "$CUSTOM_PROVIDER_NAME" ] || [ -n "$CUSTOM_BASE_URL" ] || [ -n "$CUSTOM_
   CUSTOM_PROVIDER_OK=true
   if [ -z "$CUSTOM_PROVIDER_NAME" ] || [ -z "$CUSTOM_BASE_URL" ] || [ -z "$CUSTOM_MODEL_ID" ]; then
-    echo "⚠️  Custom provider skipped: set CUSTOM_PROVIDER_NAME, CUSTOM_BASE_URL, and CUSTOM_MODEL_ID together."
     CUSTOM_PROVIDER_OK=false
   fi
   case "$CUSTOM_PROVIDER_NORMALIZED" in
     anthropic|openai|openai-codex|google|google-vertex|deepseek|opencode|opencode-go|openrouter|kilocode|vercel-ai-gateway|zai|z-ai|z.ai|zhipu|moonshot|kimi-coding|minimax|qwen|modelstudio|xiaomi|volcengine|volcengine-plan|byteplus|byteplus-plan|qianfan|mistral|mistralai|xai|x-ai|nvidia|cohere|groq|together|huggingface|cerebras|venice|synthetic|github-copilot)
-      echo "⚠️  Custom provider skipped: CUSTOM_PROVIDER_NAME='$CUSTOM_PROVIDER_NAME' conflicts with a built-in provider."
       CUSTOM_PROVIDER_OK=false
       ;;
   esac
   if [[ "$CUSTOM_BASE_URL_NORMALIZED" == */chat/completions ]] || [[ "$CUSTOM_BASE_URL_NORMALIZED" == */completions ]]; then
-    echo "⚠️  Custom provider skipped: CUSTOM_BASE_URL should be the API base URL, not a completions endpoint."
     CUSTOM_PROVIDER_OK=false
   fi
   if ! [[ "$CUSTOM_CONTEXT_WINDOW" =~ ^[0-9]+$ ]] || ! [[ "$CUSTOM_MAX_TOKENS" =~ ^[0-9]+$ ]]; then
-    echo "⚠️  Custom provider skipped: CUSTOM_CONTEXT_WINDOW and CUSTOM_MAX_TOKENS must be whole numbers."
     CUSTOM_PROVIDER_OK=false
   fi
   if [ "$CUSTOM_PROVIDER_OK" = "true" ]; then
-    echo "🔧 Registering custom provider: $CUSTOM_PROVIDER_NAME → $CUSTOM_BASE_URL_NORMALIZED"
     CONFIG_JSON=$(jq \
       --arg provider "$CUSTOM_PROVIDER_NAME" \
       --arg baseUrl "$CUSTOM_BASE_URL_NORMALIZED" \
@@ -252,7 +262,7 @@ if [ -n "$CUSTOM_PROVIDER_NAME" ] || [ -n "$CUSTOM_BASE_URL" ] || [ -n "$CUSTOM_
        }' <<<"$CONFIG_JSON")
     if [[ "$LLM_MODEL" != "$CUSTOM_PROVIDER_NAME/"* ]]; then
-      echo "⚠️  Custom provider registered, but LLM_MODEL='$LLM_MODEL' does not start with '$CUSTOM_PROVIDER_NAME/'."
     fi
   fi
 fi
@@ -273,7 +283,15 @@ elif [ "$BROWSER_PLUGIN_MODE" = "auto" ] && [ -n "$BROWSER_EXECUTABLE_PATH" ] &&
   BROWSER_SHOULD_ENABLE=true
 fi
-# Restrict bundled plugin loading on HF Spaces so unrelated broken plugins do not crash the gateway after startup.
 PLUGIN_ALLOW_JSON='["device-pair","phone-control","talk-voice"]'
 if [ "$ACP_PLUGIN_MODE" = "enabled" ] || [ "$ACP_PLUGIN_MODE" = "auto" ]; then
   PLUGIN_ALLOW_JSON=$(jq '. + ["acpx"]' <<<"$PLUGIN_ALLOW_JSON")
@@ -287,41 +305,52 @@ fi
 if [ "$WHATSAPP_ENABLED_NORMALIZED" = "true" ]; then
   PLUGIN_ALLOW_JSON=$(jq '. + ["whatsapp"]' <<<"$PLUGIN_ALLOW_JSON")
 fi
-CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".plugins.allow = $PLUGIN_ALLOW_JSON")
-CONFIG_JSON=$(echo "$CONFIG_JSON" | jq '.plugins.deny = ["lmstudio","xai"]')
-CONFIG_JSON=$(echo "$CONFIG_JSON" | jq '.plugins.entries.lmstudio.enabled = false | .plugins.entries.xai.enabled = false')
-if [ "$ACP_PLUGIN_MODE" = "disabled" ]; then
-  CONFIG_JSON=$(echo "$CONFIG_JSON" | jq '.plugins.entries.acpx.enabled = false')
-fi
-if [ "$BROWSER_SHOULD_ENABLE" != "true" ]; then
-  CONFIG_JSON=$(echo "$CONFIG_JSON" | jq '.plugins.entries.browser.enabled = false | .browser.enabled = false')
-fi
 if [ "$BROWSER_SHOULD_ENABLE" = "true" ]; then
-  CONFIG_JSON=$(echo "$CONFIG_JSON" | jq \
-    ".browser = {
-      \"enabled\": true,
-      \"defaultProfile\": \"openclaw\",
-      \"headless\": true,
-      \"noSandbox\": true,
-      \"executablePath\": \"$BROWSER_EXECUTABLE_PATH\"
-    } | .agents.defaults.sandbox.browser.allowHostControl = true")
-fi
-# Control UI origin (allow HF Space URL for web UI access)
-if [ -n "${SPACE_HOST:-}" ]; then
-  CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".gateway.controlUi.allowedOrigins = [\"https://${SPACE_HOST}\"]")
-fi
-# Disable device auth (pairing) for headless Docker — token-only auth
-CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".gateway.controlUi.dangerouslyDisableDeviceAuth = true")
-# Password auth (optional — simpler alternative to token for casual users)
-if [ -n "${OPENCLAW_PASSWORD:-}" ]; then
-  CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".gateway.auth.mode = \"password\" | .gateway.auth.password = \"$OPENCLAW_PASSWORD\"")
-fi
 # Trusted proxies (optional — fixes "Proxy headers detected from untrusted address" on HF Spaces)
 # Set TRUSTED_PROXIES as comma-separated IPs/CIDRs, e.g. "10.20.31.87,10.20.26.157"
@@ -365,12 +394,16 @@ if [ -n "${TELEGRAM_BOT_TOKEN:-}" ]; then
   ')
   if [ -n "${TELEGRAM_USER_IDS:-}" ]; then
-    # Convert comma-separated IDs to JSON array
     IDS_JSON=$(echo "$TELEGRAM_USER_IDS" | tr ',' '\n' | sed 's/^ *//;s/ *$//' | jq -R . | jq -s .)
-    CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".channels.telegram += {\"dmPolicy\": \"allowlist\", \"allowFrom\": $IDS_JSON}")
   elif [ -n "${TELEGRAM_USER_ID:-}" ]; then
-    # Single user (backward compatible)
-    CONFIG_JSON=$(echo "$CONFIG_JSON" | jq ".channels.telegram += {\"dmPolicy\": \"allowlist\", \"allowFrom\": [\"$TELEGRAM_USER_ID\"]}")
   fi
 fi
@@ -445,13 +478,13 @@ warmup_browser() {
     for attempt in 1 2 3 4 5; do
       if openclaw browser --browser-profile openclaw start >/dev/null 2>&1; then
         openclaw browser --browser-profile openclaw open about:blank >/dev/null 2>&1 || true
-        echo "  ✅ Managed browser ready"
         return 0
       fi
       sleep 2
     done
-    echo "  ⚠️ Managed browser warm-up did not complete; first browser action may need a retry"
   ) &
 }
@@ -470,15 +503,32 @@ if [ "${GATEWAY_VERBOSE:-0}" = "1" ]; then
   echo "Gateway verbose logging enabled (GATEWAY_VERBOSE=1)"
 fi
-# Use stdbuf -oL -eL to ensure logs are not buffered and appear immediately in the console
 stdbuf -oL -eL openclaw "${GATEWAY_ARGS[@]}" 2>&1 | tee -a /home/node/.openclaw/gateway.log &
 GATEWAY_PID=$!
-# Wait a moment for startup errors
-sleep 3
-if ! kill -0 $GATEWAY_PID 2>/dev/null; then
   echo ""
-  echo "❌ Gateway failed to start. Last 30 lines of log:"
   echo "────────────────────────────────────────────"
   tail -30 /home/node/.openclaw/gateway.log
   exit 1

 # ── Validate required secrets ──
 ERRORS=""
 if [ -z "$LLM_API_KEY" ]; then
+  ERRORS="${ERRORS}  - LLM_API_KEY is not set\n"
 fi
 if [ -z "$LLM_MODEL" ]; then
+  ERRORS="${ERRORS}  - LLM_MODEL is not set (e.g. google/gemini-2.5-flash, anthropic/claude-sonnet-4-5, openai/gpt-4)\n"
 fi
 if [ -z "$GATEWAY_TOKEN" ]; then
+  ERRORS="${ERRORS}  - GATEWAY_TOKEN is not set (generate: openssl rand -hex 32)\n"
 fi
 if [ -n "$ERRORS" ]; then
   echo "Missing required secrets:"
 # Auto-correct Gemini models to use google/ prefix if anthropic/ was mistakenly used
 if [[ "$LLM_MODEL" == "anthropic/gemini"* ]]; then
   LLM_MODEL=$(echo "$LLM_MODEL" | sed 's/^anthropic\//google\//')
+  echo "Note: corrected model from anthropic/gemini* to google/gemini*"
 fi
 # Extract provider prefix from model name (e.g. "google/gemini-2.5-flash" → "google")
 CF_PROXY_ENV_FILE="/tmp/huggingclaw-cloudflare-proxy.env"
 if [ -n "${CLOUDFLARE_WORKERS_TOKEN:-}" ] || [ -n "${CLOUDFLARE_PROXY_URL:-}" ]; then
   export CLOUDFLARE_PROXY_DOMAINS="${CLOUDFLARE_PROXY_DOMAINS:-api.telegram.org,web.whatsapp.com,googleapis.com}"
+  # Default debug off for production. Set CLOUDFLARE_PROXY_DEBUG=true in HF
+  # Space secrets to surface per-request "Redirecting" + error-cause logs.
+  export CLOUDFLARE_PROXY_DEBUG="${CLOUDFLARE_PROXY_DEBUG:-false}"
   echo "Preparing Cloudflare outbound proxy..."
   python3 /home/node/app/cloudflare-proxy-setup.py || true
   if [ -f "$CF_PROXY_ENV_FILE" ]; then
 CONFIGEOF
 )
+# Apply gateway token, model, and logging in a single jq pass.
+# Uses --arg so values containing quotes/backslashes can't break the JSON or
+# inject jq filters (relevant for OPENCLAW_PASSWORD/GATEWAY_TOKEN below too).
+CONFIG_JSON=$(jq \
+  --arg token "$GATEWAY_TOKEN" \
+  --arg model "$LLM_MODEL" \
+  --arg fileLevel "$OPENCLAW_FILE_LOG_LEVEL" \
+  --arg consoleLevel "$OPENCLAW_CONSOLE_LOG_LEVEL" \
+  --arg consoleStyle "$OPENCLAW_CONSOLE_LOG_STYLE" \
+  '.gateway.auth.token = $token
+   | .agents.defaults.model = $model
+   | .logging.level = $fileLevel
+   | .logging.consoleLevel = $consoleLevel
+   | .logging.consoleStyle = $consoleStyle' <<<"$CONFIG_JSON")
 # Optional: dynamic custom OpenAI-compatible provider registration
 CUSTOM_PROVIDER_NAME="${CUSTOM_PROVIDER_NAME:-}"
   CUSTOM_PROVIDER_OK=true
   if [ -z "$CUSTOM_PROVIDER_NAME" ] || [ -z "$CUSTOM_BASE_URL" ] || [ -z "$CUSTOM_MODEL_ID" ]; then
+    echo "Warning: custom provider skipped: set CUSTOM_PROVIDER_NAME, CUSTOM_BASE_URL, and CUSTOM_MODEL_ID together."
     CUSTOM_PROVIDER_OK=false
   fi
   case "$CUSTOM_PROVIDER_NORMALIZED" in
     anthropic|openai|openai-codex|google|google-vertex|deepseek|opencode|opencode-go|openrouter|kilocode|vercel-ai-gateway|zai|z-ai|z.ai|zhipu|moonshot|kimi-coding|minimax|qwen|modelstudio|xiaomi|volcengine|volcengine-plan|byteplus|byteplus-plan|qianfan|mistral|mistralai|xai|x-ai|nvidia|cohere|groq|together|huggingface|cerebras|venice|synthetic|github-copilot)
+      echo "Warning: custom provider skipped: CUSTOM_PROVIDER_NAME='$CUSTOM_PROVIDER_NAME' conflicts with a built-in provider."
       CUSTOM_PROVIDER_OK=false
       ;;
   esac
   if [[ "$CUSTOM_BASE_URL_NORMALIZED" == */chat/completions ]] || [[ "$CUSTOM_BASE_URL_NORMALIZED" == */completions ]]; then
+    echo "Warning: custom provider skipped: CUSTOM_BASE_URL should be the API base URL, not a completions endpoint."
     CUSTOM_PROVIDER_OK=false
   fi
   if ! [[ "$CUSTOM_CONTEXT_WINDOW" =~ ^[0-9]+$ ]] || ! [[ "$CUSTOM_MAX_TOKENS" =~ ^[0-9]+$ ]]; then
+    echo "Warning: custom provider skipped: CUSTOM_CONTEXT_WINDOW and CUSTOM_MAX_TOKENS must be whole numbers."
     CUSTOM_PROVIDER_OK=false
   fi
   if [ "$CUSTOM_PROVIDER_OK" = "true" ]; then
+    echo "Registering custom provider: $CUSTOM_PROVIDER_NAME -> $CUSTOM_BASE_URL_NORMALIZED"
     CONFIG_JSON=$(jq \
       --arg provider "$CUSTOM_PROVIDER_NAME" \
       --arg baseUrl "$CUSTOM_BASE_URL_NORMALIZED" \
        }' <<<"$CONFIG_JSON")
     if [[ "$LLM_MODEL" != "$CUSTOM_PROVIDER_NAME/"* ]]; then
+      echo "Warning: custom provider registered, but LLM_MODEL='$LLM_MODEL' does not start with '$CUSTOM_PROVIDER_NAME/'."
     fi
   fi
 fi
   BROWSER_SHOULD_ENABLE=true
 fi
+# Plugin allow/deny rationale:
+#   ALLOW: device-pair, phone-control, talk-voice are the minimum bundled
+#          plugins that the Control UI/dashboard needs to render correctly
+#          on HF Spaces. Without these the UI shows blank panels.
+#          telegram/whatsapp/browser/acpx are added conditionally below.
+#   DENY:  lmstudio crashes on boot when no local server is reachable;
+#          xai PLUGIN (separate from the xai model PROVIDER) is broken in
+#          current OpenClaw releases and prevents gateway start. Disabling
+#          the plugin does NOT affect xai-as-a-model-provider.
 PLUGIN_ALLOW_JSON='["device-pair","phone-control","talk-voice"]'
 if [ "$ACP_PLUGIN_MODE" = "enabled" ] || [ "$ACP_PLUGIN_MODE" = "auto" ]; then
   PLUGIN_ALLOW_JSON=$(jq '. + ["acpx"]' <<<"$PLUGIN_ALLOW_JSON")
 if [ "$WHATSAPP_ENABLED_NORMALIZED" = "true" ]; then
   PLUGIN_ALLOW_JSON=$(jq '. + ["whatsapp"]' <<<"$PLUGIN_ALLOW_JSON")
 fi
+# Apply plugin allow/deny + per-entry toggles in one jq pass.
+ACPX_DISABLED=false
+if [ "$ACP_PLUGIN_MODE" = "disabled" ]; then ACPX_DISABLED=true; fi
+BROWSER_DISABLED=true
+if [ "$BROWSER_SHOULD_ENABLE" = "true" ]; then BROWSER_DISABLED=false; fi
+CONFIG_JSON=$(jq \
+  --argjson allow "$PLUGIN_ALLOW_JSON" \
+  --argjson acpxDisabled "$ACPX_DISABLED" \
+  --argjson browserDisabled "$BROWSER_DISABLED" \
+  '.plugins.allow = $allow
+   | .plugins.deny = ["lmstudio","xai"]
+   | .plugins.entries.lmstudio.enabled = false
+   | .plugins.entries.xai.enabled = false
+   | (if $acpxDisabled then .plugins.entries.acpx.enabled = false else . end)
+   | (if $browserDisabled then
+        .plugins.entries.browser.enabled = false | .browser.enabled = false
+      else . end)' <<<"$CONFIG_JSON")
 if [ "$BROWSER_SHOULD_ENABLE" = "true" ]; then
+  CONFIG_JSON=$(jq \
+    --arg execPath "$BROWSER_EXECUTABLE_PATH" \
+    '.browser = {
+       "enabled": true,
+       "defaultProfile": "openclaw",
+       "headless": true,
+       "noSandbox": true,
+       "executablePath": $execPath
+     }
+     | .agents.defaults.sandbox.browser.allowHostControl = true' <<<"$CONFIG_JSON")
+fi
+# Control UI origin (allow HF Space URL for web UI access).
+# Disable device auth (pairing) for headless Docker — token-only auth.
+# Combined into one jq pass; --arg keeps password/host injection-safe.
+CONFIG_JSON=$(jq \
+  --arg spaceHost "${SPACE_HOST:-}" \
+  --arg password "${OPENCLAW_PASSWORD:-}" \
+  '.gateway.controlUi.dangerouslyDisableDeviceAuth = true
+   | (if $spaceHost != "" then
+        .gateway.controlUi.allowedOrigins = ["https://" + $spaceHost]
+      else . end)
+   | (if $password != "" then
+        .gateway.auth.mode = "password" | .gateway.auth.password = $password
+      else . end)' <<<"$CONFIG_JSON")
 # Trusted proxies (optional — fixes "Proxy headers detected from untrusted address" on HF Spaces)
 # Set TRUSTED_PROXIES as comma-separated IPs/CIDRs, e.g. "10.20.31.87,10.20.26.157"
   ')
   if [ -n "${TELEGRAM_USER_IDS:-}" ]; then
+    # Convert comma-separated IDs to JSON array (already safe — jq -R parses).
     IDS_JSON=$(echo "$TELEGRAM_USER_IDS" | tr ',' '\n' | sed 's/^ *//;s/ *$//' | jq -R . | jq -s .)
+    CONFIG_JSON=$(jq \
+      --argjson ids "$IDS_JSON" \
+      '.channels.telegram += {"dmPolicy": "allowlist", "allowFrom": $ids}' <<<"$CONFIG_JSON")
   elif [ -n "${TELEGRAM_USER_ID:-}" ]; then
+    # Single user (backward compatible). --arg keeps quotes/odd chars safe.
+    CONFIG_JSON=$(jq \
+      --arg userId "$TELEGRAM_USER_ID" \
+      '.channels.telegram += {"dmPolicy": "allowlist", "allowFrom": [$userId]}' <<<"$CONFIG_JSON")
   fi
 fi
     for attempt in 1 2 3 4 5; do
       if openclaw browser --browser-profile openclaw start >/dev/null 2>&1; then
         openclaw browser --browser-profile openclaw open about:blank >/dev/null 2>&1 || true
+        echo "Managed browser ready."
         return 0
       fi
       sleep 2
     done
+    echo "Warning: managed browser warm-up did not complete; first browser action may need a retry."
   ) &
 }
   echo "Gateway verbose logging enabled (GATEWAY_VERBOSE=1)"
 fi
+# Use stdbuf -oL -eL to ensure logs are not buffered and appear immediately
+# in the console. NOTE: $! captures the LAST pipeline element (tee), not
+# openclaw — fine for passing to `wait` (waits for the whole pipeline to
+# finish), but kill -0 on it is uninformative. We probe TCP instead.
 stdbuf -oL -eL openclaw "${GATEWAY_ARGS[@]}" 2>&1 | tee -a /home/node/.openclaw/gateway.log &
 GATEWAY_PID=$!
+# Poll for the gateway to start listening on 7860. OpenClaw can take 20-30s
+# on cold start (plugin install + auto-restore). Bail out early if the
+# pipeline died.
+GATEWAY_READY_TIMEOUT="${GATEWAY_READY_TIMEOUT:-90}"
+ready=false
+for ((i=0; i<GATEWAY_READY_TIMEOUT; i++)); do
+  if (echo > /dev/tcp/127.0.0.1/7860) 2>/dev/null; then
+    ready=true
+    break
+  fi
+  if ! kill -0 "$GATEWAY_PID" 2>/dev/null; then
+    break
+  fi
+  sleep 1
+done
+if [ "$ready" != "true" ]; then
   echo ""
+  echo "Gateway failed to start. Last 30 lines of log:"
   echo "────────────────────────────────────────────"
   tail -30 /home/node/.openclaw/gateway.log
   exit 1

workspace-sync.py CHANGED Viewed

@@ -161,7 +161,17 @@ def restore_embedded_state() -> None:
             shutil.rmtree(WHATSAPP_CREDS_DIR, ignore_errors=True)
             WHATSAPP_CREDS_DIR.parent.mkdir(parents=True, exist_ok=True)
             shutil.copytree(WHATSAPP_BACKUP_DIR, WHATSAPP_CREDS_DIR)
             os.chmod(OPENCLAW_HOME / "credentials", 0o700)
             print("WhatsApp credentials restored.")
         else:
             print(f"Warning: saved WhatsApp credentials incomplete ({file_count} files), skipping restore.")

             shutil.rmtree(WHATSAPP_CREDS_DIR, ignore_errors=True)
             WHATSAPP_CREDS_DIR.parent.mkdir(parents=True, exist_ok=True)
             shutil.copytree(WHATSAPP_BACKUP_DIR, WHATSAPP_CREDS_DIR)
+            # Lock down dir tree: 0700 on directories, 0600 on every file
+            # so the WhatsApp session secrets can't be read by other users.
             os.chmod(OPENCLAW_HOME / "credentials", 0o700)
+            for path in WHATSAPP_CREDS_DIR.rglob("*"):
+                try:
+                    if path.is_dir():
+                        os.chmod(path, 0o700)
+                    elif path.is_file():
+                        os.chmod(path, 0o600)
+                except OSError:
+                    pass
             print("WhatsApp credentials restored.")
         else:
             print(f"Warning: saved WhatsApp credentials incomplete ({file_count} files), skipping restore.")