tao-shen Claude Opus 4.6 commited on
Commit
4da041a
·
1 Parent(s): b0f06a1

fix: remove tao-shen/HuggingClaw-data refs to prevent unwanted linked spaces

Browse files

HF scans README content to detect dataset references, causing duplicated
Spaces to appear as "linked" to the original dataset. Replace all specific
dataset repo names with generic examples and remove huggingface-dataset tag.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Files changed (3) hide show
  1. .env.example +1 -1
  2. README.md +2 -3
  3. scripts/sync_hf.py +1 -1
.env.example CHANGED
@@ -42,7 +42,7 @@ HF_TOKEN=hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
42
 
43
  # Target Dataset repository for data backup.
44
  # Format: your-username/repo-name
45
- # Example: tao-shen/HuggingClaw-data
46
  #
47
  # Manual mode (default): create the repo yourself, then set this variable.
48
  # Auto mode (AUTO_CREATE_DATASET=true): if not set, HuggingClaw derives
 
42
 
43
  # Target Dataset repository for data backup.
44
  # Format: your-username/repo-name
45
+ # Example: your-name/YourSpace-data
46
  #
47
  # Manual mode (default): create the repo yourself, then set this variable.
48
  # Auto mode (AUTO_CREATE_DATASET=true): if not set, HuggingClaw derives
README.md CHANGED
@@ -10,7 +10,6 @@ short_description: Deploy OpenClaw on HuggingFace Spaces
10
  app_port: 7860
11
  tags:
12
  - huggingface
13
- - huggingface-dataset
14
  - openrouter
15
  - chatbot
16
  - llm
@@ -79,7 +78,7 @@ Go to **Settings → Repository secrets** and configure:
79
  | Secret | Status | Description | Example |
80
  |--------|:------:|-------------|---------|
81
  | `HF_TOKEN` | **Required** | HF Access Token with write permission ([create one](https://huggingface.co/settings/tokens)) | `hf_AbCdEfGhIjKlMnOpQrStUvWxYz` |
82
- | `OPENCLAW_DATASET_REPO` | See below | Dataset repo for backup — format: `username/repo-name`. Required in manual mode; optional in auto mode (see [Data Persistence](#data-persistence)) | `tao-shen/HuggingClaw-data` |
83
  | `OPENAI_API_KEY` | Recommended | OpenAI (or any [OpenAI-compatible](https://openclawdoc.com/docs/reference/environment-variables)) API key | `sk-proj-xxxxxxxxxxxx` |
84
  | `OPENROUTER_API_KEY` | Optional | [OpenRouter](https://openrouter.ai) API key (200+ models, free tier available) | `sk-or-v1-xxxxxxxxxxxx` |
85
  | `ANTHROPIC_API_KEY` | Optional | Anthropic Claude API key | `sk-ant-xxxxxxxxxxxx` |
@@ -102,7 +101,7 @@ HuggingClaw syncs `~/.openclaw` (conversations, settings, credentials) to a priv
102
  1. Set `AUTO_CREATE_DATASET` = `true` in your Space secrets
103
  2. Set `HF_TOKEN` with write permission
104
  3. (Optional) Set `OPENCLAW_DATASET_REPO` if you want a custom repo name
105
- 4. On first startup, HuggingClaw automatically creates a **private** Dataset repo. If `OPENCLAW_DATASET_REPO` is not set, it derives the name from your HF username + Space name: `your-username/SpaceName-data` (e.g. `tao-shen/HuggingClaw-data`). Each Space gets its own dataset, so duplicating a Space won't cause conflicts
106
 
107
  > **Security note:** `AUTO_CREATE_DATASET` defaults to `false` — the system will not create repos on your behalf unless you explicitly opt in.
108
 
 
10
  app_port: 7860
11
  tags:
12
  - huggingface
 
13
  - openrouter
14
  - chatbot
15
  - llm
 
78
  | Secret | Status | Description | Example |
79
  |--------|:------:|-------------|---------|
80
  | `HF_TOKEN` | **Required** | HF Access Token with write permission ([create one](https://huggingface.co/settings/tokens)) | `hf_AbCdEfGhIjKlMnOpQrStUvWxYz` |
81
+ | `OPENCLAW_DATASET_REPO` | See below | Dataset repo for backup — format: `username/repo-name`. Required in manual mode; optional in auto mode (see [Data Persistence](#data-persistence)) | `your-name/YourSpace-data` |
82
  | `OPENAI_API_KEY` | Recommended | OpenAI (or any [OpenAI-compatible](https://openclawdoc.com/docs/reference/environment-variables)) API key | `sk-proj-xxxxxxxxxxxx` |
83
  | `OPENROUTER_API_KEY` | Optional | [OpenRouter](https://openrouter.ai) API key (200+ models, free tier available) | `sk-or-v1-xxxxxxxxxxxx` |
84
  | `ANTHROPIC_API_KEY` | Optional | Anthropic Claude API key | `sk-ant-xxxxxxxxxxxx` |
 
101
  1. Set `AUTO_CREATE_DATASET` = `true` in your Space secrets
102
  2. Set `HF_TOKEN` with write permission
103
  3. (Optional) Set `OPENCLAW_DATASET_REPO` if you want a custom repo name
104
+ 4. On first startup, HuggingClaw automatically creates a **private** Dataset repo. If `OPENCLAW_DATASET_REPO` is not set, it derives the name from your HF username + Space name: `your-username/SpaceName-data` (e.g. `your-name/YourSpace-data`). Each Space gets its own dataset, so duplicating a Space won't cause conflicts
105
 
106
  > **Security note:** `AUTO_CREATE_DATASET` defaults to `false` — the system will not create repos on your behalf unless you explicitly opt in.
107
 
scripts/sync_hf.py CHANGED
@@ -81,7 +81,7 @@ SYNC_INTERVAL = int(os.environ.get("SYNC_INTERVAL", "60"))
81
  AUTO_CREATE_DATASET = os.environ.get("AUTO_CREATE_DATASET", "false").lower() in ("true", "1", "yes")
82
 
83
  # Dataset repo: always auto-derive from SPACE_ID when not explicitly set.
84
- # Format: {username}/{SpaceName}-data (e.g. "tao-shen/HuggingClaw-data")
85
  # This ensures each duplicated Space gets its own dataset automatically.
86
  HF_REPO_ID = os.environ.get("OPENCLAW_DATASET_REPO", "")
87
  if not HF_REPO_ID and SPACE_ID:
 
81
  AUTO_CREATE_DATASET = os.environ.get("AUTO_CREATE_DATASET", "false").lower() in ("true", "1", "yes")
82
 
83
  # Dataset repo: always auto-derive from SPACE_ID when not explicitly set.
84
+ # Format: {username}/{SpaceName}-data (e.g. "your-name/YourSpace-data")
85
  # This ensures each duplicated Space gets its own dataset automatically.
86
  HF_REPO_ID = os.environ.get("OPENCLAW_DATASET_REPO", "")
87
  if not HF_REPO_ID and SPACE_ID: