Upload production/.env.minimal
Browse files- production/.env.minimal +17 -8
production/.env.minimal
CHANGED
|
@@ -6,20 +6,29 @@
|
|
| 6 |
CLOUDFLARE_API_KEY=sk-your-cloudflare-api-key
|
| 7 |
CLOUDFLARE_ACCOUNT_ID=your-account-id-here
|
| 8 |
|
| 9 |
-
# OPTIONAL but recommended: NVIDIA NIM (primary β
|
| 10 |
NVIDIA_API_KEY=nvapi-your-nvidia-api-key
|
| 11 |
|
|
|
|
|
|
|
|
|
|
| 12 |
# =====================================
|
| 13 |
# That's it. Everything else auto-configures:
|
| 14 |
-
# β’ Primary:
|
| 15 |
-
# β’
|
| 16 |
-
# β’
|
| 17 |
-
# β’
|
| 18 |
-
# β’
|
| 19 |
-
# β’
|
|
|
|
| 20 |
# =====================================
|
| 21 |
|
| 22 |
# βββ Optional: Enable MLX local on M2 Pro Max βββ
|
| 23 |
-
#
|
| 24 |
# MLX_ENABLED=true
|
| 25 |
# MLX_API_BASE=http://host.docker.internal:8000/v1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
CLOUDFLARE_API_KEY=sk-your-cloudflare-api-key
|
| 7 |
CLOUDFLARE_ACCOUNT_ID=your-account-id-here
|
| 8 |
|
| 9 |
+
# OPTIONAL but recommended: NVIDIA NIM (primary β fastest cloud)
|
| 10 |
NVIDIA_API_KEY=nvapi-your-nvidia-api-key
|
| 11 |
|
| 12 |
+
# OPTIONAL: Google Gemini (tertiary fallback β great for coding/reasoning)
|
| 13 |
+
GEMINI_API_KEY=your-gemini-api-key-here
|
| 14 |
+
|
| 15 |
# =====================================
|
| 16 |
# That's it. Everything else auto-configures:
|
| 17 |
+
# β’ Primary: NIM (if API key set)
|
| 18 |
+
# β’ Secondary: Cloudflare Workers AI
|
| 19 |
+
# β’ Tertiary: Google Gemini
|
| 20 |
+
# β’ Local: MLX (off by default β enable below)
|
| 21 |
+
# β’ Cache: Redis (auto-launched)
|
| 22 |
+
# β’ DB: Postgres (auto-launched)
|
| 23 |
+
# β’ Metrics: Prometheus + Grafana
|
| 24 |
# =====================================
|
| 25 |
|
| 26 |
# βββ Optional: Enable MLX local on M2 Pro Max βββ
|
| 27 |
+
# For Gemma 4 local (best Claude Opus alternative):
|
| 28 |
# MLX_ENABLED=true
|
| 29 |
# MLX_API_BASE=http://host.docker.internal:8000/v1
|
| 30 |
+
#
|
| 31 |
+
# Recommended Gemma 4 MLX models for 96GB:
|
| 32 |
+
# β’ gemma-4-31b-bf16 (~65GB runtime) β highest quality, best Opus alternative
|
| 33 |
+
# β’ gemma-4-26b-a4b-it-bf16 (~55GB) β slightly smaller, still excellent
|
| 34 |
+
# β’ gemma-4-e4b-it (~12GB) β fast, efficient, good for quick tasks
|