raazkumar commited on
Commit
e979df8
Β·
verified Β·
1 Parent(s): 77f8256

Upload production/.env.minimal

Browse files
Files changed (1) hide show
  1. production/.env.minimal +17 -8
production/.env.minimal CHANGED
@@ -6,20 +6,29 @@
6
  CLOUDFLARE_API_KEY=sk-your-cloudflare-api-key
7
  CLOUDFLARE_ACCOUNT_ID=your-account-id-here
8
 
9
- # OPTIONAL but recommended: NVIDIA NIM (primary β€” faster)
10
  NVIDIA_API_KEY=nvapi-your-nvidia-api-key
11
 
 
 
 
12
  # =====================================
13
  # That's it. Everything else auto-configures:
14
- # β€’ Primary: NIM (if API key set)
15
- # β€’ Fallback: Cloudflare Workers AI
16
- # β€’ Cache: Redis (auto-launched)
17
- # β€’ DB: Postgres (auto-launched)
18
- # β€’ Metrics: Prometheus + Grafana
19
- # β€’ MLX: Off by default (enable below for local M2)
 
20
  # =====================================
21
 
22
  # ─── Optional: Enable MLX local on M2 Pro Max ───
23
- # Uncomment to run 70B models locally via Metal GPU:
24
  # MLX_ENABLED=true
25
  # MLX_API_BASE=http://host.docker.internal:8000/v1
 
 
 
 
 
 
6
  CLOUDFLARE_API_KEY=sk-your-cloudflare-api-key
7
  CLOUDFLARE_ACCOUNT_ID=your-account-id-here
8
 
9
+ # OPTIONAL but recommended: NVIDIA NIM (primary β€” fastest cloud)
10
  NVIDIA_API_KEY=nvapi-your-nvidia-api-key
11
 
12
+ # OPTIONAL: Google Gemini (tertiary fallback β€” great for coding/reasoning)
13
+ GEMINI_API_KEY=your-gemini-api-key-here
14
+
15
  # =====================================
16
  # That's it. Everything else auto-configures:
17
+ # β€’ Primary: NIM (if API key set)
18
+ # β€’ Secondary: Cloudflare Workers AI
19
+ # β€’ Tertiary: Google Gemini
20
+ # β€’ Local: MLX (off by default β€” enable below)
21
+ # β€’ Cache: Redis (auto-launched)
22
+ # β€’ DB: Postgres (auto-launched)
23
+ # β€’ Metrics: Prometheus + Grafana
24
  # =====================================
25
 
26
  # ─── Optional: Enable MLX local on M2 Pro Max ───
27
+ # For Gemma 4 local (best Claude Opus alternative):
28
  # MLX_ENABLED=true
29
  # MLX_API_BASE=http://host.docker.internal:8000/v1
30
+ #
31
+ # Recommended Gemma 4 MLX models for 96GB:
32
+ # β€’ gemma-4-31b-bf16 (~65GB runtime) β€” highest quality, best Opus alternative
33
+ # β€’ gemma-4-26b-a4b-it-bf16 (~55GB) β€” slightly smaller, still excellent
34
+ # β€’ gemma-4-e4b-it (~12GB) β€” fast, efficient, good for quick tasks