File size: 3,800 Bytes
664512d | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 | # Toto Weather Forecasting Demo β Plan
A public Hugging Face Space that pulls live data from a personal Ecowitt GW3000 weather station, runs Datadog's Toto 2.0 (smallest variant) to forecast the next 24h of temperature/humidity/pressure, and shows it next to the National Weather Service forecast.
**Hook:** "Language models predict the next token. What if you could predict the future with the same technology?"
## Architecture (one file, one process)
```
Ecowitt Cloud API v3 βββ
ββββΊ app.py (Gradio Blocks) βββΊ HF Space (CPU basic)
NWS API /forecastHourly β β
ββ Toto 2.0 small (HF Hub, ~4M params, CPU)
ββ Plotly figs (gr.Plot Γ 3)
ββ TTL cache (1h) on fetches + inference
```
## Repo layout
```
time-series-ai-weather-forecast/
βββ app.py # Gradio entry point
βββ requirements.txt
βββ README.md # HF Space frontmatter + description
βββ .env.example
βββ .gitignore
βββ docs/
β βββ plan.md # this file
βββ src/
βββ ecowitt.py # Ecowitt API client
βββ nws.py # NWS forecast client
βββ forecast.py # Toto load + inference
βββ plotting.py # Plotly figure builders
βββ cache.py # TTL cache decorator
```
Single `app.py` is also fine; splitting into `src/` keeps each concern testable.
## Build order
1. **Ecowitt client** (current focus) β fetch real-time + last 7 days history, return a clean hourly `pandas.DataFrame` with columns `temp_f`, `humidity`, `pressure_inhg`, indexed by UTC timestamp.
2. **NWS client** β `/points/{lat},{lon}` β `forecastHourly` URL β 24h hourly forecast aligned to Ecowitt's cadence.
3. **Toto inference** β load smallest Toto 2.0 from HF Hub, univariate forecast per metric, return median + p10 + p90 over a 24h horizon.
4. **Plotting** β one Plotly figure per metric: past actuals (solid), Toto median (dashed) + p10βp90 band (shaded), NWS forecast (dashed, distinct color), vertical "now" marker.
5. **Gradio app** β `gr.Blocks`, title + hook, "Refresh" button, three `gr.Plot` outputs. `demo.load` runs once on visit; cache prevents repeat inference.
6. **Local smoke test** β `python app.py`, verify all three plots render with real data.
7. **Push to HF** β set secrets in Space settings, watch build, verify public URL.
## Ecowitt API v3 reference (verified URLs)
Base: `https://api.ecowitt.net/api/v3`
- `GET /device/real_time` β current snapshot. Params: `application_key`, `api_key`, `mac`, `call_back=all`.
- `GET /device/history` β historical data. Params: `application_key`, `api_key`, `mac`, `start_date`, `end_date`, `cycle_type`, `call_back`. Cycle types per Ecowitt's storage tiers: `5min` (last 90 days), `30min` (last year), `240min` (last 2 years), `auto`. Date format and exact `call_back` values to be confirmed against the live API on first call.
- `GET /device/info` β sanity check that creds + MAC are valid.
For the demo we want hourly cadence over the last 7 days, so `cycle_type=30min` and we resample to 1h locally. (5min would also work; 30min is lighter.)
## Secrets / config
Local `.env` (gitignored):
```
ECOWITT_APPLICATION_KEY=...
ECOWITT_API_KEY=...
ECOWITT_DEVICE_MAC=...
LAT=...
LON=...
```
On HF Space β Settings β Variables and Secrets:
- Secrets: `ECOWITT_APPLICATION_KEY`, `ECOWITT_API_KEY`
- Variables: `ECOWITT_DEVICE_MAC`, `LAT`, `LON`
## Out of scope
- Multivariate Toto inference
- Fine-tuning
- Auth, rate limiting, monitoring
- Metrics beyond temp/humidity/pressure
|