Spaces:

lablab-ai-amd-developer-hackathon
/

riprap-nyc

Running

App Files Files Community

riprap-nyc / slides /VIDEO_TRANSCRIPT.md

seriffic

ship: v0.5.0 code changes — compare UI + cleanup pass

caa28aa 1 day ago

preview code

raw

history blame contribute delete

10.4 kB

Riprap — Demo Video Transcript

AMD × lablab.ai Developer Hackathon · May 4–10 2026

Target: ~5 minutes

[SLIDE 1 — Title card] · ~0:00–0:10

SCREEN: Slide 1. Riprap logo. "Citation-grounded NYC flood-exposure briefings, on AMD MI300X."

Climate risk is one of the most consequential datasets in real estate and urban planning right now. But the tools that exist today give you a score. A number from one to ten. No explanation. No sources. Just a black box. We built Riprap to be the audit trail behind that number.

[SLIDE 2 — The problem] · ~0:10–0:30

SCREEN: Slide 2. "Climate risk data is a black box." Two boxes: market scores vs Zillow pulling climate data.

First Street gives you a flood factor. ClimateCheck gives you a percentile. Jupiter charges enterprise rates for a proprietary model. In November 2025, Zillow removed climate risk scores from listings entirely — under pressure from the real-estate industry. When a number meets resistance, the only defense is the audit trail. Riprap is the audit trail.

[SLIDE 3 — Solution] · ~0:30–0:40

SCREEN: Slide 3. Screenshot of the Riprap UI — briefing prose with citation chips, map panel, stone trace.

Type any address in New York City. Get back a written briefing where every numeric claim — every flood depth, every complaint count, every risk percentage — links to its primary public-record source. Federal data. City data. Apache-2.0 models. Nothing proprietary.

[SLIDE 4 — Civic-tech case] · ~0:40–1:00

SCREEN: Slide 4. Four boxes: NY Disclosure Law, DEP Stormwater Plan, EJNYC FVI, No commercial APIs.

New York's property disclosure law — March 2024 — requires sellers to disclose flood history. Riprap is the citable narrative that makes that disclosure meaningful. The DEP's $30 billion stormwater priority list covers 86 sites. Riprap provides the per-neighborhood evidence layer that backs up that ranking. And because every model is Apache-2.0 and every dataset is public record, environmental justice advocates can audit the same system that a developer uses. No commercial gatekeeping.

[SLIDE 5 — Architecture] · ~1:00–1:30

SCREEN: Slide 5. "Five Stones fan out. One cited briefing comes back." Four evidence cards (Cornerstone, Keystone, Touchstone, Lodestone) + Capstone bar at bottom.

The architecture is called Five Stones. A natural-language query hits the Planner — Granite 4.1 3B — which classifies intent and selects a specialist roster. Each Stone is a class of evidence. Cornerstone reads the hazard record: Sandy inundation zones, FEMA flood maps, USGS high-water marks, Prithvi satellite imagery. Keystone reads what's exposed: MTA stations, schools, hospitals, building footprints from our TerraMind NYC fine-tune. Touchstone reads what's happening now: live FloodNet sensors, 311 flood complaints, NOAA tide gauges. Lodestone looks forward: NPCC4 sea-level projections, our Granite TTM Battery surge nowcast. Then Capstone — Granite 4.1 8B on vLLM — synthesizes everything into a four-section briefing. Every numeric claim must cite its source, or the Mellea rejection sampler rerolls it. The briefing doesn't publish until all four grounding checks pass.

[SLIDE 6 — Fine-tuning] · ~1:30–1:50

SCREEN: Slide 6. Three fine-tune cards: Prithvi-EO-2.0-NYC-Pluvial · TerraMind-NYC-Adapters · Granite-TTM-r2-Battery-Surge.

We trained three NYC-specialized models on AMD MI300X hardware, all published Apache-2.0 on Hugging Face Hub. Prithvi-EO-2.0-NYC-Pluvial detects pluvial flooding from Sentinel-2 imagery — 0.60 IoU on the Ida test set, a 6× lift over the baseline. TerraMind-NYC-Adapters adds LoRA adapters for building footprint and land-use classification, plus 6 points of mIoU in 18 minutes of training. And Granite TTM r2 fine-tuned on the Battery tide gauge gives us a 9.6-hour surge residual nowcast at 35% lower RMSE than persistence. These aren't experiments. They're in production in every briefing.

[SLIDE 7 — Demo intro] · ~1:50–2:00

SCREEN: Slide 7. "Live demo." Query text: "I'm thinking about renting an apartment at 80 Pioneer Street, Brooklyn. Should I worry?"

Let's run it live. Three queries, three different intents.

[DEMO CLIP 1 — Pioneer Street, single address] · ~2:00–2:40

SCREEN: Cut to recording riprap-demo-20260506-234537.webm at t≈62s.

Left panel: briefing fully rendered. Title "Flood-exposure briefing · 80 Pioneer Street, Red Hook."
Sections 01 Status through 04 Policy context visible with inline [1] [2] [3] citation chips.
Right panel: Sandy flood map showing Pioneer Street pinned inside the inundation zone (blue overlay).
Status bar: intent: single_address · 19 specialists · attempt 1 · done

Thirteen seconds end-to-end. Nineteen specialists fired. The briefing tells you: Pioneer Street sits inside Hurricane Sandy's 2012 inundation zone, 0.82 metres above the nearest drainage channel, in the 78th percentile for water accumulation risk. FloodNet sensor FN-BK-018 — two blocks away — has logged four flood events since 2023. The DEP's high-intensity scenario puts the site under six inches of standing water. Every number has a footnote. Every footnote resolves to a named public dataset.

SCREEN: Slow scroll of left briefing panel while voiceover continues. Citation chips [1] [2] [3] visible inline. Bottom of panel shows section 04 "Policy context" with RAG passages from NPCC4.

The map on the right isn't decorative — it's live. The layers are grouped by Stone, so you can see exactly which evidence tier each visual comes from.

[DEMO CLIP 2 — Mellea 4/4 grounding card] · ~2:40–3:05

SCREEN: Recording at t≈270s. Right panel scrolled to Capstone section.

Capstone card: "grounding checks: 4/4 passed", rerolls=0, passed=4, attempt=1.
Four check items: numerics_grounded · no_placeholder_tokens · citations_dense · citations_resolve

Here's the proof. Mellea ran four grounding checks on the completed briefing: every non-trivial number appears verbatim in a source document; no template fragments leaked through; every number has a citation in the same sentence; every cited ID resolves to an actual input document. Four of four. First attempt. Zero rerolls. This is what "every number cites its source" looks like as a machine-verifiable claim, not a marketing promise.

[DEMO CLIP 3 — Hollis, Queens · neighborhood intent] · ~3:05–3:30

SCREEN: Recording at t≈510s. New query: "Hollis, Queens."

Status bar: intent: neighborhood · 9 specialists · attempt 1 · done
Left panel: neighborhood briefing — NTA-level statistics, DEP stormwater scenario percentages, 311 flood complaint counts.
Right panel: Cornerstone section with Sandy inundation percentage for the NTA + FEMA layer.

Same system, different intent. "Hollis, Queens" is a neighborhood query — nine specialists instead of nineteen, NTA-level aggregates instead of point data. The planner classified it in under a second and dispatched the right Stone roster automatically. Hollis is a stormwater-flooding neighborhood, not a coastal one. The briefing reflects that: Sandy inundation is low; the DEP moderate-intensity scenario covers 22% of impervious surface; 311 flood complaints cluster around the 180th Street drainage corridor. Different geography, different risk profile, same citation standard.

[DEMO CLIP 4 — Compare · Pioneer vs Gold Street] · ~3:30–4:00

SCREEN: Screenshot compare-hf.jpg — the live HF Space compare result.

Title: "COMPARE 80 PIONEER STREET BROOKLYN TO 100 GOLD STREET MANHATTAN"
Key differences bar at top: Status: 80 vs 100 · Empirical: 65 vs 26 · Modeled Drainage (HAND): 3.81m vs 38.2m
Side-by-side Status sections — Pioneer: "exposed to flood risk, Sandy inundation zone, TWI 14.79." Gold St: "moderate flood exposure, HAND 6.42m, mid-slope position."
Status bar: intent: compare · 11 specialists · attempt 1 · done

One more. "Compare 80 Pioneer Street Brooklyn to 100 Gold Street Manhattan." The planner routes this as a compare intent — two full specialist runs, results merged side by side. The key differences bar surfaces the contrast immediately: Pioneer Street sits 3.81 metres above its nearest drainage channel. Gold Street at 100 is 38.2 metres. Pioneer has 65 empirical flood signals in the record; Gold Street has 26. Same city. Same storm history. Radically different exposure. This is the query a developer, an insurer, or a disclosure attorney actually wants to run.

[SLIDE 8 — What's next] · ~4:00–4:20

SCREEN: Slide 8. Three boxes: Break out the Stones · Other flood-impacted cities · Historical-event mode.

The architecture is NYC-specific by data choice, not by code. The five-Stone pattern generalizes: Houston, Miami, Jakarta — swap the probe sets and RAG corpus, the FSM is the same. Each Stone is already isolated enough to ship as a standalone package. And we want to add historical-event mode: re-run the FSM against snapshot data from before Sandy, before Ida. Validation against measured outcomes as a first-class feature, not an afterthought.

[SLIDE 9 — CTA] · ~4:20–4:30

SCREEN: Slide 9. Dark background. "github.com/msradam/riprap-nyc" large. "Apache-2.0 · public data · AMD MI300X · IBM Granite 4.1 · Mellea grounding."

Everything is open. Apache-2.0, public data, MIT and Apache models. Riprap on AMD MI300X. Try it at the link in the description.

Segment map

Segment	Source	Timestamp / asset
Slides 1–7	`slides/deck.pdf`	screen-record slide deck
Demo clip 1 — Pioneer briefing + map	`assets/video/riprap-demo-20260506-234537.webm`	t≈62–90s
Demo clip 2 — Mellea 4/4 card	`assets/video/riprap-demo-20260506-234537.webm`	t≈265–290s
Demo clip 3 — Hollis neighborhood	`assets/video/riprap-demo-20260506-234537.webm`	t≈505–545s
Demo clip 4 — Compare result	`compare-hf.jpg` (static screenshot or re-record)	n/a
Slides 8–9	`slides/deck.pdf`	screen-record slide deck

Total runtime estimate

~4:30 — comfortable under 5 min with natural pauses.