| # Riprap: landscape research |
|
|
| Captured 2026-05-06 as part of the AMD x lablab.ai hackathon polish |
| phase. This document underpins the pitch deck (`slides/deck.md`) and |
| the demo-script choices. Re-validate against the live web before |
| re-using any specific figure. |
|
|
| --- |
|
|
| ## What Riprap is, distinctly |
|
|
| A citation-grounded LLM that writes audit-quality flood-exposure |
| briefings for NYC addresses by fusing live, historical, modeled, and |
| projected data sources. Mellea rejection sampling refuses to publish |
| a numeric claim it can't cite. The output isn't a score. It's a |
| four-section prose briefing with `[doc_id]` citations on every |
| numeric assertion, where each `doc_id` resolves to one specific |
| dataset (Sandy 2012 zone, NYC DEP scenario, USGS HWM, Sentinel-2 |
| chip, NOAA gauge reading, NPCC4 SLR projection). |
|
|
| Granite 4.1 8B drives the prose. Granite Embedding 278M plus GLiNER |
| drive policy-doc retrieval. Prithvi-EO 2.0, TerraMind LULC and |
| Buildings, and Granite TTM r2 drive the EO and forecast probes, |
| with three Apache-2.0 NYC fine-tunes trained on AMD MI300X published |
| on HF Hub. |
|
|
| Architectural commitments other tools don't make: |
|
|
| 1. **Silence over confabulation.** When a probe returns no data, the |
| briefing omits the section rather than papering over it. |
| 2. **Five-stone epistemic structure.** The user can see what's |
| empirical vs modeled vs proxy vs synthetic. |
| 3. **Fully open-source pipeline.** Apache-2.0 end-to-end on public- |
| record data, no commercial APIs touched at runtime. |
| 4. **Deployable on either local Ollama or AMD MI300X via vLLM** with |
| auto-failover. |
|
|
| Stack as of 2026-05-06: SvelteKit UI on HF Spaces (cpu-basic) at the |
| AMD-hackathon org, FastAPI agent FSM, two-container droplet (vLLM |
| plus riprap-models) on MI300X, full address probe suite at 5/5 PASS |
| in 5.8 to 13.1 s end-to-end. |
|
|
| --- |
|
|
| ## Landscape map |
|
|
| ### Direct comps: score-based property risk tools |
|
|
| | Tool | What it gives | Who it serves | Hidden cost | |
| |---|---|---|---| |
| | **First Street Risk Factor** (Flood Factor) | Score 1 to 10 plus 30-yr risk narrative; powers Redfin, Realtor.com (until Dec 2025 also Zillow) | Homebuyers; some lenders | Closed model; commercial partnerships; Zillow removed it under industry pressure in Dec 2025 | |
| | **ClimateCheck** | Score 1 to 100 plus around 30-page property report; 2050 projections | Homeowners plus REIT/PE diligence | Subscription tiers; methodology behind paywall | |
| | **Jupiter ClimateScore Global** | Enterprise SaaS / API; financial metrics (CapEx, OpEx, credit risk) | Banks, insurers, asset managers | Enterprise pricing; not consumer-facing | |
| | **Cervest / Climate X / ICEYE** | Variants of above for ESG / reinsurance | Corporate finance and insurance | Same | |
|
|
| Score-based tools all converge on the same shape: one number, one |
| chart, an explainer paragraph. None show what claim is grounded in |
| which dataset. None expose the audit trail. |
|
|
| ### NYC-specific government tools |
|
|
| - **FloodHelpNY** (City plus State, IDEO-designed). Address lookup |
| to flood-zone label plus insurance plus free resiliency audit. |
| Forms-based, consumer-facing, doesn't fuse live signals. |
| - **NYC Flood Hazard Mapper.** ArcGIS web map of FEMA, NPCC, Sandy, |
| and future scenarios. Static visualization, no narrative. |
| - **NYC OEM Flood Maps page.** Index of the above. |
| - **EJNYC Flood Vulnerability Index** (released 2024-04 by Mayor's |
| Office of Climate and EJ). First-ever city FVI, used to direct |
| spending under NY's "Disadvantaged Communities" framework (35% of |
| climate spend by law). |
| - **FloodNet NYC** (NYU plus CUNY plus city). Over 350 ultrasonic |
| sensors at 1-min cadence, growing to 500 by end-2026. Has a public |
| dashboard but no narrative layer. |
|
|
| ### Federal / authoritative |
|
|
| - **FEMA Flood Map Service Center / NFHL.** Official; covers 90%+ |
| of population; static GIS layer plus PDFs. The disclosure-of- |
| record but not a synthesis tool. |
|
|
| ### Real-estate platforms (the volatile zone) |
|
|
| - **Redfin.** Still shows First Street Flood Factor on every |
| listing. |
| - **Realtor.com.** Still shows it on 110M+ listings. |
| - **Zillow.** Removed climate risk display in December 2025 under |
| California Regional MLS pressure. Still links out, but it's |
| hidden. This created a vacuum that an open citation-grounded |
| alternative could fill. |
|
|
| ### Closest academic / AI comps |
|
|
| - **Flood-LLM** (Brisbane, MDPI Sustainability 2026). Multi-source |
| LLM for property-level flood risk, validated on Brisbane against |
| official labels. Academic, not deployed; no Mellea-style citation |
| discipline; no live signals. |
| - **GIS-Integrated Flood LLM** (Tandfonline 2024). LLM constrained |
| by a flood knowledge graph plus GIS interaction. Research artefact. |
| - **FloodLense** (arXiv 2024). UNet/RDN/ViT plus LLM for satellite |
| flood detection. Research; image-only. |
|
|
| --- |
|
|
| ## Where Riprap fits: differentiators that demo well |
|
|
| Ranked by visibility in a 3-minute demo: |
|
|
| 1. **Citation prose vs scores.** Riprap returns *"Hurricane Sandy |
| flooded this address on October 29 to 30, 2012, according to the |
| empirical inundation zone [sandy]. 19 flood-related 311 service |
| requests were logged within 200 m over five years [nyc311]."* |
| Every number cites a doc; each doc resolves to a footer source |
| row. First Street returns "Flood Factor 8/10". This gap is the |
| demo. |
| 2. **Live, historical, modeled, projected: in one paragraph.** Sandy |
| 2012 (empirical), DEP 2080 stormwater scenarios (modeled), 311 |
| last 5 years (proxy), FloodNet last 3 years (empirical, |
| hyperlocal), NPCC4 SLR (projected), Granite TTM r2 surge nowcast |
| (96-h forecast). No comp combines all four temporal modes. |
| 3. **Open-source NYC fine-tunes.** Three Apache-2.0 models |
| (`Prithvi-EO-2.0-NYC-Pluvial`, `TerraMind-NYC-Adapters`, |
| `Granite-TTM-r2-Battery-Surge`) trained on AMD MI300X. Anyone can |
| reproduce, fork to other cities, or audit. First Street's model |
| is closed; ClimateCheck's methodology is behind a paywall. |
| 4. **AMD hardware story.** The whole stack runs on MI300X via vLLM |
| (LLM) plus a sibling ROCm container (probes). All Apache-2.0. |
| This is the AMD hackathon track's preferred narrative: open |
| models, open infra, open data, real GPU acceleration. |
| 5. **Mellea grounding receipts.** The four checks |
| (`numerics_grounded`, `no_placeholder_tokens`, `citations_dense`, |
| `citations_resolve`) are the audit. The meta card surfaces "4/4 |
| grounding checks passed, 1 reroll". That's audit credibility no |
| consumer comp shows. |
| 6. **Self-aware silence.** Touchstone shows "FloodNet sensor: 0 |
| events in 3 years" with `silent_by_design`. Lodestone shows "TTM |
| Battery surge forecast: peak |residual| < 0.3 m, omitted." Most |
| tools always render a value. Riprap's silence is a feature. |
|
|
| --- |
|
|
| ## Stakeholder demos to craft |
|
|
| Six concrete personas, each with a query that exercises a different |
| part of the system. These are the demo arcs to rehearse. |
|
|
| ### 1. Resident / homebuyer (the FloodHelpNY swap-in) |
|
|
| > *"I'm thinking about renting an apartment at 80 Pioneer Street, |
| > Brooklyn. Should I worry?"* |
|
|
| **Demo arc.** Type the address. Watch the planner classify |
| `single_address`, then 19 step events fire across the four data |
| Stones in around 13 s. Briefing names Sandy 2012 inundation, 65 311 |
| complaints, 2 FloodNet sensors with 4 events including a 51 mm peak |
| on a specific date, Ida 2021 HWM 130 m away, microtopo HAND 3.81 m |
| plus TWI 14.79 (very high saturation propensity). Footer shows 7+ |
| named primary sources. |
|
|
| **Demo hook.** "Compare what we just generated to First Street's |
| number-and-bar-chart for the same address. Which would you trust to |
| make a $4,000/month decision?" |
|
|
| ### 2. Real-estate attorney / disclosure compliance |
|
|
| > *"Does 100 Gold Street, Manhattan need to disclose flood risk |
| > under RPL §462(2)?"* |
|
|
| **Demo arc.** Same single_address path. Briefing produces a citable |
| narrative covering FEMA designation, prior flood claims (where |
| present), terrain, recent complaints. Mellea grounding check is the |
| qualifier: "this prose is grounded against four invariants and |
| passed 4/4." |
| |
| **Demo hook.** New York's March-2024 amended Property Condition |
| Disclosure Statement requires sellers to disclose flood history and |
| FEMA-floodplain status. RPL §231-b requires every residential lease |
| to disclose prior flood damage. Riprap is the citable narrative |
| tool. Show how the briefing maps line-by-line to the disclosure |
| requirements. |
| |
| ### 3. NYC OEM / DEP planner |
| |
| > *"Hollis, Queens"* |
| |
| **Demo arc.** Neighborhood intent fires (9 step events), produces an |
| NTA-level briefing. 434 flood-related 311 over 3 years (87 catch- |
| basin clogged, 42 street-flooding), 4.3% of neighborhood projected |
| to flood under DEP moderate-2050 scenario, 25% of cells with HAND<1 |
| m. RAG retrieval pulls relevant DEP/NPCC4 policy paragraphs. |
| |
| **Demo hook.** DEP just announced a $30B stormwater priority list |
| (86 locations) and a $68M Brooklyn Bluebelt expansion in Prospect |
| Park. Riprap supports the prioritization argument with citable per- |
| NTA evidence. Pair with the EJNYC Flood Vulnerability Index for the |
| EJ-spending overlay (35%-to-disadvantaged-communities legal |
| mandate). |
| |
| ### 4. Insurance underwriter / actuary |
| |
| > *"442 East Houston Street, Manhattan"* |
| |
| **Demo arc.** Same as resident demo, but emphasize the **provenance |
| trace** UI. Every Stone row, every doc_id, every source URL, |
| vintage, and tier glyph. |
|
|
| **Demo hook.** When an underwriter writes a risk memo, the audit |
| chain matters. First Street's "we used a proprietary catastrophe |
| model" doesn't survive a regulator review the way "we used FEMA |
| Sandy 2012 polygon, NYC DEP 2021 stormwater scenario, USGS Ida HWM |
| Event 312, NOAA gauge 8518750, NWS station KNYC, Granite TTM r2 |
| fine-tune (test MAE 0.1091 m vs 0.1467 zero-shot, citable)" does. |
|
|
| ### 5. Climate journalist / advocacy |
|
|
| > *"Coney Island, Brooklyn"* |
|
|
| **Demo arc.** Neighborhood briefing. 87.5% of NTA in 2012 Sandy |
| zone, 382 flood complaints over 3 years, 7.8% projected flooded |
| under 2050 moderate, 38.9% of DEM cells with HAND<1 m, DEP extreme- |
| 2080 at 44.2% flooded. |
|
|
| **Demo hook.** ProPublica/NYTimes/THE CITY-style data journalism. |
| Every claim in a Riprap briefing is reproducible. Anyone can paste |
| the same query and get a near-identical narrative. The journalist |
| can publish the briefing as the methods section. |
|
|
| ### 6. Architect / developer |
|
|
| > *"What are they building in Gowanus and is it risky"* |
|
|
| **Demo arc.** Planner classifies `development_check`. FSM pulls DOB |
| filings plus flood layers for the project sites. Briefing comments |
| on which proposed buildings sit inside Sandy 2012, which intersect |
| DEP extreme-2080, what the microtopo says. |
|
|
| **Demo hook.** Pre-design siting check. The Gowanus rezoning is one |
| of NYC's largest active development zones, well known to flood. Show |
| how the tool surfaces flood concerns before architects pour |
| concrete. |
|
|
| --- |
|
|
| ## Lateral and unexpected use cases |
|
|
| Ten bets, ordered roughly from most-buildable to most-speculative. |
|
|
| 1. **Pre-storm cohort briefings.** Subscribe Riprap to NWS flood- |
| watch alerts. When a watch lands, fan out one briefing per |
| affected NTA plus push to OEM, press, and advocacy lists. Citable |
| evidence on demand for the press cycle that follows. |
| 2. **Climate-grant evidence sections.** HUD CDBG-DR and FEMA BRIC |
| applications need an auditable evidence base. Riprap auto- |
| generates the "vulnerability assessment" section so a community |
| group can apply for resilience funding without hiring a |
| consultant. |
| 3. **Local Law disclosure boilerplate.** Plug into a brokerage's |
| listing flow. When an agent enters an address, auto-generate the |
| NY RPL §231-b lease addendum or §462(2) disclosure draft. ROI is |
| high since the law took effect 2024 and many landlords are still |
| figuring out compliance. |
| 4. **MTA station-hardening prioritization.** Riprap already has the |
| MTA-entrance probe (KEY-001 in the demo). Run the FSM across all |
| subway entrances; rank by exposure × ridership. The MTA's |
| October-2025 Climate Resilience Roadmap Update is the policy |
| hook. |
| 5. **DOE school siting.** When DOE reviews proposed school sites or |
| selects schools for retrofit, Riprap briefings (with `expect_311_ge` |
| plus Sandy plus DEP overlays) would catch flood exposure that |
| form-style screens miss. |
| 6. **Time-machine variant.** Re-run the FSM with snapshot data from |
| a past date. *"What would Riprap have said about Hollis on August |
| 31, 2021, the day before Ida?"* Useful for retrospective analysis, |
| expert testimony, and stress-testing the system. |
| 7. **Cross-city scaffold.** The architecture is NYC-specific by data |
| choice, not by code. Port to Houston (post-Harvey plus Hurricane |
| Beryl 2024), Miami (king tides), Boston (CSO floods), Charleston |
| (chronic tidal), with a per-city probe set plus RAG corpus. |
| 8. **Federation with FloodNet alerts.** When a sensor triggers a |
| flood event NOW, fire a Riprap live_now briefing for the |
| surrounding NTA: *"what's at stake in the next 6 hours."* |
| Connects FloodNet's hyperlocal sensor reads to the OEM decision |
| loop. |
| 9. **EJNYC × Riprap pairing.** Rank all 188 NTAs by Riprap-detected |
| exposure, intersect with state DAC designations. Output: a map of |
| "underserved plus underwater". The most underfunded high-exposure |
| neighborhoods. |
| 10. **Court testimony / expert witness.** Citable, reproducible |
| flood narrative as a court exhibit. The Mellea passes-record |
| plus provenance trace are the kind of artefact a regulator or |
| judge can audit. Especially relevant after the December-2025 |
| Zillow controversy created public discussion of climate-data |
| integrity. |
| |
| --- |
| |
| ## Risks and framing |
| |
| - **Real-estate industry pushback.** December 2025: Zillow removed |
| First Street's climate scores under MLS pressure because the data |
| was hurting transaction volume. A free, citation-grounded |
| alternative could face the same reflex. Riprap's defence is that |
| it's a narrative tool for professional analytical work, not a |
| buy/don't-buy verdict. Keep the disclaimer footer prominent. |
| - **Redlining hazard.** Exposure narratives can be misused by |
| landlords or insurers to discriminate against high-flood-risk |
| (often disproportionately disadvantaged) neighborhoods. |
| Mitigations: (a) the citation transparency makes biased reasoning |
| auditable, (b) the EJNYC pairing in lateral-use #9 reframes |
| exposure data as a tool *for* affected communities, not against |
| them, (c) keep "for professional analytical work, not personal |
| property decisions" front and center. |
| - **Disclosure-status liability.** A briefing is *evidence* but |
| probably not *the* §462(2) disclosure under New York real-estate |
| law. Don't position it as legal disclosure-of-record without a |
| real-estate-attorney review. |
| - **Cold-start latency.** First query after droplet redeploy is |
| around 30 s while models warm. For demos, ping the Space and run |
| one warm-up query 5 minutes before showtime. |
| - **Geocoder edge cases.** "PS 188, Lower East Side" geocoded to a |
| Brooklyn PS 188 in our test suite. For demos, pick fully-qualified |
| street addresses; document the disambiguation behavior. |
| |
| --- |
| |
| ## Polish punch-list (deck-driven) |
| |
| Concrete polish items the research surfaces, ranked by demo value: |
| |
| 1. **Sample-query pills on landing.** Six clickable pills below the |
| search bar, one per persona above. Let the audience demo |
| themselves. |
| 2. **A "What this is" bar at the top of the landing.** Three lines: |
| *"Citation-grounded NYC flood briefings. Every number cites a |
| primary source. Open-source, public data, audit-grade synthesis."* |
| 3. **Compare-mode link from the briefing.** Once Riprap delivers a |
| single_address briefing, surface "compare with another address" |
| as a one-click affordance. The compare intent already exists in |
| the planner. |
| 4. **EJNYC-FVI overlay** on the map sidebar (#9 above). Riprap's |
| exposure × DAC designation, two clicks to a powerful editorial |
| demo. |
| 5. **First-query warm-up message** during the cold start: *"loading |
| probes on AMD MI300X. First query after redeploy takes around 30 |
| s; subsequent queries 5 to 13 s."* |
|
|
| --- |
|
|
| ## Sources |
|
|
| - [First Street Foundation: Flood Factor methodology](https://firststreet.org/methodology/flood) |
| - [FloodHelpNY: NYC and IDEO consumer tool](https://www.floodhelpny.org/en) |
| - [ClimateCheck: flood risk methodology](https://climatecheck.com/risks/flood) |
| - [Jupiter Intelligence: ClimateScore Global / FloodScore](https://www.jupiterintel.com/climatescore-global) |
| - [FEMA Flood Map Service Center](https://msc.fema.gov/) |
| - [NY State: RPL §231-b residential lease flood disclosure (2023)](https://www.nysenate.gov/legislation/bills/2021/S5472) |
| - [NYSBA: Property Condition Disclosure flood-risk amendment (Mar 2024)](https://nysba.org/breaking-news-new-rules-on-property-condition-disclosure-and-flood-risk-go-into-effect-today/) |
| - [CNN: Zillow removes climate risk data under industry pressure (Dec 2025)](https://www.cnn.com/2025/12/02/climate/zillow-climate-data-extreme-weather-first-street-redfin) |
| - [NYC Stormwater Resiliency Plan](https://www.nyc.gov/assets/orr/pdf/publications/stormwater-resiliency-plan.pdf) |
| - [FloodNet NYC: methodology and sensor network](https://www.floodnet.nyc/methodology) |
| - [FloodNet WRR 2024: peer-reviewed sensor paper](https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2023WR036806) |
| - [EJNYC Report: Mayor's Office of Climate and Environmental Justice](https://climate.cityofnewyork.us/ejnyc-report/the-state-of-environmental-justice-in-nyc/) |
| - [Flood-LLM: Brisbane case study (MDPI 2026)](https://www.mdpi.com/2071-1050/18/6/2957) |
| - [GIS-Integrated Flood LLM (Tandfonline 2024)](https://www.tandfonline.com/doi/full/10.1080/13658816.2024.2306167) |
| - [THE CITY: Disadvantaged Communities flood funding (NY Climate Law)](https://www.thecity.nyc/2022/05/02/billions-ny-climate-law-disadvantaged-communities-flood/) |
| - [Inman: Redfin First Street integration](https://www.inman.com/2021/02/18/redfin-starts-displaying-flood-risk-data-on-listings/) |
| - [FACTUM: citation-hallucination detection in long-form RAG](https://arxiv.org/pdf/2601.05866) |
| - [AMD x lablab.ai Developer Hackathon (May 4 to 10, 2026)](https://lablab.ai/ai-hackathons/amd-developer) |
|
|