# Riprap scoring methodology > Riprap produces a **flood-exposure tier (1–4) per NYC address**, not > a calibrated damage probability. The tier is a deterministic > literature-grounded composite of public-data signals; the language > model writes the citing prose around it but does not score. ## 1. Why this design Closed-methodology scores (First Street, Jupiter, Fathom) are useful products but uncitable in civic work. A NYCEM grant writer can't quote "0.73" in a FEMA BRIC sub-application without a defensible audit trail. At the same time, an LLM-emitted score would be non-reproducible and uncalibrated, with documented LLM-as-judge pathologies (Zheng et al. 2023; Wang et al. 2024). The honest middle: **a deterministic rubric a planner can argue with**. The tier is computed in `app/score.py` and mirrored in `web/static/app.js`. Both implementations are kept in sync; the Python side is authoritative for register builds and CLI exports. ## 2. Methodology pedigree The composite construction follows a well-trodden path in the multi- indicator vulnerability/exposure literature: - **Cutter, Boruff & Shirley (2003)**, *Social Science Quarterly* 84(2): 242–261. The SoVI hazards-of-place pattern: group indicators thematically; sum factors with equal weights because there is no defensible theoretical basis for differential weighting. - **Tate (2012)**, *Natural Hazards* 63: 325–347. Explicit Monte Carlo sensitivity analysis showing that hierarchical equal-weighted composites are the most rank-stable. This is why we use equal weights *within* sub-indices. - **Balica, Wright & van der Meulen (2012)**, *Natural Hazards* 64: 73–105. Coastal City Flood Vulnerability Index, multiplicative (Exposure × Susceptibility / Resilience). We adopt only the override-behavior of multiplicative form, as a "max-empirical floor" (§4 below), because we have no resilience term. - **Kim et al. (2019)**, *Scientific Reports* 9:18564. Additive vs geometric aggregation; additive is more transparent and reproducible *if* sub-indices are pre-grouped thematically. Done. NPCC4 (2024) Ch. 3 (Rosenzweig et al., *Annals of the New York Academy of Sciences* 1539) and the NYC Hazard Mitigation Plan 2024 supply the NYC-specific tiering hierarchy that informs which scenarios get higher weights inside the Regulatory sub-index. ## 3. Sub-index structure Three thematic sub-indices, each normalized to [0, 1] by dividing the weighted sum by the maximum possible weight in the group. The composite is the simple sum of the three sub-indices (range 0–3). ### 3.1 Regulatory sub-index Binary "inside zone" indicators with weights ordered by agency tiering: | Indicator | Weight | Citation | |---------------------------------|-------:|----------| | FEMA NFHL 1% (SFHA) | 1.00 | FEMA NFHL. Regulatory mandate threshold | | FEMA NFHL 0.2% | 0.50 | FEMA NFHL. Tail scenario | | NYC DEP Moderate-2050 + 2.5 ft | 0.75 | NYC DEP Stormwater Maps 2021; NPCC4 Ch.3 | | NYC DEP Extreme-2080 + SLR | 0.50 | NYC DEP Stormwater Maps 2021. Explicitly tail | | NYC DEP Tidal-2050 | 0.75 | NPCC4 Ch.3 coastal projection | Why DEP-2050 outranks DEP-2080: NPCC4 designates the 2080 extreme scenario as a **tail** projection. Closer-horizon coastal/pluvial maps. Those a current planner can act on. Get the higher weight. ### 3.2 Hydrological sub-index Continuous terrain measures, banded into 4 levels (1.0 / 0.66 / 0.33 / 0): | Indicator | Weight | Bands | Citation | |---|---:|---|---| | HAND (m) | 1.00 | <1, 1–3, 3–10, ≥10 | Nobre et al., 2011, *J. Hydrology* 404: 13–29 | | TWI quartile | 0.50 | ≥12, 10–12, 8–10, <8 | Beven & Kirkby, 1979; Sørensen et al., 2006, *HESS* 10 | | Elev pct (200 m, inv) | 0.50 | <10, 10–25, 25–50, ≥50 | Standard geomorphometric proxy | | Elev pct (750 m, inv) | 0.50 | <10, 10–25, 25–50, ≥50 | Standard geomorphometric proxy | | Basin relief (m) | 0.25 | ≥8, 4–8, 2–4, <2 | Supporting variable, Nobre 2011 | TWI is half-weighted relative to HAND because TWI is documented as noisier in flat urban DEMs (Sørensen 2006 explicitly states TWI is site-specific and best percentile-binned). HAND remains the canonical hydrology indicator (Aristizabal et al. 2023, *WRR* 59, NOAA NWM). ### 3.3 Empirical sub-index Mix of binary observed-extent flags and banded count signals: | Indicator | Weight | Citation | |----------------------------|-------:|----------| | Sandy 2012 inundation | 1.00 + **floor** | NYC OD `5xsi-dfpx`; NYC HMP 2024 | | USGS Ida HWM within 100 m | 1.00 + **floor** | USGS STN Event 312 | | USGS Ida HWM within 800 m | 0.50 | USGS STN Event 312 | | Prithvi-EO 2.0 Ida polygon | 0.75 | Jakubik et al., 2025 (NASA/IBM Prithvi-EO 2.0); semi-empirical | | 311 complaint count band | 0.75 | NYC OD `erm2-nwe9`; NYC 311-as-flood-proxy literature | | FloodNet trigger (3 yr) | 0.75 | FloodNet NYC; NPCC4 Ch.3 references | The 311 and FloodNet weights are capped at 0.75 because both signals have documented coverage and reporting bias. 311 reflects civic engagement as well as flooding, FloodNet has uneven spatial coverage. Sandy and HWMs are 1.0 because they're engineered ground-truth observations. Bands for 311 count (200 m buffer, 5-year window): | Count | Value | |---------|------:| | ≥10 | 1.00 | | 3–9 | 0.66 | | 1–2 | 0.33 | | 0 | 0 | ## 4. Max-empirical floor If **Sandy 2012 inundation** OR **a USGS Ida HWM within 100 m** fired, the tier is capped at **2 (Elevated)**. It cannot be worse, regardless of the additive composite. This recovers the *important* multiplicative behaviour Balica 2012 argues for: empirical, ground-truth observations should not be cancelled out by terrain or modeled scenarios. We implement it as a floor (a `min(tier, 2)` after composition) rather than a full multiplicative form so the composite remains additive and auditable. The 100 m radius is chosen because USGS HWM positional uncertainty is typically 5–30 m horizontal. 100 m gives ~3σ headroom for a confident "this address was inundated" signal. ## 5. Composite → tier mapping The composite is the sum of the three normalized sub-indices (range 0–3): | Composite | Tier | Label | |-----------|-----:|----------------------| | ≥ 1.50 | 1 | High exposure | | ≥ 1.00 | 2 | Elevated exposure | | ≥ 0.50 | 3 | Moderate exposure | | > 0 | 4 | Limited exposure | | 0 | 0 | No flagged exposure | Then floor: `Sandy or HWM<100m → tier ≤ 2`. ## 6. Live signals are NOT in the score NWS active alerts, NOAA tide residual (surge), and NWS hourly precip are **not** part of the static tier. Per **IPCC AR6 WG II** glossary and **NPCC4** Ch. 3, exposure is a quasi-stationary property of place; event occurrence is time-varying. Mixing the two would produce a tier that flickers every six minutes and that residents could interpret as neither "is my building exposed?" nor "is it flooding right now?". Live signals are surfaced separately in the UI as a **"Current conditions"** badge, with their own provenance (NOAA station ID, NWS alert URL, ASOS station code), and they expire on their own cadence. Static tier is unaffected. This mirrors how First Street separates Flood Factor (static, 30-yr horizon) from event-day Flood Lab products, and how Fathom separates Global Flood Map from real-time intelligence. ## 7. Honest scope Riprap's tier is **not**: - A flood-damage probability or expected loss. - A flood-insurance rating. For that, see **FEMA Risk Rating 2.0** (FEMA 2021), which uses claims-driven GLMs over decades of labeled outcome data we do not have. - A vulnerability assessment. Engineering fragility (foundation type, electrical hardening, drainage), social capacity, and financial absorption are out of scope. - A prediction. Future-scenario layers (DEP 2050/2080, FEMA 0.2%) are bounding scenarios, not forecasts. It **is**: - An exposure prior. A literature-grounded, deterministic, reproducible index of how many publicly-documented flood signals overlap this address. - Auditable end-to-end: every term has a published source; every weight has a rationale; the floor rule has a stated motivation; the tier breakpoints are documented above. - Forkable: a researcher who disagrees with any weight can edit `app/score.py` and rerun. The UI methodology panel makes this invitation explicit. ## 8. Caveats foregrounded in UI copy These appear next to the tier badge and in the methodology disclosure: > **Riprap tiers are not flood-damage probabilities.** They reflect > publicly-documented exposure indicators only. > **311 counts are influenced by neighborhood reporting habits** and > may under-represent flooding in lower-engagement areas > (Agonafir et al. and the broader 311-as-civic-engagement literature). > **DEP 2050/2080 and FEMA 0.2% are bounding scenarios, not forecasts.** > The tier reads them as "if this scenario materialized, this address > would be inside its footprint". Not "this is the expected future." > **Compound flooding is not separately modeled.** Concurrence of rain > + tide + groundwater is the residual research frontier (NPCC4 Ch. 3). ## 9. Sensitivity / future work - **Tate-style Monte Carlo perturbation** of weights to characterize how sensitive each tier assignment is to weight choice. Not yet implemented; would be a natural next research output. - **Calibration exercise** if a labeled dataset emerges (FEMA assistance records, building-level damage from Sandy/Ida insurance claims). Until then, "calibrated" is a word we do not use. - **Block- or NTA-level aggregation** for neighborhood-grade scoring, with each indicator computed as an areal aggregate rather than a point sample. ## References Aristizabal, F. et al. (2023). "Improving Continental Hydrologic Modeling Using Height Above Nearest Drainage." *Water Resources Research* 59. Balica, S., Wright, N., & van der Meulen, F. (2012). "A Flood Vulnerability Index for Coastal Cities and Its Use in Assessing Climate Change Impacts." *Natural Hazards* 64: 73–105. Beven, K. J., & Kirkby, M. J. (1979). "A Physically Based, Variable Contributing Area Model of Basin Hydrology." *Hydrological Sciences Bulletin* 24(1): 43–69. Cutter, S. L., Boruff, B. J., & Shirley, W. L. (2003). "Social Vulnerability to Environmental Hazards." *Social Science Quarterly* 84(2): 242–261. FEMA (2021). *NFIP Risk Rating 2.0 Methodology and Data Sources.* Jakubik, J. et al. (2025). "Prithvi-EO 2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation Applications." NASA/IBM. Kim, S. et al. (2019). "Assessment of Aggregation Frameworks for Composite Indicators in Measuring Flood Vulnerability to Climate Change." *Scientific Reports* 9:18564. Nobre, A. D. et al. (2011). "Height Above the Nearest Drainage. A Hydrologically Relevant New Terrain Model." *Journal of Hydrology* 404(1–2): 13–29. NYC HMP (2024). *NYC Hazard Mitigation Plan 2024.* NYC Emergency Management. NYC NPCC4 (2024). *4th NYC Climate Assessment.* New York City Panel on Climate Change. Including Rosenzweig et al., Ch. 3, *Annals NYAS* 1539. Sørensen, R., Zinko, U., & Seibert, J. (2006). "On the Calculation of the Topographic Wetness Index." *Hydrology and Earth System Sciences* 10: 101–112. Tate, E. (2012). "Social Vulnerability Indices: A Comparative Assessment Using Uncertainty and Sensitivity Analysis." *Natural Hazards* 63: 325–347.