Applications · Section 2
Risk profiling: the post-model report
How a cat modeller turns a cedent's loss sets into the post-model report an underwriter reads before structuring a deal — loss-set reconciliation, exposure summary, and the subject loss risk profile.
The business problem
Section titled “The business problem”It is renewal season at Helios Re. SunCoast Insurance — the Florida-based cedent behind three of the five contracts in the portfolio — wants its reinsurance program renewed, and its broker has sent the submission: the package of exposure data, historical losses, and desired structure that opens every placement.
The catastrophe modeller has run SunCoast’s exposure data through the vendor models. Now comes the deliverable this story is about: the post-model report — the write-up of model output that characterizes the cedent’s exposure and subject loss for the underwriter, before any contract is applied. Names for this artifact vary by organization — modelling summary, account analytics report — but every reinsurer with a modelling team produces some version of it.
The underwriter will not rerun the models. They will read this report and decide how to structure and price the renewal. That makes the report’s job precise: answer the questions the underwriter would otherwise have to guess at.
- What arrived, and does it behave? Which loss sets, from which models, and do they reconcile?
- Where is the book exposed? The book is the set of risks SunCoast has underwritten — the policies whose losses this submission is asking Helios Re to help carry. Which perils, geographies, and lines of business carry the risk?
- What does the subject loss look like? Expected loss, volatility, and — above all — the tail.
- What drives the tail? One giant event, or many medium ones? Which peril, which geography?
Everything in this story uses tools the Analytics Toolkit already built: trials and TELTs, EP curves, and the metrics EL, VaR, and TVaR. This is the Toolkit exercised end to end on a real task — measuring the object before anything transforms it.
The pipeline at a glance
Section titled “The pipeline at a glance”Everything left of the loss sets — exposure data preparation, geocoding, model configuration — is the catastrophe modelling process itself, and stays out of scope here (the trial worldview section sketches where trials come from; model internals are a discipline of their own). Our pipeline starts where the model output arrives: one loss set per peril, each a TELT.
This split is the normal state of affairs, not an inconvenience. Hurricane and earthquake come from different models — often different vendors — with different event catalogs. A submission covering two perils arrives as two loss sets, and nothing guarantees they agree on schema, units, or conventions until you check.
Step 1: validate and reconcile the loss sets
Section titled “Step 1: validate and reconcile the loss sets”Before any metric is quoted, the analyst answers a quieter question: do these loss sets behave? The workflow is the same everywhere, even when the tooling differs:
- Validate each loss set at the ingestion boundary — schema, sign convention, trial range. One assert per invariant.
- Compute standalone metrics per loss set — each peril’s distribution on its own.
- Combine and recompute — the same metrics on the union of loss sets.
- Reconcile — check that standalone and combined views relate the way arithmetic says they must.
The reconciliation step has teeth because expected loss and the tail metrics behave differently under combination — and only one of them gives an exact check. Expected loss is a mean, and trial by trial the combined loss simply is the hurricane loss plus the earthquake loss, so the means add exactly:
That identity is the check: if the combined EL is anything other than the sum of the parts, something is structurally wrong — duplicated rows, a dropped trial, a join gone bad. A quantile carries no such identity. The combined is not the sum of the standalone VaRs, and — as the numbers below show — the difference is not an error to chase but the first sight of diversification.
The output, as a table:
| Loss set | Rows | EL | Std | VaR(90%) | TVaR(90%) | Max |
|---|---|---|---|---|---|---|
| HU | 400 | $254.6M | $133.0M | $471.8M | $532.4M | $593.0M |
| EQ | 1,257 | $21.0M | $6.9M | $31.2M | $32.7M | $34.2M |
| Sum | $275.6M | $503.0M | $565.1M | |||
| Combined | 1,657 | $275.6M | $135.3M | $489.1M | $552.6M | $616.2M |
The EL check passes exactly: $254.6M + $21.0M = $275.6M, matching the combined book and the average loss the Toolkit established for SunCoast.
The tail is the more interesting half. The standalone VaR(90%)s sum to $503.0M, but the combined VaR(90%) is $489.1M — $14M lower, and the reason is concrete. With trials the 90% tail is the second-worst year (); hurricane’s second-worst year is trial 2, earthquake’s is trial 12 — different years. The $503.0M sum implicitly prices a year in which both perils hit their second-worst at once, which never happens in the data. The combined VaR instead reads the second-worst total year, trial 2, where earthquake contributed only $17.3M rather than its own $31.2M peak. That $14M shortfall is diversification: the perils reach their tails in different trials, so the book’s tail comes in under the sum of its parts.
The sum of standalone VaRs is worth naming, because it is the no-diversification benchmark — the tail the book would carry if the two perils always peaked together (the comonotonic case, in which VaR genuinely does add). The distance from that benchmark down to the combined tail is the first quantitative glimpse of diversification benefit; the portfolio roll-up story measures it properly, across many contracts at once. One honest caveat on the benchmark: plain VaR is not guaranteed to fall below the standalone sum the way it does here — it is the reason the site prefers TVaR, which is sub-additive and can never exceed the sum of standalone TVaRs. Here VaR behaves; the analysis can proceed.
Step 2: the exposure picture
Section titled “Step 2: the exposure picture”With the loss sets verified, the report turns to the book itself. The TELT’s tiv column — the total insured value each occurrence touched — lets the report characterize the book from the model-output side along three axes the underwriter will structure against:
- Concentration — which perils, geographies, and lines of business the loss falls on. We measure this as each slice’s expected annual loss: its share of the $275.6M book EL, computed by averaging the slice’s loss across the 20 trials. Summed over any one dimension, these contributions add back to the book EL, so the chart below is a decomposition of expected loss, not a pile of unrelated totals.
- Exposed value — how much insured value stands behind each slice (the TIV the events touched), which says whether a concentration of loss reflects a large book or an unusually severe one.
- Severity — how hard events damage what they touch, read as the aggregate damage ratio, over the slice. Severity that is uniform across slices is reassuring; an outlier slice is where a data problem would surface.
SunCoast expected annual loss by geography — each geography's mean contribution to a year's loss, the seven bars summing to the book's $275.6M expected loss. Labels show each geography's share; hover for the aggregate damage ratio. Texas — not Florida — carries the largest share.
The same view in numbers, by peril — expected annual loss is each peril’s contribution to the book EL, so these are exactly the per-loss-set ELs from the reconciliation table above:
| Peril | Expected annual loss | Share | Damage ratio |
|---|---|---|---|
| Hurricane (FL, GA, MS, LA, TX) | $254.6M | 92.4% | 11.6% |
| Earthquake (CA, AZ) | $21.0M | 7.6% | 12.2% |
Three observations carry into the report:
- This is a hurricane book. 92% of expected loss is hurricane, on an exposed value many times the earthquake book’s. The earthquake exposure is genuinely secondary — SunCoast writes it at roughly a third of market scale (the Helios Re data notes record the 0.35 scaling), and the loss sets reflect that.
- The footprint is Gulf-wide, not Florida-shaped. Texas carries 36.4% of expected loss — more than twice Florida’s 15.9%. A reader who knows SunCoast as “the Florida cedent” learns something here, and so does the underwriter renewing a Florida-only layer: most of this book’s expected loss falls outside that layer’s geography.
- Severity is homogeneous. Aggregate damage ratios sit between 10.1% and 13.4% across every peril, geography, and line of business. No slice shows the implausibly high or low severity that usually signals an exposure-data problem. For a report whose reader cannot rerun the model, this is the data-quality signal that matters.
Step 3: the subject loss profile
Section titled “Step 3: the subject loss profile”The core of the report is the distribution itself. One chart and one table characterize the subject loss the way the Toolkit taught: per-peril and combined AEP curves, then the headline metrics with return periods.
SunCoast subject loss AEP curves per loss set (20 trials, log return-period axis). The earthquake curve hugs the bottom of the chart at this scale — that flatness is the finding: the tail of the combined book is a hurricane tail.
| Metric | Value | Reading |
|---|---|---|
| EL | $275.6M | Mean annual loss across all 20 trials |
| Std (CV) | $135.3M (0.49) | Volatility around the mean |
| AEP VaR(80%) | $378.2M | 1-in-5 annual loss |
| AEP VaR(90%) | $489.1M | 1-in-10 annual loss |
| AEP TVaR(90%) | $552.6M | Average of the worst 10% of years |
| AEP VaR(95%) | $616.2M | 1-in-20 annual loss — the worst trial |
| OEP VaR(90%) | $174.1M | 1-in-10 largest single occurrence |
With trials, the VaR machinery is concrete enough to check by eye. Writing for the -th largest annual loss (the order-statistic notation from the Toolkit), at the tail holds trials, so — the second-largest annual loss — and averages the worst two. At the tail is a single trial, so VaR, TVaR, and the maximum coincide at $616.2M: the 20-trial dataset has hit its resolution limit, a point the report must state rather than bury (more on this below).
Two structural findings ride on the chart:
- The tail is a hurricane tail. Toggle the earthquake curve and watch it hug the axis: its worst year is $34.2M, smaller than the average hurricane year. In the two tail trials that define TVaR(90%), hurricane losses are 96% of the total. Earthquake matters in the body of the distribution — a steady $10–34M every year — but contributes nothing material to the tail.
- Bad years are many-storm years. The 1-in-10 occurrence loss is $174.1M while the 1-in-10 annual loss is $489.1M. That $315M gap is multi-occurrence risk — the OEP vs. AEP distinction doing real work. SunCoast’s worst years are bad because several storms make landfall, not because a single storm is exceptionally large: the worst trial holds 29 occurrences. And the OEP curve’s flat top tells its own story — a single catalog event, a $174.1M Texas hurricane, recurs in 7 of 20 trials and single-handedly sets the occurrence tail.
The gap between the two curves below is the visual form of that finding — the aggregate (annual) curve sits well above the occurrence (single-largest) curve at every return period, and the distance between them is exactly the multi-occurrence risk:
SunCoast subject loss EP curves (20 trials). Move your cursor over the chart for details. The dashed AEP curve is always at or above the solid OEP curve — the gap reflects multi-occurrence risk.
The profile itself composes the pieces the Toolkit already provides — build the AEP and OEP curves, read off the metrics, attribute the tail:
What the underwriter reads off this page
Section titled “What the underwriter reads off this page”The report exists to drive decisions, so let’s close the loop: here is what each finding means to the person structuring the renewal.
| Finding | Underwriting implication |
|---|---|
| Tail is 96% hurricane | Tail-protecting structures should attach on the hurricane book; earthquake needs no occurrence protection of its own |
| Bad years are many-storm years ($315M OEP–AEP gap) | A single occurrence layer will be hit repeatedly in bad years — reinstatement terms and aggregate protections are where the negotiation lives |
| Texas carries 36% of expected loss; Florida 16% | A Florida-only layer leaves most of the book’s expected loss outside its geography — the renewal conversation should ask whether that is intentional |
| One catalog event sets the OEP plateau | The occurrence tail is concentrated, not smooth — per-event terms are sensitive to a handful of catalog entries |
| Damage ratios homogeneous (10.1–13.4%) | No exposure-data red flags visible from model output |
| 20 trials → resolution floor at 1-in-20 | Quotes beyond VaR(90%) need a larger trial set before anyone prices off them |
None of this is a structuring decision — attachment points, limits, and prices belong to the next story in this chapter, where contract terms start transforming this distribution. The post-model report’s job ends where it should: the underwriter now knows what the object looks like before anything reshapes it.
At production scale
Section titled “At production scale”The 20-trial walkthrough is the real workflow in miniature; three things change with the zeros.
Trial and row counts. Production loss sets run 10,000 to 1,000,000+ trials, and the row count is the trial count times the events per trial — which depends on the peril’s frequency. A hurricane book might see a handful of occurrences per trial; a high-frequency peril like earthquake can average on the order of a thousand. At a million trials, that is around 10⁹ rows for a single loss set. The metrics do not change shape — each is a groupby plus a sort, and — and at 10⁷ rows the pandas library runs them in seconds in memory. At 10⁹ rows on one loss set, in-memory pandas is no longer the right tool: this is where columnar formats, chunked or out-of-core aggregation, and pushing the groupby into a query engine start to matter. What the extra trials buy is not speed but resolution: the 1-in-200 and 1-in-500 quotes that capital decisions need and 20 trials cannot see.
Loss-set multiplicity makes it a big-data problem. Two loss sets here, for one cedent’s submission; a reinsurer’s portfolio fans out far wider. A single reinsurer might write on the order of a thousand programs across its cedents, each program covering several peril-and-LOB combinations and modelled across multiple vendors, model versions, and climate assumptions — tens of thousands of loss sets to track across the portfolio, before any are combined. Reconciliation stops being a sanity check and becomes the backbone of the workflow: every loss set validated, every combination accounted for, and the “which loss sets entered this report?” question answered precisely. That makes loss sets named, versioned inputs, not files on a share — and it is combining dozens of them into the portfolio view, not profiling any one, where performance actually bites.
Reproducibility. This chapter’s standing principle applies with no modification: the report is a pure function of (loss sets, parameters). Same inputs, same report, byte for byte. When the vendor ships a model update and the loss sets regenerate, the report regenerates with them — automatically, not by an analyst re-running notebook cells in the right order. The moment a post-model report is hand-assembled, two reports for the same submission can disagree, and the underwriter is back to guessing which one to trust.
That is the first application story: the Toolkit’s metrics, composed into the artifact that starts every deal. The next story hands this risk profile to the underwriter’s structuring problem, where financial terms begin transforming the distribution the report just measured.