Data Quality Analytics

RMS Data Quality Analytics empower insurers and reinsurers to assess exposure data quality, measure its impact, and target data improvement where it matters most for catastrophe risk models. Data Quality Analytics deliver objective and independent insight into the main elements of exposure data—where it is (location), what it is (vulnerability attributes), and how much it is (valuation)—allowing insurers and reinsurers to assess the quality of exposure data input into catastrophe models.

Key Components

Data Quality Analytics account for severity (hazard and hazard gradient), the relative importance of attributes, and the loss contributions, to reveal insights that go beyond simple measures such as percentages of street-level geocoding or known/unknown attributes—conventional, but incomplete metrics that can lead to a false sense of security about data quality. Data Quality Analytics consist of completeness scores and accuracy assessments. These transparent, actionable, and easy-to-communicate metrics are used in conjunction with the RMS ExposureSource database to evaluate data quality and inform portfolio management and underwriting decisions.

Completeness Scores

Weighted by peril, hazard severity, hazard gradient, line of business, and other factors, completeness scores measure:

  • Geocoding Resolution Score (GRS): Assesses the variability in modeled losses resulting from the level of geocoding resolution

  • Vulnerability Completeness Score (VCS): Measures the amount of variability in modeled losses resulting from missing primary vulnerability attributes, such as occupancy, construction class, year built, and number of stories

  • Completeness Score (CS): Uses a unique algorithm that combines the GRS and VCS scores based on the relative importance of each.

  • VCS Improvement Potential: Quantifies the relative importance of the missing building characteristics on the VCS—improvement potentials are based on extensive sensitivity analyses

  • Variation in Loss Cost: Measures the potential variation in model loss estimates resulting from the percentage of low resolution geocoding and unknown vulnerability attribute information in a data set. For example, in a case where there are 10 buildings—8 with known construction type, 2 with unknown construction type—the variation in loss cost indicates the maximum variation (+/-) due to the 2 unknown construction types if they were coded to the best case and worst case attributes.

Accuracy Assessment

Accuracy is assessed using rules and algorithms that assess the credibility and objectivity of the data, and by comparing data against trusted in-house and third-party sources.

  • Validation Heuristics: Rules that identify inconsistent or illogical combinations of geocoding, vulnerability, valuation, and financial attributes, and investigate suspicious patterns in the data suggesting bulk coding

  • Comparison against RMS ExposureSource Database:  Data is validated against the ExposureSource database of location-specific commercial and residential property exposure data, developed specifically to provide exposure data and optimized for use with catastrophe models for the insurance industry

  • Comparison against Unknown Portfolio: Industry comparison metrics indicate how aggressively or conservatively the portfolio has been coded compared to the underlying industry inventory—i.e., how loss results for the portfolio as coded would compare to results if occupancy is known but everything else is unknown

Data Quality Analytics provide objective and independent insight into the quality of exposure data input into catastrophe models, by providing transparent (intuitive and easily understandable), actionable and portable (easy to communicate and transmit across insurance value chain) metrics and insights to inform portfolio management and underwriting decisions.

Data Quality Analytics Brochure

The ExposureSource Database

The ExposureSource database is a comprehensive database of location-specific, U.S. commercial and residential property exposure data. Developed specifically to provide exposure data optimized for use with catastrophe modeling for the insurance industry, the ExposureSource database provides a robust source of exposure data for improved risk analysis and management.

ExposureSource Database Brochure
 

Related Information

Best's Review: "C-Level Agenda"
Global Reinsurance"Data Quality Matters"