← back to blog

Pricing Sophistication @ Insurance

Chaïmae SritiOctober 2025
Commercial Auto$1B Business BookStatus: Live5 Coverages

1. Problem Statement

By 2021, Company [X] IRCA (Individually Rated Commercial Auto) book was showing persistent deterioration. Loss ratios were stuck near 79% on a $1B portfolio, despite repeated tactical fixes. Pricing models had grown stale, refresh cycles were unreliable, and risk segmentation lagged behind competitors. The result was a widening performance gap: good risks churned away, bad risks concentrated, and each regulatory filing became a painful, error-prone exercise. Leadership recognized that without a step-change in pricing sophistication and operational discipline, both profitability and regulatory credibility were at risk.

Scope & Timeline

6 quarters total; 4 for model development, extensive cross-team review, and stakeholder acceptance; 2 for building automated monitoring and refresh tooling, ensuring end-to-end reconciliation.

Who's the audience/customer?

  • Primary: Pricing actuaries/analysts (rate adequacy, profitability, regulatory compliance).
  • Secondary: Underwriters (quoting, risk selection, strategy).
  • Business/Product: Product managers, IT, and executive leadership (cycle time, regulatory risk, portfolio performance).

Why is this problem critical?

  • For actuaries: Without robust, granular, explainable models, pricing falls out of sync with risk, raising regulatory and financial risk.
  • For underwriters: Old models missed key risk factors, causing adverse selection; good risks churned, bad risks concentrated.
  • For leadership: Persistent 79% loss ratios on a $1B book meant more than $17M/year in losses, regulatory exposure, and unsustainable market position.

What insights prompted the work?

  • Internal review: more than 200 key variables ignored in prior models, ad-hoc/non-reusable codebases, high manual effort.
  • Competitor analysis: Rivals filed with 3–5x more granular segmentation; we were behind on risk selection and pricing accuracy.
  • IT post-mortems: Lack of modularity/pipeline discipline caused refreshes to fail, introduced data errors, and slowed every regulatory cycle.

What was uniquely challenging/engineering issue?

  • First time at Company [X]: A pipeline was built for modularity and reusability; previous work was one-off, ad-hoc, not scalable.
  • Data complexity: Needed to reconcile and standardize more than 200 features from fragmented sources for multiple coverages, including long-tailed risks.
  • Technical-regulatory balance: Had to balance state-of-the-art ML lift with strict actuarial and regulatory interpretability; no model could go to production unless fully explainable.
  • Cultural resistance: Faced cultural and technical resistance from teams used to their own workflows and toolkits.

2. Solution Options & Insurance Rationale

Solution Exploration Process:

We evaluated each approach against strict insurance/actuarial requirements: full regulatory explainability, real non-linear segmentation, operational maintainability, and future-proof modularity.

GLM / Rule-based (actuarial expert rules)
Why Considered: Industry standard, trusted, easy to file.
Decision: Not enough segmentation power; can't handle non-linearity in risk factors; manual upkeep. Used as compliance baseline only.
Single-shot XGBoost + SHAP
Why Considered: Captures non-linear risk, strong lift, attractive for ML benchmarking.
Rejected: Not fully explainable; even with SHAP, still black-box for regulators. Rejected for direct production pricing.
Ensemble XGBoost + SHAP
Why Considered: Highest predictive lift, captures complex non-linearities, great for deep analytics.
Rejected: Same issues as above; not maintainable by actuarial; business/regulatory acceptance too low for filings.
GAM (with non-linear splines) + Ensemble XGBoost (support/analytics)
Why Considered: GAM with non-linear splines: achieves interpretable, regulator-accepted modeling and true non-linear risk segmentation. Ensemble XGBoost run in parallel (shadow): used for advanced analytics, drift detection, segment profiling.
Selected: Only option combining high segmentation power with full explainability, auditability, and operational handoff to actuarial. GAM for pricing and filings; ensemble XGBoost as analytics-only "copilot" (never in direct rating logic).

Convergence Rationale:

  • GAM with non-linear splines handled all regulatory and interpretability requirements, while allowing us to model complex effects (e.g., age, tenure, exposure) that traditional GLMs or rule sets missed.
  • Ensemble XGBoost (support model) was integrated for analytics, risk profiling, and ongoing monitoring; surfacing feature interactions, drift, and emerging segments. Never used for customer-facing rating.
  • Demos, validation, and pilot runs showed this was the only setup that both actuaries and regulators could trust, and that delivered the risk lift we needed.

Why this worked: The combination of GAM (for non-linear splines, interpretability, and filings) and ensemble XGBoost (for analytics, drift, and business intelligence) raised segmentation sophistication, while fully satisfying regulatory and operational needs.

Irreversible decisions: Locked in modular, parameterized GAM with splines for all production models and regulatory filings. Established ensemble XGBoost as a support-only analytics module, wired into dashboards for business and actuarial insight. Automated all monitoring, retrain, and documentation for seamless refreshes.

System Design:

PIPELINE OVERVIEW
Data Sources
Internal
Claims
Historical data
Policies
Coverage terms
Losses
Financial impact
Vehicles
VIN & specs
External
Industry Bureau
Benchmarks
EDA & Cleaning
Processing
• Binning & capping
• Grouping
• Missing data
• Outlier handling
Feature Engineering
Creation
• Cross-variables
• Polynomial terms
• Ratios & transforms
• Domain expertise
Selection
Dimensionality reduction
Model Selection
GAM Models
• Constrained GAM
• Unconstrained GAM
GBM Models
• XGBoost
• LightGBM
Optimization
Grid & Bayesian
Performance
Metrics & scoring
Validation
• Time-based CV
• Holdout testing
• Business validation
Production
Monitoring
• Drift detection
• Performance tracking
• Business KPIs
• Data quality
Auto-Refresh
• Monthly retraining
• Trigger updates
• A/B validation
• Safe rollbacks
Primary Component
Standard Component
Critical Path

3. Technical Considerations

  • Aligned actuarial, pricing, and product; surfaced KPIs for every team.
  • Engineered a modular pipeline; automated EDA, feature engineering, and one-click retrain, all re-usable for future projects.
  • Delivered compliant GAM in production, XGBoost for advanced risk slicing.
  • Institutionalized monitoring pipeline with drift triggers and retrain logic.
  • Negotiated tough trade-offs around explainability vs. predictive lift.

Implementation Trade-offs:

Modularity vs. Delivery Speed
Investing in modular, reusable code slowed first delivery but cut onboarding time for new coverages by 70%. Standardized data ingestion, EDA, feature engineering, training, and monitoring modules.
Interpretability vs. Predictive Power
Only GAM/GLM allowed for full audit trails, regulator approval, and actuarial sign-off. ML models used for analytics, not direct pricing.
Data Quality & Reconciliation
Built configurable, testable pipeline to reconcile more than200 features, handle missingness, schema drift, and source mismatch.
Monitoring Frequency vs. IT Overhead
Monthly retraining and scoring, fully automated (one-click), reduced risk of model drift, and removed IT bottlenecks.
Scalability vs. Cost
Accepted higher short-term infra cost for pipeline parallelization; justified by cross-line scaling and regulatory agility.

4. Measuring Success

Value Delivered:

  • Cut loss ratio from 79% to 65% ($12M+ benefit).
  • Model refresh cycle reduced from yearly to monthly (fully automated).
  • Lifted normalized Gini from 0.18 to 0.34 (risk segmentation).
  • Reduced new coverage onboarding from 12 to 4 months.
  • System handles both short- and long-tail lines, with robust interim and long-term monitoring.

Success Metrics:

Metric
Baseline
Target
Achieved
Loss Ratio (%)
79%
70%
65%
Normalized Gini
0.18
0.34
0.34
Model Refresh Frequency
Yearly
Quarterly
Monthly
New LOB Onboarding Time
12 months
4 months
4 months
Regulator Approval
2+ cycles
1st pass
1st pass

Model Development:

  • Used out-of-time validation (training on 2015–2019, validating on 2021–2022) to ensure true generalizability and prevent overfitting to historical trends.
  • Applied cross-validation with time-based splits to account for temporal drift, seasonality, and to simulate real-world performance on unseen data.
  • Tracked lift, Gini, and loss ratio metrics across each split, confirming that improvements were not due to data leakage or overfitting.

Pre/Post Deployment:

  • Dashboards monitored live KPIs: loss ratios, normalized Gini, frequency, severity, and ultimate loss.
  • Segment profitability and drift detection were tracked monthly; flagged outliers were escalated for root-cause analysis.

Regulatory Acceptance:

  • All validation and monitoring evidence was documented for regulatory review; achieved first-pass approval with no back-and-forth.

5. Key Learnings

  • Modularity compounds: Standardized, reusable components let new coverages onboard in weeks instead of quarters. It eliminated repeated reconciliation errors and made monitoring reliable across lines.
  • Engineer feedback loops for drift: Long-tailed claims meant true loss performance could take years to surface. We built interim signals such as quote-to-bind ratios, early claim triggers, and feature distribution drift checks to detect model drift months earlier, before it showed up as deteriorating loss ratios on the P&L.
  • Stakeholder alignment is technical work: Early and frequent demos with actuaries, IT, and regulators prevented late-stage surprises, reduced rework, and built trust in the system's reliability.
  • Interpretability as a first-class requirement: Pairing ML lift (GAM splines, XGBoost support models) with fully explainable GLMs ensured we gained predictive power without sacrificing regulatory approval or business buy-in.