1Two production AI agents, gated on evaluation, inside a regulated insurer
Director of Climate Risk Products and Applied AI · CarbonPool
- Days → under 10 minutes
- ~80% of team on AI-augmented workflows
- 2 agents in production
Problem
Technical project assessment and counterparty due diligence took several days of manual analyst work, and in an insurance context the output carries liability, so speed could not come at the cost of correctness.
What I did
Architected, built, and deployed two production LLM agents with Claude Code: an applied research agent for technical project assessment, and a due-diligence / KYC agent for project developers, investors, and documentation. Each was gated on the model-evaluation pipeline before any client use. I also rolled out Claude Code to the modelling team and Cowork to the C-suite.
Outcome
Manual turnaround cut from several days to under 10 minutes. Around 80% of the team onboarded onto AI-augmented workflows, with time saved measured per workflow.
Skills
- Production agent development
- Evaluation-first governance
- AI adoption across a regulated org
- Tech-to-non-tech translation