Skip to content
QUANTIFYING VALUE

Benchmark Results

Not all AI models are created equal — especially on complex enterprise insurance work. Benchmarking is how we measure what actually delivers in underwriting and claims, so you can see where accuracy holds up and where rework starts. Pick a workflow below to see the field-level results.


Built by Insurance Experts, for Insurance Experts 

InsurGPT™ is the generative AI model built by insurance operators, for insurance operators. Trained on millions of insurance-specific data points, it understands the unique language and complexities of the insurance industry. Experience AI tailored to solve your challenges, delivering accuracy and intelligent automation like never before.

Insurance experts

Roots Outperforms General Knowledge LLMs 

98%+ Accuracy Guaranteed

Pricing-mobile

 

Bevaya Benchmark Results

Claims extraction

Field-level accuracy on common claims data extraction. Bevaya's fine-tuned model is shown with and without a 0.9 confidence threshold.

At a 0.9 confidence threshold, Bevaya reaches 98–100% accuracy across every claims field tested.
Field GPT-4% Accuracy Mistral 7B PE% Accuracy Bevaya FTNo threshold Bevaya FTThreshold > 0.9
Claim Number52326898
Claimant Name988899100
Date of Report87729298
Date of Service78577698

Last updated: December 2025