Skip to content
Platform

Document Intelligence

Read any insurance document. Know where every answer came from.
Bevaya's Document Intelligence handles hundreds of carrier formats, scanned or digital, and traces every extracted value back to its exact location on the page.

Extracted fields
Claim numberCLM-2024-118827
0.99
ClaimantDaniel R. Smith
0.98
Date of loss11/14/2024
0.97
Demand summary
Total demanded$385,000
Reserve$118,500
Field confidence
Claim #99%
Claimant98%
Demand $98%
Specials93%
Doc info
FileDemand_Letter.pdf
TypeLegal · FNOD
Pages4
Document summary
Plaintiff demands $385,000 for medical specials, lost wages, and pain & suffering arising from a 11/14/2024 motor-vehicle incident. Counsel cites three medical providers and a 30-day response window. Recommended action: route to claims-litigation team for reserve review.
Extracted fields
InsuredStoltz Trucking LLC
0.99
Policy numberCGL-PA-7711-04
0.99
Coverage period2020 – 2024
0.97
Loss totals
5-yr incurred$172,200
Open reserves$118,000
Field confidence
Insured99%
Policy #99%
Claims96%
Paid $94%
Doc info
FileLoss_Run_2024.pdf
TypeLoss run · 5-yr
CarrierTravelers
Document summary
Insured Stoltz Trucking LLC shows 4 closed claims and 1 open claim across the 2020–2024 period, with $172,200 total incurred. The open 2024 claim accounts for $118,000 in reserves. Frequency trending upward in years 3–4. Recommended action: review for renewal pricing.
Extracted fields
PatientDaniel R. Smith
0.99
Provider NPI1417834592
0.99
Account number88-4471
0.98
Charge total
Total charges$7,120
Patient resp.$1,420
Field confidence
Patient99%
NPI99%
CPT codes97%
Charges98%
Doc info
FileUB04_StJude.pdf
TypeMedical · UB-04
Visits4
Document summary
Patient Daniel R. Smith received 4 itemized services at St. Jude Medical Center between 11/19 and 12/03/2024 totaling $7,120 in charges, including ER visit and MRI lumbar. Codes verified against AMA CPT directory. Recommended action: link to claim CLM-2024-118827.
Extracted fields
ApplicantStoltz Trucking LLC
0.99
FEIN83-2884420
0.99
Effective date06/01/2024
0.98
Business profile
SIC 4213 · trucking11 yrs
Employees42
Field confidence
Applicant99%
FEIN99%
SIC97%
Coverages95%
Doc info
FileACORD_125.pdf
TypeCommercial app
ProducerPenn Brokerage
Document summary
Applicant Stoltz Trucking LLC applying for commercial coverage effective 06/01/2024. Business classified as SIC 4213 (trucking, except local) with 11 years in operation and 42 employees. Loss disclosure flagged for prior carrier comparison. Recommended action: route to underwriting queue.
Parsing Demand letter
0%
OCR Classify Extract Ground Verify
Built for insurance complexity

Reads what generic AI and legacy IDP can't.

Loss runs in hundreds of carrier formats. ACORD 125s, 126s, and 130s. SOVs sprawling across multi-tab spreadsheets. CMS-1500s. Handwritten claim notes. Scanned demand letters with coffee stains. Document Intelligence handles them with a mosaic of specialized InsurGPT™ models, each one purpose-built for a specific document type.

  • Mosaic of specialized models. The right model is selected automatically for each task. Loss runs go to a loss run model. ACORDs go to an ACORD model. No prompt engineering required.
  • Hundreds of carrier formats. Trained on 300M+ real insurance documents, not public web data. The complex financial fields like reserves, total incurred, and recoveries are exactly where the gap is largest against general AI.
  • Document splitting and classification. A 40-page submission package becomes individual ACORD forms, supplemental applications, loss runs, and schedules. Each one routed to the right workflow treatment.
  • Verification layer on every output. A secondary model checks the primary model's work before it leaves the system. The double-check is why Bevaya reaches accuracy levels general AI cannot.
Classify & Split
Extract
Verify
PDF
Submission_Package_NorthBridge_Q2.pdf
Classifying
 
ACORD model Loss Run model SOV model Supplemental model
Source package
 
 
 
 
40 pages
 
InsurGPT™ classifier
ACORD 125
0 pages
Loss Run
0 pages
SOV
0 pages
Supplemental
0 pages
Verification layer · secondary model double-checks every classification · 0% accuracy
Insights
Review
PDF
Demand_Letter.pdf
Highlights
 
CLAIM NUMBER
CLM-2024-118827
0.99
page 1 · ¶3 · x:412 y:286
CLAIMANT NAME
Daniel R. Smith
0.98
page 1 · ¶2 · x:188 y:248
DEMAND AMOUNT
$385,000.00
0.92
page 1 · ¶5 · x:296 y:412
Page 1 of 1 100%
WEXLER LAW FIRM
Attorneys at Law · Personal Injury · Pennsylvania & New Jersey
1224 Walnut Street, Suite 800 · Philadelphia, PA 19107 · (215) 555-0142
 
May 2, 2026
VIA EMAIL: claims-intake@bevaya-demo.com
Bevaya Insurance Company · Claims Department
 
RE: Daniel R. Smith v. Stoltz Trucking & Logistics LLC
Date of Loss: November 14, 2024
Claim Number: CLM-2024-118827
Policy Number: CGL-PA-7711-04
 
Dear Adjuster:
Pursuant to the above-referenced matter, please consider
this letter our formal demand in the total amount of
$385,000.00 for resolution of all claims.
 
 
 
Awaiting field selection…
Grounded by GutenOCR

Every value traceable to the page. Down to the coordinate.

Bevaya's proprietary GutenOCR technology ties every extracted data point back to its exact source. Not just the page. The paragraph, the line, even the X and Y coordinates on the page. Reviewers verify in seconds. Auditors get a citation, not a guess. This level of grounding sets Document Intelligence apart from every other IDP and OCR tool on the market.

  • X-Ray Mode source highlighting. Click any field. The original page opens and highlights the exact location the AI used to extract that value.
  • Field-level confidence scoring. Every extracted field carries a score between 0 and 100. Set the threshold per field, per flow. Above the bar auto-processes; below it routes for human review.
  • Explainable outputs. Field-level rationale, source citations, and confidence indicators ship with every extraction. You see not just what was extracted, but why and from where.
  • Validation against external sources. Drug codes checked against national databases. Addresses normalized. The platform catches errors a human reviewer might miss.

The Benchmarks

When put to the test, InsurGPT™ beats general AI on every insurance task.

Specialized models outperform general-purpose ones on the work your team actually handles. Here's the head-to-head.

On claims accuracy, InsurGPT™ scored 99%. The strongest general model managed 62%.

A 37-point gap on the work your team handles every day — claims indexing, FNOL, demand letters, medical bills.

Source: Roots benchmark tests, December 2025. View methodology

Claims Accuracy

InsurGPT™
0%
Mistral AI
62%
GPT-5.0
58%
Gemini 3.0 Pro
55%
Built for iteration

Test changes in seconds, not days.

Real-time document testing — configure extraction logic, test against a sample document, and see results instantly.
Selective recompute — fix one field's extraction without rerunning the entire flow.
Feedback-driven learning — every human correction is captured, audited, and routed back to the models.
Managed for drift — Bevaya's continuous management team monitors accuracy and retrains models as your document mix shifts.
Confidence scoring on every field so reviewers know where to focus first.
Configurable per document type without retraining the underlying model.

Document understanding | Capability

Hundreds of carrier formats handled out of the box. Specialized models for every insurance document type, from ACORD forms to handwritten claim notes.

  • Hundreds of carrier-specific formats supported out of the box
  • ACORD 125, 126, 130, and full ACORD library coverage
  • Loss runs across hundreds of carrier templates
  • Statements of Values (SOV) across multi-tab spreadsheets
  • CMS-1500 and other medical bill formats
  • Handwritten notes, checkboxes, complex tables
  • Scanned PDFs, digital PDFs, email attachments, image files
  • Email and document ingestion (Outlook, SFTP, API)
  • Multi-page document handling at production volume

Grounded extraction | Capability

Every extracted value carries a citation to the page, paragraph, line, and coordinate. Powered by Bevaya’s proprietary GutenOCR technology.

  • GutenOCR proprietary grounded OCR engine
  • Page, paragraph, line, and X/Y coordinate traceability
  • X-Ray Mode source highlighting on every field
  • Field-level confidence scoring (0 to 100)
  • Configurable confidence thresholds per field, per flow
  • Explainable outputs with field-level rationale
  • Source citations attached to every extracted value
  • External data validation (drug codes, addresses, policy data)
  • Duplicate detection across documents and submissions

Classification & orchestration | Capability

Multi-document packages split, classified, and routed to the right specialized model automatically. No manual triage required.

  • Automatic document type identification
  • Intelligent document splitting on multi-document packages
  • Routing to the right specialized model per document type
  • 60+ claim document types recognized in production
  • Document comparison across versions
  • Document validation and cross-checking
  • Document summarization for adjusters and underwriters
  • Workflow context preserved across multi-document submissions

Build, test, and improve | Capability

Configure, test, and tune extractions in real time. Every correction feeds back to the models. Bevaya’s team manages model drift continuously.

  • Real-time document testing in a single interface
  • Selective recompute on a single field
  • Verification layer (secondary model double-check)
  • Feedback-driven continuous learning
  • Federated learning across the platform
  • Active model drift monitoring and retraining
  • Turnkey models reducing fine-tuning needs
  • Per-field override tracking to surface model issues
 
Resources & insights

More on Document Intelligence.

Case Study - claims
Research

Page stream segmentation with LLMs

How Bevaya Labs approaches a foundational problem in insurance document AI.

Case Study - claims
Case Study

Workers' comp carrier processes claims 100x faster

How indexing automation delivered 432% ROI in 12 months.

2026.06.02-library-webinar-registration-how-to-establish-clear-ai-ownership-in-your-insurance-organization
Architecture

Inside the Bevaya platform architecture

How specialized models, HITL controls, and integrations come together in production.

Trust & Security

Trust by design

Built for an industry where data security isn't optional.

Data ownership

Your data stays yours.

Never shared with other customers or vendors. Bevaya doesn't train shared models on your data.

Visit the Trust Center
Your tenant
No training
Logical isolation
Role-based access
SSO + SCIM
Customer-managed keys

Your data · only your team sees it

Compliance

Encrypted end-to-end.

256-bit AES encryption, in transit and at rest. Independent third-party audits conducted annually.

Visit the Trust Center
SOC 2 Type 2
HIPAA
GDPR
CCPA
23 NYCRR 500
AES-256

Audited annually · independent third party

Deployment

Runs in Azure.

Enterprise-grade infrastructure, hosted where insurance organizations already trust their data.

Visit the Trust Center
Microsoft Azure Azure Marketplace
AWS Private VPC
Google Cloud GCP-native
Azure Marketplace
Guidewire Marketplace

Deploy where your stack already lives

Oversight

Every decision audited.

Immutable audit logs. Confidence scoring. Human-in-the-Loop review on low-confidence items.

Visit the Trust Center
AI extracted limits from ACORD 125 98% conf.
Reviewer confirmed coverage Approved
Endorsement flagged for review 62% · HITL
Policy match validated 95% conf.
Audit log written · immutable Sealed

Immutable trail · every decision, every reviewer

FAQ

Inside Document Intelligence.

Hundreds of carrier formats handled out of the box. Specialized models for every insurance document type, from ACORD forms to handwritten claim notes. Includes hundreds of carrier-specific formats supported out of the box; ACORD 125, 126, 130, and full ACORD library coverage; loss runs across hundreds of carrier templates; Statements of Values (SOV) across multi-tab spreadsheets; CMS-1500 and other medical bill formats; handwritten notes, checkboxes, complex tables; scanned PDFs, digital PDFs, email attachments, image files; email and document ingestion (Outlook, SFTP, API); and multi-page document handling at production volume.

Every extracted value carries a citation to the page, paragraph, line, and coordinate. Powered by Bevaya's proprietary GutenOCR technology. Includes the GutenOCR proprietary grounded OCR engine; page, paragraph, line, and X/Y coordinate traceability; X-Ray Mode source highlighting on every field; field-level confidence scoring (0 to 100); configurable confidence thresholds per field, per flow; explainable outputs with field-level rationale; source citations attached to every extracted value; external data validation (drug codes, addresses, policy data); and duplicate detection across documents and submissions.

Multi-document packages split, classified, and routed to the right specialized model automatically. No manual triage required. Includes automatic document type identification; intelligent document splitting on multi-document packages; routing to the right specialized model per document type; 60+ claim document types recognized in production; document comparison across versions; document validation and cross-checking; document summarization for adjusters and underwriters; and workflow context preserved across multi-document submissions.

Configure, test, and tune extractions in real time. Every correction feeds back to the models. Bevaya's team manages model drift continuously. Includes real-time document testing in a single interface; selective recompute on a single field; a verification layer (secondary model double-check); feedback-driven continuous learning; federated learning across the platform; active model drift monitoring and retraining; turnkey models reducing fine-tuning needs; and per-field override tracking to surface model issues.

Get Started

Ready to design, deploy, and govern
your AI workforce?

Bevaya AI Agents can help you triage, analyze, and recommend across underwriting, claims, and policy servicing. Let's connect and show you how it works..