Cheque OCR and Content Extraction

Cheque OCR and Data Extraction Software

Extract amount, date, payee, MICR, account and routing data, signatures, endorsements, and confidence scores from cheque images — including handwritten fields on personal cheques. Chequedb turns cheque OCR output into structured fields that can move into review queues, APIs, exports, and approval workflows.

For developer integration, start with the check OCR API page. For operational review, route extracted fields into the cheque management portal.

Printed AND handwritten

97%+ accuracy on handwritten cheques

Standard OCR vendors work with printed fields only. Chequedb reads the fields that appear on personal cheques — the payee name, numerical amount, legal amount in words, and date — all written by hand. If you process personal cheques, that distinction determines your actual automation rate.

97.8%
Numerical amount (CAR)
97.1%
Legal amount in words (LAR)
96.5%
Payee name
Why handwriting recognition is technically different

Generic OCR vs Cheque-Specific Extraction

Generic OCR returns text. Cheque OCR needs field positions, MICR interpretation, validation, confidence, and a workflow destination.

Generic OCR

  • Reads visible text without cheque field context
  • Leaves amount, payee, MICR, and date validation to downstream code
  • Does not create review-ready exception states

Cheque OCR Software

  • Returns structured cheque fields with confidence scores
  • Combines OCR, ICR, MICR, image quality, and amount validation
  • Sends exceptions into approval, reconciliation, and audit workflows

The Extraction Pipeline

A complete pipeline from image capture to structured cheque data, review, and export

1. Capture

High-resolution image capture (200-300 DPI) via mobile, scanner, or kiosk

2. Preprocess

Deskew, denoise, binarize, and remove security backgrounds

3. Extract

OCR/ICR recognition for all fields with confidence scoring

4. Validate

Cross-field validation, checksum verification, fraud scoring

5. Output

Structured JSON output with confidence scores for each field

Extraction Technologies

How Chequedb reads printed fields, handwriting, and MICR in a single extraction pass

MICR Reading

Magnetic Ink Character Recognition for the MICR line. Reads routing numbers, account numbers, and cheque serial numbers with near-perfect accuracy.

99.9%
Accuracy Rate

OCR

Optical Character Recognition for printed fields—bank names, addresses, date stamps, and pre-printed account holder information.

99%+
Accuracy Rate

ICR

Intelligent Character Recognition for handwritten text — payee names, amounts in words, and memo fields. Field-specific models trained on cheque handwriting.

97%+
Accuracy Rate

Deep Learning

End-to-end neural networks for layout analysis, field localization, and multi-modal fusion across all extraction methods.

<200ms
Processing Time

Complete Data Extraction

Extract every relevant field from cheques with confidence scoring for each element. Our system handles standard layouts, variations, and edge cases.

MICR Line: Routing number, account number, cheque serial number
Amount Fields: Numeric (courtesy) and written (legal) amounts with cross-validation
Date: Issue date with validity checking
Payee: Handwritten or printed payee name extraction
Signature: Signature region extraction for verification
Bank Information: Bank name, branch, and address details
json
{
  "extraction_id": "ext_20240214120000_abc123",
  "confidence": 0.94,
  "fields": {
    "routing_number": {
      "value": "021000021",
      "confidence": 0.99,
      "source": "micr"
    },
    "account_number": {
      "value": "1234567890",
      "confidence": 0.98,
      "source": "micr"
    },
    "amount": {
      "numeric": 1500.00,
      "written": "One thousand five hundred",
      "confidence": 0.89,
      "mismatch": false
    },
    "payee": {
      "value": "John Smith",
      "confidence": 0.87,
      "needs_review": false
    },
    "date": {
      "value": "2024-02-14",
      "confidence": 0.95,
      "valid": true
    }
  }
}

API Integration

Submit an image, get structured fields and confidence scores.

curl
curl -X POST \
  https://api.chequedb.com/v1/extract \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "image=@cheque.jpg" \
  -F "fields=all" \
  -F "confidence_threshold=0.85"

Synchronous API

For real-time processing with immediate response. Ideal for mobile cheque deposit and interactive applications. Average response time under 2 seconds.

Asynchronous API

For high-volume batch processing. Submit jobs and receive webhook notifications when extraction completes. Handles thousands of cheques per minute.

SDK Support

Native SDKs for iOS, Android, and Web with pre-built UI components. View SDK documentation.

Need the broader deposit workflow view first? Review the bank check OCR API page.

Advanced Features

Built for teams that need field-level accuracy, exception control, and review traceability

Confidence Scoring

Every extracted field includes a confidence score (0.0-1.0). Set thresholds for automatic acceptance or manual review routing.

Cross-Field Validation

Automatically verify that numeric and written amounts match. Validate routing number checksums and date ranges.

Fraud Detection

Integrated fraud scoring during extraction. Detects altered amounts, suspicious patterns, and duplicate cheques.

Human-in-the-Loop

Automatic routing of low-confidence extractions to review queues. Side-by-side image and data comparison interface.

Batch Processing

Process thousands of cheques in parallel. Ideal for back-office operations and end-of-day processing.

Real-time Processing

Sub-second response times for mobile and interactive applications. Webhooks for status updates.

Extraction Accuracy by Field Type

Printed Fields

MICR Line (Routing/Account)99.9%
Bank Information99.5%
Date (Printed)98.5%

Handwritten Fields

Amount (Numeric / CAR)97.8%
Amount (Written / LAR)97.1%
Payee Name96.5%

Accuracy measured across 10M+ production cheques including personal, business, and consumer items. Low-confidence handwritten fields route to human review automatically — not posted with uncertain values. How handwriting recognition works.

Common Use Cases

Mobile Cheque Deposit

Enable customers to deposit cheques via mobile apps with instant data extraction and validation. Learn more.

Bulk Processing

Process thousands of cheques from lockbox services, mailrooms, and back-office operations. Learn more.

Fraud Prevention

Combine extraction with fraud detection to identify altered amounts, forged signatures, and suspicious patterns. Learn more.

Frequently Asked Questions

What is cheque data extraction?

Cheque data extraction is the automated process of capturing and converting information from paper cheques into structured, machine-readable formats. Using technologies like OCR and MICR, the system reads routing numbers, account numbers, amounts, dates, and payee names from cheque images. Modern solutions also use AI and deep learning to handle handwritten fields with high accuracy.

How accurate is automated check data extraction?

Chequedb achieves 99%+ accuracy for printed fields (MICR line, bank details) and 97%+ for handwritten fields including the numerical amount, legal amount in words, and payee name. The accuracy depends on image quality, handwriting legibility, and the extraction technology used. Unlike standard OCR vendors that only handle printed text, Chequedb uses field-specific ICR models trained on cheque handwriting, plus cross-validation between the numeric and written amounts to catch mismatches before posting.

What data can be extracted from a check?

Check data extraction captures: (1) MICR line data, including routing number, account number, and check serial number; (2) Amount fields, including both numeric (courtesy amount) and written (legal amount); (3) Date of issue; (4) Payee name; (5) Memo field; (6) Signature for verification; and (7) Bank name and branch information. Advanced systems also extract security feature data for fraud detection.

How does the Chequedb extraction API work?

The check processing API accepts check images via REST endpoints, processes them through our extraction pipeline, and returns structured JSON data with confidence scores. The pipeline includes image preprocessing (deskewing, denoising), field localization, OCR/ICR recognition, data validation, and fraud scoring. Results are typically returned in under 2 seconds, with webhook notifications for asynchronous processing.

What's the difference between OCR and MICR for check processing?

MICR (Magnetic Ink Character Recognition) reads the special magnetic ink used for routing and account numbers at the bottom of checks. It is highly accurate, but it only works for that specific line. OCR (Optical Character Recognition) reads printed text visually and can extract all fields. Modern systems combine both: MICR for the bottom line, OCR for printed fields, and ICR (Intelligent Character Recognition) for handwritten text.

Can check data extraction handle handwritten checks?

Yes — and this is a key differentiator. Chequedb uses field-specific ICR (Intelligent Character Recognition) models trained on cheque handwriting, achieving 97%+ accuracy on handwritten amounts, 97.1% on the legal amount in words, and 96.5% on payee names. Standard OCR vendors handle printed fields well but cannot reliably extract handwritten text. For operations that process personal cheques — where the payee, amounts, and date are all handwritten — a printed-only extraction system leaves the most important fields unautomated. Low-confidence extractions route to human review automatically rather than being posted with uncertain values.

Ready to Replace Manual Cheque Handling?

Book a live walkthrough to see extraction, exception routing, and approvals in context. Then validate your field requirements before moving to sandbox testing.

Frequently Asked Questions

What is cheque data extraction?

Cheque data extraction is the process of automatically reading cheque fields using MICR, OCR, and ICR technologies. It captures the MICR line (routing, account, cheque number), printed fields (payee, amount, date), and handwritten fields (legal amount, signature). The extracted data is validated against business rules before posting.

What fields can cheque data extraction software read?

A production extraction system reads: MICR line (routing number, account number, cheque number), date, payee name, courtesy amount (numeric), legal amount (written), memo, endorsement text, and bank stamps. Each field includes a confidence score so low-confidence reads can be flagged for review.

How accurate is cheque data extraction?

Accuracy depends on the technology and image quality. MICR reading is typically 99%+ for the control line. OCR/ICR accuracy varies: printed text can reach 97%+, handwritten fields 90-95% depending on legibility. A good system provides confidence scores per field rather than a single aggregate number.

Can cheque data extraction read handwritten cheques?

Yes, with ICR (Intelligent Character Recognition). ICR is purpose-built for handwriting — it handles variations in cursive, print, and mixed handwriting styles. Combined with MICR for the control line and OCR for printed fields, ICR enables full automation for personal and business cheques.

How does cheque data extraction handle low-confidence reads?

Low-confidence reads are not silently accepted. The system flags them with a reason code (blurry image, missing field, handwriting not recognized) and routes them to a review queue. The reviewer sees the original image crop and confidence score before making a decision.

What is the difference between a cheque data extraction API and a general OCR API?

A cheque data extraction API is purpose-built for cheques — it understands where each field is located, which recognition method to apply (MICR vs OCR vs ICR), and how to validate the extracted data against business rules. A general OCR API just returns text from an image without cheque-specific field understanding.