Automated Cheque Data Extraction

Cheque Data Extraction with 99%+ Accuracy

Extract MICR, amounts, dates, endorsements, and payee data automatically from cheque images. Our AI-powered OCR, ICR, and deep learning technology delivers bank-grade accuracy for financial institutions and fintechs.

For the high-level workflow and buyer view, start with the bank check OCR API page, then use this route for extraction-specific implementation detail.

The Extraction Pipeline

A complete pipeline from image capture to structured cheque data output

1. Capture

High-resolution image capture (200-300 DPI) via mobile, scanner, or kiosk

2. Preprocess

Deskew, denoise, binarize, and remove security backgrounds

3. Extract

OCR/ICR recognition for all fields with confidence scoring

4. Validate

Cross-field validation, checksum verification, fraud scoring

5. Output

Structured JSON output with confidence scores for each field

Extraction Technologies

Multi-modal approach combining proven and cutting-edge technologies

MICR Reading

Magnetic Ink Character Recognition for the MICR line. Reads routing numbers, account numbers, and cheque serial numbers with near-perfect accuracy.

99.9%
Accuracy Rate

OCR

Optical Character Recognition for printed fields—bank names, addresses, date stamps, and pre-printed account holder information.

99%+
Accuracy Rate

ICR

Intelligent Character Recognition for handwritten text—payee names, amounts in words, and memo fields using advanced ML models.

85-95%
Accuracy Rate

Deep Learning

End-to-end neural networks for layout analysis, field localization, and multi-modal fusion across all extraction methods.

<200ms
Processing Time

Complete Data Extraction

Extract every relevant field from cheques with confidence scoring for each element. Our system handles standard layouts, variations, and edge cases.

MICR Line: Routing number, account number, cheque serial number
Amount Fields: Numeric (courtesy) and written (legal) amounts with cross-validation
Date: Issue date with validity checking
Payee: Handwritten or printed payee name extraction
Signature: Signature region extraction for verification
Bank Information: Bank name, branch, and address details
json
{
  "extraction_id": "ext_20240214120000_abc123",
  "confidence": 0.94,
  "fields": {
    "routing_number": {
      "value": "021000021",
      "confidence": 0.99,
      "source": "micr"
    },
    "account_number": {
      "value": "1234567890",
      "confidence": 0.98,
      "source": "micr"
    },
    "amount": {
      "numeric": 1500.00,
      "written": "One thousand five hundred",
      "confidence": 0.89,
      "mismatch": false
    },
    "payee": {
      "value": "John Smith",
      "confidence": 0.87,
      "needs_review": false
    },
    "date": {
      "value": "2024-02-14",
      "confidence": 0.95,
      "valid": true
    }
  }
}

Simple API Integration

Extract data from cheques with a single API call

curl
curl -X POST \
  https://api.chequedb.com/v1/extract \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F "image=@cheque.jpg" \
  -F "fields=all" \
  -F "confidence_threshold=0.85"

Synchronous API

For real-time processing with immediate response. Ideal for mobile cheque deposit and interactive applications. Average response time under 2 seconds.

Asynchronous API

For high-volume batch processing. Submit jobs and receive webhook notifications when extraction completes. Handles thousands of cheques per minute.

SDK Support

Native SDKs for iOS, Android, and Web with pre-built UI components.View SDK documentation.

Need the broader deposit workflow view first? Review the bank check OCR API page.

Advanced Features

Enterprise-grade capabilities for production deployments

Confidence Scoring

Every extracted field includes a confidence score (0.0-1.0). Set thresholds for automatic acceptance or manual review routing.

Cross-Field Validation

Automatically verify that numeric and written amounts match. Validate routing number checksums and date ranges.

Fraud Detection

Integrated fraud scoring during extraction. Detects altered amounts, suspicious patterns, and duplicate cheques.

Human-in-the-Loop

Automatic routing of low-confidence extractions to review queues. Side-by-side image and data comparison interface.

Batch Processing

Process thousands of cheques in parallel. Ideal for back-office operations and end-of-day processing.

Real-time Processing

Sub-second response times for mobile and interactive applications. Webhooks for status updates.

Extraction Accuracy by Field Type

Printed Fields

MICR Line (Routing/Account)99.9%
Bank Information99.5%
Date (Printed)98.5%

Handwritten Fields

Amount (Numeric)95.2%
Amount (Written)87.3%
Payee Name84.6%

Accuracy rates based on production data from over 10 million processed cheques. Handwritten field accuracy varies with image quality and legibility.

Common Use Cases

Mobile Cheque Deposit

Enable customers to deposit cheques via mobile apps with instant data extraction and validation. Learn more.

Bulk Processing

Process thousands of cheques from lockbox services, mailrooms, and back-office operations. Learn more.

Fraud Prevention

Combine extraction with fraud detection to identify altered amounts, forged signatures, and suspicious patterns. Learn more.

Frequently Asked Questions

What is cheque data extraction?

Cheque data extraction is the automated process of capturing and converting information from paper cheques into structured, machine-readable formats. Using technologies like OCR and MICR, the system reads routing numbers, account numbers, amounts, dates, and payee names from cheque images. Modern solutions also use AI and deep learning to handle handwritten fields with high accuracy.

How accurate is automated check data extraction?

Modern check data extraction achieves 99%+ accuracy for printed fields (MICR line, bank details) and 85-95% for handwritten amounts. The accuracy depends on image quality, handwriting legibility, and the extraction technology used. Chequedb's system uses multiple validation layers, cross-referencing numeric and written amounts, verifying routing number checksums, and confidence scoring to ensure data integrity before processing.

What data can be extracted from a check?

Check data extraction captures: (1) MICR line data, including routing number, account number, and check serial number; (2) Amount fields, including both numeric (courtesy amount) and written (legal amount); (3) Date of issue; (4) Payee name; (5) Memo field; (6) Signature for verification; and (7) Bank name and branch information. Advanced systems also extract security feature data for fraud detection.

How does the Chequedb extraction API work?

The check processing API accepts check images via REST endpoints, processes them through our extraction pipeline, and returns structured JSON data with confidence scores. The pipeline includes image preprocessing (deskewing, denoising), field localization, OCR/ICR recognition, data validation, and fraud scoring. Results are typically returned in under 2 seconds, with webhook notifications for asynchronous processing.

What's the difference between OCR and MICR for check processing?

MICR (Magnetic Ink Character Recognition) reads the special magnetic ink used for routing and account numbers at the bottom of checks. It is highly accurate, but it only works for that specific line. OCR (Optical Character Recognition) reads printed text visually and can extract all fields. Modern systems combine both: MICR for the bottom line, OCR for printed fields, and ICR (Intelligent Character Recognition) for handwritten text.

Can check data extraction handle handwritten checks?

Yes. ICR (Intelligent Character Recognition) technology uses machine learning to read handwritten text on checks. While more challenging than printed text, modern deep learning models achieve 85-95% accuracy on handwritten amounts and 80-90% on payee names. The system flags low-confidence extractions for manual review, ensuring accuracy while maximizing automation rates.

Start Extracting Data Today

Book a live walkthrough, validate your extraction requirements, and move into sandbox testing for check data extraction in minutes.