Customer Story
How Hitachi streamlined Bank Statement Reconciliation using Docsumo
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
About the customer
A 100% subsidiary of Hitachi Ltd, Hitachi Payment Services empowers financial institutions and merchant aggregatorswith a comprehensive array of technology-led payment solutions that includes installation of ATMs and POS devices throughout India.
Industry
Financial Services
Company size
2,000+ Employees
Portfolio Units
1,000+ per Month
Document Processed
3,000+ per Month
The case study: In a nutshell
Before
Manually scanning bank statements of varying structures
A team of manual data entry operators and underwriters process 3,000+ bank statements on a monthly basis
Scanning data from 50+ bank statement types manually is cumbersome
Little to no validation done on captured data
All documents had to undergo double manual entry
After
Capture data from unstructured documents with smart AI-based APIs
Employees review only exceptions
All the variations in the layout are taken care by ML-based smart data extraction API
Docsumo's algorithms auto-classify letters and validate data with custom rules in real-time
95%+ straight through processing

The Challenge
Process unstructured bank statements
- Hitachi collects 3,000+ digital and scanned bank statement copies per month to reconcile payouts to ATM operators with withdrawals from the ATM.
Identify & classify bank statements
- A clean or contaminated bank statement is determined based on the transactions recorded. In order to classify a bank statement as clean/contaminated, each transaction recorded in it needs to be categorized accordingly.
- Data to extract includes transaction details and category.
Capture data from bank statements with 50+ layouts
- Not only did the structures vary for different bank statements but the position of data to capture varies for these documents
- Some of them were in tabular formats.
Categorize & derive attributes from extracted data
- The manual extraction lacked a logical validation of payment and transaction details.
The Docsumo Solution
Ingesting bank statements
- API-based direct integration that seamlessly ingests Bank Statements onto Docsumo.
Pre-processing and getting ready for data extraction
- Inbuilt document pre-processors identified the letter formats (JPG, PDF, PNG etc.) and queued them up for data extraction.
Data extraction from unstructured text
- Docsumo's OCR module used the vectorized position reference in a letter to extract data.
- The OCR not only parsed through letters with varying fonts, layouts, image quality, and resolution; it even extracted data from the tables with 95%+ accuracy.
Intelligent categorization of key value pairs
- Our proprietary NLP-based classification framework started rapidly learning from all the documents. It was trained to categorize key value pairs and line items.
- Another algorithm started making intelligent predictions to identify the data within a bank statements.
Rule-based data validation
- Once the data is extracted, a rule-based validation engine applied contextual data validation and correction algorithms.
Integration with downstream software
- The data was extracted in a JSON format that was easily integrated into client's database via APIs and iframe.

Result: 99%+ Data extraction accuracy
10x
Faster processing of unstructured data
95%
Touchless processing using smart validation rules
99%
Data accuracy with intelligent automation

Ready to automate your data extraction?
Let's talk.
Docsumo's intelligent document processing enables you to extract data easily, efficiently, and accurately.
Fill up the form to speak with an automation expert.
Fill up the form to speak with an automation expert.
