With Docsumo, we are now saving more than 500 hours per month.
“With Docsumo, we are now able to assign barcodes in less than 2 mins. The same process used to take us 20 mins previously. We are now saving hundreds of hours a month generating Advanced Shipment Notifications. It has reduced manual errors drastically.”
About the customer
The case study: In a nutshell
Process unstructured bill of lading
- Biagibros scans and extract data from bill of lading documents to generate barcodes.
Identify & classify documents
- Biagibros needs to classify different types of bill of lading and queue for manual data extraction
- Data to extract includes customer order information and careeir details.
Capture data from bill of lading to generate barcodes
- Not only did the structures vary for different bill of lading documents but the position of data to capture varies
- Some of this data was in tabular formats.
Categorize & derive attributes from extracted data
- The manual extraction lacked a logical validation to ensure accuracy.
The Docsumo Solution
Ingesting bill of lading
- API-based direct integration that seamlessly ingests bill of lading onto Docsumo.
Pre-processing and getting ready for data extraction
- Inbuilt document pre-processors identified the letter formats (JPG, PDF, PNG etc.) and queued them up for data extraction.
Data extraction from unstructured text
- Docsumo's OCR module used the vectorized position reference in a letter to extract data.
- The OCR not only parsed through letters with varying fonts, layouts, image quality, and resolution; it even extracted data from the tables with 99%+ accuracy.
Intelligent categorization of key value pairs
- Our proprietary NLP-based classification framework started rapidly learning from all the documents. It was trained to categorize key value pairs and line items.
- Another algorithm started making intelligent predictions to identify the data within a document.
Rule-based data validation
- Once the data is extracted, a rule-based validation engine applied contextual data validation and correction algorithms.
Integration with downstream software
- The data was extracted in a JSON format that was easily integrated into DellBoomi and Highjumpinto.
Result: 99%+ Data extraction accuracy
Fill up the form to speak with an automation expert.