Intelligent Document Processing

Automating data extraction for income verification documents for mortgage loans

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Automating data extraction for income verification documents for mortgage loans

Mortgage bankers face huge consumer demand to provide better customer experiences and make them seamless. Today, it is important to ensure that transactions processed are secure, verified, and audited. Income verification documents are used to check mortgage payees’ credentials and ensure they provide genuine details.  Banks struggle to meet compliance requirements when processing customer data and personally identifiable information. This is where emerging technologies such as automated data extraction and intelligent OCR solutions come in.

Requirement of income verification documents in BFSI

Document fraud is a common problem faced by bankers and lenders who give mortgages to prospective buyers. The mortgage loan origination process entails various stages which a buyer goes through in order to get a loan from banks for buying homes, leading all the way up to the keys being handed over.

While a majority of submissions are genuine, sometimes users fabricate information in these documents to get loan approvals for mortgages. Fraudsters can hire third-party counterfeiting services to commit document fraud and many don’t need design skills to get past data security checks.

Changing the formatting of checks is a minor error which can be spotted but more sophisticated alterations such as edits to fonts, styles, spacing, etc., are often missed by the human eye.

A classic example is this black line for the activity summary. Notice how it doesn’t reach all the way to the end in the second image. 

Activity summary

When processing huge volumes of documents, spotting these minute details becomes a challenge which is why mortgage employment verification Is needed. Many mortgage documents are photocopied and sent to banks and fintech companies. Sometimes the level of reproduction is such that it’s not visible to the eye that they’re not originals. 

Another common type of fraud is robo signatures where officials who have worked at various banks end up signing multiple documents. These electronic signatures are stolen and used for producing false notaries, forgeries, and millions of fraudulent documents. 

There are cases of non-authentic signatures being stamped on loan documents which malicious agents use to trick clients. Banks use employment verification for mortgages and analyze customer documents using AI, OCR, and machine learning solutions to spot such alterations and cross-reference details with original documents. 

Types of income verification documents

1. IRS Forms

IRS Form I-9 proves the identity of employers and a recipient’s income status. Name, address, and social security number are the key details enlisted in these. Permanent Resident Cards, scanned passport pages, and employment authorization documents are needed as supporting documents while filling these forms up. Freelancers and self-employed individuals submit IRS Form 1099 Miscellaneous Income for mortgage applications.

2. Loss of income Documents

Occasionally, banks require individuals to furnish a loss of income form to show changes in work status. Sometimes an employee gets fired, loses a job, or goes through a period of unemployment before getting hired again. Loss of income form is used for verification of employment and Section 2 of these forms is the area that explains unemployment reasons and details.

3. Paystub

Banks and NBFCs require paystubs to be attested by current or previous employers since forging pay stubs is a common practice by criminals. Paystubs provide proof of employee earnings in a company or organization.

4. Income verification letter

Income verification letters are used to show history of payments, salary, employment details, and additional references. Verifier’s name, company information, zip code, address, employment start and end dates, etc., are contained in it. Some states have a state-specific income verification form which banks request at the time of mortgage applications when these letters are not furnished. Income verification letters alone aren’t enough and they must be attested by a notary. Additional documents have to be submitted as proof of income for verification purposes such as tax returns, bank statements, and worker’s compensation letter. 

Automated data extraction from income verification documents for analysis

Near-replica forgeries of documents used for fannie mae verification of mortgage make it possible for criminals to apply for loans and mortgages using fake identities and get away with financial crimes. Banks use automated data extraction from income verification documents for deeper analysis and get insights related to different forgeries committed.

By using intelligent OCR data capture and machine learning algorithms, it is possible to identify nuances in forgeries, examine document layouts, and spot missing details. AI technology combined with data extraction can help lenders prevent document frauds by marking false positives and cross-referencing databases that have analyzed original documents in the past.

Mortgage underwriters find it challenging to go through piles of documents and are usually responsible for approving or denying loan applications. As part of mortgage employment verification reviews, they have to do a cash flow analysis and verification modules can be created using automated document extraction software to streamline and speed up the process. 

Docsumo makes it easy to extract data from Freddie Mac Form 91 and Fannie Mae Form 1084 and automatically enter those details into cash flow analysis forms. Here is a step-by-step overview of how to automate data extraction from income verification documents for analysis using Docsumo:

  1. Log in to and enter your user credentials to access the platform
  1. In order to extract data from your income verification forms, you have to create an API and train it. To do this, go to Document Types and hit Create New Document Type
  1. Upload your first income verification form and set key value pairs. Keys are the main fields and values are the values associated with them. By doing this, you help the machine learning model get familiarized with the structure and layout of your verification form. 
  1. Set the appropriate data types for the values of each field. You can add sections as a document is made up of multiple sections. Add key value pairs to each section and work your way in that direction. Use the ‘Add Section,’ and ‘Add Key Value Pair,’ buttons to do this. Once your sections and key value pairs are set, you’re ready for the next step
  1. Click on Save and Close. Docsumo will confirm if you want to apply the changes to all your new documents or to existing and new ones. Select your preference and close.
  1. At this point, your income verification form has been annotated. Now go to Document Types and upload at least 20 income verification forms. You will find extracted key value pairs shown by the API.  Click on ‘Review,’ to see if any values are missing. You can guide the API and help the software capture missing fields during the review process and assist the model in getting smarter
  1. After all the fields are correctly annotated, you can go ahead and click ‘Approve.’ You can proceed to ‘Approve,’ the remaining documents if everything is in order and the model is reading accurately.
  1. Next time you want to extract data from your IRS forms automatically, simply enable the income Verification API under APIs & Services


If you’re trying to automate data extraction from income verifications forms for quick and accurate analysis, using software like Docsumo is a reliable choice. Sign up for a free demo today to see how it works and get hands-on experience!

Suggested Case Study
Automating Portfolio Management for Westland Real Estate Group
The portfolio includes 14,000 units across all divisions across Los Angeles County, Orange County, and Inland Empire.
Thank you! You will shortly receive an email
Oops! Something went wrong while submitting the form.
Pankaj Tripathi
Written by
Pankaj Tripathi

Helping enterprises capture data for analytics and decisioning

Is document processing becoming a hindrance to your business growth?
Join Docsumo for recent Doc AI trends and automation tips. Docsumo is the Document AI partner to the leading lenders and insurers in the US.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.