How Intelligent Process Automation Addresses the AI Data Problem

Companies looking to make effective use of artificial intelligence (AI) face a big problem: data – as in, AI solutions often require too much of it to create effective models.

Let’s say you’re trying to create a model to automate the mortgage underwriting process. The process typically involves a human looking over lots of documents to assess an applicant’s creditworthiness. Those documents are likely to include tax returns, credit reports, W-2 wage forms, bank statements and more. To create an accurate classification or extraction model for this use case, you’d need over 100,000 sample data points or more.

Unfortunately, this puts AI out of reach for most organizations save the Googles and Amazons of the world.

Rule-based AI: Insatiable demand for data

One way to try to automate a mortgage underwriting workflow and avoid the need for huge data sets, is to use a templated approach, which involves creating a series of rules to extract key bits of relevant data from each type of document. Such data may include adjusted gross income from tax returns for multiple years, salary data from W-2s, and lots of data around credit card balances, auto loans, and other types of debt.

These rules would have to define exactly where on each type of document the relevant data can be found. That’s no mean feat given the variation among the documents in question. Consider just tax returns. One applicant may file a 1040, while another uses 1040A and a third 1040EZ. Bank statements, of course, will vary depending on the bank in question as will credit reports from the three major reporting firms.

While a rules-based approach might seem like a viable solution to avoid the need for thousands of sample data points, you’d find yourself presented with another challenge. Beyond the variation in format, there’s also plenty of judgment calls to be made about which data to extract. Writing rules to cover every possible permutation of what an underwriter may care about is an exercise in futility.

It’s worth noting that robotic process automation (RPA) isn’t a likely solution to this problem. RPA is great at automating predefined steps that never vary. For example, if you know that a salary figure shows up in the same place on the same document every time, you can use RPA to automate the process of highlighting that figure, copying it and pasting it into a downstream system the underwriter uses for credit evaluations. But given all the variation inherent in the mortgage underwriting process, that’s not a viable approach.

Adding intelligence to process automation

What’s required is an AI solution with more emphasis on “intelligence,” which is where the concept of intelligent process automation (IPA) comes in.

IPA tools are able to understand document context and learn what a given value looks like and find it no matter where on a document it may be. For example, any mortgage underwriting process involves assessing the total outstanding debt an applicant is carrying. That means combing over those credit reports to find balances on credit cards, auto loans and the like.

By examining only about 200 examples of what a debt figure looks like, a good intelligent automation tool can then find debt figures on any relevant document, no matter if it’s from Equifax, Experian or TransUnion. That’s because the IPA tool learns and can understand the surrounding context of the document, so it can discern a debt figure from, say, income. It can also be trained to extract data showing whether an applicant is typically on time with loan payments or chronically late.

The key value proposition behind IPA is that it works with just those 200 or so examples of the value in question, a capability known as “low training data.” Training data is the gating factor that stymies so many AI projects; most companies simply don’t have enough data to accurately train the AI tool. But IPA makes use of AI technologies including machine learning, transfer learning and natural language processing to overcome that issue and produce models that are extremely accurate.

And IPA tools can be applied to plenty of use cases besides mortgage underwriting, including insurance claims processing, customer onboarding, title and deed processing, financial document analysis and more.

[addtoany]

Increase intake capacity. Drive top line revenue growth.

Schedule Demo

Unstructured Unlocked podcast

April 24, 2024 | E45

Unstructured Unlocked episode 45 with Daniel Faggella, Head of Research, CEO at Emerj Artificial Intelligence Research

Listen Now

April 10, 2024 | E44

Unstructured Unlocked episode 44 with Tom Wilde, Indico Data CEO, and Robin Merttens, Executive Chairman of InsTech

Listen Now

March 27, 2024 | E43

Unstructured Unlocked episode 43 with Sunil Rao, Chief Executive Officer at Tribble

Listen Now

View All

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Get Started

Industry

Use Cases

Get Started

Resources

Documentation

Customer Stories

Get Started

Get Started

Get Started

Indico Named as Major Contender and Star Performer in Everest Group's PEAK Matrix® for Intelligent Document Processing (IDP)

BLOG

How Intelligent Process Automation Addresses the AI Data Problem

Rule-based AI: Insatiable demand for data

Adding intelligence to process automation

Increase intake capacity. Drive top line revenue growth.

Related Posts

Artificial Intelligence, Insurance Underwriting

Risk assessment redefined: The role of automation in insurance underwriting

Artificial Intelligence, Insurance

Indico CEO Tom Wilde Discusses AI’s Role in Insurance at Insurtech Insights EU 2024

Artificial Intelligence, Digital Transformation, Unstructured Unlocked

How to adopt AI with intention and quality: tips from Sunil Rao

Unstructured Unlocked podcast

Unstructured Unlocked episode 45 with Daniel Faggella, Head of Research, CEO at Emerj Artificial Intelligence Research

Unstructured Unlocked episode 44 with Tom Wilde, Indico Data CEO, and Robin Merttens, Executive Chairman of InsTech

Unstructured Unlocked episode 43 with Sunil Rao, Chief Executive Officer at Tribble

Get started with Indico

Schedule1-1 Demo

Resources

Blog

Gain insights from experts in automation, data, machine learning, and digital transformation.

Unstructured Unlocked

Enterprise leaders discuss how to unlock value from unstructured data.

YouTube Channel

Check out our YouTube channel to see clips from our podcast and more.

Get our best content on intelligent automation sent to your inbox weekly!

Schedule
1-1 Demo