You are logged in as: Free Trial User
Author: Nicolas Boucher, April 2024
Finance teams often deal with a large volume of documents (invoices, contracts, purchase orders, sales orders, credit notes…, etc.).
Unfortunately, in many companies, these documents are processed manually and require significant human labor.
In addition, it could create misplaced invoices, human errors in data entry, and delayed approvals, which add up to lost man hours and inefficiencies.
This article is an overview of a tool, Microsoft Azure AI Document Intelligence (formerly known as Form Recognizer), that can reduce manual document processing by automating the extraction and analysis of financial documents.
Document Intelligence streamlines operations, reduces errors, and accelerates decision-making.
Table of Contents
Azure AI Document Intelligence is a cloud-based task-specific Azure AI service that uses machine-learning models to automate the processing of your data in applications and workflows.
Azure AI Document Intelligence leverages the power of machine learning, deep learning, Optical Character Recognition (OCR) and Supervised learning for document processing.
It has a wide range of applications for process automation, knowledge mining, and other industry-specific applications.
The following section provides a stepwise breakdown of the workflow, outlining the role of each component and the technical aspects of data extraction with Azure AI Document Intelligence.
Collecting Documents (unstructured data):
The starting point involves various unstructured documents in file formats supported by Azure Blob Storage, typically including PDF, JPG, PNG, Handwritten notes, and TIFF.
Data Storage in Azure Blob:
Azure Blob Storage is a cloud-based repository for these unstructured documents. It offers scalable and cost-effective storage for large volumes of data. The URLs of the uploaded documents can be fed to cognitive services for the next step.
Azure Function to Invoke Document Intelligence Service:
An Azure function acts as a serverless compute unit triggered by events. This workflow’s trigger is designed to respond to new document uploads within Azure Blob Storage.
The triggered Azure function utilizes the Azure Functions SDK to interact with other Azure services. The Document Intelligence service uses its REST API or a client library (available in various programming languages) to initiate document processing.
Document Intelligence:
Upon receiving the request from the Azure function, Azure AI Document Intelligence performs document layout analysis, analyzes the data type,
Structured Data for Onwards Processing/Analysis:
The Azure function receives the response from Azure AI Document Intelligence, which contains the extracted data in a structured format like JSON. This structured data is readily usable for further processing and analysis based on your needs, i.e., database storage, system integration, reporting, or logic app to initiate further processing (Payment in case of AP).
The diagram below presents the workflow of Azure Document Intelligence, which automates document processing and converts unstructured data into valuable insights for your applications.
See Azure AI Document Intelligence in action:
Now that we know how it works, it is important to understand the different modalities of Azure AI Document Intelligence.
Document intelligence needs a context to extract the correct data from a document.
The type of model provides this context.
Fundamentally, there are two document analysis models: the prebuilt and the custom models.
A brief overview of both is below:
Prebuilt models are ready-to-use models for common document processing tasks.
These models are trained on a diverse dataset and can recognize common document types such as invoices, receipts, business cards, and forms.
They come pre-trained to extract critical fields and information from these documents accurately.
The most relevant prebuilt models for finance are “Invoices”, “Receipts” and “Contracts”.
Custom models allow users to train their models tailored to their specific document processing needs.
Users can upload their dataset of documents and annotate them with the relevant fields they want to extract.
The custom model then learns from this annotated data to accurately recognize and extract the specified fields.
This is how the model works:
This offers greater control and accuracy for:
The possibilities with Microsoft Azure spread across all finance domains, from understanding tax compliances for different regions or industries to reducing manual processing.
Here are the most commonly used cases:
The most common use case of Azure document intelligence is reducing manual data entry through its OCR model.
See the full documentation on the invoice model: here
Extract key-value pairs from financial statements using General Layout API for consolidations, reporting, or Audit.
We can set up logical sequences in Azure to create a robotic payment automation routine. It can be done in the following steps:
Through intelligent search indexing, essential information from vendor bids or supplier contracts can be extracted to analyze and compare different bids.
If you want to continue to learn more, here is a great webinar from Microsoft:
Here’s a comparison table highlighting key features of Azure Document Intelligence and its prominent alternatives:
Feature | Deployment | Document Formats | Extraction Capabilities | Community & Support | Customization | Advanced Features | Accuracy | Pricing | Ease of Use | Security & Compliance |
---|---|---|---|---|---|---|---|---|---|---|
Azure AI Document Intelligence | Cloud-based | PDF, JPG, PNG, TIFF, BMP, Handwritten etc. | Key-value pairs, tables, text, custom models | Large community, good documentation | Yes, build custom models | Layout analysis, confidence scores | High, depends on document complexity | Free tier, pay-per-use | Easy to use with SDKs and libraries | Compliant with various industry standards |
Amazon Comprehend | Cloud-based | PDF, JPG, PNG, etc. | Key-value pairs, entities, sentiment analysis | Large community, good documentation | Yes, custom entity recognizers | Redaction, multi-language support | High, varies based on document type | Free tier, pay-per-use | Easy to use with SDKs | Compliant with various standards |
Google Cloud Natural Language API | Cloud-based | Various file formats | Text extraction, entity recognition | Large community, good documentation | Limited | Syntax analysis, topic modelling | Moderate | Pay-per-use | Easy to use with client libraries | Compliant with various standards |
IBM Datacap | Cloud-based | Various | Forms, data extraction, workflow automation | Large community, extensive support | Yes, extensive customization | Batch processing, document routing | High | Subscription-based | More complex setup | Supports industry compliance |
ABBYY Vantage | Cloud-based | Various | Invoices, documents, data extraction | Large community, good support | Yes, custom classifiers | Document classification | High | Subscription-based | More complex setup | Supports industry compliance |
Tesseract (Open-Source in Python) | Open-source | Images (JPG, PNG, etc.) | Text extraction (OCR) | Active community, moderate support | Yes through Python | N/A | Lower accuracy compared to commercial options | Free | Requires programming knowledge | N/A |
OpenCV (Open-Source in Python) | Open-source | Images (various) | Text extraction (OCR) | Active community, moderate support | Yes through Python | N/A | Can be customized through Python code | Free | Requires programming knowledge | N/A |
Step-by-step instructions on how to implement Azure AI Document Intelligence for finance document processing
1. Sign up on Azure using the following link: https://signup.azure.com/
You can try it for free for 30 days but you do need to provide your credit card.
There is no installation needed to start using it as you can access Azure through your browser.
Video explaining the sign-up:
2. Setting up Azure Form Recognizer service in Azure Portal
3. Pricing Tier & Connection Details
4. Setting up extraction models using Document Intelligence studio
Here is a short explanatory video:
Azure Document Intelligence offers integrations with various services and tools to streamline your document processing workflow.
Document intelligence can be easily integrated with other Azure services for further processing. Some of the commonly used Azure services with document intelligence are
Third-party applications can be integrated with ERPs and CRMs using several integration methods, including APIs, SDKs, or custom integrations using platforms like Celigo.
Azure Document Intelligence offers a flexible pricing model. It has three different pricing structures.
Pay as You Go: Pay only for what you use.
Commitment Tiers: Pay an upfront monthly fee for high-volume usage at a discount.
Commitment Tiers: Disconnected container
To learn more about the pricing you can also visit the following link:
https://azure.microsoft.com/en-us/pricing/details/ai-document-intelligence/
Considering the nature of financial data, ensuring security and compliance with best practices is important.
Azure Document Intelligence prioritizes encrypting data in transit and at rest, adhering to strict access controls, and following industry standards like SOC 2, HIPAA, and GDPR.
This ensures that your sensitive financial information, like invoice amounts and account numbers, remains protected throughout the processing pipeline.
Additionally, you have control over where your data is stored, allowing you to choose a region that aligns with your financial regulatory requirements.
For all features, the input data and results are deleted within 24 hours and not used for any other purpose.
For customer-trained models, customers can delete their models and associated metadata at any time by using the API.
While Azure Document Intelligence offers a powerful solution for automating data extraction from financial documents, it’s essential to consider its limitations and challenges:
In conclusion, Azure Document Intelligence emerges as a powerful tool for document processing and analysis within the finance sector.
Leveraging advanced machine learning algorithms and cognitive services, Azure AI Document Intelligence can extract structured data from unstructured documents such as invoices, receipts, and forms.
Organizations can significantly improve efficiency, accuracy, and decision-making by automating tedious document processing tasks.
The greatest advantage is to benefit from the integration of Azure AI Document Intelligence with existing workflows and systems, especially if you are using the Microsoft environment, facilitating streamlined operations and enhanced productivity.
STARTS IN:
— NEXT MASTERCLASS IS ON JANUARY 24TH —
12:30 PM - 2:00 PM (New York),
9:30 AM - 11:00 AM (Los Angeles),
6:30 PM - 8:00 PM (Berlin),
5:30 PM - 7:00 PM (London),
4:30 AM - 6:00 AM (next day, Sydney)
How Coca Cola is Using AI to Speed Up the Forecasting Process
Zoom Meeting Link:
https://us06web.zoom.us/j/87403043001?pwd=lG5PB37DMhyjdPRlSWL4zNAHRpxPqU.1
Join the masterclass at the scheduled time and date.
(Your Google Calendar invite has been sent. Please let us know if you haven’t received it.)