SlideShare una empresa de Scribd logo
1 de 42
Learn What Is Intelligent Document and Data Capture
and Get Started
The Paperless Office…
Chasing the Impossible?
In a now famous (or infamous) 1975
issue of BusinessWeek titled “The Office
of the Future” technologists describe
“The Paperless Office.”
“Vincent E. Giuliano of Arthur D. Little,
Inc., figures that the use of paper in
business for records and correspondence
should be declining by 1980, ‘and by 1990,
most record-handling will be electronic.’”
I think we can all agree that we’re
not there yet.
How about we agree that what we really
want is “The Nearly Paperless Office”?
The first part of any Document or
Content Management System is capture.
What is Intelligent
Document and Data
Capture?
To keep it simple let’s stick with AIIM’s (Association
for Information and Image Management) definition.
AIIM is a nonprofit, serving information and image
professionals.
“Document capture and data capture are not the
same thing. Document capture is the conversion of a
paper document into an electronic image of that
document. Data capture extracts data from a
business form”.
We’ll interpret “form” here as
any paper or electronic source.
Why intelligent or
automated?
Reduce Labor
Speed Processing and
Information Delivery
Comply with
Regulations
Reduce Errors
So what is the capture process?
So what is the capture process?
There are many models, from broad three-
step processes to more specific five-step
processes.
So what is the capture process?
There are many models, from broad three-
step processes to more specific five-step
processes.
Let’s go with the five-step.
1. Capture
Paper Sources: Electronic Sources:
Captured with scanners or
MFP devices.
Network directories, emails,
electronic forms, print streams,
faxes…anything made of 1’s and 0s.
2. Classify/Organize/Categorize
Identifying what the document or information is in
order to correctly process and deliver the document
and extract the information.
2. Classify/Organize/Categorize
Identifying what the document or information is in
order to correctly process and deliver the document
and extract the information.
Invoice ContractTax Form
Patient
Record
?
How should it be processed? Where should it be
routed and stored?
3. Extract or Mine
Capturing data for the index or other purposes.
May be data such as
customer number, freight
tracking number, invoice
number, supplier name
etc.
Or, full-text indexing may
be required where all
text on the documents
are captured. See What
is Document Indexing.
4. Validate
Using technology or manual inspection to ensure that
a document is classified and processed correctly
4. Validate
With technology this may mean automatically validating
against data sources or employing business rules.
For instance if an inventory item should contain three alpha
characters followed by five numbers, all documents not
following that scheme may be tagged for manual inspection
before further processing is done.
PEN21096
CAP36581
INV98453
PA568793
5. Deliver or Integrate
…to or with a search and retrieval or content
management system.
Obviously, without a system to
locate documents or data, a system
is useless.
Henry Schein,
Dentrix, Dentrix
Enterprise
Dentrix Ascend,
Easy Dental
Viive,
DentalVision,
axiUm
5. Deliver or Integrate
Often index information is sent to the document
management system via an XML or CSV file where it can be
made immediately available to the user.
Systems such as SharePoint, Epic, Laserfiche and other
ECM, EMR, EHR systems have various ways of accepting
data feeds
Filenet
Laserfiche
Documentum
MyMedicalRecords
Eaglesoft
Allscripts
Epic
Dentrix
CSV or XML
So how do we get that pig to
Today we have proven and developing
technologies propelling us to The Nearly
Paperless Office.
Barcode recognition (BCR) offers
the most trustworthy recognition
technology for data capture.
• Split Files
• Classify Documents
• Route Files
• Index
• Name Files
• Bookmark PDFs
Use Barcodes to …
Learn more at What Can Barcodes Do For Me?
OCR is another mature data capture technology to...
• Digitize text images so that they can be electronically
edited, searched, and stored
• Make image-based files fully text-searchable or extract
data from a zone for indexing
• Identify document areas for automatic OCR capture
(zonal OCR)
• Drag-and-drop highlighted document text which is
automatically OCR'd and dropped into index fields (drag
and drop OCR or rubber band OCR)
• Use extracted data to split, name, route, validate, etc.
Other Recognition Technologies For Data
Capture
• Handwriting recognition
• Not as accurate as OCR, limited role in some capture systems
ICR (Intelligent Character
Recognition)
• Capturing human-marked data from document forms such as
surveys and tests.
• Like ICR, lower accuracy, limited application within data capture
OMR (Optical Mark Recognition)
• Uses BCR, OCR, ICR and OMR in a structured data capture format
• Typically templates are designed to instruct the capture software
where to look for information and how to process the information
Forms Recognition
Data or Text Mining
(Often using Regular Expressions (regex))
A fast and powerful method to search, extract and
replace specific data found within scanned documents.
• Essentially a special text string for
describing a search pattern.
• Extremely flexible and patterns can be
constructed to match almost anything.
• Use data identified with regex to
classify, split, name and route files.
Learn more at Using Regular Expressions for Automated Data Capture and Extraction.
Data or Text Mining
(Often using Regular Expressions (regex))
…simply processing a large volume of
documents, generally into a few files
or one file and using intelligent
capture software to process.
Some products process folders of
documents on demand or “watch”
folders for files to process.
Batch Document
Processing
Learn more at What is Batch Document Processing?
Image Enhancement
• Adaptive thresholding
• Deskew
• Despeckle
• Remove blank pages or
separator sheets
• Auto rotate
• Remove lines
To improve usability and increase accuracy of OCR and other
recognition technologies, image enhancement is required.
Learn more at Improving OCR Accuracy with Cleanup and Enhancement.
Where is intelligent
document and data
capture going?
Cloud Computing
Increased cloud computing will bring easily
accessible resources and repositories for
documents.
See Docs in the Clouds.
“The use of cloud computing is growing,
and by 2016 this growth will increase to
become the bulk of new IT spend.”
Gartner, Inc. Oct. 2013
Security Focus
Couple the increasing number of documents being
stored with the growing ways to access them, and
security concerns will continue to increase.
Improved Data Mining and
Classification
The increased used of data mining and better
classification will increase OCR demands and
lower the use of barcodes and separator pages.
Increased Mobility
Increased mobility demands in business impacts
all information technology. Users want all
information available from all platforms, no
matter when or where.
Don’t be caught napping,
JUST GET STARTED.
No one data capture product can “do it
all”, but there is no better time to get
started than now. ”The Nearly Paperless
Office” can be yours.
Learn More about Document Imaging and Capture
For more on:
• Watching folder,
• Monitoring folder,
• Watching folders,
• Batch Processing,
• Bulk scanning,
• Split files with barcodes,
• Barcode splitting,
• How to batch process,
• Batch process folders,
• Docufi,
• Imageramp,
• Watch folders,
• Data capture,
• Scanning to folders,
• Scanning to folder,
• Scan to Folder,
• Batch Splitting
• Migration to document
management
Contact Us
DocuFi
30 years’ experience in the Document Imaging market
Capture Solutions www.docufi.com
Copyright ©2014
makers of ImageRamp,
Document Management
Capture Solution
Image Credits
• Christina Rutz, “When Pigs Fly”, http://bit.ly/1giOj05
• Nottsexminer , “Utopia”, http://bit.ly/1gnZTmS
• Kenny Louie, “One Way”, http://bit.ly/1iA7pxQ
• Spiffie, “Fujitsu ScanSnap S300M”, http://bit.ly/1ksdhhv
• Doctorwonder, “Stack O'Money!”, http://bit.ly/1fgxpko
• Maciej Lewandowski, “Pig on the wings”, http://bit.ly/N6lZCJ
• Sjsharktank, “Pigs fly, so now what?”, http://bit.ly/1g8UsYc
• Elvissa, “flyingpig”, http://bit.ly/1nLMzyB
• Jennicatpink, “Piglet Pile”, http://bit.ly/1cT6KUF
• Eddi, “phone”, http://bit.ly/1ftUezJ
• Martin Cathrae, “Cute Piggie“,http://bit.ly/1nLUDiT
• Sarah Beth Dwyer, “Jim's Pig”, http://bit.ly/Prl3dl

Más contenido relacionado

La actualidad más candente

Painless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with AlfrescoPainless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with AlfrescoBlueFishTX
 
DocuSolve Scanning Solutions
DocuSolve Scanning SolutionsDocuSolve Scanning Solutions
DocuSolve Scanning SolutionsGordon Bishop
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technologyDataminingTools Inc
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and workAmr Abd El Latief
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesFellowBuddy.com
 
Key aspects of big data storage and its architecture
Key aspects of big data storage and its architectureKey aspects of big data storage and its architecture
Key aspects of big data storage and its architectureRahul Chaturvedi
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining ConceptsDung Nguyen
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data miningEr. Nawaraj Bhandari
 

La actualidad más candente (20)

8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial8 Document Capture Must Haves, a Document Management Tutorial
8 Document Capture Must Haves, a Document Management Tutorial
 
An Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your RequirementsAn Introduction to Document Scanning, Understanding Your Requirements
An Introduction to Document Scanning, Understanding Your Requirements
 
Batch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp BatchBatch Document Processing with ImageRamp Batch
Batch Document Processing with ImageRamp Batch
 
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
Fujitsu ScanSnap Scanner, an overview of document data capture with barcodes,...
 
What is Document Indexing? A tutorial for intelligent data capture.
What is Document Indexing? A tutorial for intelligent data capture.What is Document Indexing? A tutorial for intelligent data capture.
What is Document Indexing? A tutorial for intelligent data capture.
 
Painless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with AlfrescoPainless Document Scanning and Indexing with Alfresco
Painless Document Scanning and Indexing with Alfresco
 
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
Mobile Cloud Capture: Customize your Data Capture on Mobile Devices with Proc...
 
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...
ChronoScan Document Scanning and Capture for Unparralleled Data Extraction an...
 
Custom Capture Tool Development
Custom Capture Tool DevelopmentCustom Capture Tool Development
Custom Capture Tool Development
 
Improve OCR Accuracy, Clean Up and Enhance Scanned Images
Improve OCR Accuracy, Clean Up and Enhance Scanned ImagesImprove OCR Accuracy, Clean Up and Enhance Scanned Images
Improve OCR Accuracy, Clean Up and Enhance Scanned Images
 
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File FormatsPDF vs. TIFF, An Evaluation of Document Scanning File Formats
PDF vs. TIFF, An Evaluation of Document Scanning File Formats
 
DocuSolve Scanning Solutions
DocuSolve Scanning SolutionsDocuSolve Scanning Solutions
DocuSolve Scanning Solutions
 
Data warehouse and olap technology
Data warehouse and olap technologyData warehouse and olap technology
Data warehouse and olap technology
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
 
Big data
Big dataBig data
Big data
 
Data Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture NotesData Mining & Data Warehousing Lecture Notes
Data Mining & Data Warehousing Lecture Notes
 
Key aspects of big data storage and its architecture
Key aspects of big data storage and its architectureKey aspects of big data storage and its architecture
Key aspects of big data storage and its architecture
 
Data Mining Concepts
Data Mining ConceptsData Mining Concepts
Data Mining Concepts
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data mining
 
Data Warehouse
Data Warehouse Data Warehouse
Data Warehouse
 

Destacado

Scanning & document management
Scanning & document managementScanning & document management
Scanning & document managementGautam Ganguly
 
Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?Digismartek
 
Scanning Document Types | Record Nations
Scanning Document Types | Record NationsScanning Document Types | Record Nations
Scanning Document Types | Record NationsRecord Nations
 
Apa itu soft copy
Apa itu soft copyApa itu soft copy
Apa itu soft copyjohnthj
 
Document scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working bestDocument scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working bestVander Loto
 

Destacado (8)

Image Scanning Services
Image Scanning ServicesImage Scanning Services
Image Scanning Services
 
Scanning & document management
Scanning & document managementScanning & document management
Scanning & document management
 
Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?Why you need to use document scanning management system for business?
Why you need to use document scanning management system for business?
 
What is Data Capture
What is Data CaptureWhat is Data Capture
What is Data Capture
 
RU
RURU
RU
 
Scanning Document Types | Record Nations
Scanning Document Types | Record NationsScanning Document Types | Record Nations
Scanning Document Types | Record Nations
 
Apa itu soft copy
Apa itu soft copyApa itu soft copy
Apa itu soft copy
 
Document scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working bestDocument scanning and capture (local, central, outsource) what's working best
Document scanning and capture (local, central, outsource) what's working best
 

Similar a What is Intelligent Document and Data Capture? A look at the technologies to move to a "nearly" paperless office.

Modern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdfModern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdfDhanashreeBadhe
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?ARC Document Solutions
 
Data Processing and its Types
Data Processing and its TypesData Processing and its Types
Data Processing and its TypesMuhammad Zubair
 
Proven Methods of Data Collection in Data Processing
Proven Methods of Data Collection in Data ProcessingProven Methods of Data Collection in Data Processing
Proven Methods of Data Collection in Data Processingloginworks software
 
ITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBorde
ITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBordeITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBorde
ITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBordeGarrett P. Laborde
 
UiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptxUiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptxRohitRadhakrishnan8
 
No Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair MonarchNo Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair MonarchAltair
 
Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP SolutionProvectus
 
Accenture Insurance Data Capture
Accenture Insurance Data Capture Accenture Insurance Data Capture
Accenture Insurance Data Capture Accenture Insurance
 
Automation of document management paul fenton webinar
Automation of document management paul fenton webinarAutomation of document management paul fenton webinar
Automation of document management paul fenton webinarMontrium
 
Document Automation and Integration Webinar For CVision
Document Automation and Integration Webinar For CVisionDocument Automation and Integration Webinar For CVision
Document Automation and Integration Webinar For CVisionChris Riley ☁
 
Prescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxPrescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxKarthik132344
 

Similar a What is Intelligent Document and Data Capture? A look at the technologies to move to a "nearly" paperless office. (20)

Modern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdfModern Document Processing | Nanonets Blog.pdf
Modern Document Processing | Nanonets Blog.pdf
 
DU_SERIES_Session1.pdf
DU_SERIES_Session1.pdfDU_SERIES_Session1.pdf
DU_SERIES_Session1.pdf
 
What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?What is Optical Character Recognition (OCR) Technology?
What is Optical Character Recognition (OCR) Technology?
 
Data Processing and its Types
Data Processing and its TypesData Processing and its Types
Data Processing and its Types
 
Proven Methods of Data Collection in Data Processing
Proven Methods of Data Collection in Data ProcessingProven Methods of Data Collection in Data Processing
Proven Methods of Data Collection in Data Processing
 
iot_module4.pdf
iot_module4.pdfiot_module4.pdf
iot_module4.pdf
 
ITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBorde
ITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBordeITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBorde
ITGulfCoast: Technology Trends In The Legal Industry by Garrett LaBorde
 
IoT underthe hood
IoT underthe hoodIoT underthe hood
IoT underthe hood
 
UiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptxUiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptx
 
No Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair MonarchNo Code Data Transformation for Insurance with Altair Monarch
No Code Data Transformation for Insurance with Altair Monarch
 
Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
 
Document Parsing
Document ParsingDocument Parsing
Document Parsing
 
Accenture Insurance Data Capture
Accenture Insurance Data Capture Accenture Insurance Data Capture
Accenture Insurance Data Capture
 
Automation of document management paul fenton webinar
Automation of document management paul fenton webinarAutomation of document management paul fenton webinar
Automation of document management paul fenton webinar
 
Document Automation and Integration Webinar For CVision
Document Automation and Integration Webinar For CVisionDocument Automation and Integration Webinar For CVision
Document Automation and Integration Webinar For CVision
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Leveraging IOT and Latest Technologies
Leveraging IOT and Latest TechnologiesLeveraging IOT and Latest Technologies
Leveraging IOT and Latest Technologies
 
Abstract
AbstractAbstract
Abstract
 
Prescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptxPrescriptive Analytics-1.pptx
Prescriptive Analytics-1.pptx
 
DU PPT (1).pptx
DU PPT (1).pptxDU PPT (1).pptx
DU PPT (1).pptx
 

Último

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Último (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

What is Intelligent Document and Data Capture? A look at the technologies to move to a "nearly" paperless office.

  • 1. Learn What Is Intelligent Document and Data Capture and Get Started The Paperless Office… Chasing the Impossible?
  • 2. In a now famous (or infamous) 1975 issue of BusinessWeek titled “The Office of the Future” technologists describe “The Paperless Office.”
  • 3. “Vincent E. Giuliano of Arthur D. Little, Inc., figures that the use of paper in business for records and correspondence should be declining by 1980, ‘and by 1990, most record-handling will be electronic.’”
  • 4. I think we can all agree that we’re not there yet.
  • 5. How about we agree that what we really want is “The Nearly Paperless Office”?
  • 6. The first part of any Document or Content Management System is capture.
  • 7. What is Intelligent Document and Data Capture?
  • 8. To keep it simple let’s stick with AIIM’s (Association for Information and Image Management) definition. AIIM is a nonprofit, serving information and image professionals.
  • 9. “Document capture and data capture are not the same thing. Document capture is the conversion of a paper document into an electronic image of that document. Data capture extracts data from a business form”.
  • 10. We’ll interpret “form” here as any paper or electronic source.
  • 11. Why intelligent or automated? Reduce Labor Speed Processing and Information Delivery Comply with Regulations Reduce Errors
  • 12. So what is the capture process?
  • 13. So what is the capture process? There are many models, from broad three- step processes to more specific five-step processes.
  • 14. So what is the capture process? There are many models, from broad three- step processes to more specific five-step processes. Let’s go with the five-step.
  • 15. 1. Capture Paper Sources: Electronic Sources: Captured with scanners or MFP devices. Network directories, emails, electronic forms, print streams, faxes…anything made of 1’s and 0s.
  • 16. 2. Classify/Organize/Categorize Identifying what the document or information is in order to correctly process and deliver the document and extract the information.
  • 17. 2. Classify/Organize/Categorize Identifying what the document or information is in order to correctly process and deliver the document and extract the information. Invoice ContractTax Form Patient Record ? How should it be processed? Where should it be routed and stored?
  • 18. 3. Extract or Mine Capturing data for the index or other purposes. May be data such as customer number, freight tracking number, invoice number, supplier name etc. Or, full-text indexing may be required where all text on the documents are captured. See What is Document Indexing.
  • 19. 4. Validate Using technology or manual inspection to ensure that a document is classified and processed correctly
  • 20. 4. Validate With technology this may mean automatically validating against data sources or employing business rules. For instance if an inventory item should contain three alpha characters followed by five numbers, all documents not following that scheme may be tagged for manual inspection before further processing is done. PEN21096 CAP36581 INV98453 PA568793
  • 21. 5. Deliver or Integrate …to or with a search and retrieval or content management system. Obviously, without a system to locate documents or data, a system is useless.
  • 22. Henry Schein, Dentrix, Dentrix Enterprise Dentrix Ascend, Easy Dental Viive, DentalVision, axiUm 5. Deliver or Integrate Often index information is sent to the document management system via an XML or CSV file where it can be made immediately available to the user. Systems such as SharePoint, Epic, Laserfiche and other ECM, EMR, EHR systems have various ways of accepting data feeds Filenet Laserfiche Documentum MyMedicalRecords Eaglesoft Allscripts Epic Dentrix CSV or XML
  • 23. So how do we get that pig to
  • 24. Today we have proven and developing technologies propelling us to The Nearly Paperless Office.
  • 25. Barcode recognition (BCR) offers the most trustworthy recognition technology for data capture.
  • 26. • Split Files • Classify Documents • Route Files • Index • Name Files • Bookmark PDFs Use Barcodes to … Learn more at What Can Barcodes Do For Me?
  • 27. OCR is another mature data capture technology to... • Digitize text images so that they can be electronically edited, searched, and stored • Make image-based files fully text-searchable or extract data from a zone for indexing • Identify document areas for automatic OCR capture (zonal OCR) • Drag-and-drop highlighted document text which is automatically OCR'd and dropped into index fields (drag and drop OCR or rubber band OCR) • Use extracted data to split, name, route, validate, etc.
  • 28. Other Recognition Technologies For Data Capture • Handwriting recognition • Not as accurate as OCR, limited role in some capture systems ICR (Intelligent Character Recognition) • Capturing human-marked data from document forms such as surveys and tests. • Like ICR, lower accuracy, limited application within data capture OMR (Optical Mark Recognition) • Uses BCR, OCR, ICR and OMR in a structured data capture format • Typically templates are designed to instruct the capture software where to look for information and how to process the information Forms Recognition
  • 29. Data or Text Mining (Often using Regular Expressions (regex)) A fast and powerful method to search, extract and replace specific data found within scanned documents.
  • 30. • Essentially a special text string for describing a search pattern. • Extremely flexible and patterns can be constructed to match almost anything. • Use data identified with regex to classify, split, name and route files. Learn more at Using Regular Expressions for Automated Data Capture and Extraction. Data or Text Mining (Often using Regular Expressions (regex))
  • 31. …simply processing a large volume of documents, generally into a few files or one file and using intelligent capture software to process. Some products process folders of documents on demand or “watch” folders for files to process. Batch Document Processing Learn more at What is Batch Document Processing?
  • 32. Image Enhancement • Adaptive thresholding • Deskew • Despeckle • Remove blank pages or separator sheets • Auto rotate • Remove lines To improve usability and increase accuracy of OCR and other recognition technologies, image enhancement is required. Learn more at Improving OCR Accuracy with Cleanup and Enhancement.
  • 33. Where is intelligent document and data capture going?
  • 34. Cloud Computing Increased cloud computing will bring easily accessible resources and repositories for documents. See Docs in the Clouds. “The use of cloud computing is growing, and by 2016 this growth will increase to become the bulk of new IT spend.” Gartner, Inc. Oct. 2013
  • 35. Security Focus Couple the increasing number of documents being stored with the growing ways to access them, and security concerns will continue to increase.
  • 36. Improved Data Mining and Classification The increased used of data mining and better classification will increase OCR demands and lower the use of barcodes and separator pages.
  • 37. Increased Mobility Increased mobility demands in business impacts all information technology. Users want all information available from all platforms, no matter when or where.
  • 38. Don’t be caught napping, JUST GET STARTED.
  • 39. No one data capture product can “do it all”, but there is no better time to get started than now. ”The Nearly Paperless Office” can be yours.
  • 40. Learn More about Document Imaging and Capture
  • 41. For more on: • Watching folder, • Monitoring folder, • Watching folders, • Batch Processing, • Bulk scanning, • Split files with barcodes, • Barcode splitting, • How to batch process, • Batch process folders, • Docufi, • Imageramp, • Watch folders, • Data capture, • Scanning to folders, • Scanning to folder, • Scan to Folder, • Batch Splitting • Migration to document management Contact Us DocuFi 30 years’ experience in the Document Imaging market Capture Solutions www.docufi.com Copyright ©2014 makers of ImageRamp, Document Management Capture Solution
  • 42. Image Credits • Christina Rutz, “When Pigs Fly”, http://bit.ly/1giOj05 • Nottsexminer , “Utopia”, http://bit.ly/1gnZTmS • Kenny Louie, “One Way”, http://bit.ly/1iA7pxQ • Spiffie, “Fujitsu ScanSnap S300M”, http://bit.ly/1ksdhhv • Doctorwonder, “Stack O'Money!”, http://bit.ly/1fgxpko • Maciej Lewandowski, “Pig on the wings”, http://bit.ly/N6lZCJ • Sjsharktank, “Pigs fly, so now what?”, http://bit.ly/1g8UsYc • Elvissa, “flyingpig”, http://bit.ly/1nLMzyB • Jennicatpink, “Piglet Pile”, http://bit.ly/1cT6KUF • Eddi, “phone”, http://bit.ly/1ftUezJ • Martin Cathrae, “Cute Piggie“,http://bit.ly/1nLUDiT • Sarah Beth Dwyer, “Jim's Pig”, http://bit.ly/Prl3dl