SlideShare una empresa de Scribd logo
1 de 11
OPTICAL CHARACTER
RECOGNITION
Divyanshu Sagar
Ahmed Zaid Faizee
Vidyut Singhania
INTRO
1. Ingenious piece of software.
2. Involves the mechanical/electronic
conversion of scanned images of
typewritten/printed text into machine-
encoded/computer-readable text.
• 3. Heavily used in the
industry.
INTRO ii
• Common method of digitizing printed texts
• Subtle software which is as highly overlooked as it is simple.
• Numerous applications and uses – editing, scanning,
searching, comparison, compact storage and many more!
• OCR is a field of research in pattern recognition, artificial
intelligence and computer vision.
Problem Statement
Ever since Charles Babbage invented the computer back in the early 19th
century, Computer machines have held man's imagination for numerous reasons - the
primary being what all is this collection of nuts, bolts and wires capable of doing.
Character Recognition is one such concept which has beheld mankind’s attention. There
can be no greater testimony to the same than the fact that people were already working on
this idea - a few decades before John McCarthy even coined the term "Artificial
Intelligence".
Today, especially, Character Recognition plays a very important part of our daily lives as
they are incorporated so subtly that we even forget their presence. Some examples are
their implementation in Microsoft Word, Adobe Acrobat and even Pen computing.
Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned
or photoed images of typewritten or printed text into machine-encoded/computer-
readable text. This text can then be used in numerous ways - ranging from assisting the
visually impaired (text-to-speech), extracting information from the image, pen computing
and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues
of technology like Machine Learning, Artificial Intelligence and Neural Networks. We
propose to develop a system based on mathematical algorithms and principles which
involve all the aforementioned technologies. That being said, Optical Character Recognition
(OCR) also depends on a few other factors : the quality of the image taken, the orientation
of and the dialect being used. Our paper aims to address the aforementioned
problems, which enables its application in numerous new fields as well as the obvious &
established aspects of our surroundings.
Tech Jargon - I
• Pre-processing
Used to improve the successful
recognition of the image (include De-
skew, Layout analysis, Despeckle)
• Character/glyph recognition
• Post-processing
• Application specific optimization
Tweaking the system to better deal
with specific or different inputs.
Tech Jargon - II
Segmentation
Includes two important phases:
1) Obtaining training samples
2) Recognizing new images after
training
Feature Extraction
Feature of the character are extracted
and hence are compared with the glyph
Classification
After the extraction, neural network is
trained using the training data
Our Current Progress
• We started with the Neural Networks / Machine Learning
aspect of the project.
• We have implemented Univariate / Multivariate
Linear/Regularized Linear Regression, Gradient Descent for
Multiple Variables and Logistic/ Regularized Logistic
Regression.
• Currently, we are studying & working on the
implementation of Neural Nets using Forward Propogation.
• We plan on tackling character segmentation and feature
extraction next.
Technology to be used
• We are using the following technology
platforms :
– GNU Octave
To develop and test the OCR software.
– 5MP HD camera (720p @ 30fps)
To take images for detection
Timeline
Literature Review
• Microsoft One Note
• Adobe PDF scanner
• HP scanner
Optical Character Recognition (OCR)

Más contenido relacionado

La actualidad más candente

Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer Version
Naiyan Noor
 
offline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati textoffline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati text
Bhumika Patel
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition system
Vijay Apurva
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization word
Dhana K
 

La actualidad más candente (20)

Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
 
Handwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer VersionHandwriting Recognition Using Deep Learning and Computer Version
Handwriting Recognition Using Deep Learning and Computer Version
 
OCR (Optical Character Recognition)
OCR (Optical Character Recognition) OCR (Optical Character Recognition)
OCR (Optical Character Recognition)
 
Character Recognition using Machine Learning
Character Recognition using Machine LearningCharacter Recognition using Machine Learning
Character Recognition using Machine Learning
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
Final Report on Optical Character Recognition
Final Report on Optical Character Recognition Final Report on Optical Character Recognition
Final Report on Optical Character Recognition
 
Project report of OCR Recognition
Project report of OCR RecognitionProject report of OCR Recognition
Project report of OCR Recognition
 
offline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati textoffline character recognition for handwritten gujarati text
offline character recognition for handwritten gujarati text
 
Optical Character Recognition Using Python
Optical Character Recognition Using PythonOptical Character Recognition Using Python
Optical Character Recognition Using Python
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition system
 
Handwriting Recognition
Handwriting RecognitionHandwriting Recognition
Handwriting Recognition
 
Ocr abstract
Ocr abstractOcr abstract
Ocr abstract
 
ocr
ocrocr
ocr
 
Text extraction From Digital image
Text extraction From Digital imageText extraction From Digital image
Text extraction From Digital image
 
Optical Character Reader - Project Report BTech
Optical Character Reader - Project Report BTechOptical Character Reader - Project Report BTech
Optical Character Reader - Project Report BTech
 
Handwritten Character Recognition
Handwritten Character RecognitionHandwritten Character Recognition
Handwritten Character Recognition
 
Handwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural networkHandwritten character recognition using artificial neural network
Handwritten character recognition using artificial neural network
 
Optical character recognization word
Optical character recognization wordOptical character recognization word
Optical character recognization word
 
OCR speech using Labview
OCR speech using LabviewOCR speech using Labview
OCR speech using Labview
 

Similar a Optical Character Recognition (OCR)

Intelligent image processing
Intelligent image processingIntelligent image processing
Intelligent image processing
Andrew Stewart
 

Similar a Optical Character Recognition (OCR) (20)

Face Recognition System
Face Recognition SystemFace Recognition System
Face Recognition System
 
Intelligent image processing
Intelligent image processingIntelligent image processing
Intelligent image processing
 
Traffic Violation Detector using Object Detection
Traffic Violation Detector using Object DetectionTraffic Violation Detector using Object Detection
Traffic Violation Detector using Object Detection
 
AIDC India - AI Vision Slides
AIDC India - AI Vision SlidesAIDC India - AI Vision Slides
AIDC India - AI Vision Slides
 
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine LearningMakine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
Makine Öğrenmesi ile Görüntü Tanıma | Image Recognition using Machine Learning
 
Computer architecture for vision system
Computer architecture for vision systemComputer architecture for vision system
Computer architecture for vision system
 
A Deep Learning Approach to Recognize Cursive Handwriting
A Deep Learning Approach to Recognize Cursive HandwritingA Deep Learning Approach to Recognize Cursive Handwriting
A Deep Learning Approach to Recognize Cursive Handwriting
 
IRJET- Sign Language Interpreter
IRJET- Sign Language InterpreterIRJET- Sign Language Interpreter
IRJET- Sign Language Interpreter
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
IRJET- Object Detection in an Image using Deep Learning
IRJET- Object Detection in an Image using Deep LearningIRJET- Object Detection in an Image using Deep Learning
IRJET- Object Detection in an Image using Deep Learning
 
Optical Recognition of Handwritten Text
Optical Recognition of Handwritten TextOptical Recognition of Handwritten Text
Optical Recognition of Handwritten Text
 
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
IRJET- Scandroid: A Machine Learning Approach for Understanding Handwritten N...
 
IRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten CharactersIRJET- Intelligent Character Recognition of Handwritten Characters
IRJET- Intelligent Character Recognition of Handwritten Characters
 
Saksham presentation
Saksham presentationSaksham presentation
Saksham presentation
 
AI GRPOUP 4 PRESENTATION.pptx
AI GRPOUP 4 PRESENTATION.pptxAI GRPOUP 4 PRESENTATION.pptx
AI GRPOUP 4 PRESENTATION.pptx
 
Utilization of Machine Learning in Computer Vision
Utilization of Machine Learning in Computer VisionUtilization of Machine Learning in Computer Vision
Utilization of Machine Learning in Computer Vision
 
IRJET- Text Recognization of Product for Blind Person using MATLAB
IRJET- Text Recognization of Product for Blind Person using MATLABIRJET- Text Recognization of Product for Blind Person using MATLAB
IRJET- Text Recognization of Product for Blind Person using MATLAB
 
4 Best Computer Vision Use Cases for Solving Business Challenges
4 Best Computer Vision Use Cases for Solving Business Challenges4 Best Computer Vision Use Cases for Solving Business Challenges
4 Best Computer Vision Use Cases for Solving Business Challenges
 
ARTIFICIAL INTELLIGENCE.pptx
ARTIFICIAL INTELLIGENCE.pptxARTIFICIAL INTELLIGENCE.pptx
ARTIFICIAL INTELLIGENCE.pptx
 
ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062
ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062
ARTIFICIAL INTELLIGENT ( ITS / TASK 6 ) done by Wael Saad Hameedi / P71062
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

Optical Character Recognition (OCR)

  • 2. INTRO 1. Ingenious piece of software. 2. Involves the mechanical/electronic conversion of scanned images of typewritten/printed text into machine- encoded/computer-readable text. • 3. Heavily used in the industry.
  • 3. INTRO ii • Common method of digitizing printed texts • Subtle software which is as highly overlooked as it is simple. • Numerous applications and uses – editing, scanning, searching, comparison, compact storage and many more! • OCR is a field of research in pattern recognition, artificial intelligence and computer vision.
  • 4. Problem Statement Ever since Charles Babbage invented the computer back in the early 19th century, Computer machines have held man's imagination for numerous reasons - the primary being what all is this collection of nuts, bolts and wires capable of doing. Character Recognition is one such concept which has beheld mankind’s attention. There can be no greater testimony to the same than the fact that people were already working on this idea - a few decades before John McCarthy even coined the term "Artificial Intelligence". Today, especially, Character Recognition plays a very important part of our daily lives as they are incorporated so subtly that we even forget their presence. Some examples are their implementation in Microsoft Word, Adobe Acrobat and even Pen computing. Optical Character Recognition (OCR) is the mechanical or electronic conversion of scanned or photoed images of typewritten or printed text into machine-encoded/computer- readable text. This text can then be used in numerous ways - ranging from assisting the visually impaired (text-to-speech), extracting information from the image, pen computing and so on. Optical Character Recognition (OCR) is a result of cross-linking various avenues of technology like Machine Learning, Artificial Intelligence and Neural Networks. We propose to develop a system based on mathematical algorithms and principles which involve all the aforementioned technologies. That being said, Optical Character Recognition (OCR) also depends on a few other factors : the quality of the image taken, the orientation of and the dialect being used. Our paper aims to address the aforementioned problems, which enables its application in numerous new fields as well as the obvious & established aspects of our surroundings.
  • 5. Tech Jargon - I • Pre-processing Used to improve the successful recognition of the image (include De- skew, Layout analysis, Despeckle) • Character/glyph recognition • Post-processing • Application specific optimization Tweaking the system to better deal with specific or different inputs.
  • 6. Tech Jargon - II Segmentation Includes two important phases: 1) Obtaining training samples 2) Recognizing new images after training Feature Extraction Feature of the character are extracted and hence are compared with the glyph Classification After the extraction, neural network is trained using the training data
  • 7. Our Current Progress • We started with the Neural Networks / Machine Learning aspect of the project. • We have implemented Univariate / Multivariate Linear/Regularized Linear Regression, Gradient Descent for Multiple Variables and Logistic/ Regularized Logistic Regression. • Currently, we are studying & working on the implementation of Neural Nets using Forward Propogation. • We plan on tackling character segmentation and feature extraction next.
  • 8. Technology to be used • We are using the following technology platforms : – GNU Octave To develop and test the OCR software. – 5MP HD camera (720p @ 30fps) To take images for detection
  • 10. Literature Review • Microsoft One Note • Adobe PDF scanner • HP scanner

Notas del editor

  1. In 1914, Emanuel Goldberg developed a machine that read characters and converted them into standard telegraph code. Around the same time, Edmund Fournied'Albe developed the Otophone, a handheld scanner that when moved across a printed page, produced tones that corresponded to specific letters or characters.