SlideShare una empresa de Scribd logo
1 de 24
“A Novel Text detection System Based on
Character and Link Energies”
Presented by: Arun Patel
Roll No.: 15EC65R18
M.Tech 1st year VIPES, IIT Kharagpur
1
Algorithm
• This algorithm can detect most text object in various condition including different lightening,
different colors, complex background and low contrast text.
• This method is robust to the font, size, color and orientation of text and discriminate text object
from others effectively.
Fig(1) Algorithm 2
Initialization of Candidate Text Objects
 Localize the candidate Part
 Euler number
 Let 𝑣𝑖 and 𝑣𝑗 be two candidate parts with widths W 𝑣𝑖
and Wvj , heights Hvi and Hvj , and
centroids Cvi and Cvj ;
dist.(Cvi ,Cvj)≤wd .min(max(Wvi ,Hvi), max(Wvj ,Hvj))
 Finally, the candidate character parts that are reachable by one another via one or more links are
grouped to form a candidate text.
3
Fig.2 Initialization of candidate text objects 4
Character Features
• One important characteristic that discriminate text object from other object is that character are made
up of strokes that typically have approximately uniform thickness resulting in two near parallel edges
sets in their boundaries.
• Two edges sets have high similarities in length, orientation and curvature.
• Similarities of two stroke edges is captured by gradient vector of each point on the boundary.
Fig.3 (a) edge pairs of strokes (b)Gradient vectors of ‘R’ 5
• For a character, it has two near parallel edges sets and the gradients of an edge point and its
corresponding point should have approximately opposite direction.
• Distance between the points and their corresponding are similar because the change of stroke
width is usually small.
Fig.4 Corresponding pairs and links 6
Average Angle Difference of Corres. Pairs(Dangle)
• Let N denote the number of edge points of a candidate part. P(i)(1 ≤ i ≤ N) is the ith edge point with
the corresponding point P(i) corr .The difference of the gradient directions of the corresponding
pair (P(i) , P(i)
corr.) is defined as:
𝑑 𝑎𝑛𝑔𝑙𝑒
(𝑖)
=abs(𝜃 𝑝
(𝑖)
-𝜃 𝑝 𝑐𝑜𝑟𝑟
(𝑖)
)
• Dangle measures the average gradient direction difference of all corresponding pairs of a candidate
part.
• Dangle =
1
𝑁∙𝜋 𝑖=1
𝑁
𝑑 𝑎𝑛𝑔𝑙𝑒
(𝑖)
• For an ideal character Dangle reaches the maximum value 1.
7
Fraction of non-noise pair (Fnon-noise)
• In some cases, however, a character may have a smaller 𝑑 𝑎𝑛𝑔𝑙𝑒
(𝑖)
due to noise or deformations.
We compute Fnon−noise to measure the noise and deformation levels of a part based on d (i)
angle .
• Fnon-noise =
1
𝑁 𝑖=1
𝑁
ℎ(𝑑 𝑎𝑛𝑔𝑙𝑒
𝑖
, β)
• h(𝑑 𝑎𝑛𝑔𝑙𝑒
(𝑖) ,β)=1 if d (i)
angle >β
=0 else
• Fnon-noise is the fraction of all pairs for which the angle difference d(i)
angle is greater than β.
Fig5.Noise connections and non-noise connections Ref.(1) 8
R A C E
Dangle 0.889 0.865 0.925 0.897
Fnon-noise 0.754 0.684 0.897 0.806
Fig.6 Dangle and Fnon-noise Ref. (1)
9
• we divide the non-noise connections into two types: stroke-length connection and stroke-width
connection.
• By doing so, we can separate circle like objects and compute the feature vector of stroke width.
• Let k(i)(1 ≤ i ≤ N) be one of N non-noise connections of a part and have Ik
(i) intersections with other
non-noise connections. We define stroke-length connection and stroke-width as follows:
• K(i)∈ stroke−length connection, if (
Ik
(i)
𝑁
)> TIS
stroke−width connection, otherwise
• For circle, every connection intersects with all other connections at its center. Hence, all non-
noise connection of a circle are stroke length connection.
• Character have much more stroke-width connection than the non-characters.
10
Fig 7. Percentages of stroke-width links of two example images. Ref.(1)
11
Vector of Stroke Width (𝑉 𝑤𝑖𝑑𝑡ℎ)
• The vector of stroke width Vwidth is defined as: 𝑉 𝑤𝑖𝑑𝑡ℎ=[𝑤 𝑑
(1)
, 𝑤 𝑑
(2)
].
• Characters typically have one or two dominating stroke widths depending on their fonts.
• Then, we estimate dominating stroke-width w(i)
d through a weighted average computation using w(i)
p and its
two immediately adjacent neighbors:
𝑤 𝑑
(𝑖)
= r1×( 𝑤 𝑝
(𝑖)
−1)+ 𝑤 𝑝
(𝑖)
+r2× ( 𝑤 𝑝
(𝑖)
+1)
r1+1+r2
Fig.8 Histogram of the lengths of stroke width connections Ref. (1) 12
Character Energy
• For a part vi , we consider that its 𝐷 𝑎𝑛𝑔𝑙𝑒
(𝑖)
and 𝐹𝑛𝑜𝑛−𝑛𝑜𝑖𝑠𝑒
(𝑖)
are equally important for text detection
and define the character energy 𝐸𝑐ℎ𝑎𝑟
(𝑖)
of vi as follows:
𝐸𝑐ℎ𝑎𝑟
(𝑖)
=
𝐷 𝑎𝑛𝑔𝑙𝑒
(𝑖)
+𝐹𝑛𝑜𝑛−𝑛𝑜𝑖𝑠𝑒
(𝑖)
2
,0≤ 𝐸𝑐ℎ𝑎𝑟
(𝑖)
≤1.
• It can be treated as a measure of the probability that vi is a character.
• Character have larger Echar can discriminate text objects from other objects and it is robust to the
font,size,color and orientation of characters.
• 𝐷 𝑎𝑛𝑔𝑙𝑒
(𝑖)
and 𝐹𝑛𝑜𝑛−𝑛𝑜𝑖𝑠𝑒
(𝑖)
are correlated.
13
(a) (b)
Fig.9 two character with different noise/deformation levels Ref. (1) 14
𝑫 𝒂𝒏𝒈𝒍𝒆 𝑭 𝒏𝒐𝒏_𝒏𝒐𝒊𝒔𝒆 𝑬 𝒄𝒉𝒂𝒓
(a) 0.8846 0.5950 0.5950
(b) 0.8847 0.5261 0.7054
Fig.10 character energy Ref. (1) 15
Link energy
• Link energy is computed for every candidate link to measure the probability that two parts
connected by the link are both characters.
• Link energy is computed by measuring two values:
1. Similarity in the properties of neighboring parts, such as the color, stroke width, and size.
2.Spatial consistency in the direction and distance between neighboring parts in a string of parts.
• For two connected parts vi and vj ,we use color, stroke width(Vwidth),character width, and character
height to capture similarities between them.
𝐸𝐿𝑖𝑛𝑘
(𝑖,𝑗)
=
1
4 𝑘=1
4
(𝑤 𝑘.𝑠𝑖,𝑗
(𝑘)
) 𝑤 𝑘= 0.25
• Higher the 𝐸𝐿𝑖𝑛𝑘
(𝑖,𝑗)
higher the similarities between two parts.
16
Similarity Computation Of two Character
Fig.11 Link energy Ref.(1) 17
colour 𝑆𝑖,𝑗
(1)
=
1
3 (𝐶=𝑅,𝐺,𝐵)(1 −
|Ci−Cj|
255
)
Vwidth
𝑆𝑖,𝑗
(2)
=
1
2 𝑘=1
𝑘=2
𝑆𝑖𝑚𝑖 𝑅𝑖,𝑗
𝑣
𝑘 , 𝑅𝑖,𝑗
𝑣
=
Vi (k)
Vj (k)
Character
width
𝑆𝑖,𝑗
(3)
= 𝑆𝑖𝑚𝑖 𝑅𝑖,𝑗
𝑤
, 𝑅𝑖,𝑗
𝑤
=
𝑊𝑖
𝑊 𝑗
Character
Height
𝑆𝑖,𝑗
(4)
=Simi(𝑅𝑖,𝑗
(𝐻)
), 𝑅𝑖,𝑗
𝑤
=
𝐻𝑖
𝐻 𝑗
Text Unit Energy
• For the text unit containing two parts vi and v j , the text unit energ 𝐸𝑡𝑒𝑥𝑡
(𝑖,𝑗)
is computed using
character energies 𝐸𝑐ℎ𝑎𝑟
(𝑖)
, 𝐸𝑐ℎ𝑎𝑟
(𝑗)
and link energy 𝐸𝑙𝑖𝑛𝑘
(𝑖,𝑗)
:
• 𝐸𝑡𝑒𝑥𝑡
(𝑖,𝑗)
=
1
2
[(
𝐸 𝑐ℎ𝑎𝑟
(𝑖)
+ 𝐸 𝑐ℎ𝑎𝑟
(𝑗)
2
)+ 𝐸𝑙𝑖𝑛𝑘
(𝑖,𝑗)
]
• To refine the detected text objects, text units whose text unit energies are smaller than a pre-
defined threshold Ttext are removed from the text objects.
• choice of this threshold depends upon the characteristic of the datasets, a threshold of of 0.7
worked well for several datasets used for testing this algorithm.
18
Fig.12 Text energy Ref(1) 19
Fig.13 Threshold Etext and txt detection outputs Ref(1) 20
Result on ICDAR 2003/2005 Dataset Objects
Fig.14 experimental outputs Ref.(2) 21
Evalution Results
Algorithm Precision
Ashida 0.55
Hinneck Becker 0.62
SWT 0.73
Novel Text detection 0.74
22
Recall
0.46
0.67
0.60
0.69
References:
• [1] Jing Zhang and Rangachar Kasturi ,“A novel text detection system based on characters
and link energies”, image processing, IEEE trans., vol.23, No.9, pp.4187-4198, September 2014.
• [2] S.M.Lucas, A. Panaretos, L.Sosa,A. Tang, S. Wong, and R. Young, “ICDAR 2003 robust reading
competitions”, in Proc. 7th Int. Conf. Document And Recognit.,vol.2,pp. 682,2003.
• [3]D,.Marr and Hildreth, “Theory of edge detection,” Proc.Roy.Soc. London B,vol.
207,No.1167,pp. 187-217,1980.
23
24
Thank You

Más contenido relacionado

La actualidad más candente

IRJET- Devnagari Text Detection
IRJET- Devnagari Text DetectionIRJET- Devnagari Text Detection
IRJET- Devnagari Text DetectionIRJET Journal
 
Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...Divya Gera
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsIJCSIS Research Publications
 
Self-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with SmoothingSelf-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with SmoothingPriyanka Wagh
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character RecognitionNitin Vishwari
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognitionaavi241
 
Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...eSAT Journals
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image RetrievalRayan Dasoriya
 
Path Finding Solutions For Grid Based Graph
Path Finding Solutions For Grid Based GraphPath Finding Solutions For Grid Based Graph
Path Finding Solutions For Grid Based Graphacijjournal
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final pptrinki nag
 
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...ijiert bestjournal
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff DistanceIRJET Journal
 
Tracking of objects with known color signature - ELITECH 20
Tracking of objects with known color signature - ELITECH 20Tracking of objects with known color signature - ELITECH 20
Tracking of objects with known color signature - ELITECH 20Lukas Tencer
 

La actualidad más candente (20)

IRJET- Devnagari Text Detection
IRJET- Devnagari Text DetectionIRJET- Devnagari Text Detection
IRJET- Devnagari Text Detection
 
C04741319
C04741319C04741319
C04741319
 
Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...Texture features based text extraction from images using DWT and K-means clus...
Texture features based text extraction from images using DWT and K-means clus...
 
40120140501009
4012014050100940120140501009
40120140501009
 
Ay32333339
Ay32333339Ay32333339
Ay32333339
 
Text Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text RegionsText Extraction System by Eliminating Non-Text Regions
Text Extraction System by Eliminating Non-Text Regions
 
Self-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with SmoothingSelf-Directing Text Detection and Removal from Images with Smoothing
Self-Directing Text Detection and Removal from Images with Smoothing
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
F045053236
F045053236F045053236
F045053236
 
Optical Character Recognition
Optical Character RecognitionOptical Character Recognition
Optical Character Recognition
 
Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...Scene text recognition in mobile applications by character descriptor and str...
Scene text recognition in mobile applications by character descriptor and str...
 
Text Detection and Recognition
Text Detection and RecognitionText Detection and Recognition
Text Detection and Recognition
 
Content Based Image Retrieval
Content Based Image RetrievalContent Based Image Retrieval
Content Based Image Retrieval
 
Path Finding Solutions For Grid Based Graph
Path Finding Solutions For Grid Based GraphPath Finding Solutions For Grid Based Graph
Path Finding Solutions For Grid Based Graph
 
Cbir final ppt
Cbir final pptCbir final ppt
Cbir final ppt
 
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
AN IMPLEMENTATION OF ADAPTIVE PROPAGATION-BASED COLOR SAMPLING FOR IMAGE MATT...
 
IRJET- Object Detection using Hausdorff Distance
IRJET-  	  Object Detection using Hausdorff DistanceIRJET-  	  Object Detection using Hausdorff Distance
IRJET- Object Detection using Hausdorff Distance
 
Ac03401600163.
Ac03401600163.Ac03401600163.
Ac03401600163.
 
Sentence generation
Sentence generationSentence generation
Sentence generation
 
Tracking of objects with known color signature - ELITECH 20
Tracking of objects with known color signature - ELITECH 20Tracking of objects with known color signature - ELITECH 20
Tracking of objects with known color signature - ELITECH 20
 

Destacado

[RakutenTechConf2013] [C4-1] Text detection in product images
[RakutenTechConf2013] [C4-1] Text detection in product images[RakutenTechConf2013] [C4-1] Text detection in product images
[RakutenTechConf2013] [C4-1] Text detection in product imagesRakuten Group, Inc.
 
Optical Character Recognition (OCR)
Optical Character Recognition (OCR)Optical Character Recognition (OCR)
Optical Character Recognition (OCR)Vidyut Singhania
 
See the Future, See the Stars Rakuten Institute of Technology
See the Future, See the Stars Rakuten Institute of TechnologySee the Future, See the Stars Rakuten Institute of Technology
See the Future, See the Stars Rakuten Institute of TechnologyRakuten Group, Inc.
 
Introduction to homography
Introduction to homographyIntroduction to homography
Introduction to homographyShih-Hsiang Lin
 
Presentation iwssip2012
Presentation iwssip2012Presentation iwssip2012
Presentation iwssip2012Bernhard Quehl
 
PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)Nidhi Gopal
 
Scene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular AutomataScene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular AutomataKonstantinos Zagoris
 
text detection and reading of product label for blind persons
 text detection and reading of product label for blind persons text detection and reading of product label for blind persons
text detection and reading of product label for blind personsVivek Chamorshikar
 
Emotion detection from text using data mining and text mining
Emotion detection from text using data mining and text miningEmotion detection from text using data mining and text mining
Emotion detection from text using data mining and text miningSakthi Dasans
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Karan Panjwani
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition systemVijay Apurva
 
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...Sathmica K
 

Destacado (13)

[RakutenTechConf2013] [C4-1] Text detection in product images
[RakutenTechConf2013] [C4-1] Text detection in product images[RakutenTechConf2013] [C4-1] Text detection in product images
[RakutenTechConf2013] [C4-1] Text detection in product images
 
Optical Character Recognition (OCR)
Optical Character Recognition (OCR)Optical Character Recognition (OCR)
Optical Character Recognition (OCR)
 
See the Future, See the Stars Rakuten Institute of Technology
See the Future, See the Stars Rakuten Institute of TechnologySee the Future, See the Stars Rakuten Institute of Technology
See the Future, See the Stars Rakuten Institute of Technology
 
Introduction to homography
Introduction to homographyIntroduction to homography
Introduction to homography
 
Mob ocr
Mob ocrMob ocr
Mob ocr
 
Presentation iwssip2012
Presentation iwssip2012Presentation iwssip2012
Presentation iwssip2012
 
PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)PPT Image Analysis(IRDE, DRDO)
PPT Image Analysis(IRDE, DRDO)
 
Scene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular AutomataScene Text Detection on Images using Cellular Automata
Scene Text Detection on Images using Cellular Automata
 
text detection and reading of product label for blind persons
 text detection and reading of product label for blind persons text detection and reading of product label for blind persons
text detection and reading of product label for blind persons
 
Emotion detection from text using data mining and text mining
Emotion detection from text using data mining and text miningEmotion detection from text using data mining and text mining
Emotion detection from text using data mining and text mining
 
Optical Character Recognition( OCR )
Optical Character Recognition( OCR )Optical Character Recognition( OCR )
Optical Character Recognition( OCR )
 
optical character recognition system
optical character recognition systemoptical character recognition system
optical character recognition system
 
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...PORTABLE CAMERA-BASED  ASSISTIVE TEXT AND PRODUCT  LABEL READING FROM HAND- H...
PORTABLE CAMERA-BASED ASSISTIVE TEXT AND PRODUCT LABEL READING FROM HAND- H...
 

Similar a Text Detection From Image

Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Tanjarul Islam Mishu
 
Lecture 5b graphs and hashing
Lecture 5b graphs and hashingLecture 5b graphs and hashing
Lecture 5b graphs and hashingVictor Palmar
 
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptxthanhdowork
 
Bt9301, computer graphics
Bt9301, computer graphicsBt9301, computer graphics
Bt9301, computer graphicssmumbahelp
 
Lecture 2.3.1 Graph.pptx
Lecture 2.3.1 Graph.pptxLecture 2.3.1 Graph.pptx
Lecture 2.3.1 Graph.pptxking779879
 
Applications of graphs
Applications of graphsApplications of graphs
Applications of graphsTech_MX
 
Thesis : "IBBET : In Band Bandwidth Estimation for LAN"
Thesis : "IBBET : In Band Bandwidth Estimation for LAN"Thesis : "IBBET : In Band Bandwidth Estimation for LAN"
Thesis : "IBBET : In Band Bandwidth Estimation for LAN"Vishalkumarec
 
Semantic Data Management in Graph Databases
Semantic Data Management in Graph DatabasesSemantic Data Management in Graph Databases
Semantic Data Management in Graph DatabasesMaribel Acosta Deibe
 
Distributed coloring with O(sqrt. log n) bits
Distributed coloring with O(sqrt. log n) bitsDistributed coloring with O(sqrt. log n) bits
Distributed coloring with O(sqrt. log n) bitsSubhajit Sahu
 
FACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKS
FACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKSFACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKS
FACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKSijaia
 
Full n final prjct
Full n final prjctFull n final prjct
Full n final prjctpunu2602
 
Vectorise all the things
Vectorise all the thingsVectorise all the things
Vectorise all the thingsJodieBurchell1
 

Similar a Text Detection From Image (20)

Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
Dynamic time wrapping (dtw), vector quantization(vq), linear predictive codin...
 
Entropy based measures for graphs
Entropy based measures for graphsEntropy based measures for graphs
Entropy based measures for graphs
 
Lecture 5b graphs and hashing
Lecture 5b graphs and hashingLecture 5b graphs and hashing
Lecture 5b graphs and hashing
 
Data Mining Lecture_9.pptx
Data Mining Lecture_9.pptxData Mining Lecture_9.pptx
Data Mining Lecture_9.pptx
 
poster
posterposter
poster
 
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
 
Bt9301, computer graphics
Bt9301, computer graphicsBt9301, computer graphics
Bt9301, computer graphics
 
Lecture 2.3.1 Graph.pptx
Lecture 2.3.1 Graph.pptxLecture 2.3.1 Graph.pptx
Lecture 2.3.1 Graph.pptx
 
Applications of graphs
Applications of graphsApplications of graphs
Applications of graphs
 
Thesis : "IBBET : In Band Bandwidth Estimation for LAN"
Thesis : "IBBET : In Band Bandwidth Estimation for LAN"Thesis : "IBBET : In Band Bandwidth Estimation for LAN"
Thesis : "IBBET : In Band Bandwidth Estimation for LAN"
 
VoxelNet
VoxelNetVoxelNet
VoxelNet
 
NLP_KASHK:Minimum Edit Distance
NLP_KASHK:Minimum Edit DistanceNLP_KASHK:Minimum Edit Distance
NLP_KASHK:Minimum Edit Distance
 
Semantic Data Management in Graph Databases
Semantic Data Management in Graph DatabasesSemantic Data Management in Graph Databases
Semantic Data Management in Graph Databases
 
Graph 1
Graph 1Graph 1
Graph 1
 
Distributed coloring with O(sqrt. log n) bits
Distributed coloring with O(sqrt. log n) bitsDistributed coloring with O(sqrt. log n) bits
Distributed coloring with O(sqrt. log n) bits
 
Buffon's needle exercises
Buffon's needle exercisesBuffon's needle exercises
Buffon's needle exercises
 
FACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKS
FACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKSFACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKS
FACE RECOGNITION ALGORITHM BASED ON ORIENTATION HISTOGRAM OF HOUGH PEAKS
 
Full n final prjct
Full n final prjctFull n final prjct
Full n final prjct
 
Vectorise all the things
Vectorise all the thingsVectorise all the things
Vectorise all the things
 
06 Vector Visualization
06 Vector Visualization06 Vector Visualization
06 Vector Visualization
 

Último

Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTbhaskargani46
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...SUHANI PANDEY
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueBhangaleSonal
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLManishPatel169454
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdfSuman Jyoti
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756dollysharma2066
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptDineshKumar4165
 

Último (20)

Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
VIP Model Call Girls Kothrud ( Pune ) Call ON 8005736733 Starting From 5K to ...
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
NFPA 5000 2024 standard .
NFPA 5000 2024 standard                                  .NFPA 5000 2024 standard                                  .
NFPA 5000 2024 standard .
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Double Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torqueDouble Revolving field theory-how the rotor develops torque
Double Revolving field theory-how the rotor develops torque
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELLPVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
PVC VS. FIBERGLASS (FRP) GRAVITY SEWER - UNI BELL
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 

Text Detection From Image

  • 1. “A Novel Text detection System Based on Character and Link Energies” Presented by: Arun Patel Roll No.: 15EC65R18 M.Tech 1st year VIPES, IIT Kharagpur 1
  • 2. Algorithm • This algorithm can detect most text object in various condition including different lightening, different colors, complex background and low contrast text. • This method is robust to the font, size, color and orientation of text and discriminate text object from others effectively. Fig(1) Algorithm 2
  • 3. Initialization of Candidate Text Objects  Localize the candidate Part  Euler number  Let 𝑣𝑖 and 𝑣𝑗 be two candidate parts with widths W 𝑣𝑖 and Wvj , heights Hvi and Hvj , and centroids Cvi and Cvj ; dist.(Cvi ,Cvj)≤wd .min(max(Wvi ,Hvi), max(Wvj ,Hvj))  Finally, the candidate character parts that are reachable by one another via one or more links are grouped to form a candidate text. 3
  • 4. Fig.2 Initialization of candidate text objects 4
  • 5. Character Features • One important characteristic that discriminate text object from other object is that character are made up of strokes that typically have approximately uniform thickness resulting in two near parallel edges sets in their boundaries. • Two edges sets have high similarities in length, orientation and curvature. • Similarities of two stroke edges is captured by gradient vector of each point on the boundary. Fig.3 (a) edge pairs of strokes (b)Gradient vectors of ‘R’ 5
  • 6. • For a character, it has two near parallel edges sets and the gradients of an edge point and its corresponding point should have approximately opposite direction. • Distance between the points and their corresponding are similar because the change of stroke width is usually small. Fig.4 Corresponding pairs and links 6
  • 7. Average Angle Difference of Corres. Pairs(Dangle) • Let N denote the number of edge points of a candidate part. P(i)(1 ≤ i ≤ N) is the ith edge point with the corresponding point P(i) corr .The difference of the gradient directions of the corresponding pair (P(i) , P(i) corr.) is defined as: 𝑑 𝑎𝑛𝑔𝑙𝑒 (𝑖) =abs(𝜃 𝑝 (𝑖) -𝜃 𝑝 𝑐𝑜𝑟𝑟 (𝑖) ) • Dangle measures the average gradient direction difference of all corresponding pairs of a candidate part. • Dangle = 1 𝑁∙𝜋 𝑖=1 𝑁 𝑑 𝑎𝑛𝑔𝑙𝑒 (𝑖) • For an ideal character Dangle reaches the maximum value 1. 7
  • 8. Fraction of non-noise pair (Fnon-noise) • In some cases, however, a character may have a smaller 𝑑 𝑎𝑛𝑔𝑙𝑒 (𝑖) due to noise or deformations. We compute Fnon−noise to measure the noise and deformation levels of a part based on d (i) angle . • Fnon-noise = 1 𝑁 𝑖=1 𝑁 ℎ(𝑑 𝑎𝑛𝑔𝑙𝑒 𝑖 , β) • h(𝑑 𝑎𝑛𝑔𝑙𝑒 (𝑖) ,β)=1 if d (i) angle >β =0 else • Fnon-noise is the fraction of all pairs for which the angle difference d(i) angle is greater than β. Fig5.Noise connections and non-noise connections Ref.(1) 8
  • 9. R A C E Dangle 0.889 0.865 0.925 0.897 Fnon-noise 0.754 0.684 0.897 0.806 Fig.6 Dangle and Fnon-noise Ref. (1) 9
  • 10. • we divide the non-noise connections into two types: stroke-length connection and stroke-width connection. • By doing so, we can separate circle like objects and compute the feature vector of stroke width. • Let k(i)(1 ≤ i ≤ N) be one of N non-noise connections of a part and have Ik (i) intersections with other non-noise connections. We define stroke-length connection and stroke-width as follows: • K(i)∈ stroke−length connection, if ( Ik (i) 𝑁 )> TIS stroke−width connection, otherwise • For circle, every connection intersects with all other connections at its center. Hence, all non- noise connection of a circle are stroke length connection. • Character have much more stroke-width connection than the non-characters. 10
  • 11. Fig 7. Percentages of stroke-width links of two example images. Ref.(1) 11
  • 12. Vector of Stroke Width (𝑉 𝑤𝑖𝑑𝑡ℎ) • The vector of stroke width Vwidth is defined as: 𝑉 𝑤𝑖𝑑𝑡ℎ=[𝑤 𝑑 (1) , 𝑤 𝑑 (2) ]. • Characters typically have one or two dominating stroke widths depending on their fonts. • Then, we estimate dominating stroke-width w(i) d through a weighted average computation using w(i) p and its two immediately adjacent neighbors: 𝑤 𝑑 (𝑖) = r1×( 𝑤 𝑝 (𝑖) −1)+ 𝑤 𝑝 (𝑖) +r2× ( 𝑤 𝑝 (𝑖) +1) r1+1+r2 Fig.8 Histogram of the lengths of stroke width connections Ref. (1) 12
  • 13. Character Energy • For a part vi , we consider that its 𝐷 𝑎𝑛𝑔𝑙𝑒 (𝑖) and 𝐹𝑛𝑜𝑛−𝑛𝑜𝑖𝑠𝑒 (𝑖) are equally important for text detection and define the character energy 𝐸𝑐ℎ𝑎𝑟 (𝑖) of vi as follows: 𝐸𝑐ℎ𝑎𝑟 (𝑖) = 𝐷 𝑎𝑛𝑔𝑙𝑒 (𝑖) +𝐹𝑛𝑜𝑛−𝑛𝑜𝑖𝑠𝑒 (𝑖) 2 ,0≤ 𝐸𝑐ℎ𝑎𝑟 (𝑖) ≤1. • It can be treated as a measure of the probability that vi is a character. • Character have larger Echar can discriminate text objects from other objects and it is robust to the font,size,color and orientation of characters. • 𝐷 𝑎𝑛𝑔𝑙𝑒 (𝑖) and 𝐹𝑛𝑜𝑛−𝑛𝑜𝑖𝑠𝑒 (𝑖) are correlated. 13
  • 14. (a) (b) Fig.9 two character with different noise/deformation levels Ref. (1) 14 𝑫 𝒂𝒏𝒈𝒍𝒆 𝑭 𝒏𝒐𝒏_𝒏𝒐𝒊𝒔𝒆 𝑬 𝒄𝒉𝒂𝒓 (a) 0.8846 0.5950 0.5950 (b) 0.8847 0.5261 0.7054
  • 15. Fig.10 character energy Ref. (1) 15
  • 16. Link energy • Link energy is computed for every candidate link to measure the probability that two parts connected by the link are both characters. • Link energy is computed by measuring two values: 1. Similarity in the properties of neighboring parts, such as the color, stroke width, and size. 2.Spatial consistency in the direction and distance between neighboring parts in a string of parts. • For two connected parts vi and vj ,we use color, stroke width(Vwidth),character width, and character height to capture similarities between them. 𝐸𝐿𝑖𝑛𝑘 (𝑖,𝑗) = 1 4 𝑘=1 4 (𝑤 𝑘.𝑠𝑖,𝑗 (𝑘) ) 𝑤 𝑘= 0.25 • Higher the 𝐸𝐿𝑖𝑛𝑘 (𝑖,𝑗) higher the similarities between two parts. 16
  • 17. Similarity Computation Of two Character Fig.11 Link energy Ref.(1) 17 colour 𝑆𝑖,𝑗 (1) = 1 3 (𝐶=𝑅,𝐺,𝐵)(1 − |Ci−Cj| 255 ) Vwidth 𝑆𝑖,𝑗 (2) = 1 2 𝑘=1 𝑘=2 𝑆𝑖𝑚𝑖 𝑅𝑖,𝑗 𝑣 𝑘 , 𝑅𝑖,𝑗 𝑣 = Vi (k) Vj (k) Character width 𝑆𝑖,𝑗 (3) = 𝑆𝑖𝑚𝑖 𝑅𝑖,𝑗 𝑤 , 𝑅𝑖,𝑗 𝑤 = 𝑊𝑖 𝑊 𝑗 Character Height 𝑆𝑖,𝑗 (4) =Simi(𝑅𝑖,𝑗 (𝐻) ), 𝑅𝑖,𝑗 𝑤 = 𝐻𝑖 𝐻 𝑗
  • 18. Text Unit Energy • For the text unit containing two parts vi and v j , the text unit energ 𝐸𝑡𝑒𝑥𝑡 (𝑖,𝑗) is computed using character energies 𝐸𝑐ℎ𝑎𝑟 (𝑖) , 𝐸𝑐ℎ𝑎𝑟 (𝑗) and link energy 𝐸𝑙𝑖𝑛𝑘 (𝑖,𝑗) : • 𝐸𝑡𝑒𝑥𝑡 (𝑖,𝑗) = 1 2 [( 𝐸 𝑐ℎ𝑎𝑟 (𝑖) + 𝐸 𝑐ℎ𝑎𝑟 (𝑗) 2 )+ 𝐸𝑙𝑖𝑛𝑘 (𝑖,𝑗) ] • To refine the detected text objects, text units whose text unit energies are smaller than a pre- defined threshold Ttext are removed from the text objects. • choice of this threshold depends upon the characteristic of the datasets, a threshold of of 0.7 worked well for several datasets used for testing this algorithm. 18
  • 19. Fig.12 Text energy Ref(1) 19
  • 20. Fig.13 Threshold Etext and txt detection outputs Ref(1) 20
  • 21. Result on ICDAR 2003/2005 Dataset Objects Fig.14 experimental outputs Ref.(2) 21
  • 22. Evalution Results Algorithm Precision Ashida 0.55 Hinneck Becker 0.62 SWT 0.73 Novel Text detection 0.74 22 Recall 0.46 0.67 0.60 0.69
  • 23. References: • [1] Jing Zhang and Rangachar Kasturi ,“A novel text detection system based on characters and link energies”, image processing, IEEE trans., vol.23, No.9, pp.4187-4198, September 2014. • [2] S.M.Lucas, A. Panaretos, L.Sosa,A. Tang, S. Wong, and R. Young, “ICDAR 2003 robust reading competitions”, in Proc. 7th Int. Conf. Document And Recognit.,vol.2,pp. 682,2003. • [3]D,.Marr and Hildreth, “Theory of edge detection,” Proc.Roy.Soc. London B,vol. 207,No.1167,pp. 187-217,1980. 23