SlideShare una empresa de Scribd logo
1 de 7
Descargar para leer sin conexión
Short Paper
ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013

Text-Image Separation in Document Images using
Boundary/Perimeter Detection
Priti P. Rege1, Chanchal A. Chandrakar2
College of Engineering,, Pune, India.
Email: 1 ppr.extc@coep.ac.in
2
chanch.chandra@gmail.com
bottom-up Approaches are used for page segmentation and
block identification [5], [7]. Yuan, Tan [2] designed method
that makes use of edge information to extract textual blocks
from gray scale document images. It aims at detecting only
textual regions on heavy noise infected newspaper images
and separate them from graphical regions. The White Tiles
Approach [3] described new approaches to page segmentation
and classification. In this method, once the white tiles of each
region have been gathered together and their total area is
estimated, and regions are classified as text or images. George
Nagy, Mukkai Krishnamoorthy [4] have proposed two
complementary methods for characterizing the spatial structure
of digitized technical documents and labeling various logical
components without using optical character recognition.
Projection profile method [6], [8] is used for separating the
text and images, which is only suitable for Devanagari
Documents (Hindi document). The main disadvantage of this
method is that the irregular shaped images with nonrectangular shaped text blocks may result in loss of some text.
They can be dealt with by adapting algorithms available for
Roman script. Kuo-Chin Fan, Chi-Hwa Liu, Yuan-Kai Wang
[9] have implemented a feature-based document analysis
system which utilizes domain knowledge to segment and
classify mixed text/graphics/image documents. This method
is only suitable for pure text or image document, i.e. a document
which has only text region or image region. This method is
good for text-image identification not for extraction. The
Constrained Run-Length Algorithm (CRLA) [10] is a wellknown technique for page segmentation. The algorithm is
very efficient for partitioning documents with Manhattan
layouts but not suited to deal with complex layout pages, e.g.
irregular graphics embedded in a text paragraph. Its main
drawback is the use of only local information during the
smearing stage, which may lead to erroneous linkage of text
and graphics. Kuo-Chin Fan, Liang-Shen Wang, Yuan-Kai
Wang [11] proposed an intelligent document analysis system
to achieve the document segmentation and identification goal.
The proposed system consists of two modules: block
segmentation and block identification. Two kinds of features,
connectivity histogram and multi resolution features are
extracted.

Abstract—Document analysis plays an important role in office
automation, especially in intelligent signal processing. The
proposed system consists of two modules: block segmentation
and block identification. In this approach, first a document is
segmented into several non-overlapping blocks by utilizing a
novel recursive segmentation technique, and then extracts
the features embedded in each segmented block are extracted.
Two kinds of features, connected components and image
boundary/perimeter features are extracted. Document with
text inside image pose limitations in earlier reported literature.
This is taken care of by applying additional pass of the Run
Length Smearing on the extracted image that contains text.
Proposed scheme is independent of type and language of the
document.
Index Terms— OCR, Segmantation, Connected components.

I. INTRODUCTION
Work in the field of document image processing covers
many different areas including preprocessing, layout analysis,
optical character recognition, graphics analysis, form
processing, and writer identification. Text parts are the main
information carriers in most applications. For that purpose, it
is necessary to locate text objects within the image, recognize
them, and extract the hidden information. The documents
may contain, besides text, graphics and images that overlap.
Since text lines are not always horizontally aligned, finding
text parts and locating characters, words, and lines are not
trivial tasks. Due to the tremendous reduction in the storage
space of the processed results, it is advantageous to
reproduce, transmit, and store the document in the processed
form. The extracted regions can then be processed by a
subsequent step according to their types, e.g. OCR for text
regions and compression for graphics and halftone images.
Several methods have been proposed to separate text
and image or text and graphics from the background. Two
broad categories of available methods are Connected
Component based (CC) and texture based algorithms. The
first category segments the image into a set of CCs and then
classifies each CC as either text or non text. CC based
algorithms are relatively simple The underlying assumption
is that texts in document images can be seen as sets of
separating connected components each of which has distinct
intensity or color distribution and linked edge contours. Up
to now several strategies have been tried to solve the problem
of segmentation. Techniques for page segmentation and
layout analysis are broadly divided in to three main categories:
top-down, bottom-up and hybrid techniques [1]. Many
© 2013 ACEEE
DOI: 01.IJSIP.4.1.1044

II. METHODOLOGY
A. Document Segmentation
The main aim of document segmentation is to segment a
document into several separate blocks with each block
29
Short Paper
ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013
representing one type of medium. The method we adopted in
segmenting documents is a combination of run-length
smearing algorithm and boundary perimeter detection
procedure. It first performs the smearing operation in both
the horizontal and vertical directions to generate blocks. Due
to the selection of an improper threshold in the run-length
smearing process, an intact paragraph might be segmented
into several consecutive horizontal stripes with each stripe
representing one horizontal text line. The stripe merging
procedure is thus devised to merge those text stripes that
belong to the same paragraph. The merit is that it generates
an intact text block instead of several smaller text stripes. By
this way, it not only saves storage space but also reduces
the burden in performing optical character recognition.

Figure 2. Image after horizontal run length smearing

B. Pre-processing
Preprocessing of document images is the way of using
mature image processing techniques to improve the quality
of images. Its purpose is to enhance and extract useful
information of images for later processing purposes. Two
preprocessing tasks, thresholding and noise removal, are
performed here.

Document image is usually captured as colored image (RGB)
and we convert it into Gray level image. image. Hence, a
binarization procedure converting a gray-level image into a
binary image is necessary. OTSU Document image is usually
captured as colored image (RGB) and we convert it into Gray
level image. Thresholding is utilized to accomplish the
conversion task. Figure 1, shows the original and binarized
image.
C. Block Extraction
With the pre-processing being done, a binarized image is
then obtained. In this binarized image, each meaningful block
which can be easily recognized by human beings is composed
of pixels. RLSA is an operation to connect two nonadjacent
runs into one merged run if the distance between these two
runs is smaller than a threshold.
It is assumed that white pixels are represented by 0 and
black pixels by 1. If the number of 0 between l’s is less than or
equal to a constant C, then 0 is replaced by 1. In other words,
two runs having distance smaller than the threshold C will be
merged into one run. Example
Before smearing:
1111111000001111111100011
After smearing:
1111111000001111111111111
The result generated by the smearing operation is several
l-runs in each horizontal row.
Two constants CHT (Constant Horizontal threshold
value) and CVT (Constant Vertical threshold value) are needed
due to the processing of Run Length Smearing algorithm
(RLSA) in two directions. CHT selected as half of CVT. Figure
2 and 3 illustrate the operation of RLSA in two directions i.e.
horizontal and vertical directions. The results generated by
horizontal and vertical smearing operations are then combined
by logical AND operation to produce the meaningful blocks.
In other words, a pixel in the resulting image is black if the
pixels at the same position of both vertical and horizontal
Run-length smearing images are black. Though run length
smearing can combine related pixels into meaningful blocks,
there are still some small blocks which should be further
combined. We repeatedly apply run length smearing with
increased runlengths to the output of AND operation to merge
these blochs. An image illustrating the result generated by

Figure 1. Original color image and Binarized image

© 2013 ACEEE
DOI: 01.IJSIP.4.1.1044

30
Short Paper
ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013
RLSA is shown in Figure 4. Result of next step, to detect
boundary after reapplying horizontal run length smearing,
is shown in Figure 5.
D. Boundary/Perimeter Detection
Since we have obtained meaningful blocks, the next step
is to find the outside boundary of smearing blocks. Outside
boundary is defined as the white pixels surrounding the
extracted boundary of a block. The traditional edge detection
methods are not adequate for detecting this kind of boundary.
For example, if traditional edge detection methods are
operated on a one-pixel-width line block open contour will be
generated.

Figure 6. Four possible search Direction

We have used 4-connectivity as well as 8-connectivity
for boundary extraction. It is observed that 4-connectivity
We have used 4-connectivity as well as 8-connectivity for
boundary extraction. It is observed that 4-connectivity gives
batter results. Figure 6 shows the four possible search directions. A pixel is considered to be a part of the perimeter, if it is
nonzero and it is connected to at least one nonzero value.
Connectivity can also be defined in a more general way
for any dimension by using connectivity for a 3-by-3 matrix
of 0’s and 1’s. The 1-valued elements define neighborhood
locations relative to the center element of connectivity. Note
that connectivity must be symmetric about its center element. Figure 7 shows the image after boundary detection.
E. Connected Component and Area Computation
Connected components are rectangular boxes bounding
together regions of connected black pixels. The objective of
CC stage is to form rectangles around distinct components
in the page. The algorithm used is a simple iterative procedure with successive scan lines of an image to determine
whether black pixels in any pair of scan lines are connected
together. Bounding rectangles are extended to enclose any
groupings of connected black pixels between successive
scan lines.
A bounding box stops growing in size only when there
are no more black pixels on the current scan line join black

Figure 3. Image after vertical run length smearing

Figure 4. Final Result of Run length smearing

Figure 7. Image after Boundary Detection

pixels from previous line. These bounding boxes then form
the skeleton for further analysis. After detection of boundary
from the image, we find the connected component. Then, with
the help of connected component we compute the area of

Figure 5. Image after reapplying horizontal run length smearing

© 2013 ACEEE
DOI: 01.IJSIP.4.1.1044

31
Short Paper
ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013
closed loop. From Figure 6 it is clear that the text which is
converted into uniform stripes has uniform area. When we
compare the area of the closed loop (stripes which form by
run length smearing) with open loop area then, it fills Zero in
smaller area. Figure 7 shows the smaller area which is filled
with zero or black pixel.
To find the connected component the algorithm uses the
following general procedure:
1. Scan all image pixels, assigning preliminary labels
to nonzero pixels and recording label equivalences
in a union-find table.
2. Resolve the equivalence classes using the unionfind algorithm.
3. Relabel the pixels based on the resolved
equivalence classes.

Figure 8. Smaller Area filled with Zero smearing

F. Image-Text Separation
The text areas are thus filled with zero or black pixel, which
is shown in Figure 8. By ORing the Original binary image
with image so obtained entire text is separated from the original images. This is shown in Figure 9. After extracting text,
we XOR this image with the image in Figure 3 to separated
image/graphics in the original binary document. The separated image is shown in Figure 10. Results for two more
samples are given in Figure 11 and Figure 12. Text and image
parts are clearly separated.
G. Multi pass smearing
For several images like the one shown in Figure 13, the
separated image part may contain some text. This is possible
mainly if some part of text is isolated and has a font much
bigger than the rest of the text. For such cases, we repeat the
run length smearing operation on the separated image. Figure
13 (a) is the original image. The text and image separated after
application of our scheme are shown in Figure 13 (b) and (c)
respectively. The image extracted in (c) shows some text
content. When the smearing is applied on this image the text
and image part is fully separated. This is shown in Figure 13
(d).

Figure 9. Text Extracted from given Document smearing

CONCLUSIONS
Choice of the segmentation is very important step in the
separation of newspaper and magazine document images.
This choice is based on the quality of the input image, the
required output quality and speed of processing. Run length
smearing algorithm using boundary detection techniques are
found to have good performance for the images having good
separation between the text and the images pixels. Run length
smearing algorithm and recursive segmentation technique are
adopted in the segmentation stage to extract non-overlapping blocks embedded in the document.
The Enhanced CRLA has been chosen to build a number
of document processing systems because of its advantages
compared with other page segmentation techniques. For
example, although the texture- analysis-based approaches
are powerful to handle various page layouts, they are
commonly time-consuming because of the pixel-level
© 2013 ACEEE
DOI: 01.IJSIP.4.1.1044

Figure 10. Image Extracted from given Document Image

classification. The methods based on connected component
analysis may have problems to extract large headline
characters and thus they suit to deal with documents of certain
character size. However, the original CRLA can only treat
Manhattan page format; the present work extends its
capability to cover both Manhattan and non-Manhattan
layout and thus expands its application domain significantly.
But the range of mean length of horizontal black runs (MBRL)
and White-black transition count per unit width (MTC) in
CRLA is not suitable for all type of document images.
To improve the performance of these techniques our
method is implemented and it works well in case of all type of
document images and works on text in several languages i.e.
32
Short Paper
ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013
Hindi, English, Bangla, Telugu, Chinese. This method is language independent. If we see Projection profile method [1],
[8] it is only applicable for Devanagari Document images. It
needs that the layout of newspaper document image is very
specific. Otherwise it will not separate the text-image region.
Run length smearing using stripe merging method is only
suitable when stripes formed during rum length smearing are
perfect rectangle.
In our text-image separation system using boundary
detection, there is no need of skew correction. Without skew
correction it can work properly. Also, there is no requirement
of labeling different sizes of text and images in the given
documents. It can easily separate the text and image from
document image. The method can be used recursively if the
separated image contains some residue text. This system
provides a good platform for implementing segmentation
techniques suitable for developing real time processing
techniques and processing large databases of newspaper
and magazine document images.

[11] Kuo-Chin Fan, Liang-Shen Wang, Yuan-Kai Wang , “Page
segmentation and identification for intelligent signal
processing”, Signal Processing, Elsevier, pp. 329-346, 1995.

REFERENCES
[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

Swapnil Khedekar Vemulapati Ramanaprasad Srirangaraj
Setlur, “Text - Image Separation in Devanagari Documents”,
Proceedings of the Seventh International Conference on
Document Analysis and Recognition (ICDAR’03), 2003
Q. Yuan, C.L. Tan, “Text Extraction from Gray Scale
Document Images Using Edge Information”. Proceedings of
the International Conference on Document Analysis and
Recognition, ICDAR’01, Seattle, USA, pp. 302-306
September 10- 13, 2001
A. Antonacopoulos and R T Ritchings “Segmentation and
Classification of Document Images”, The Institution of
Electrical Engineers, U.K. 1995
[G. Nagy, S. Seth, “Hierarchical Representation of Optically
Scanned Documents”. Proceedings of 7th Intl. Conf. on Pattern
Recognition, Montreal, Canada, pp. 347-349, 1984.
Kyong-Ho Lee, Student Yoon-Chul Choy, and Sung-Bae Cho,
“ Geometric Structure Analysis of Document Images: A
Knowledge-Based Approach”, IEEE transactions on Pattern
Analysis and Machine Intelligence, Vol. 22, No. 11, November
2000.
Jean Duong, Hubert Emptoz, “Features for Printed Document
Image Analysis”, IEEE Transaction on Document Image
Analysis Vol. 02, No. 17 November 2002.
P. Mitchell, H. Yan, “Newspaper document analysis featuring
connected line segmentation”, Proceedings of International
Conference on on Document Analysis and Recognition,
ICDAR’01, Seattle, USA, 2001.
Abdel Wahab Zramdini and Rolf Ingold, “Optical font
recognition from projection profiles”, Proceedings of
Electronic Publishing. Vol. 6(3), pp.249–260, September, 1993.
Kuo-Chin Fan, Chi-Hwa Liu, Yuan-Kai Wang, “Segmentation
and classification of mixed text/graphics/image documents”,
Pattern Recognition Letters, Elsevier, pp. 1201-1209,
December 1994.
Hung-Ming Sun, “Enhanced Constrained Run-Length
Algorithm for Complex Layout Document Processing”,
International Journal of Applied Science and Engineering pp.
297-309, 2006.

© 2013 ACEEE
DOI: 01.IJSIP.4.1.1044

Figure 11. Text image separated for sample 2

33
Short Paper
ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013

Figure 12. Text image separated for sample 3

(a) Original Image

(c) Image separated after first iteration (with some text)

(b) Text separated after first iteration

© 2013 ACEEE
DOI: 01.IJSIP.4.1.1044

34
Short Paper
ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013

(d) Text and image separation after second iteration of smearing on image in (c)
Figure 13. Text separation requiring two passes

© 2013 ACEEE
DOI: 01.IJSIP.4.1.1044

35

Más contenido relacionado

La actualidad más candente

Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...
iaemedu
 
Hangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector MachineHangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector Machine
Editor IJCATR
 

La actualidad más candente (18)

H017416670
H017416670H017416670
H017416670
 
I017417176
I017417176I017417176
I017417176
 
Ac03401600163.
Ac03401600163.Ac03401600163.
Ac03401600163.
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Modified Skip Line Encoding for Binary Image Compression
Modified Skip Line Encoding for Binary Image CompressionModified Skip Line Encoding for Binary Image Compression
Modified Skip Line Encoding for Binary Image Compression
 
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
Empirical Coding for Curvature Based Linear Representation in Image Retrieval...
 
Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...Feature integration for image information retrieval using image mining techni...
Feature integration for image information retrieval using image mining techni...
 
Texture Classification
Texture ClassificationTexture Classification
Texture Classification
 
A comparative study on content based image retrieval methods
A comparative study on content based image retrieval methodsA comparative study on content based image retrieval methods
A comparative study on content based image retrieval methods
 
Image to text Converter
Image to text ConverterImage to text Converter
Image to text Converter
 
Hangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector MachineHangul Recognition Using Support Vector Machine
Hangul Recognition Using Support Vector Machine
 
Comparative Analysis of Lossless Image Compression Based On Row By Row Classi...
Comparative Analysis of Lossless Image Compression Based On Row By Row Classi...Comparative Analysis of Lossless Image Compression Based On Row By Row Classi...
Comparative Analysis of Lossless Image Compression Based On Row By Row Classi...
 
A Survey on Tamil Handwritten Character Recognition using OCR Techniques
A Survey on Tamil Handwritten Character Recognition using OCR TechniquesA Survey on Tamil Handwritten Character Recognition using OCR Techniques
A Survey on Tamil Handwritten Character Recognition using OCR Techniques
 
Automatic dominant region segmentation for natural images
Automatic dominant region segmentation for natural imagesAutomatic dominant region segmentation for natural images
Automatic dominant region segmentation for natural images
 
Content based image retrieval based on shape with texture features
Content based image retrieval based on shape with texture featuresContent based image retrieval based on shape with texture features
Content based image retrieval based on shape with texture features
 
A CONCERT EVALUATION OF EXEMPLAR BASED IMAGE INPAINTING ALGORITHMS FOR NATURA...
A CONCERT EVALUATION OF EXEMPLAR BASED IMAGE INPAINTING ALGORITHMS FOR NATURA...A CONCERT EVALUATION OF EXEMPLAR BASED IMAGE INPAINTING ALGORITHMS FOR NATURA...
A CONCERT EVALUATION OF EXEMPLAR BASED IMAGE INPAINTING ALGORITHMS FOR NATURA...
 
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
Query Image Searching With Integrated Textual and Visual Relevance Feedback f...
 
C1803011419
C1803011419C1803011419
C1803011419
 

Similar a Text-Image Separation in Document Images using Boundary/Perimeter Detection

Enhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical RecordsEnhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical Records
csandit
 

Similar a Text-Image Separation in Document Images using Boundary/Perimeter Detection (20)

F045053236
F045053236F045053236
F045053236
 
Optical Character Recognition from Text Image
Optical Character Recognition from Text ImageOptical Character Recognition from Text Image
Optical Character Recognition from Text Image
 
AVC based Compression of Compound Images Using Block Classification Scheme
AVC based Compression of Compound Images Using Block Classification SchemeAVC based Compression of Compound Images Using Block Classification Scheme
AVC based Compression of Compound Images Using Block Classification Scheme
 
Header Based Classification of Journals Using Document Image Segmentation and...
Header Based Classification of Journals Using Document Image Segmentation and...Header Based Classification of Journals Using Document Image Segmentation and...
Header Based Classification of Journals Using Document Image Segmentation and...
 
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document SegmentationA Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
 
Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project R...
Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project R...Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project R...
Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project R...
 
Recognition Technology for Four Arithmetic Operations
Recognition Technology for Four Arithmetic OperationsRecognition Technology for Four Arithmetic Operations
Recognition Technology for Four Arithmetic Operations
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
 
A Survey of Image Processing and Identification Techniques
A Survey of Image Processing and Identification TechniquesA Survey of Image Processing and Identification Techniques
A Survey of Image Processing and Identification Techniques
 
Enhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical RecordsEnhancement and Segmentation of Historical Records
Enhancement and Segmentation of Historical Records
 
E017322833
E017322833E017322833
E017322833
 
A Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed TextA Review of Optical Character Recognition System for Recognition of Printed Text
A Review of Optical Character Recognition System for Recognition of Printed Text
 
DEVELOPMENT OF AN ALPHABETIC CHARACTER RECOGNITION SYSTEM USING MATLAB FOR BA...
DEVELOPMENT OF AN ALPHABETIC CHARACTER RECOGNITION SYSTEM USING MATLAB FOR BA...DEVELOPMENT OF AN ALPHABETIC CHARACTER RECOGNITION SYSTEM USING MATLAB FOR BA...
DEVELOPMENT OF AN ALPHABETIC CHARACTER RECOGNITION SYSTEM USING MATLAB FOR BA...
 
An effective approach to offline arabic handwriting recognition
An effective approach to offline arabic handwriting recognitionAn effective approach to offline arabic handwriting recognition
An effective approach to offline arabic handwriting recognition
 
A novel predicate for active region merging in automatic image segmentation
A novel predicate for active region merging in automatic image segmentationA novel predicate for active region merging in automatic image segmentation
A novel predicate for active region merging in automatic image segmentation
 
A novel predicate for active region merging in automatic image segmentation
A novel predicate for active region merging in automatic image segmentationA novel predicate for active region merging in automatic image segmentation
A novel predicate for active region merging in automatic image segmentation
 
Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...Segmentation and recognition of handwritten digit numeral string using a mult...
Segmentation and recognition of handwritten digit numeral string using a mult...
 
Block Classification Scheme of Compound Images: A Hybrid Extension
Block Classification Scheme of Compound Images: A Hybrid ExtensionBlock Classification Scheme of Compound Images: A Hybrid Extension
Block Classification Scheme of Compound Images: A Hybrid Extension
 
A Survey Paper on Character Recognition
A Survey Paper on Character RecognitionA Survey Paper on Character Recognition
A Survey Paper on Character Recognition
 
MRI Image Segmentation Using Level Set Method and Implement an Medical Diagno...
MRI Image Segmentation Using Level Set Method and Implement an Medical Diagno...MRI Image Segmentation Using Level Set Method and Implement an Medical Diagno...
MRI Image Segmentation Using Level Set Method and Implement an Medical Diagno...
 

Más de IDES Editor

Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
IDES Editor
 
Line Losses in the 14-Bus Power System Network using UPFC
Line Losses in the 14-Bus Power System Network using UPFCLine Losses in the 14-Bus Power System Network using UPFC
Line Losses in the 14-Bus Power System Network using UPFC
IDES Editor
 
Cloud Security and Data Integrity with Client Accountability Framework
Cloud Security and Data Integrity with Client Accountability FrameworkCloud Security and Data Integrity with Client Accountability Framework
Cloud Security and Data Integrity with Client Accountability Framework
IDES Editor
 
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
IDES Editor
 

Más de IDES Editor (20)

Power System State Estimation - A Review
Power System State Estimation - A ReviewPower System State Estimation - A Review
Power System State Estimation - A Review
 
Artificial Intelligence Technique based Reactive Power Planning Incorporating...
Artificial Intelligence Technique based Reactive Power Planning Incorporating...Artificial Intelligence Technique based Reactive Power Planning Incorporating...
Artificial Intelligence Technique based Reactive Power Planning Incorporating...
 
Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...
Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...
Design and Performance Analysis of Genetic based PID-PSS with SVC in a Multi-...
 
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
Optimal Placement of DG for Loss Reduction and Voltage Sag Mitigation in Radi...
 
Line Losses in the 14-Bus Power System Network using UPFC
Line Losses in the 14-Bus Power System Network using UPFCLine Losses in the 14-Bus Power System Network using UPFC
Line Losses in the 14-Bus Power System Network using UPFC
 
Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...
Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...
Study of Structural Behaviour of Gravity Dam with Various Features of Gallery...
 
Assessing Uncertainty of Pushover Analysis to Geometric Modeling
Assessing Uncertainty of Pushover Analysis to Geometric ModelingAssessing Uncertainty of Pushover Analysis to Geometric Modeling
Assessing Uncertainty of Pushover Analysis to Geometric Modeling
 
Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...
Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...
Secure Multi-Party Negotiation: An Analysis for Electronic Payments in Mobile...
 
Selfish Node Isolation & Incentivation using Progressive Thresholds
Selfish Node Isolation & Incentivation using Progressive ThresholdsSelfish Node Isolation & Incentivation using Progressive Thresholds
Selfish Node Isolation & Incentivation using Progressive Thresholds
 
Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...
Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...
Various OSI Layer Attacks and Countermeasure to Enhance the Performance of WS...
 
Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...
Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...
Responsive Parameter based an AntiWorm Approach to Prevent Wormhole Attack in...
 
Cloud Security and Data Integrity with Client Accountability Framework
Cloud Security and Data Integrity with Client Accountability FrameworkCloud Security and Data Integrity with Client Accountability Framework
Cloud Security and Data Integrity with Client Accountability Framework
 
Genetic Algorithm based Layered Detection and Defense of HTTP Botnet
Genetic Algorithm based Layered Detection and Defense of HTTP BotnetGenetic Algorithm based Layered Detection and Defense of HTTP Botnet
Genetic Algorithm based Layered Detection and Defense of HTTP Botnet
 
Enhancing Data Storage Security in Cloud Computing Through Steganography
Enhancing Data Storage Security in Cloud Computing Through SteganographyEnhancing Data Storage Security in Cloud Computing Through Steganography
Enhancing Data Storage Security in Cloud Computing Through Steganography
 
Low Energy Routing for WSN’s
Low Energy Routing for WSN’sLow Energy Routing for WSN’s
Low Energy Routing for WSN’s
 
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
Permutation of Pixels within the Shares of Visual Cryptography using KBRP for...
 
Rotman Lens Performance Analysis
Rotman Lens Performance AnalysisRotman Lens Performance Analysis
Rotman Lens Performance Analysis
 
Band Clustering for the Lossless Compression of AVIRIS Hyperspectral Images
Band Clustering for the Lossless Compression of AVIRIS Hyperspectral ImagesBand Clustering for the Lossless Compression of AVIRIS Hyperspectral Images
Band Clustering for the Lossless Compression of AVIRIS Hyperspectral Images
 
Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...
Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...
Microelectronic Circuit Analogous to Hydrogen Bonding Network in Active Site ...
 
Texture Unit based Monocular Real-world Scene Classification using SOM and KN...
Texture Unit based Monocular Real-world Scene Classification using SOM and KN...Texture Unit based Monocular Real-world Scene Classification using SOM and KN...
Texture Unit based Monocular Real-world Scene Classification using SOM and KN...
 

Último

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Último (20)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Food safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdfFood safety_Challenges food safety laboratories_.pdf
Food safety_Challenges food safety laboratories_.pdf
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Text-Image Separation in Document Images using Boundary/Perimeter Detection

  • 1. Short Paper ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013 Text-Image Separation in Document Images using Boundary/Perimeter Detection Priti P. Rege1, Chanchal A. Chandrakar2 College of Engineering,, Pune, India. Email: 1 ppr.extc@coep.ac.in 2 chanch.chandra@gmail.com bottom-up Approaches are used for page segmentation and block identification [5], [7]. Yuan, Tan [2] designed method that makes use of edge information to extract textual blocks from gray scale document images. It aims at detecting only textual regions on heavy noise infected newspaper images and separate them from graphical regions. The White Tiles Approach [3] described new approaches to page segmentation and classification. In this method, once the white tiles of each region have been gathered together and their total area is estimated, and regions are classified as text or images. George Nagy, Mukkai Krishnamoorthy [4] have proposed two complementary methods for characterizing the spatial structure of digitized technical documents and labeling various logical components without using optical character recognition. Projection profile method [6], [8] is used for separating the text and images, which is only suitable for Devanagari Documents (Hindi document). The main disadvantage of this method is that the irregular shaped images with nonrectangular shaped text blocks may result in loss of some text. They can be dealt with by adapting algorithms available for Roman script. Kuo-Chin Fan, Chi-Hwa Liu, Yuan-Kai Wang [9] have implemented a feature-based document analysis system which utilizes domain knowledge to segment and classify mixed text/graphics/image documents. This method is only suitable for pure text or image document, i.e. a document which has only text region or image region. This method is good for text-image identification not for extraction. The Constrained Run-Length Algorithm (CRLA) [10] is a wellknown technique for page segmentation. The algorithm is very efficient for partitioning documents with Manhattan layouts but not suited to deal with complex layout pages, e.g. irregular graphics embedded in a text paragraph. Its main drawback is the use of only local information during the smearing stage, which may lead to erroneous linkage of text and graphics. Kuo-Chin Fan, Liang-Shen Wang, Yuan-Kai Wang [11] proposed an intelligent document analysis system to achieve the document segmentation and identification goal. The proposed system consists of two modules: block segmentation and block identification. Two kinds of features, connectivity histogram and multi resolution features are extracted. Abstract—Document analysis plays an important role in office automation, especially in intelligent signal processing. The proposed system consists of two modules: block segmentation and block identification. In this approach, first a document is segmented into several non-overlapping blocks by utilizing a novel recursive segmentation technique, and then extracts the features embedded in each segmented block are extracted. Two kinds of features, connected components and image boundary/perimeter features are extracted. Document with text inside image pose limitations in earlier reported literature. This is taken care of by applying additional pass of the Run Length Smearing on the extracted image that contains text. Proposed scheme is independent of type and language of the document. Index Terms— OCR, Segmantation, Connected components. I. INTRODUCTION Work in the field of document image processing covers many different areas including preprocessing, layout analysis, optical character recognition, graphics analysis, form processing, and writer identification. Text parts are the main information carriers in most applications. For that purpose, it is necessary to locate text objects within the image, recognize them, and extract the hidden information. The documents may contain, besides text, graphics and images that overlap. Since text lines are not always horizontally aligned, finding text parts and locating characters, words, and lines are not trivial tasks. Due to the tremendous reduction in the storage space of the processed results, it is advantageous to reproduce, transmit, and store the document in the processed form. The extracted regions can then be processed by a subsequent step according to their types, e.g. OCR for text regions and compression for graphics and halftone images. Several methods have been proposed to separate text and image or text and graphics from the background. Two broad categories of available methods are Connected Component based (CC) and texture based algorithms. The first category segments the image into a set of CCs and then classifies each CC as either text or non text. CC based algorithms are relatively simple The underlying assumption is that texts in document images can be seen as sets of separating connected components each of which has distinct intensity or color distribution and linked edge contours. Up to now several strategies have been tried to solve the problem of segmentation. Techniques for page segmentation and layout analysis are broadly divided in to three main categories: top-down, bottom-up and hybrid techniques [1]. Many © 2013 ACEEE DOI: 01.IJSIP.4.1.1044 II. METHODOLOGY A. Document Segmentation The main aim of document segmentation is to segment a document into several separate blocks with each block 29
  • 2. Short Paper ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013 representing one type of medium. The method we adopted in segmenting documents is a combination of run-length smearing algorithm and boundary perimeter detection procedure. It first performs the smearing operation in both the horizontal and vertical directions to generate blocks. Due to the selection of an improper threshold in the run-length smearing process, an intact paragraph might be segmented into several consecutive horizontal stripes with each stripe representing one horizontal text line. The stripe merging procedure is thus devised to merge those text stripes that belong to the same paragraph. The merit is that it generates an intact text block instead of several smaller text stripes. By this way, it not only saves storage space but also reduces the burden in performing optical character recognition. Figure 2. Image after horizontal run length smearing B. Pre-processing Preprocessing of document images is the way of using mature image processing techniques to improve the quality of images. Its purpose is to enhance and extract useful information of images for later processing purposes. Two preprocessing tasks, thresholding and noise removal, are performed here. Document image is usually captured as colored image (RGB) and we convert it into Gray level image. image. Hence, a binarization procedure converting a gray-level image into a binary image is necessary. OTSU Document image is usually captured as colored image (RGB) and we convert it into Gray level image. Thresholding is utilized to accomplish the conversion task. Figure 1, shows the original and binarized image. C. Block Extraction With the pre-processing being done, a binarized image is then obtained. In this binarized image, each meaningful block which can be easily recognized by human beings is composed of pixels. RLSA is an operation to connect two nonadjacent runs into one merged run if the distance between these two runs is smaller than a threshold. It is assumed that white pixels are represented by 0 and black pixels by 1. If the number of 0 between l’s is less than or equal to a constant C, then 0 is replaced by 1. In other words, two runs having distance smaller than the threshold C will be merged into one run. Example Before smearing: 1111111000001111111100011 After smearing: 1111111000001111111111111 The result generated by the smearing operation is several l-runs in each horizontal row. Two constants CHT (Constant Horizontal threshold value) and CVT (Constant Vertical threshold value) are needed due to the processing of Run Length Smearing algorithm (RLSA) in two directions. CHT selected as half of CVT. Figure 2 and 3 illustrate the operation of RLSA in two directions i.e. horizontal and vertical directions. The results generated by horizontal and vertical smearing operations are then combined by logical AND operation to produce the meaningful blocks. In other words, a pixel in the resulting image is black if the pixels at the same position of both vertical and horizontal Run-length smearing images are black. Though run length smearing can combine related pixels into meaningful blocks, there are still some small blocks which should be further combined. We repeatedly apply run length smearing with increased runlengths to the output of AND operation to merge these blochs. An image illustrating the result generated by Figure 1. Original color image and Binarized image © 2013 ACEEE DOI: 01.IJSIP.4.1.1044 30
  • 3. Short Paper ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013 RLSA is shown in Figure 4. Result of next step, to detect boundary after reapplying horizontal run length smearing, is shown in Figure 5. D. Boundary/Perimeter Detection Since we have obtained meaningful blocks, the next step is to find the outside boundary of smearing blocks. Outside boundary is defined as the white pixels surrounding the extracted boundary of a block. The traditional edge detection methods are not adequate for detecting this kind of boundary. For example, if traditional edge detection methods are operated on a one-pixel-width line block open contour will be generated. Figure 6. Four possible search Direction We have used 4-connectivity as well as 8-connectivity for boundary extraction. It is observed that 4-connectivity We have used 4-connectivity as well as 8-connectivity for boundary extraction. It is observed that 4-connectivity gives batter results. Figure 6 shows the four possible search directions. A pixel is considered to be a part of the perimeter, if it is nonzero and it is connected to at least one nonzero value. Connectivity can also be defined in a more general way for any dimension by using connectivity for a 3-by-3 matrix of 0’s and 1’s. The 1-valued elements define neighborhood locations relative to the center element of connectivity. Note that connectivity must be symmetric about its center element. Figure 7 shows the image after boundary detection. E. Connected Component and Area Computation Connected components are rectangular boxes bounding together regions of connected black pixels. The objective of CC stage is to form rectangles around distinct components in the page. The algorithm used is a simple iterative procedure with successive scan lines of an image to determine whether black pixels in any pair of scan lines are connected together. Bounding rectangles are extended to enclose any groupings of connected black pixels between successive scan lines. A bounding box stops growing in size only when there are no more black pixels on the current scan line join black Figure 3. Image after vertical run length smearing Figure 4. Final Result of Run length smearing Figure 7. Image after Boundary Detection pixels from previous line. These bounding boxes then form the skeleton for further analysis. After detection of boundary from the image, we find the connected component. Then, with the help of connected component we compute the area of Figure 5. Image after reapplying horizontal run length smearing © 2013 ACEEE DOI: 01.IJSIP.4.1.1044 31
  • 4. Short Paper ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013 closed loop. From Figure 6 it is clear that the text which is converted into uniform stripes has uniform area. When we compare the area of the closed loop (stripes which form by run length smearing) with open loop area then, it fills Zero in smaller area. Figure 7 shows the smaller area which is filled with zero or black pixel. To find the connected component the algorithm uses the following general procedure: 1. Scan all image pixels, assigning preliminary labels to nonzero pixels and recording label equivalences in a union-find table. 2. Resolve the equivalence classes using the unionfind algorithm. 3. Relabel the pixels based on the resolved equivalence classes. Figure 8. Smaller Area filled with Zero smearing F. Image-Text Separation The text areas are thus filled with zero or black pixel, which is shown in Figure 8. By ORing the Original binary image with image so obtained entire text is separated from the original images. This is shown in Figure 9. After extracting text, we XOR this image with the image in Figure 3 to separated image/graphics in the original binary document. The separated image is shown in Figure 10. Results for two more samples are given in Figure 11 and Figure 12. Text and image parts are clearly separated. G. Multi pass smearing For several images like the one shown in Figure 13, the separated image part may contain some text. This is possible mainly if some part of text is isolated and has a font much bigger than the rest of the text. For such cases, we repeat the run length smearing operation on the separated image. Figure 13 (a) is the original image. The text and image separated after application of our scheme are shown in Figure 13 (b) and (c) respectively. The image extracted in (c) shows some text content. When the smearing is applied on this image the text and image part is fully separated. This is shown in Figure 13 (d). Figure 9. Text Extracted from given Document smearing CONCLUSIONS Choice of the segmentation is very important step in the separation of newspaper and magazine document images. This choice is based on the quality of the input image, the required output quality and speed of processing. Run length smearing algorithm using boundary detection techniques are found to have good performance for the images having good separation between the text and the images pixels. Run length smearing algorithm and recursive segmentation technique are adopted in the segmentation stage to extract non-overlapping blocks embedded in the document. The Enhanced CRLA has been chosen to build a number of document processing systems because of its advantages compared with other page segmentation techniques. For example, although the texture- analysis-based approaches are powerful to handle various page layouts, they are commonly time-consuming because of the pixel-level © 2013 ACEEE DOI: 01.IJSIP.4.1.1044 Figure 10. Image Extracted from given Document Image classification. The methods based on connected component analysis may have problems to extract large headline characters and thus they suit to deal with documents of certain character size. However, the original CRLA can only treat Manhattan page format; the present work extends its capability to cover both Manhattan and non-Manhattan layout and thus expands its application domain significantly. But the range of mean length of horizontal black runs (MBRL) and White-black transition count per unit width (MTC) in CRLA is not suitable for all type of document images. To improve the performance of these techniques our method is implemented and it works well in case of all type of document images and works on text in several languages i.e. 32
  • 5. Short Paper ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013 Hindi, English, Bangla, Telugu, Chinese. This method is language independent. If we see Projection profile method [1], [8] it is only applicable for Devanagari Document images. It needs that the layout of newspaper document image is very specific. Otherwise it will not separate the text-image region. Run length smearing using stripe merging method is only suitable when stripes formed during rum length smearing are perfect rectangle. In our text-image separation system using boundary detection, there is no need of skew correction. Without skew correction it can work properly. Also, there is no requirement of labeling different sizes of text and images in the given documents. It can easily separate the text and image from document image. The method can be used recursively if the separated image contains some residue text. This system provides a good platform for implementing segmentation techniques suitable for developing real time processing techniques and processing large databases of newspaper and magazine document images. [11] Kuo-Chin Fan, Liang-Shen Wang, Yuan-Kai Wang , “Page segmentation and identification for intelligent signal processing”, Signal Processing, Elsevier, pp. 329-346, 1995. REFERENCES [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] Swapnil Khedekar Vemulapati Ramanaprasad Srirangaraj Setlur, “Text - Image Separation in Devanagari Documents”, Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR’03), 2003 Q. Yuan, C.L. Tan, “Text Extraction from Gray Scale Document Images Using Edge Information”. Proceedings of the International Conference on Document Analysis and Recognition, ICDAR’01, Seattle, USA, pp. 302-306 September 10- 13, 2001 A. Antonacopoulos and R T Ritchings “Segmentation and Classification of Document Images”, The Institution of Electrical Engineers, U.K. 1995 [G. Nagy, S. Seth, “Hierarchical Representation of Optically Scanned Documents”. Proceedings of 7th Intl. Conf. on Pattern Recognition, Montreal, Canada, pp. 347-349, 1984. Kyong-Ho Lee, Student Yoon-Chul Choy, and Sung-Bae Cho, “ Geometric Structure Analysis of Document Images: A Knowledge-Based Approach”, IEEE transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 11, November 2000. Jean Duong, Hubert Emptoz, “Features for Printed Document Image Analysis”, IEEE Transaction on Document Image Analysis Vol. 02, No. 17 November 2002. P. Mitchell, H. Yan, “Newspaper document analysis featuring connected line segmentation”, Proceedings of International Conference on on Document Analysis and Recognition, ICDAR’01, Seattle, USA, 2001. Abdel Wahab Zramdini and Rolf Ingold, “Optical font recognition from projection profiles”, Proceedings of Electronic Publishing. Vol. 6(3), pp.249–260, September, 1993. Kuo-Chin Fan, Chi-Hwa Liu, Yuan-Kai Wang, “Segmentation and classification of mixed text/graphics/image documents”, Pattern Recognition Letters, Elsevier, pp. 1201-1209, December 1994. Hung-Ming Sun, “Enhanced Constrained Run-Length Algorithm for Complex Layout Document Processing”, International Journal of Applied Science and Engineering pp. 297-309, 2006. © 2013 ACEEE DOI: 01.IJSIP.4.1.1044 Figure 11. Text image separated for sample 2 33
  • 6. Short Paper ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013 Figure 12. Text image separated for sample 3 (a) Original Image (c) Image separated after first iteration (with some text) (b) Text separated after first iteration © 2013 ACEEE DOI: 01.IJSIP.4.1.1044 34
  • 7. Short Paper ACEEE Int. J. on Signal & Image Processing, Vol. 4, No. 1, Jan 2013 (d) Text and image separation after second iteration of smearing on image in (c) Figure 13. Text separation requiring two passes © 2013 ACEEE DOI: 01.IJSIP.4.1.1044 35