[RakutenTechConf2013] [C4-1] Text detection in product images

Text detection in product images
10/26/2013
Naoki Chiba, Lead Scientist

Rakuten Institute of Technology
Rakuten Inc.
http://rit.rakuten.co.jp/

Product images
Sales pitches in images

Applications:
• Content retrieval/filtering
• Recognition
• Translation
2

RIT Text Detector

Far more accurate

Works like magic

3

Outline

１ Text detection overview
2 Current methods
3 RIT’s approach

4

Outline

2 Current methods
3 RIT’s approach

5

Academic Research
• Natural scene OCR ≠ traditional scanned
OCR
–
–
–
–

Camera captured
Illumination variations
Perspective distortion
Short text

Digital-born text

Natural-scene text
Source: ICDAR Text locating competition 6

Product Images - Two Purposes
Text’s role is different

1. Sales pitches
1. Product list

7

Product list
Sales pitch (Merchant’s names, Price, Shipping)

8

“Now Printing” images
Showing image unavailability, but..

Not
Updated

9

Text detection for product images

More accurate
Much Faster

10

Outline

2 Current methods
3 RIT’s approach

11

Current methods

1. Texture based (Classifier-based)
2. Region based (Connected components)
3. Hybrids

12

1. Texture-based method
• Special texture
• Scan
• Classifier (SVM, AdaBoost
or Neural network)

Problems:
• Scale/Rotation variant

• High computation
13

2. Region-based method
• Local features
(edges or color clustering)

• Connected component
analysis
• Text lines and word
separation
Output of Stroke width transform
Problem:
• False candidates

14

3. Hybrid method

B
Classifier
SVM
Random Forrest
AdaBoost

Region based
Edge (Stroke Width Transform)
Color clustering

15

Problems

1. Character/word annotation
Time-consuming task

2. Transparent text
Hard to detect

16

Problem 1: Character/word annotation
Time consuming for many images

17

Problem 2: Transparent text

?
• Weak edges (difficult to detect)

18

Outline

2 Current methods
3 RIT’s approach

19

RIT’s Approach

Time-consuming task

Text image classifier using imagewise annotation
2. Transparent text
Hard to detect

Transparent text detection and
background recovery

20

1. Text image classifier
using image-wise annotation

• Text image detection (not char/word)
– Image-wise annotation (less time)
– Clustering detected regions
(measure text likeliness)

21

Image-wise Annotation

送料無料
text
Draw rectangles

Character-wise

non-text

Classify text/non-text

Image-wise

22

f2

Clustering detected regions
P(C1) = 3/4
x

x

C1

C５

x
C3

x

x
C2

P(C4) = 0/3
C４

Region in text images
Region in non-text images
x

f1

Cluster center
23

Comparison
Better than a typical method
Accuracy
90.0%
80.0%
70.0%
60.0%
50.0%
40.0%
30.0%
20.0%
10.0%
0.0%

Current

Proposed

• Rakuten 500 images
• Compared w/a traditional region-based method

24

RIT’s Approach

Time-consuming task

Text image classifier using imagewise annotation
2. Transparent text
Hard to detect

Transparent text detection and
background recovery

25

2. Transparent text detection and
background recovery
• Edge Detection with adaptive threshold
– Image content analysis

• Background recovery
– Text color/opacity estimation

26

Edge detection with adaptive thresholds

•

Less noise

Weak edges are
better preserved
27

Texture strength
Measuring image complexity
Image patches:
Direction and energy:
eigenvectors and eigenvalues[1]

Texture strength:
[1] Xiang Zhu and Peyman Milanfar, “Automatic parameter selection for denoising
algorithms using a no-reference measure of image content,” IEEE transactions on
image processing, pp. 3116–32, 2010.
28

Proposed text detection
1. Texture based (Classifier based)
SVM/Random Forest/AdaBoost

2. Region based (Connected components)
Edge/Color Clustering

3. Hybrids
Region (Edge Stroke Width)
+
Texture (AdaBoost)

29

System flow

•

Input image

Components
Analysis

Adaptive Edge
detection

Stroke width transform and
Connected component

Detected
text
30

Detection result

(a) constant threshold

(b) proposed
31

System flow

•

Input image

Components
Analysis

Adaptive Edge
detection

Detected
text

Stroke width transform and
Connected component

Background
recovery
32

Transparent Text
opacity
I

text color

I = O(1- r)+ rT
O
I: observed pixel value
O: original pixel value

• 2 >= equations
• Least squares solution
• 2 unknown

33

Extraction result

(a) original

(b) recovered

34

Comparison with InPainting

Original

Magic
Patented!

InPainting

Rakuten

35

Details: ACPR 2013

Thank you!

36

[RakutenTechConf2013] [C4-1] Text detection in product images

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (18)

Destacado

Destacado (10)

Similar a [RakutenTechConf2013] [C4-1] Text detection in product images

Similar a [RakutenTechConf2013] [C4-1] Text detection in product images (20)

Más de Rakuten Group, Inc.

Más de Rakuten Group, Inc. (20)

Último

Último (20)

[RakutenTechConf2013] [C4-1] Text detection in product images

Notas del editor