SlideShare una empresa de Scribd logo
1 de 61
Tools for Image Retrieval
in Large Multimedia Databases
by Carles Ventura Royo
Directors:
Verónica Vilaplana
Xavier Giró
Tutor:
Ferran Marqués
Barcelona, September 2011
1
Index
 Identifying the problem
 State of art: indexing techniques
 State of art: Hierarchical Cellular Tree (HCT)
 Modifications to the original HCT
 Experimental results
 Implemented tools
 Conclusions and future work lines
2
3
Identifying the problem (I)
“House”
DATABASE
4
Identifying the problem (II)
QUERY
TARGETS
Visual + textual
descriptors
“house”
“garden”
K
Identifying the problem (III)
 K nearest neighbor problem
 Solution: Sequential scan
 Drawback: Computational time for large
databases (10 s for a 200,000 elements)
 Approximate K nearest neighbor problem
 Indexing techniques
5
Requirements of the solution
 Dynamic approach
 Multimedia databases are not static
 Insertions and deletions
 High dimensional feature spaces
 “curse of dimensionality” problem
 MPEG-7 visual descriptors are high-dimensional
feature vectors
6
Index
 Identifying the problem
 State of art: indexing techniques
 State of art: Hierarchical Cellular Tree (HCT)
 Modifications to the original HCT
 Experimental results
 Implemented tools
 Conclusions and future work lines
7
Indexing techniques (I)
 Hierarchical data structures
 Spatial Access Methods (SAMs)
 K-d tree, R-tree, R*-tree, TV-tree, etc.
 Drawbacks:
 Items have to be represented in an N-dimensional feature
space
 Dissimilarity measure based on a Lp metric
 SAMs do not scale up well to high dimensional spaces
8
Indexing techniques (II)
 Hierarchical data structures
 Metric Access Methods (MAMs)
 VP-tree, MVP-tree, GNAT, M-tree, etc.
 More general approach than SAMs
 Assuming only a similarity distance function
 MAMs scale up well to high dimensional spaces
 Drawbacks:
 Static MAMs do not support dynamic changes
 Dependence on pre-fixed parameters
9
Indexing techniques (III)
 Locality Sensitive Hashing
 It uses hash functions
 Nearby data points are hashed into the same
bucket with a high probability
 Points faraway are hashed into the same bucket
with a low probability
 Drawback:
 It does not solves the K nearest neighbor problem, but
the Є-near neighbor problem.
10
Index
 Identifying the problem
 State of art: indexing techniques
 State of art: Hierarchical Cellular Tree (HCT)
 Modifications to the original HCT
 Experimental results
 Implemented tools
 Conclusions and future work lines
11
Solution adopted
 Hierarchical Cellular Tree (HCT)
 MAM-based indexing scheme
 Hierarchical structure
 Self-organized tree
 Incremental construction
in a bottom-up fashion
 Unbalanced tree
 Not dependence on a maximum capacity
 Preemptive cell search algorithm for insertion
 Dynamic approach
[KG07] S. Kiranyaz and M.Gabbouj, Hierarchical Cellular Tree: An efficient
indexing scheme for content-based retrieval on multimedia databases.
12
HCT: Cell Structure (I)
 Basic container structure
 Undirected graph
 Minimum Spanning Tree (MST)
 Cell nucleus
 Covering radius
13
HCT: Cell Structure (II)
 Cell compactness
 Maturity size
 Mitosis operation
14
HCT: Level Structure
 Representatives for each cell from the lower
level
 Responsible for maximizing the compactness
of its cells
 Compactness threshold
15
HCT Operations (I)
 Item insertion
 Find the most
suitable cell
 Most Similar Nucleus
vs Preemptive Cell Search
16
HCT Operations (II)
 Item insertion
 Find the most
suitable cell
 Most Similar Nucleus
vs Preemptive Cell Search
C2: d(O,O2
N) - r(O2
N) < dmin
C3: d(O,O3
N) - r(O3
N) > dmin
17
HCT Operations (III)
 Item insertion
 Find the most suitable cell
 Append the element
 Generic post-processing check
 Mitosis operation
 Nucleus change
18
HCT Operations (IV)
 Item removal
 Cell search algorithm not required
 Remove the element
 Generic post-processing check
 Mitosis operation
 Nucleus change
19
HCT: Retrieval scheme
 Progressive Query
 Periodical subqueries over database subsets
 Query Path formation
 Based on Most Similar Nucleus
 Ranking agreggation
20
Index
 Identifying the problem
 State of art: indexing techniques
 State of art: Hierarchical Cellular Tree (HCT)
 Modifications to the original HCT
 Experimental results
 Implemented tools
 Conclusions and future work lines
21
Modifications to the HCT (I)
 Covering radius
 Original definition gives an approximation by
defect
 Consider all the elements belonging to the
subtree
 High computational cost
 Approximation by excess
22
Modifications to the HCT (II)
 Covering radius
23
Modifications to the HCT (III)
 Covering radius
24
rC (C9) = max
• d(E,B)+rC(C1)
• rC(C2)
• d(E,H)+rC(C3)
Modifications to the HCT (III)
 HCT construction
 Preemptive Cell Search over all the levels
 A method for updating the covering radius
 To reduce the searching time
 It can be performed after the HCT construction or
periodically
25
Modifications to the HCT (IV)
 Searching techniques
 PQ fails in solving the KNN problem efficiently
 New searching techniques
 Most Similar Nucleus
 Preemptive Cell Search
 Hybrid
 Number of cells to be considered
 Minimum number of cells
 Cells hosting 2·K elements
 Cellular structure is not kept
26
Index
 Identifying the problem
 State of art: indexing techniques
 State of art: Hierarchical Cellular Tree (HCT)
 Modifications to the original HCT
 Experimental results
 Implemented tools
 Conclusions and future work lines
27
Experimental results
 CCMA image database of 216,317 elements
 HCT building evaluation
 Construction time
 Retrieval system evaluation
 Retrieval time
 Elements retrieved
28
HCT building evaluation
29
Retrieval system evaluation (I)
30
Retrieval sytem evaluation (V)
31
Retrieval system evaluation (VI)
32
Retrieval system evaluation (II)
33
 Evaluation with respect to exhaustive search
 Mean Competitive Recall
 Elements in common
 Mean Normalized Aggregate Goodness
 Kendall distance
 Number of exchanges needed in a bubble sort
 Query set of 1,082 images
Retrieval system evaluation (III)
34
 Preemptive Cell Search
Retrieval system evaluation (IV)
 Searching techniques comparative
35
Index
 Identifying the problem
 State of art: indexing techniques
 State of art: Hierarchical Cellular Tree (HCT)
 Modifications to the original HCT
 Experimental results
 Implemented tools
 Conclusions and future work lines
36
Implemented tools (I)
 database_indexing tool
 Tool for indexing an image database
 HCT is stored at disk
 hct_query tool
 Tool for carrying out a search over an indexed
database
 HCT is read from disk and load at main memory
37
Implemented tools (II)
 A server/client architecture
 Based on a messaging system: KSC
38
Index
 Identifying the problem
 State of art: indexing techniques
 State of art: Hierarchical Cellular Tree (HCT)
 Modifications to the original HCT
 Experimental results
 Implemented tools
 Conclusions and future work lines
39
Conclusions
 Hierarchical Cellular Tree implementation
 To improve the retrieval times
 Generic implementation for any kind of data
 Modifications proposed
 HCT evaluation
 Measures extracted from literature
 Preemptive Cell Search technique gives the
best performance
 It is essential not to use an underestimated value
for the covering radius 40
Future work lines
 Very large databases
 Not using only main memory
 Region-based CBIR system
 Each image can be represented by a set of
regions
 Browser application based on HCT
 Take advantage of the hierarchical structure
 Alternative way to retrieve elements
41
Thanks for your attention
42
HCT Operations (IV)
 HCT construction
43
HCT Operations (V)
 HCT construction
44
A tool for image DB indexing
 MPEG-7 visual descriptors
 Indexing a database
 Inserting new elements to an indexed
database
45
A tool for image retrieval
 It requires an indexed database
 Image query
 Image file
 XML file with the visual descriptors
 TXT file with several image queries
 Interactive mode
 Performing a search
46
A server/client architecture
 Based on a messaging system: KSC
47
Retrieval system evaluation (IV)
48
 Most Similar Nucleus
Preemptive: 0.8319 seconds (retrieval time)
27.51 (CR) and 99.26% retrieved queries
Retrieval system evaluation (V)
49
 Progressive Query (Most Similar Nucleus)
Preemptive: 0.8319 seconds (retrieval time)
27.51 (CR) and 99.26% retrieved queries
Retrieval system evaluation (VI)
50
 Hybrid
 Preemptive Cell Search over 8 uppermost levels
Preemptive: 0.8319 seconds (retrieval time)
27.51 (CR) and 99.26% retrieved queries
Retrieval system evaluation
51
Retrieval system evaluation
52
Retrieval system evaluation
53
Retrieval system evaluation
54
Retrieval system evaluation
55
Retrieval system evaluation
56
Retrieval system evaluation
57
Retrieval system evaluation
58
Retrieval system evaluation
59
Retrieval system evaluation
60
Retrieval system evaluation
61

Más contenido relacionado

La actualidad más candente

Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningKAMAL CHOUDHARY
 
Autonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionAutonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionaimsnist
 
Applications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELApplications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELaimsnist
 
Graphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials ScienceGraphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials Scienceaimsnist
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...BrianDeCost
 
Earth Science Platform
Earth Science PlatformEarth Science Platform
Earth Science PlatformTed Habermann
 
A schema generation approach for column oriented no sql data stores
A schema generation approach for column oriented no sql data storesA schema generation approach for column oriented no sql data stores
A schema generation approach for column oriented no sql data storesKIRAN V
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionNextMove Software
 
On how to efficiently implement Deep Learning algorithms on PYNQ platform
On how to efficiently implement Deep Learning algorithms on PYNQ platformOn how to efficiently implement Deep Learning algorithms on PYNQ platform
On how to efficiently implement Deep Learning algorithms on PYNQ platformNECST Lab @ Politecnico di Milano
 
Chapter1_C.doc
Chapter1_C.docChapter1_C.doc
Chapter1_C.docbutest
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...aimsnist
 
Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...aimsnist
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...KAMAL CHOUDHARY
 
Machine learning astronomical structure
Machine learning astronomical structureMachine learning astronomical structure
Machine learning astronomical structurePanditNitesh
 
Intelligent Multimedia Recommendation
Intelligent Multimedia RecommendationIntelligent Multimedia Recommendation
Intelligent Multimedia RecommendationWanjin Yu
 
A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce IJECEIAES
 

La actualidad más candente (20)

Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
 
Autonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionAutonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisition
 
Applications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NRELApplications of Machine Learning for Materials Discovery at NREL
Applications of Machine Learning for Materials Discovery at NREL
 
Graphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials ScienceGraphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials Science
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
Earth Science Platform
Earth Science PlatformEarth Science Platform
Earth Science Platform
 
A schema generation approach for column oriented no sql data stores
A schema generation approach for column oriented no sql data storesA schema generation approach for column oriented no sql data stores
A schema generation approach for column oriented no sql data stores
 
Dmdw1
Dmdw1Dmdw1
Dmdw1
 
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs RevolutionRecent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
Recent Advances in Chemical & Biological Search Systems: Evolution vs Revolution
 
Csi
CsiCsi
Csi
 
On how to efficiently implement Deep Learning algorithms on PYNQ platform
On how to efficiently implement Deep Learning algorithms on PYNQ platformOn how to efficiently implement Deep Learning algorithms on PYNQ platform
On how to efficiently implement Deep Learning algorithms on PYNQ platform
 
Chapter1_C.doc
Chapter1_C.docChapter1_C.doc
Chapter1_C.doc
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
“Materials Informatics and Big Data: Realization of 4th Paradigm of Science i...
 
Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...
 
AVL TREE AN EFFICIENT RETRIEVAL ENGINE IN CLASSIFIED FINGERPRINT DATABASE
AVL TREE AN EFFICIENT RETRIEVAL ENGINE IN CLASSIFIED FINGERPRINT DATABASEAVL TREE AN EFFICIENT RETRIEVAL ENGINE IN CLASSIFIED FINGERPRINT DATABASE
AVL TREE AN EFFICIENT RETRIEVAL ENGINE IN CLASSIFIED FINGERPRINT DATABASE
 
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...Accelerated Materials Discovery & Characterization with Classical, Quantum an...
Accelerated Materials Discovery & Characterization with Classical, Quantum an...
 
Machine learning astronomical structure
Machine learning astronomical structureMachine learning astronomical structure
Machine learning astronomical structure
 
Intelligent Multimedia Recommendation
Intelligent Multimedia RecommendationIntelligent Multimedia Recommendation
Intelligent Multimedia Recommendation
 
A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce A Novel Approach for Clustering Big Data based on MapReduce
A Novel Approach for Clustering Big Data based on MapReduce
 

Destacado

落合 Wba hackathon2_成果報告_最終版
落合 Wba hackathon2_成果報告_最終版落合 Wba hackathon2_成果報告_最終版
落合 Wba hackathon2_成果報告_最終版koji ochiai
 
Creating new classes of objects with deep generative neural nets
Creating new classes of objects with deep generative neural netsCreating new classes of objects with deep generative neural nets
Creating new classes of objects with deep generative neural netsAkin Osman Kazakci
 
유사 이미지 검색 기술 동향 - Pinterest 사례
유사 이미지 검색 기술 동향 - Pinterest 사례유사 이미지 검색 기술 동향 - Pinterest 사례
유사 이미지 검색 기술 동향 - Pinterest 사례Geunhee Cho
 
Image enhancement
Image enhancementImage enhancement
Image enhancementMANISH T I
 
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval SystemLIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval SystemUniversitat Politècnica de Catalunya
 
情報幾何勉強会 EMアルゴリズム
情報幾何勉強会 EMアルゴリズム 情報幾何勉強会 EMアルゴリズム
情報幾何勉強会 EMアルゴリズム Shinagawa Seitaro
 
第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...
第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...
第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...koji ochiai
 
第3回nips読み会・関西『variational inference foundations and modern methods』
第3回nips読み会・関西『variational inference  foundations and modern methods』第3回nips読み会・関西『variational inference  foundations and modern methods』
第3回nips読み会・関西『variational inference foundations and modern methods』koji ochiai
 
YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)
YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)
YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)Universitat Politècnica de Catalunya
 
[DL輪読会]Semi supervised qa with generative domain-adaptive nets
[DL輪読会]Semi supervised qa with generative domain-adaptive nets[DL輪読会]Semi supervised qa with generative domain-adaptive nets
[DL輪読会]Semi supervised qa with generative domain-adaptive netsDeep Learning JP
 
[DL輪読会]Unsupervised Cross-Domain Image Generation
[DL輪読会]Unsupervised Cross-Domain Image Generation[DL輪読会]Unsupervised Cross-Domain Image Generation
[DL輪読会]Unsupervised Cross-Domain Image GenerationDeep Learning JP
 
Binarized Neural Networks
Binarized Neural NetworksBinarized Neural Networks
Binarized Neural NetworksShotaro Sano
 
[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...
[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...
[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...Deep Learning JP
 
深層強化学習 Pydata.Okinawa Meetup #22
深層強化学習 Pydata.Okinawa Meetup #22深層強化学習 Pydata.Okinawa Meetup #22
深層強化学習 Pydata.Okinawa Meetup #22Naoto Yoshida
 
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Universitat Politècnica de Catalunya
 

Destacado (20)

Convolutional Features for Instance Search
Convolutional Features for Instance SearchConvolutional Features for Instance Search
Convolutional Features for Instance Search
 
Region-oriented Convolutional Networks for Object Retrieval
Region-oriented Convolutional Networks for Object RetrievalRegion-oriented Convolutional Networks for Object Retrieval
Region-oriented Convolutional Networks for Object Retrieval
 
Multi-label Remote Sensing Image Retrieval based on Deep Features
Multi-label Remote Sensing Image Retrieval based on Deep FeaturesMulti-label Remote Sensing Image Retrieval based on Deep Features
Multi-label Remote Sensing Image Retrieval based on Deep Features
 
Relevance feedback for image retrieval with EEG signals
Relevance feedback for image retrieval with EEG signalsRelevance feedback for image retrieval with EEG signals
Relevance feedback for image retrieval with EEG signals
 
落合 Wba hackathon2_成果報告_最終版
落合 Wba hackathon2_成果報告_最終版落合 Wba hackathon2_成果報告_最終版
落合 Wba hackathon2_成果報告_最終版
 
Creating new classes of objects with deep generative neural nets
Creating new classes of objects with deep generative neural netsCreating new classes of objects with deep generative neural nets
Creating new classes of objects with deep generative neural nets
 
유사 이미지 검색 기술 동향 - Pinterest 사례
유사 이미지 검색 기술 동향 - Pinterest 사례유사 이미지 검색 기술 동향 - Pinterest 사례
유사 이미지 검색 기술 동향 - Pinterest 사례
 
Part-based Object Retrieval with Binary Partition Trees
Part-based Object Retrieval with Binary Partition TreesPart-based Object Retrieval with Binary Partition Trees
Part-based Object Retrieval with Binary Partition Trees
 
Image enhancement
Image enhancementImage enhancement
Image enhancement
 
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval SystemLIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
LIvRE: A Video Extension to the LIRE Content-Based Image Retrieval System
 
情報幾何勉強会 EMアルゴリズム
情報幾何勉強会 EMアルゴリズム 情報幾何勉強会 EMアルゴリズム
情報幾何勉強会 EMアルゴリズム
 
第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...
第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...
第2回nips読み会・関西資料『unsupervised learning for physical interaction through video ...
 
第3回nips読み会・関西『variational inference foundations and modern methods』
第3回nips読み会・関西『variational inference  foundations and modern methods』第3回nips読み会・関西『variational inference  foundations and modern methods』
第3回nips読み会・関西『variational inference foundations and modern methods』
 
YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)
YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)
YouTube-8M: A Large-Scale Video Classification Benchmark (UPC Reading Group)
 
[DL輪読会]Semi supervised qa with generative domain-adaptive nets
[DL輪読会]Semi supervised qa with generative domain-adaptive nets[DL輪読会]Semi supervised qa with generative domain-adaptive nets
[DL輪読会]Semi supervised qa with generative domain-adaptive nets
 
[DL輪読会]Unsupervised Cross-Domain Image Generation
[DL輪読会]Unsupervised Cross-Domain Image Generation[DL輪読会]Unsupervised Cross-Domain Image Generation
[DL輪読会]Unsupervised Cross-Domain Image Generation
 
Binarized Neural Networks
Binarized Neural NetworksBinarized Neural Networks
Binarized Neural Networks
 
[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...
[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...
[DL輪読会]StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generat...
 
深層強化学習 Pydata.Okinawa Meetup #22
深層強化学習 Pydata.Okinawa Meetup #22深層強化学習 Pydata.Okinawa Meetup #22
深層強化学習 Pydata.Okinawa Meetup #22
 
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
Skin Lesion Detection from Dermoscopic Images using Convolutional Neural Netw...
 

Similar a Tools for Image Retrieval in Large Multimedia Databases

Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniquesPoonam Kshirsagar
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...Ian Foster
 
Cs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_miningCs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_mininghari91
 
Hadoop World 2010 - BAH - Fuzzy Table
Hadoop World 2010 - BAH - Fuzzy TableHadoop World 2010 - BAH - Fuzzy Table
Hadoop World 2010 - BAH - Fuzzy TableCloudera, Inc.
 
Btv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalBtv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalVinh Bui
 
Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Farzaneh Rezaei
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...Universidade de São Paulo
 
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM ProcessingHow to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processinginside-BigData.com
 
Large Scale Data Clustering: an overview
Large Scale Data Clustering: an overviewLarge Scale Data Clustering: an overview
Large Scale Data Clustering: an overviewVahid Mirjalili
 
Applying ‘best fit’ frameworks to systematic review data extraction
Applying ‘best fit’ frameworks to systematic review data extractionApplying ‘best fit’ frameworks to systematic review data extraction
Applying ‘best fit’ frameworks to systematic review data extractionAndrea Miller-Nesbitt
 
Foundations for the Future of Science
Foundations for the Future of ScienceFoundations for the Future of Science
Foundations for the Future of ScienceGlobus
 
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...Kitware Kitware
 
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"Fwdays
 
Basic use of xcms
Basic use of xcmsBasic use of xcms
Basic use of xcmsXiuxia Du
 
Enabling open and reproducible computer systems research: the good, the bad a...
Enabling open and reproducible computer systems research: the good, the bad a...Enabling open and reproducible computer systems research: the good, the bad a...
Enabling open and reproducible computer systems research: the good, the bad a...Grigori Fursin
 
A Proposed Method to Develop Shared Papers for Researchers at Conference
A Proposed Method to Develop Shared Papers for Researchers at ConferenceA Proposed Method to Develop Shared Papers for Researchers at Conference
A Proposed Method to Develop Shared Papers for Researchers at Conferenceiosrjce
 
Big Data Lessons from the Cloud
Big Data Lessons from the CloudBig Data Lessons from the Cloud
Big Data Lessons from the CloudMapR Technologies
 

Similar a Tools for Image Retrieval in Large Multimedia Databases (20)

Current clustering techniques
Current clustering techniquesCurrent clustering techniques
Current clustering techniques
 
Braintalk cuso nm
Braintalk cuso nmBraintalk cuso nm
Braintalk cuso nm
 
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...Computing Just What You Need: Online Data Analysis and Reduction  at Extreme ...
Computing Just What You Need: Online Data Analysis and Reduction at Extreme ...
 
Cs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_miningCs 1004 -_data_warehousing_and_data_mining
Cs 1004 -_data_warehousing_and_data_mining
 
Hadoop World 2010 - BAH - Fuzzy Table
Hadoop World 2010 - BAH - Fuzzy TableHadoop World 2010 - BAH - Fuzzy Table
Hadoop World 2010 - BAH - Fuzzy Table
 
Btv thesis defense_v1.02-final
Btv thesis defense_v1.02-finalBtv thesis defense_v1.02-final
Btv thesis defense_v1.02-final
 
Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)Automatic Image Annotation (AIA)
Automatic Image Annotation (AIA)
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...
 
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM ProcessingHow to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
How to Scale from Workstation through Cloud to HPC in Cryo-EM Processing
 
Large Scale Data Clustering: an overview
Large Scale Data Clustering: an overviewLarge Scale Data Clustering: an overview
Large Scale Data Clustering: an overview
 
Applying ‘best fit’ frameworks to systematic review data extraction
Applying ‘best fit’ frameworks to systematic review data extractionApplying ‘best fit’ frameworks to systematic review data extraction
Applying ‘best fit’ frameworks to systematic review data extraction
 
Foundations for the Future of Science
Foundations for the Future of ScienceFoundations for the Future of Science
Foundations for the Future of Science
 
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
A Framework for Comparison and Evaluation of Nonlinear Intra-Subject Image Re...
 
Deep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLabDeep Learning Initiative @ NECSTLab
Deep Learning Initiative @ NECSTLab
 
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
Ivan Sahumbaiev "Deep Learning approaches meet 3D data"
 
Basic use of xcms
Basic use of xcmsBasic use of xcms
Basic use of xcms
 
Enabling open and reproducible computer systems research: the good, the bad a...
Enabling open and reproducible computer systems research: the good, the bad a...Enabling open and reproducible computer systems research: the good, the bad a...
Enabling open and reproducible computer systems research: the good, the bad a...
 
A Proposed Method to Develop Shared Papers for Researchers at Conference
A Proposed Method to Develop Shared Papers for Researchers at ConferenceA Proposed Method to Develop Shared Papers for Researchers at Conference
A Proposed Method to Develop Shared Papers for Researchers at Conference
 
N0176195102
N0176195102N0176195102
N0176195102
 
Big Data Lessons from the Cloud
Big Data Lessons from the CloudBig Data Lessons from the Cloud
Big Data Lessons from the Cloud
 

Más de Universitat Politècnica de Catalunya

The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...Universitat Politècnica de Catalunya
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoTowards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoUniversitat Politècnica de Catalunya
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Universitat Politècnica de Catalunya
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosUniversitat Politècnica de Catalunya
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Universitat Politècnica de Catalunya
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Universitat Politècnica de Catalunya
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Universitat Politècnica de Catalunya
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Universitat Politècnica de Catalunya
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Universitat Politècnica de Catalunya
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Universitat Politècnica de Catalunya
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Universitat Politècnica de Catalunya
 

Más de Universitat Politècnica de Catalunya (20)

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Deep Generative Learning for All
Deep Generative Learning for AllDeep Generative Learning for All
Deep Generative Learning for All
 
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
 
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-NietoTowards Sign Language Translation & Production | Xavier Giro-i-Nieto
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
 
The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021The Transformer - Xavier Giró - UPC Barcelona 2021
The Transformer - Xavier Giró - UPC Barcelona 2021
 
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
 
Open challenges in sign language translation and production
Open challenges in sign language translation and productionOpen challenges in sign language translation and production
Open challenges in sign language translation and production
 
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in VideosGeneration of Synthetic Referring Expressions for Object Segmentation in Videos
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
 
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in MinecraftDiscovery and Learning of Navigation Goals from Pixels in Minecraft
Discovery and Learning of Navigation Goals from Pixels in Minecraft
 
Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...Learn2Sign : Sign language recognition and translation using human keypoint e...
Learn2Sign : Sign language recognition and translation using human keypoint e...
 
Intepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural NetworksIntepretability / Explainable AI for Deep Neural Networks
Intepretability / Explainable AI for Deep Neural Networks
 
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
 
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
 
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
 
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
 
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
 
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
 
Curriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object SegmentationCurriculum Learning for Recurrent Video Object Segmentation
Curriculum Learning for Recurrent Video Object Segmentation
 
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
 

Último

A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Último (20)

A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Tools for Image Retrieval in Large Multimedia Databases

  • 1. Tools for Image Retrieval in Large Multimedia Databases by Carles Ventura Royo Directors: Verónica Vilaplana Xavier Giró Tutor: Ferran Marqués Barcelona, September 2011 1
  • 2. Index  Identifying the problem  State of art: indexing techniques  State of art: Hierarchical Cellular Tree (HCT)  Modifications to the original HCT  Experimental results  Implemented tools  Conclusions and future work lines 2
  • 3. 3 Identifying the problem (I) “House” DATABASE
  • 4. 4 Identifying the problem (II) QUERY TARGETS Visual + textual descriptors “house” “garden” K
  • 5. Identifying the problem (III)  K nearest neighbor problem  Solution: Sequential scan  Drawback: Computational time for large databases (10 s for a 200,000 elements)  Approximate K nearest neighbor problem  Indexing techniques 5
  • 6. Requirements of the solution  Dynamic approach  Multimedia databases are not static  Insertions and deletions  High dimensional feature spaces  “curse of dimensionality” problem  MPEG-7 visual descriptors are high-dimensional feature vectors 6
  • 7. Index  Identifying the problem  State of art: indexing techniques  State of art: Hierarchical Cellular Tree (HCT)  Modifications to the original HCT  Experimental results  Implemented tools  Conclusions and future work lines 7
  • 8. Indexing techniques (I)  Hierarchical data structures  Spatial Access Methods (SAMs)  K-d tree, R-tree, R*-tree, TV-tree, etc.  Drawbacks:  Items have to be represented in an N-dimensional feature space  Dissimilarity measure based on a Lp metric  SAMs do not scale up well to high dimensional spaces 8
  • 9. Indexing techniques (II)  Hierarchical data structures  Metric Access Methods (MAMs)  VP-tree, MVP-tree, GNAT, M-tree, etc.  More general approach than SAMs  Assuming only a similarity distance function  MAMs scale up well to high dimensional spaces  Drawbacks:  Static MAMs do not support dynamic changes  Dependence on pre-fixed parameters 9
  • 10. Indexing techniques (III)  Locality Sensitive Hashing  It uses hash functions  Nearby data points are hashed into the same bucket with a high probability  Points faraway are hashed into the same bucket with a low probability  Drawback:  It does not solves the K nearest neighbor problem, but the Є-near neighbor problem. 10
  • 11. Index  Identifying the problem  State of art: indexing techniques  State of art: Hierarchical Cellular Tree (HCT)  Modifications to the original HCT  Experimental results  Implemented tools  Conclusions and future work lines 11
  • 12. Solution adopted  Hierarchical Cellular Tree (HCT)  MAM-based indexing scheme  Hierarchical structure  Self-organized tree  Incremental construction in a bottom-up fashion  Unbalanced tree  Not dependence on a maximum capacity  Preemptive cell search algorithm for insertion  Dynamic approach [KG07] S. Kiranyaz and M.Gabbouj, Hierarchical Cellular Tree: An efficient indexing scheme for content-based retrieval on multimedia databases. 12
  • 13. HCT: Cell Structure (I)  Basic container structure  Undirected graph  Minimum Spanning Tree (MST)  Cell nucleus  Covering radius 13
  • 14. HCT: Cell Structure (II)  Cell compactness  Maturity size  Mitosis operation 14
  • 15. HCT: Level Structure  Representatives for each cell from the lower level  Responsible for maximizing the compactness of its cells  Compactness threshold 15
  • 16. HCT Operations (I)  Item insertion  Find the most suitable cell  Most Similar Nucleus vs Preemptive Cell Search 16
  • 17. HCT Operations (II)  Item insertion  Find the most suitable cell  Most Similar Nucleus vs Preemptive Cell Search C2: d(O,O2 N) - r(O2 N) < dmin C3: d(O,O3 N) - r(O3 N) > dmin 17
  • 18. HCT Operations (III)  Item insertion  Find the most suitable cell  Append the element  Generic post-processing check  Mitosis operation  Nucleus change 18
  • 19. HCT Operations (IV)  Item removal  Cell search algorithm not required  Remove the element  Generic post-processing check  Mitosis operation  Nucleus change 19
  • 20. HCT: Retrieval scheme  Progressive Query  Periodical subqueries over database subsets  Query Path formation  Based on Most Similar Nucleus  Ranking agreggation 20
  • 21. Index  Identifying the problem  State of art: indexing techniques  State of art: Hierarchical Cellular Tree (HCT)  Modifications to the original HCT  Experimental results  Implemented tools  Conclusions and future work lines 21
  • 22. Modifications to the HCT (I)  Covering radius  Original definition gives an approximation by defect  Consider all the elements belonging to the subtree  High computational cost  Approximation by excess 22
  • 23. Modifications to the HCT (II)  Covering radius 23
  • 24. Modifications to the HCT (III)  Covering radius 24 rC (C9) = max • d(E,B)+rC(C1) • rC(C2) • d(E,H)+rC(C3)
  • 25. Modifications to the HCT (III)  HCT construction  Preemptive Cell Search over all the levels  A method for updating the covering radius  To reduce the searching time  It can be performed after the HCT construction or periodically 25
  • 26. Modifications to the HCT (IV)  Searching techniques  PQ fails in solving the KNN problem efficiently  New searching techniques  Most Similar Nucleus  Preemptive Cell Search  Hybrid  Number of cells to be considered  Minimum number of cells  Cells hosting 2·K elements  Cellular structure is not kept 26
  • 27. Index  Identifying the problem  State of art: indexing techniques  State of art: Hierarchical Cellular Tree (HCT)  Modifications to the original HCT  Experimental results  Implemented tools  Conclusions and future work lines 27
  • 28. Experimental results  CCMA image database of 216,317 elements  HCT building evaluation  Construction time  Retrieval system evaluation  Retrieval time  Elements retrieved 28
  • 33. Retrieval system evaluation (II) 33  Evaluation with respect to exhaustive search  Mean Competitive Recall  Elements in common  Mean Normalized Aggregate Goodness  Kendall distance  Number of exchanges needed in a bubble sort  Query set of 1,082 images
  • 34. Retrieval system evaluation (III) 34  Preemptive Cell Search
  • 35. Retrieval system evaluation (IV)  Searching techniques comparative 35
  • 36. Index  Identifying the problem  State of art: indexing techniques  State of art: Hierarchical Cellular Tree (HCT)  Modifications to the original HCT  Experimental results  Implemented tools  Conclusions and future work lines 36
  • 37. Implemented tools (I)  database_indexing tool  Tool for indexing an image database  HCT is stored at disk  hct_query tool  Tool for carrying out a search over an indexed database  HCT is read from disk and load at main memory 37
  • 38. Implemented tools (II)  A server/client architecture  Based on a messaging system: KSC 38
  • 39. Index  Identifying the problem  State of art: indexing techniques  State of art: Hierarchical Cellular Tree (HCT)  Modifications to the original HCT  Experimental results  Implemented tools  Conclusions and future work lines 39
  • 40. Conclusions  Hierarchical Cellular Tree implementation  To improve the retrieval times  Generic implementation for any kind of data  Modifications proposed  HCT evaluation  Measures extracted from literature  Preemptive Cell Search technique gives the best performance  It is essential not to use an underestimated value for the covering radius 40
  • 41. Future work lines  Very large databases  Not using only main memory  Region-based CBIR system  Each image can be represented by a set of regions  Browser application based on HCT  Take advantage of the hierarchical structure  Alternative way to retrieve elements 41
  • 42. Thanks for your attention 42
  • 43. HCT Operations (IV)  HCT construction 43
  • 44. HCT Operations (V)  HCT construction 44
  • 45. A tool for image DB indexing  MPEG-7 visual descriptors  Indexing a database  Inserting new elements to an indexed database 45
  • 46. A tool for image retrieval  It requires an indexed database  Image query  Image file  XML file with the visual descriptors  TXT file with several image queries  Interactive mode  Performing a search 46
  • 47. A server/client architecture  Based on a messaging system: KSC 47
  • 48. Retrieval system evaluation (IV) 48  Most Similar Nucleus Preemptive: 0.8319 seconds (retrieval time) 27.51 (CR) and 99.26% retrieved queries
  • 49. Retrieval system evaluation (V) 49  Progressive Query (Most Similar Nucleus) Preemptive: 0.8319 seconds (retrieval time) 27.51 (CR) and 99.26% retrieved queries
  • 50. Retrieval system evaluation (VI) 50  Hybrid  Preemptive Cell Search over 8 uppermost levels Preemptive: 0.8319 seconds (retrieval time) 27.51 (CR) and 99.26% retrieved queries

Notas del editor

  1. Good morning, I’m going to defend my master’s final project dissertation, titled Tools for Image Retrieval in Large Multimedia Databases, which has been directed by Veronica Vilaplana and Xavi Giró and tutored by Ferran Marques.
  2. First of all, we’re going to identify which is the problem to be solved.
  3. When we are looking for images in an image database, there are basically two ways for informing the retrieval system of what we’re looking for: through words or by giving an image, which is called query.
  4. Our retrieval system is based on the latter one. Thus, we give a query image to the system in order to retrieve the K most similar elements from a given database.This problem is known as the K nearest neighbor problem. In order to compute how similar the images are in our retrieval system, the images have been represented by some visual descriptors defined in MPEG-7, which are compared by using some dissimlaritiy functions also defined in this standard.
  5. The way to obtain these most similar elements is performing an exhaustive search over the whole database. However, this sequential scan is not feasible in a large database. To get an idea, an exhaustive search performed on a database of about 200,000 elements lasts 10 seconds, time which will not be acceptable for the user. Therefore, CBIR systems needs incorporating indexing techniques in order to scale up well when they work over large databases in order to achieve retrieval times that would be acceptable for the user. On the other hand, the exact K nearest neighbors may not be retrieved and we may obtain an approximated solution to the KNN problem.
  6. In particular, we need an indexing technique which fulfills the following requirements: A dynamic approach. Since multimedia databases are not static, we want an index structure which allows insertions and deletions of the indexed elements without having to reindex the whole database from scratch. Some indexing techniques suffer from the so-called “curse of dimensionality” problem when they are used in high dimensional feature spaces and, therefore, become more inefficient than even an exhaustive search. Since we work with the MPEG-7 visual descriptors, which are high-dimensional feature vectors, we need an indexing technique which is appropriate for high dimensional feature spaces.
  7. Once it’s clear the problem to be solved, we’re going to analyze which indexing techniques are available in the literature.
  8. The most traditional indexing techniques are formed mostly in a hierarchical tree structure. These indexing techniques can be mainly grouped in two categories: spatial access methods and metric access methods. The applicability of SAMs is limited by the fact that the items have to be represented by points in an N-dimensional feature space and the dissimilarity measure between two points has to be based on a distance function in Lp metric such as Euclidean distance. Furthermore, the main disadvantage of SAMs is that they do not scale up well to high dimensional spaces.
  9. The metric access methods give a more general approach than SAMs and do not suffer from the “curse of dimensionality” problem. Therefore, they scale up well to high dimensional spaces. However, some existing MAMs present several drawbacks. For instance, the static MAMs do not support dynamic changes such as new insertions or deletions. Even though some MAM-based indexing techniques provide a dynamic approach, their dependence on pre-fixed parameters can lead to significantly varying performances.
  10. Another very popular indexing technique is the Locality Sensitive Hashing, which is not based on a hierarchical tree structure. Its main disadvatge is that LSH, by its nature, does not solve the KNN problem but the e-near neighbor problem, which consists in gathering the elements within a predefined e-distance from the query point.
  11. After analyzing several indexing techniques from the literature, we have decided to implement an indexing technique called Hierarchical Cellular Tree, which was designed to bring an effective solution especially for indexing large multimedia databases.
  12. The HCT is a MAM-based indexing technique with a hierarchical structure, which consists of one or more levels and each level in turn holds one or more cells. It is a self-organized tree, which basically means that the operations (item insertion, removal, etc.) are not externally controlled, but they are carried out according to some internal rules. The HCT is built dynamically in a bottom-up fashion and is an unbalanced tree optimized for achieving highly focused cells. The HCT does not depend on a maximum capacity and, therefore, it has no limit for the cell size as long as the cell keeps a definite compactness measure. In order to perform an optimum search for the insertion process, the HCT includes a preemptive cell search algorithm. Finally, the HCT is also characterized for its totally dynamic approach.
  13. Now, we’re going to detail the two main components of the HCT: the cell structure and the level structure. A cell is a basic container structure where similar database elements are stored. The distances between each pair of elements are assigned to each edge of a connected, undirected graph. We also have the Minimum Spanning Tree, which is the subgraph that connects all the vertices together with the minimum cumulative total weight. The cell nucleus is the element which represents the cell on the upper level. It is essential to promote the best item for this representation since these elements are used during the top-down search for item insertion or for query requests. Therefore, it was proposed to choose the element having the maximum number of branches in the MST. Then, the covering radius is defined as the distance from the nucleus to the furthest element in the cell.
  14. Another cell parameter is the cell compactness, which quantifies how focused or compact the clustering for the items within the cell is. It is computed as a function of the following cell parameters: mean and standard deviation of the MST branch weights, covering radius, maximum MST branch weight and the number of elements. The cell compactness plays a key role in deciding whether or not to perform mitosis within the cell at any instant, and so does the maturity size. The maturity size is a prefixed parameter. When a cell holds a number of elements greater than or equal to the maturity size value, we consider it mature. Therefore, a cell can suffer from mitosis operation only if it is mature and it is not compact enough. When mitosis is granted, MST is again used to decide how to split the cell by breaking the branch with largest weight of the MST.
  15. As mentioned before, the HCT has a hierarchical structure, which is formed by one or more levels. The elements belonging to a particular level are the representants for each cell from the lower level except, obviously, the ground level, which contains the entire database. Each level is the responsible for maximizing the compactness of its cell. With this purpose, each level updates its own compactness threshold, which is computed by applying the median operator over the compactness values of its mature cells. This parameter is used to evaluate whether a cell is compact enough.
  16. Now, we’re going to detail the two external operations that can be carried out over a HCT: item insertion and item removal. Whenever an item insertion is performed, the first thing to do is to find the most suitable cell to which the element must be appended in the ground level. In this figure, element C is the closest nucleus from the ground level. In order to perform this operation in an efficient way, a search algorithm called Preemptive Cell Search was proposed instead of the traditional Most Similar Nucleus. The MS-Nucleus assumes that the closest nucleus yields the best subtree during descend. In the figure, the element being inserted is first compared to the nucleus objects from the upper level. Since the nucleus of cell C1 is the closest one, the two other cells would be discarded. Therefore, element C would be wrongly discarded.
  17. On the other hand, the pre-emptive cell search algorithm performs a preemptive analysis on the upper level to find out all possible nucleus objects which might yield the closest objects on the lower level. In this figure, since d(O,ON3)-r(ON3) &amp;gt; dmin, cell C3 can not yield the closest nucleus and, therefore, is discarded. On the other hand, since d(O,ON2)-r(ON2) &amp;lt; dmin, cell C2 is not discarded. Therefore, element C will be considered and the element being inserted will be appended to the most suitable cell.
  18. Once the most suitable cell has been found, the element is appended. Then, the cell becomes subject to a generic post-processing check. First, the cell is examined for a mitosis operation. In case mitosis is not performed, it is necessary to check if the nucleus has changed due to the insertion.
  19. The item removal is an operation that does not require any cell search algorithm. The element to be removed is taken out of the cell and the cell becomes subject to a generic post-processing check as in an item insertion operation.
  20. In the original article, a retrieval scheme called Progressive Query was proposed. This searching technique consists in performing periodical sub-queries over subsets of database items. The order in which the comparisons are done is given by the Query Path, which is formed based on the Most Similar Nucleus searching technique. The rankings obtained over each subset are aggregated after each subquery.
  21. So far, we have presented the Hierarchical Cellular Tree, the solution adopted for indexing large multimedia databases. Therefore, the HCT has been implemented in our development platform. Since we have detected some weak points, now we are going to present some proposed modifications to the original HCT.
  22. The covering radius was originally defined as the distance from the nucleus to the furthest element in the cell. However, we consider that we should take into account not only the elements belonging to the corresponding cell, but also all the elements belonging to its subtree. Therefore, the original HCT is working with a poor approximation by defect. Since the computation of the exact covering radius can have a high computational cost, we have proposed an approximation by excess, which guarantees that no subtree will be wrongly discarded. Therefore, the most suitable cell will be always found.
  23. For instance, in order to compute the covering radius for cell C9, all the elements in the cells C1, C2 and C3 should be considered, instead of using only the elements belonging to that cell. Using the approximation by excess that we have proposed, the covering radius for the cell C9 is computed as follows:
  24. The covering radius for the cell C9 is computed as the maximum among the following values: Covering radius of cell C1 plus the distance from element B (its nucleus) to element E, the cell C9 nucleus Covering radius of cell C2 Covering radius of cell C3 plus the distance from element H (its nucleus) to element E, the cell C9 nucleus
  25. Originally, it was proposed to adopt a hybrid approach to build the HCT, using the Preemptive Cell Search algorithm only in a certain number of the uppermost levels and the Most Similar Nucleus technique in the lowest ones. Since the Most Similar Nucleus is used, the element being inserted may not be appended to the most suitable cell. That’s the reason why we have decided to build the HCT by using the Preemptive Cell Search algorithm over all the levels. Furthermore, we have implemented a method for updating the covering radius to its actual value in order to reduce the searching time of both the insertion operations and query requests. In this way, we will visit only the cells which are strictly necessary. This method can be applied after the HCT construction or periodically.
  26. As we will see in the results, the Progressive Query fails in solving the KNN problem efficiently. That’s the reason why we have decided to implement new searching techniques. Therefore, we have adapted the cell search algorithms for insertion operations to searching techniques for retrieval. In particular, we are interested on the Preemptive Cell Search algorithm since it takes advantage of the covering radius and it guarantees that the most suitable cell is retrieved. Since the cell whose nucleus is the nearest element to the query may not contain the most similar elements, we have proposed considering a minimum number of cells. We have also proposed that the number of cells being considered depends on the number of expected results. Therefore, we have decided that the cells considered must host at least twice the number of elements expected. These elements are sorted individually without keeping the cellular structure.
  27. Once the original HCT and some proposed modifications have been presented, we’re going to proceed to their evaluation.
  28. We have selected a database which consists of approximately 200,000 elements given by the Corporació Catalana de Mitjans Audiovisuals. Although the main objective of these experiments is to evaluate how fast the retrieval system is and how good the obtained results are depending on the searching technique and the HCT used, we also want to analyze the time required for the HCT to be built.
  29. First of all, we are going to analyze the building time. We can see the time required for building the HCT when the original covering radius is used is by far the best one. However, as we will see in the retrieval system evaluation, this approach gives the worst performance. This is because the original covering radius is an approximation by defect and, therefore, a number of cells smaller than the strictly necessary are being visited during the insertion operations. Although the building time required by the proposed covering radius is worse, this approach give a much better performance. Furthermore, this time can be reduced by applying periodically the update method without affecting its performance. Thus, we have reduced the building time in a 23% by applying the update method every 5000 insertions.
  30. Now, we’re going to evaluate the retrieval system. We want to compare the approximate ranking, obtained by using a searching technique over the HCT, with the exact ranking given by the exhaustive search. Here we have an example of a search in which an image of an anchorman has been used as query. All the images illustrated represent the results of an exhaustive search over the whole image database. The images marked with a green rectangle have been retrieved by the Preemptive Cell Search algorithm whereas the ones which are marked with a red rectangle are the missing ones in the approximate ranking.
  31. Here we have an example in which three images have not been retrieved by the Preemptive Cell Search technique.
  32. And another one in which all the images have been retrieved.
  33. In order to compare the approximate ranking, obtained by using a searching technique over the HCT, with the exact ranking given by the exhaustive search, we have selected three different measures from the literature: The Mean Competitive Recall, which is defined as the number of elements in common. The Mean Normalized Aggregate Goodness.It is only 1 when the k nearest elements are retrieved and is only 0 when the k farthest elements are retrieved, the worst case. The Kendall distance, which turns out to be equal to the number of exchanges needed in a bubble sort. These measures have been computed over a query set of 1082 images.
  34. Now, we want to present the results obtained by the different searching techniques when the number of resuts expected by the user is 40. First of all, we’re going to analyze the Preemptive Cell Search algorithm depending on the covering radius used and whether the update method for the covering radius has been applied or not. We can see that the retrieval system behaves by far the worst when the original covering radius is used without the update method since there are only 12.35 out of 40 elements in common between the rankings on average. Futhermore, only 49.26% of the query images have been retrieved. However, the quality of the results for the original covering radius is remarkably improved when the proposed update method for the covering radius is used. Therefore, we can see how important it is not to use an underestimated value for the covering radius. That’s the reason why the performance of the retrieval system is also so good when the proposed covering radius is used regardless whether the update method is applied or not. Thus, in a retrieval time of about 1 second, 99% of the image queries are retrieved and there are about 28 out of 40 elements in common between the rankings on average. These results are also corroborated by the other two quality measures.
  35. Here we have a table which will allow us to compare the implemented searching techniques. We can see that the retrieval time increases jointly with the number of the levels over which the Preemptive Cell Search algorithm is applied. On the other hand, the greater the number of levels over which the Preemptive technique is applied, the better the quality measures. Since we consider that a retrieval time of about 1 second is absolutely acceptable for the user, we can conclude that the Preemptive Cell Search technique gives the best performance. Finally, we want to compare it to the Progressive Query, which is the retrieval scheme originally proposed. If we consider a larger number of elements instead of only 80 elements to obtain the approximate ranking, the Most Similar Nucleus technique becomes equivalent to the first subqueries performed by the Progressive Query. From the results showed in the table, we can see that even when we consider 20,000 elements, which means a retrieval time of 1.37 seconds, the quality of the results is worse than when the Preemptive Cell Search algorithm is used. In such interval of time, there are only 12.33 out of 40 elements in common between the rankings on average and only 31.61% of the query images have been retrieved.
  36. Finally, we’re going to present some tools which have been implemented to make it easier for the user to use the HCT.
  37. We have implemented a tool called database_indexing which is used for indexing an image database.The resulting HCT is stored at hard disk. Then, in order to use this HCT to perform searches over it, the user should use the hct_query tool.
  38. Furthermore, the image retrieval system based on the HCT has been also implemented over a server/client architecture. On the one hand, we have the hct_building tool, a daemon program which loads an indexed database and is constantly running waiting for new query requests. On the other hand, there is the query_request tool, which is the tool used to send the query request. Since we also need a means of communication between these tools, we have selected a messaging system called KSC.
  39. Finally, let’s draw the conclusions and present the future work lines.
  40. As seen, in this project we have implemented an indexing technique called Hierarchical Cellular Tree in order to improve the retrieval times of the CBIR system over large databases. Although the HCT has been chosen to be used in a particular scenario, its implementation allows to index any type of data and, therefore, the HCT is useful for indexing a generic database as long as there is a function to measure the dissimilarity between any pair of elements. Futhermore, some modifications have been proposed which have resulted in an improvement of the quality of the retrieved elements. In order to evaluate its performance, we have used some contrasted measures extracted from the literature. Finally, we conclude that the Preemptive Cell Search gives the best performance when is used for retrieval. Furthermore, it is essential not to use an underestimated value for the covering radius, either building the HCT with the proposed covering radius or applying the update method over the HCT built with the original covering radius.
  41. This project opens the door to new future work lines. The main work consists in adapting the HCT implementation for working with very large databases which do not allow to be indexed by only using the main memory. The implementation of this indexing technique will also allow to improve the retrieval time of any region-based CBIR system. For instance, if each image is represented by a set of 100 regions, an small image database of 1,000 images results in a large database of 100,000 elements. Finally, another future work line from this project is the implementation of a browser for an image database, as it is proposed in the original article, since the hierarchical structure of the HCT is quite appropriate to give an overview to the user about what lies under the current level. Thus, the browser can be also used as an alternative way to retrieve elements from an image database.
  42. Here we have an illustrative example of HCT construction where the elements are represented by colored circles and the similarity between two elements is given by how similar in color are. First, all the elements are inserted in the unique cell of the HCT, which is created in the ground level, until this becomes mature and not compact enough. In this example, this happens when the yellow ball labeled as 1 is inserted.
  43. As a result, the cell is split into two new cells and a new top level is created. The new nuclei of each child are computed (the yellow ball labeled as 1 and the red ball labeled as f) and they are inserted in the new top cell. This process continues until all the elements of the database are inserted.
  44. Although the HCT has been implemented for indexing any element type, we have implemented a tool which is used to index image database elements which are represented by some MPEG-7 visual descriptors and are compared by using the dissimilarity measures also defined in MPEG-7. The user is asked to precompute this descriptors for each image before using this tool. We have to specify which are the elements to be indexed by a TXT file including the path of the XML files containing the visual descriptors and the directory where the HCT will be stored at disk. We also specify the visual descriptor in which we are intrerested and some HCT parameters as the maturity size. Furthermore, this tool also allows the user to insert new elements to a previously indexed database by giving the path of the HCT stored at disk.
  45. Another tool which allows the user to carry out a search on an indexed database has been also implemented. The image query is specified by giving either its filename or the XML file including the visual descriptors. This tool also allows the user to perform several retrievals by giving a TXT file including the different image queries. We have also implemented an interactive mode in which the user is asked if he wants to perform a new retrieval without waiting for the HCT being load again. Here we have an example of use of this tool.
  46. The image retrieval system based on the HCT has been also implemented over a server/client architecture. On the one hand, we have the hct_building tool, a daemon program which loads an indexed database and is constantly running waiting for new query requests. On the other hand, there is the query_request tool, which is the tool used to send the query request. Since we also need a means of communication between these tools, we have selected a messaging system called KSC. The objective of the KSC is to control the registration of the clients, to find out to which message type the clients subscribe and to forward the messages to the subscribed clients. Thus, we have defined a message type named SearchKSCMessage, which defines the parameters of the search, and a message type named SearchCompletedKSCMessage which informs that the searching process has been properly completed. The query_request tool subscribes as a writer of the SearchKSCMessage and as a reader of the SearchCompletedKSCMessage, whereas the hct_building tool subscribes as a reader of the SearchKSCMessage and as a writer of the SearchCompletedKSCMessage.
  47. Now, we are going to analyze the same experiment but using the Most Similar Nucleus technique instead. For this technique, the covering radius is not used during the search so it does not matter whether the update method is used or not. Analyzing the results we come to the conclusion that the Most Similar Nucleus technique gives a bad performance since the query image is retrieved only in the 5% of the query requests and there are only 1.35 out of 40 elements in common between the rankings on average. These results are even worse when the original covering radius is used.
  48. If we consider a larger number of elements, the Most Similar Nucleus technique becomes equivalent to the first subqueries performed by the Progressive Query. From the results showed in the table, we can see that even when we consider 20,000 elements, which means a retrieval time of 1.37 seconds, the quality of the results is worse than whether the Preemptive Cell Search algorithm is used. In such interval of time, there are only 12.33 out of 40 elements in common between the rankings on average.
  49. Finally, we consider the Hybrid technique, which combines the two searching techniques analyzed before. The results showed in the table have been obtained when the Preemptive Cell Search algorithm has been applied over the 8 uppermost levels, and the Most Similar Nucleus technique in the lowest ones. For the Hybrid technique, we have a retrieval time which is always shorter than the Preemptive Cell retrieval time and longer tha the Most Similar Nucleus retrieval time. On the contrary, the quality of the elements retrieved by using the Hybrid technique is always better than applying the Most Similar Nucleus and worse than using the Preemptive Cell Search.
  50. And another one in which
  51. And another one in which
  52. And another one in which
  53. And another one in which
  54. And another one in which
  55. And another one in which
  56. And another one in which
  57. And another one in which
  58. And another one in which
  59. And another one in which