SlideShare una empresa de Scribd logo
1 de 44
Human-enhanced
Multimedia Processing

in CuBRIK with SMILA
Alessandro Bozzon, Ph.d.

Politecnico di Milano
mail: bozzon@elet.polimi.it
twitter: aleboz
Human-enhanced
Multimedia Processing

in CuBRIK with SMILA
Alessandro Bozzon, Ph.d.

Politecnico di Milano
mail: bozzon@elet.polimi.it
twitter: aleboz
The CUbRIK project

       36 month large-scale
        integrating project
       partially funded by the
        European Commission’s
        7th Framework ICT
        Programme for
        Research and
        Technological
        Development
       www.cubrikproject.eu



 5/17/2012               SMILA Themenkonferenz   2
Objectives

    The technical goal of CUbRIK is to build an open
     search platform grounded on four objectives:
        Advance the architecture of multimedia search
        Place humans in the loop
        Open the search box
        Start up a search business ecosystem




 5/17/2012                SMILA Themenkonferenz          3
Objective: Advance the
architecture of multimedia search

    Multimedia search: coordinated result of three
     main processes:
        Content processing: acquisition, analysis, indexing
         and knowledge extraction from multimedia content
        Query processing: derivation of an information need
         from a user and production of a sensible response
        Feedback processing: quality feedback on the
         appropriateness of search results




 5/17/2012               SMILA Themenkonferenz             4
Objective: Advance the
architecture of multimedia search

    Objective:
        Content processing, query processing and feedback
         processing phases will be implemented by means of
         independent components
        Components are organized in pipelines
        Each application defines ad-hoc pipelines that provide
         unique multimedia search capabilities in that scenario




 5/17/2012                 SMILA Themenkonferenz                  5
CUbRIK architecture




 5/17/2012       SMILA Themenkonferenz   6
SMILA is the backbone of CUbRIK
    CUbRIK makes use of SMILA framework as a start-up service
     engine for supporting workflow definition and execution
    Provides architectural extensions to SMILA for enhanced
     services:
         Extensible content, query and feedback processing search workflow
         Multimodality, Orchestration of human and machine computation tasks in all
          search processes
         Time and Space Awareness
         Support for social and human computation
         Persistency and Caching of content and metadata
         Support of federated configurations across a distributed architecture
         Different styles of User Interface for queries and presentation of search
          results
    Includes tools and methods for application design


 6 March 2012                     The CUbRIK Project is ....                       7
Objective: Humans in the loop
    Problem: the uncertainty of analysis algorithms leads to
     low confidence results and conflicting opinions on
     automatically extracted features
    Solution: humans have superior capacity for
     understanding the content of audiovisual material
        State of the art: humans replace automatic feature extraction
         processes (human annotations)




        Our contribution: integration of human judgment and algorithms
                Goal: improve the performance of multimedia content processing




 5/17/2012                           SMILA Themenkonferenz                        88
Example of CUbRIK Human-enhanced
computation: Trademark Logo Detection

    Problem statement: identifying occurrences of
     trademark logos in a video collection through
     keyword-based queries
        Special case of the classic problem of object recognition




    Use case: a professional user wants to retrieve all
     the occurrences of logos in a large collection of video
     clips
    Applications: rating effectiveness of advertising,
     subliminal advertising detection, automatic
     annotation, trademark violation detection

                                                                     99
Human-powered trademark logo
detection demo

    Goal: integrate human and automatic
     computation to increase precision and recall
     w.r.t. fully automatic solutions




 5/17/2012            SMILA Themenkonferenz         10
Trademark Logo Detection: problems in
automatic logo detection
    Problems in automatic logo detection:
        Object recognition is affected by the quality of the
         input set of images




        Uncertain matches, i.e., the ones with low matching
         score, could not contain the searched logo




                                                                11
Trademark Logo Detection:
contribution of human computation
    Contribution in human computation
        Filter the input logos, eliminating the irrelevant ones
        Segment the input logos




        Validate the matching results


                                                                   12
Trademark Logo Detection: pipeline




                                     13


                                     13
The CrowdSearcher framework
for HC task management




                               14

                              14
CrowdSearch framework in the
Logo detection application

           Problem solving
              process
 Process




             Task        Crowd
                          Task
                                 Types of tasks
                                 • Automatic tasks
                                 • Crowd tasks: tasks that are executed by an
                                    open-ended community of performers
            Crowd Task




                                                                                1515
Community of Performers

  Content edges,
  e.g., IS-A, part.of   Content elements
                                               The application is deployed as a
                                               Facebook application

                                               Seed community
                                               Information Technology
                        Performer to content   department of Politecnico di
                        edges, e.g., topical
                        group membership
                                               Milano
       Performers
      edges, e.g.,
        friendship,
          weak ties
                                               Task propagation
                          Performers           Each user in the seed
                                               community can propagate
                                               tasks through the social
                                               networks




                                                                                   16
                                                                                  16
Design of “Validate Logo Images”

             The “LIKE” task variant requires to choose
             relevant logos among a set of not filtered images




Human Task
  Design

             The “ADD”task variant requires to add new
             relevant image URLs
                                           Please add new relevant logos
                                            URL…


                                                              Send




                                                                           17
People to task matching & Task
Assignment

Task Deployment Criteria        Execution criteria
                                Constraints of task execution
    Content Affinity Criteria
                                        Time budget for the experiment
      Execution Criteria
                                Content Affinity criteria
                                Query on a representation of the users’ capacities
                                • Current state: manual selection of users
                  People to     • Future work: Geocultural affinity
                task matching
                                Questions are dispatched to the crowd according to the
                                user experience in answering questions
                                • Expert user: an user that has already answered to
                                  three questions

                      Task      New users answer to “LIKE” questions
                   assignment
                                Expert users answer to “LIKE”+“ADD” questions

                                                                                     18
                                                                                     18
Task propagation

    Propagation over the Facebook graph:
        Platform: CrowdSearcher
                Automatic task generation starting from a set of design
                 criteria (e.g., question type, public/private…)
        Seed community: Information Technology
         department of Politecnico di Milano
                Each user in the seed community can propagate tasks
                 through the social networks
        Work in progress:
                Twitter/LinkedIn tasks
                Task assignment according to expertise, geocultural
                 information, past work history


 5/17/2012                          CUbRIK Pipelines 1                     19
Task execution
   Task
 execution


   “LIKE” task variant   “ADD” task variant




                                               20
                                              20
Output aggregation


                                “LIKE” task variants
                                Top-5 rated logos are
                                selected as relevant logos
    Task                        “ADD” task variants
  execution                     New images are fed back to
                                the LIKE tasks
              Task outputs


                             Task output

                Output
              aggregation



                                                              21
                                                             21
Experimental evaluation

    Three experimental settings:
        No human intervention
        Logo validation performed by two domain experts
        Inclusion of the actual crowd knowledge
    Crowd involvement
        40 people involved
        50 task instances generated
        70 collected answers




                                                           22
Experimental evaluation


           1

          0.9

          0.8
                                     Crowd
          0.7
                                                                              Experts
          0.6
                                                                           Experts
 Recall




                                       Experts
          0.5                                                                                     Aleve
          0.4                    Crowd                                                            Chunky
          0.3
                         No Crowd                                                                 Shout
          0.2                                      Crowd             No Crowd
          0.1

           0        No Crowd
                0      0.1     0.2     0.3   0.4      0.5      0.6   0.7      0.8       0.9   1

                                                   Precision


                                                                                                           23
Experimental evaluation


           1

          0.9

          0.8                                               Precision decreases
                                     Crowd
          0.7
                                                                           Experts
          0.6                                               Reasons for the wrong inclusion
                                                                   Experts
 Recall




                                       Experts              • Geographical location of the users
          0.5                                                                             Aleve
                                                            • Expertise of the involved users
          0.4                    Crowd                                                         Chunky
          0.3
                         No Crowd                                                              Shout
          0.2                                      Crowd             No Crowd
          0.1

           0        No Crowd
                0      0.1     0.2     0.3   0.4      0.5      0.6   0.7   0.8       0.9   1

                                                   Precision


                                                                                                        24
Experimental evaluation


           1
                                                                     Precision decreases
                                                                     • Similarity between two
          0.9
                                                                        logos in the data set
          0.8
                                     Crowd
          0.7
                                                                              Experts
          0.6
                                                                           Experts
 Recall




                                       Experts
          0.5                                                                                     Aleve
          0.4                    Crowd                                                            Chunky
          0.3
                         No Crowd                                                                 Shout
          0.2                                      Crowd             No Crowd
          0.1

           0        No Crowd
                0      0.1     0.2     0.3   0.4      0.5      0.6   0.7      0.8       0.9   1

                                                   Precision


                                                                                                           25
Crowdsourced filtering of logos –
Problem concept



                  Google
                  Images
                                                Filtered logos




                     Filter
                     Tasks


                                                 Added logos
                      Add
                     Tasks




 5/17/2012                 CUbRIK Pipelines 1                    26
Integration in SMILA

    The demo has been integrated into the SMILA
     architecture
    Two main parts:
        Indexing part: made of asynchronous components
         (in a SMILA sense)
                Indexing of videos
                Matching phase
                Interaction with the crowd
        Search part: end users query the system by
         keyword-based queries



 5/17/2012                          CUbRIK Pipelines 1    27
Integration in SMILA




 5/17/2012         CUbRIK Pipelines 1   28
Integration in SMILA: Indexing
part overview




 5/17/2012         CUbRIK Pipelines 1   29
Reusable components

    Crawling
        Google Images + Flickr crawler
    Multimedia processing
        SIFT-based low level feature extraction
        Video segmentation component
        Key-frame extractor
        Robust low level feature matching component
    Data storage
        “Data service” for referencing multimedia resources



 5/17/2012                  CUbRIK Pipelines 1                 30
Integration in SMILA: Indexing
part – Job1, Input images retrieval




 5/17/2012          CUbRIK Pipelines 1   31
Integration in SMILA: Indexing part –
Job2, Logo collection indexing




 5/17/2012            CUbRIK Pipelines 1   32
Integration in SMILA: Indexing part –
Job3, video collection indexing




 5/17/2012            CUbRIK Pipelines 1   33
Integration in SMILA: Indexing part –
Job4, matching phase




 5/17/2012            CUbRIK Pipelines 1   34
Integration in SMILA: Indexing part –
Job5, matches filtering




 5/17/2012            CUbRIK Pipelines 1   35
Demo: Search interface




 5/17/2012        CUbRIK Pipelines 1   36
Demo: Search interface




 5/17/2012        CUbRIK Pipelines 1   37
Demo: Search interface


                                       Indexed logos that
                                       match against the
                                         video collection




 5/17/2012        CUbRIK Pipelines 1                        38
Demo: Search interface




                Video preview




 5/17/2012          CUbRIK Pipelines 1   39
Demo: Search interface




                      High confidence
                          matches




 5/17/2012        CUbRIK Pipelines 1    40
Demo: Search interface




                 Low confidence
                    matches




 5/17/2012         CUbRIK Pipelines 1   41
CUbRIK Showcases


    CUbRIK will showcase its technology with Demonstrators
     of examples of innovation in two domains:
       (Digital Libraries) History of Europe

       (Business Processes) CUbRIK search for SMEs,




    Technical evaluation in real-world conditions including
     users will be based on these Demonstrators




 6 March 2012             The CUbRIK Project is ....           42
Thanks for your attention
               www.cubrikproject.eu




5/17/2012            SMILA Themenkonferenz   43

Más contenido relacionado

Destacado

2012.09.26.CUbRIK at CHORUS + (the business)
2012.09.26.CUbRIK at CHORUS + (the business)2012.09.26.CUbRIK at CHORUS + (the business)
2012.09.26.CUbRIK at CHORUS + (the business)CUbRIK Project
 
Matching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a PurposeMatching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a PurposeCUbRIK Project
 
Introduction to Sohuman2012
Introduction to Sohuman2012Introduction to Sohuman2012
Introduction to Sohuman2012CUbRIK Project
 
2012.09.26.CUbRIK at CHORUS + (the progress)
2012.09.26.CUbRIK at CHORUS + (the progress)2012.09.26.CUbRIK at CHORUS + (the progress)
2012.09.26.CUbRIK at CHORUS + (the progress)CUbRIK Project
 
WWW 2012 - The CUbRIK Project
WWW 2012 - The CUbRIK ProjectWWW 2012 - The CUbRIK Project
WWW 2012 - The CUbRIK ProjectCUbRIK Project
 
CUbRIK at SoHuman 2012: A draw and-guess game to segment images
CUbRIK at SoHuman 2012: A draw and-guess game to segment imagesCUbRIK at SoHuman 2012: A draw and-guess game to segment images
CUbRIK at SoHuman 2012: A draw and-guess game to segment imagesCUbRIK Project
 
mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...
mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...
mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...Golinucci Broker
 

Destacado (9)

2012.09.26.CUbRIK at CHORUS + (the business)
2012.09.26.CUbRIK at CHORUS + (the business)2012.09.26.CUbRIK at CHORUS + (the business)
2012.09.26.CUbRIK at CHORUS + (the business)
 
Matching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a PurposeMatching Game Mechanics and Human Computation Tasks in Games with a Purpose
Matching Game Mechanics and Human Computation Tasks in Games with a Purpose
 
Introduction to Sohuman2012
Introduction to Sohuman2012Introduction to Sohuman2012
Introduction to Sohuman2012
 
2012.09.26.CUbRIK at CHORUS + (the progress)
2012.09.26.CUbRIK at CHORUS + (the progress)2012.09.26.CUbRIK at CHORUS + (the progress)
2012.09.26.CUbRIK at CHORUS + (the progress)
 
WWW 2012 - The CUbRIK Project
WWW 2012 - The CUbRIK ProjectWWW 2012 - The CUbRIK Project
WWW 2012 - The CUbRIK Project
 
CUbRIK at SoHuman 2012: A draw and-guess game to segment images
CUbRIK at SoHuman 2012: A draw and-guess game to segment imagesCUbRIK at SoHuman 2012: A draw and-guess game to segment images
CUbRIK at SoHuman 2012: A draw and-guess game to segment images
 
CUbRIK is
CUbRIK is CUbRIK is
CUbRIK is
 
mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...
mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...
mercato-assicurativo-italiano-opportunita-nella-patria-del-rinascimento-suiss...
 
Strategia broker assicurativi
Strategia broker assicurativiStrategia broker assicurativi
Strategia broker assicurativi
 

Similar a CUbRIK at SMILA Conference in Berlin

The CrowdSearch framework
The CrowdSearch frameworkThe CrowdSearch framework
The CrowdSearch frameworkEleonora Ciceri
 
The CrowdSearch framework
The CrowdSearch frameworkThe CrowdSearch framework
The CrowdSearch frameworkCUbRIK Project
 
Keynote at-icpc-2020
Keynote at-icpc-2020Keynote at-icpc-2020
Keynote at-icpc-2020Ralf Laemmel
 
Social Enabler for XPages
Social Enabler for XPagesSocial Enabler for XPages
Social Enabler for XPagesNiklas Heidloff
 
Open Design @ Tec Guadalajara - Mexico - 23/08/2011
Open Design @ Tec Guadalajara - Mexico - 23/08/2011Open Design @ Tec Guadalajara - Mexico - 23/08/2011
Open Design @ Tec Guadalajara - Mexico - 23/08/2011Massimo Menichinelli
 
DevOps Gamification Workshop at JTEL Summer School 2015
DevOps Gamification Workshop at JTEL Summer School 2015DevOps Gamification Workshop at JTEL Summer School 2015
DevOps Gamification Workshop at JTEL Summer School 2015IstvanKoren
 
Integrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataIntegrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataDhaval Thakker
 
Building Customized Text Mining Tools via Shiny Framework: The Future of Data...
Building Customized Text Mining Tools via Shiny Framework: The Future of Data...Building Customized Text Mining Tools via Shiny Framework: The Future of Data...
Building Customized Text Mining Tools via Shiny Framework: The Future of Data...Olga Scrivner
 
Software Analytics: Towards Software Mining that Matters (2014)
Software Analytics:Towards Software Mining that Matters (2014)Software Analytics:Towards Software Mining that Matters (2014)
Software Analytics: Towards Software Mining that Matters (2014)Tao Xie
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsRakuten Group, Inc.
 
Extensibility of IBM Connections using XPages and Open Source from OpenNTF
Extensibility of IBM Connections using XPages and Open Source from OpenNTFExtensibility of IBM Connections using XPages and Open Source from OpenNTF
Extensibility of IBM Connections using XPages and Open Source from OpenNTFLetsConnect
 
Social Connections Amsterdam 2012 - Extensibility of IBM Connections
Social Connections Amsterdam 2012 - Extensibility of IBM ConnectionsSocial Connections Amsterdam 2012 - Extensibility of IBM Connections
Social Connections Amsterdam 2012 - Extensibility of IBM ConnectionsNiklas Heidloff
 
Artificial intelligence in android development
Artificial intelligence in android developmentArtificial intelligence in android development
Artificial intelligence in android developmentanikeshkumar11
 
Bridging Current Reality & Future Vision with Reality Maps
Bridging Current Reality & Future Vision with Reality MapsBridging Current Reality & Future Vision with Reality Maps
Bridging Current Reality & Future Vision with Reality MapsMalini Rao
 
ECLAP 2013 tutorial at Porto, April 2013
ECLAP 2013 tutorial at Porto, April 2013ECLAP 2013 tutorial at Porto, April 2013
ECLAP 2013 tutorial at Porto, April 2013Paolo Nesi
 
Pharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social WebsitesPharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social Websitesgu wendong
 
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...Tom Mens
 

Similar a CUbRIK at SMILA Conference in Berlin (20)

The CrowdSearch framework
The CrowdSearch frameworkThe CrowdSearch framework
The CrowdSearch framework
 
The CrowdSearch framework
The CrowdSearch frameworkThe CrowdSearch framework
The CrowdSearch framework
 
Keynote at-icpc-2020
Keynote at-icpc-2020Keynote at-icpc-2020
Keynote at-icpc-2020
 
Social Enabler for XPages
Social Enabler for XPagesSocial Enabler for XPages
Social Enabler for XPages
 
Developing Social Networks
Developing Social NetworksDeveloping Social Networks
Developing Social Networks
 
Open Design @ Tec Guadalajara - Mexico - 23/08/2011
Open Design @ Tec Guadalajara - Mexico - 23/08/2011Open Design @ Tec Guadalajara - Mexico - 23/08/2011
Open Design @ Tec Guadalajara - Mexico - 23/08/2011
 
DevOps Gamification Workshop at JTEL Summer School 2015
DevOps Gamification Workshop at JTEL Summer School 2015DevOps Gamification Workshop at JTEL Summer School 2015
DevOps Gamification Workshop at JTEL Summer School 2015
 
Integrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched dataIntegrating digital traces into a semantic enriched data
Integrating digital traces into a semantic enriched data
 
Building Customized Text Mining Tools via Shiny Framework: The Future of Data...
Building Customized Text Mining Tools via Shiny Framework: The Future of Data...Building Customized Text Mining Tools via Shiny Framework: The Future of Data...
Building Customized Text Mining Tools via Shiny Framework: The Future of Data...
 
Software Analytics: Towards Software Mining that Matters (2014)
Software Analytics:Towards Software Mining that Matters (2014)Software Analytics:Towards Software Mining that Matters (2014)
Software Analytics: Towards Software Mining that Matters (2014)
 
Gaurav agarwal
Gaurav agarwalGaurav agarwal
Gaurav agarwal
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
Extensibility of IBM Connections using XPages and Open Source from OpenNTF
Extensibility of IBM Connections using XPages and Open Source from OpenNTFExtensibility of IBM Connections using XPages and Open Source from OpenNTF
Extensibility of IBM Connections using XPages and Open Source from OpenNTF
 
Social Connections Amsterdam 2012 - Extensibility of IBM Connections
Social Connections Amsterdam 2012 - Extensibility of IBM ConnectionsSocial Connections Amsterdam 2012 - Extensibility of IBM Connections
Social Connections Amsterdam 2012 - Extensibility of IBM Connections
 
Artificial intelligence in android development
Artificial intelligence in android developmentArtificial intelligence in android development
Artificial intelligence in android development
 
Bridging Current Reality & Future Vision with Reality Maps
Bridging Current Reality & Future Vision with Reality MapsBridging Current Reality & Future Vision with Reality Maps
Bridging Current Reality & Future Vision with Reality Maps
 
ECLAP 2013 tutorial at Porto, April 2013
ECLAP 2013 tutorial at Porto, April 2013ECLAP 2013 tutorial at Porto, April 2013
ECLAP 2013 tutorial at Porto, April 2013
 
Pharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social WebsitesPharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social Websites
 
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
ICSME 2016 keynote: An ecosystemic and socio-technical view on software maint...
 
Winter Projects GDSC IITK
Winter Projects GDSC IITKWinter Projects GDSC IITK
Winter Projects GDSC IITK
 

Más de CUbRIK Project

Humanist machine interaction with histoGraph
Humanist machine interaction with histoGraphHumanist machine interaction with histoGraph
Humanist machine interaction with histoGraphCUbRIK Project
 
histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013CUbRIK Project
 
histoGraph for historians
histoGraph for historianshistoGraph for historians
histoGraph for historiansCUbRIK Project
 
histoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitieshistoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitiesCUbRIK Project
 
CUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK Project
 
Building a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraphBuilding a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraphCUbRIK Project
 
The CUbRIK histoGraph Factsheet
The CUbRIK histoGraph FactsheetThe CUbRIK histoGraph Factsheet
The CUbRIK histoGraph FactsheetCUbRIK Project
 
CUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence ApplicationCUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence ApplicationCUbRIK Project
 
CUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual InterfaceCUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual InterfaceCUbRIK Project
 
Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?CUbRIK Project
 
CUbRIK and gaming experience@Qualinet
CUbRIK and gaming experience@QualinetCUbRIK and gaming experience@Qualinet
CUbRIK and gaming experience@QualinetCUbRIK Project
 
CUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approachCUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approachCUbRIK Project
 
ICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social mediaICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social mediaCUbRIK Project
 
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...CUbRIK Project
 
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Project
 
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a PurposeCUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a PurposeCUbRIK Project
 
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human ComputationCUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human ComputationCUbRIK Project
 
Semantic schema for geonames
Semantic schema for geonamesSemantic schema for geonames
Semantic schema for geonamesCUbRIK Project
 
Exploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak DetectionExploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak DetectionCUbRIK Project
 

Más de CUbRIK Project (20)

Humanist machine interaction with histoGraph
Humanist machine interaction with histoGraphHumanist machine interaction with histoGraph
Humanist machine interaction with histoGraph
 
histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013histoGraph presented to MMSP 2013
histoGraph presented to MMSP 2013
 
histoGraph for historians
histoGraph for historianshistoGraph for historians
histoGraph for historians
 
histoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital HumanitieshistoGraph: a case study in Digital Humanities
histoGraph: a case study in Digital Humanities
 
SMILA in CUbRIK
SMILA in CUbRIKSMILA in CUbRIK
SMILA in CUbRIK
 
CUbRIK research on social aspects
CUbRIK research on social aspectsCUbRIK research on social aspects
CUbRIK research on social aspects
 
Building a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraphBuilding a social graph for the history of Europe: the CUbRIK histoGraph
Building a social graph for the history of Europe: the CUbRIK histoGraph
 
The CUbRIK histoGraph Factsheet
The CUbRIK histoGraph FactsheetThe CUbRIK histoGraph Factsheet
The CUbRIK histoGraph Factsheet
 
CUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence ApplicationCUbRIK Fashion Trend Analysis: a Business Intelligence Application
CUbRIK Fashion Trend Analysis: a Business Intelligence Application
 
CUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual InterfaceCUbRIK Social Graph Visual Interface
CUbRIK Social Graph Visual Interface
 
Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?Mining Emotions in Short Films: User Comments or Crowdsourcing?
Mining Emotions in Short Films: User Comments or Crowdsourcing?
 
CUbRIK and gaming experience@Qualinet
CUbRIK and gaming experience@QualinetCUbRIK and gaming experience@Qualinet
CUbRIK and gaming experience@Qualinet
 
CUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approachCUbRIK: Open Box. Multimedia and Human Computation approach
CUbRIK: Open Box. Multimedia and Human Computation approach
 
ICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social mediaICT 2013: Better Society: empowering Horizon 2020 with trustable social media
ICT 2013: Better Society: empowering Horizon 2020 with trustable social media
 
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
How Do We Deep-Link? Leveraging User-Contributed Time-Links for Non-Linear Vi...
 
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
CUbRIK Research at CIKM 2012: Efficient Jaccard-based Diversity Analysis of L...
 
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a PurposeCUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
CUbRIK Tutorial at ICWE 2013: part 2 - Introduction to Games with a Purpose
 
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human ComputationCUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
CUbRIK tutorial at ICWE 2013: part 1 Introduction to Human Computation
 
Semantic schema for geonames
Semantic schema for geonamesSemantic schema for geonames
Semantic schema for geonames
 
Exploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak DetectionExploiting User Generated Content for Mountain Peak Detection
Exploiting User Generated Content for Mountain Peak Detection
 

Último

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 

Último (20)

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 

CUbRIK at SMILA Conference in Berlin

  • 1. Human-enhanced Multimedia Processing in CuBRIK with SMILA Alessandro Bozzon, Ph.d. Politecnico di Milano mail: bozzon@elet.polimi.it twitter: aleboz
  • 2. Human-enhanced Multimedia Processing in CuBRIK with SMILA Alessandro Bozzon, Ph.d. Politecnico di Milano mail: bozzon@elet.polimi.it twitter: aleboz
  • 3. The CUbRIK project  36 month large-scale integrating project  partially funded by the European Commission’s 7th Framework ICT Programme for Research and Technological Development  www.cubrikproject.eu 5/17/2012 SMILA Themenkonferenz 2
  • 4. Objectives  The technical goal of CUbRIK is to build an open search platform grounded on four objectives:  Advance the architecture of multimedia search  Place humans in the loop  Open the search box  Start up a search business ecosystem 5/17/2012 SMILA Themenkonferenz 3
  • 5. Objective: Advance the architecture of multimedia search  Multimedia search: coordinated result of three main processes:  Content processing: acquisition, analysis, indexing and knowledge extraction from multimedia content  Query processing: derivation of an information need from a user and production of a sensible response  Feedback processing: quality feedback on the appropriateness of search results 5/17/2012 SMILA Themenkonferenz 4
  • 6. Objective: Advance the architecture of multimedia search  Objective:  Content processing, query processing and feedback processing phases will be implemented by means of independent components  Components are organized in pipelines  Each application defines ad-hoc pipelines that provide unique multimedia search capabilities in that scenario 5/17/2012 SMILA Themenkonferenz 5
  • 7. CUbRIK architecture 5/17/2012 SMILA Themenkonferenz 6
  • 8. SMILA is the backbone of CUbRIK  CUbRIK makes use of SMILA framework as a start-up service engine for supporting workflow definition and execution  Provides architectural extensions to SMILA for enhanced services:  Extensible content, query and feedback processing search workflow  Multimodality, Orchestration of human and machine computation tasks in all search processes  Time and Space Awareness  Support for social and human computation  Persistency and Caching of content and metadata  Support of federated configurations across a distributed architecture  Different styles of User Interface for queries and presentation of search results  Includes tools and methods for application design 6 March 2012 The CUbRIK Project is .... 7
  • 9. Objective: Humans in the loop  Problem: the uncertainty of analysis algorithms leads to low confidence results and conflicting opinions on automatically extracted features  Solution: humans have superior capacity for understanding the content of audiovisual material  State of the art: humans replace automatic feature extraction processes (human annotations)  Our contribution: integration of human judgment and algorithms  Goal: improve the performance of multimedia content processing 5/17/2012 SMILA Themenkonferenz 88
  • 10. Example of CUbRIK Human-enhanced computation: Trademark Logo Detection  Problem statement: identifying occurrences of trademark logos in a video collection through keyword-based queries  Special case of the classic problem of object recognition  Use case: a professional user wants to retrieve all the occurrences of logos in a large collection of video clips  Applications: rating effectiveness of advertising, subliminal advertising detection, automatic annotation, trademark violation detection 99
  • 11. Human-powered trademark logo detection demo  Goal: integrate human and automatic computation to increase precision and recall w.r.t. fully automatic solutions 5/17/2012 SMILA Themenkonferenz 10
  • 12. Trademark Logo Detection: problems in automatic logo detection  Problems in automatic logo detection:  Object recognition is affected by the quality of the input set of images  Uncertain matches, i.e., the ones with low matching score, could not contain the searched logo 11
  • 13. Trademark Logo Detection: contribution of human computation  Contribution in human computation  Filter the input logos, eliminating the irrelevant ones  Segment the input logos  Validate the matching results 12
  • 14. Trademark Logo Detection: pipeline 13 13
  • 15. The CrowdSearcher framework for HC task management 14 14
  • 16. CrowdSearch framework in the Logo detection application Problem solving process Process Task Crowd Task Types of tasks • Automatic tasks • Crowd tasks: tasks that are executed by an open-ended community of performers Crowd Task 1515
  • 17. Community of Performers Content edges, e.g., IS-A, part.of Content elements The application is deployed as a Facebook application Seed community Information Technology Performer to content department of Politecnico di edges, e.g., topical group membership Milano Performers edges, e.g., friendship, weak ties Task propagation Performers Each user in the seed community can propagate tasks through the social networks 16 16
  • 18. Design of “Validate Logo Images” The “LIKE” task variant requires to choose relevant logos among a set of not filtered images Human Task Design The “ADD”task variant requires to add new relevant image URLs Please add new relevant logos URL… Send 17
  • 19. People to task matching & Task Assignment Task Deployment Criteria Execution criteria Constraints of task execution Content Affinity Criteria Time budget for the experiment Execution Criteria Content Affinity criteria Query on a representation of the users’ capacities • Current state: manual selection of users People to • Future work: Geocultural affinity task matching Questions are dispatched to the crowd according to the user experience in answering questions • Expert user: an user that has already answered to three questions Task New users answer to “LIKE” questions assignment Expert users answer to “LIKE”+“ADD” questions 18 18
  • 20. Task propagation  Propagation over the Facebook graph:  Platform: CrowdSearcher  Automatic task generation starting from a set of design criteria (e.g., question type, public/private…)  Seed community: Information Technology department of Politecnico di Milano  Each user in the seed community can propagate tasks through the social networks  Work in progress:  Twitter/LinkedIn tasks  Task assignment according to expertise, geocultural information, past work history 5/17/2012 CUbRIK Pipelines 1 19
  • 21. Task execution Task execution “LIKE” task variant “ADD” task variant 20 20
  • 22. Output aggregation “LIKE” task variants Top-5 rated logos are selected as relevant logos Task “ADD” task variants execution New images are fed back to the LIKE tasks Task outputs Task output Output aggregation 21 21
  • 23. Experimental evaluation  Three experimental settings:  No human intervention  Logo validation performed by two domain experts  Inclusion of the actual crowd knowledge  Crowd involvement  40 people involved  50 task instances generated  70 collected answers 22
  • 24. Experimental evaluation 1 0.9 0.8 Crowd 0.7 Experts 0.6 Experts Recall Experts 0.5 Aleve 0.4 Crowd Chunky 0.3 No Crowd Shout 0.2 Crowd No Crowd 0.1 0 No Crowd 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Precision 23
  • 25. Experimental evaluation 1 0.9 0.8 Precision decreases Crowd 0.7 Experts 0.6 Reasons for the wrong inclusion Experts Recall Experts • Geographical location of the users 0.5 Aleve • Expertise of the involved users 0.4 Crowd Chunky 0.3 No Crowd Shout 0.2 Crowd No Crowd 0.1 0 No Crowd 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Precision 24
  • 26. Experimental evaluation 1 Precision decreases • Similarity between two 0.9 logos in the data set 0.8 Crowd 0.7 Experts 0.6 Experts Recall Experts 0.5 Aleve 0.4 Crowd Chunky 0.3 No Crowd Shout 0.2 Crowd No Crowd 0.1 0 No Crowd 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Precision 25
  • 27. Crowdsourced filtering of logos – Problem concept Google Images Filtered logos Filter Tasks Added logos Add Tasks 5/17/2012 CUbRIK Pipelines 1 26
  • 28. Integration in SMILA  The demo has been integrated into the SMILA architecture  Two main parts:  Indexing part: made of asynchronous components (in a SMILA sense)  Indexing of videos  Matching phase  Interaction with the crowd  Search part: end users query the system by keyword-based queries 5/17/2012 CUbRIK Pipelines 1 27
  • 29. Integration in SMILA 5/17/2012 CUbRIK Pipelines 1 28
  • 30. Integration in SMILA: Indexing part overview 5/17/2012 CUbRIK Pipelines 1 29
  • 31. Reusable components  Crawling  Google Images + Flickr crawler  Multimedia processing  SIFT-based low level feature extraction  Video segmentation component  Key-frame extractor  Robust low level feature matching component  Data storage  “Data service” for referencing multimedia resources 5/17/2012 CUbRIK Pipelines 1 30
  • 32. Integration in SMILA: Indexing part – Job1, Input images retrieval 5/17/2012 CUbRIK Pipelines 1 31
  • 33. Integration in SMILA: Indexing part – Job2, Logo collection indexing 5/17/2012 CUbRIK Pipelines 1 32
  • 34. Integration in SMILA: Indexing part – Job3, video collection indexing 5/17/2012 CUbRIK Pipelines 1 33
  • 35. Integration in SMILA: Indexing part – Job4, matching phase 5/17/2012 CUbRIK Pipelines 1 34
  • 36. Integration in SMILA: Indexing part – Job5, matches filtering 5/17/2012 CUbRIK Pipelines 1 35
  • 37. Demo: Search interface 5/17/2012 CUbRIK Pipelines 1 36
  • 38. Demo: Search interface 5/17/2012 CUbRIK Pipelines 1 37
  • 39. Demo: Search interface Indexed logos that match against the video collection 5/17/2012 CUbRIK Pipelines 1 38
  • 40. Demo: Search interface Video preview 5/17/2012 CUbRIK Pipelines 1 39
  • 41. Demo: Search interface High confidence matches 5/17/2012 CUbRIK Pipelines 1 40
  • 42. Demo: Search interface Low confidence matches 5/17/2012 CUbRIK Pipelines 1 41
  • 43. CUbRIK Showcases  CUbRIK will showcase its technology with Demonstrators of examples of innovation in two domains:  (Digital Libraries) History of Europe  (Business Processes) CUbRIK search for SMEs,  Technical evaluation in real-world conditions including users will be based on these Demonstrators 6 March 2012 The CUbRIK Project is .... 42
  • 44. Thanks for your attention www.cubrikproject.eu 5/17/2012 SMILA Themenkonferenz 43