SlideShare a Scribd company logo
1 of 73
Download to read offline
Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6
September 2012
OUTLINE OF THE TALK
 Introduction
 Online Advertising

 A Modern Contextual Advertising System
        Syntactic Textual Analysis
        Semantic Textual Analysis
        Matching
        An Example: ConCA
        Experimental Results
 Conclusions
 References



Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
INTRODUCTION
OUTERNET & INTERNET




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
OUTERNET & INTERNET
   In Atkinson’s view something is missing…




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
OUTERNET & INTERNET
   In Atkinson’s view something is missing…




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
OUTERNET & INTERNET
   In Atkinson’s view something is missing…




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
OUTERNET & INTERNET
   In Atkinson’s view something is missing…




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
OUTERNET & INTERNET
   In Atkinson’s view something is missing…




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
ONLINE ADVERTISING
ONLINE ADVERTISING




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
ONLINE ADVERTISING
   Sponsored Search




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
ONLINE ADVERTISING
   Banner Advertising




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
ONLINE ADVERTISING
   Contextual Advertising




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
CONTEXTUAL ADVERTISING
               Webpage                                              Ad




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
ONLINE ADVERTISING
   Is it always a good thing?




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
ONLINE ADVERTISING
   Is it always a good thing?




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
A MODERN CONTEXTUAL
ADVERTISING SYSTEM
A MODERN CONTEXTUAL ADVERTISING SYSTEM




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
 Text Summarization
 Bag of Words Representation




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Text summarization




        State of the art techniques
            First and Last Paragraph (FLP)
            Title, First and Last Paragraph (TFLP)

            Snippet (S)

            Title and Snippet (TS)




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
      First and Last Paragraph (FLP)
           You don’t need to shell out thousands,
           survive various ballots, or swap a family
           member for a ticket to enjoy the 2012
           Summer Olympic Games this year. There's
           all manner of free events and associated
           shenanigans taking place in London and
           across the UK to mark the occasion. Here
           are ten ways to join in without spending any
           money.

           Indulge in a family feast
           Volunteer chefs at 24 Sure Start Centres
           across the UK are preparing to dish up free
           delights throughout the period. Details,
           along with all the other events that make up
           the Cultural Olympiad, are available on the
           site.
http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
SYNTACTIC TEXTUAL ANALYSIS
      Title, First and Last Paragraph (TFLP)
           You don’t need to shell out thousands,
           survive various ballots, or swap a family
           member for a ticket to enjoy the 2012
           Summer Olympic Games this year. There's
           all manner of free events and associated
           shenanigans taking place in London and
           across the UK to mark the occasion. Here
           are ten ways to join in without spending any
           money.

           Indulge in a family feast
           Volunteer chefs at 24 Sure Start Centres
           across the UK are preparing to dish up free
           delights throughout the period. Details,
           along with all the other events that make up
           the Cultural Olympiad, are available on the
           site.
http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
SYNTACTIC TEXTUAL ANALYSIS
      Title, First and Last Paragraph (TFLP)
           You don’t need to shell out thousands,
           survive various ballots, or swap a family
           member for a ticket to enjoy the 2012
           Summer Olympic Games this year. There's
           all manner of free events and associated
           shenanigans taking place London 2012 – Ten ways to celebrate the Olympics for free
                                       in London and
           across the UK to mark the occasion. Here
           are ten ways to join in without spending any
           money.

           Indulge in a family feast
           Volunteer chefs at 24 Sure Start Centres
           across the UK are preparing to dish up free
           delights throughout the period. Details,
           along with all the other events that make up
           the Cultural Olympiad, are available on the
           site.
http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
SYNTACTIC TEXTUAL ANALYSIS
      Snippet (S)




http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
SYNTACTIC TEXTUAL ANALYSIS
      Title and Snippet (TS)




http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
SYNTACTIC TEXTUAL ANALYSIS
   Bag of Words (BoW) representation




        Dimensionality reduction
            Stop-words removal
            Stemming

        Vector representation
              Set of pairs <word, occurrences>




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Stop-words removal
     You don’t need to shell out thousands,
     survive various ballots, or swap a
     family member for a ticket to enjoy the
     2012 Summer Olympic Games this
     year. There's all manner of free events
     and associated shenanigans taking
     place in London and across the UK to
     mark the occasion. Here are ten ways
     to join in without spending any money.




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Stop-words removal

      X X X X                 X
     You don’t need to shell out thousands,
     survive various ballots, X swap X
                              or      a
     family member forX ticket X enjoy the
                      Xa          to    X
     2012 Summer Olympic Games this   X
               X X
     year. There's all manner X free events
                               of
      X
     and associated shenanigans taking
     placeX London and across the UK X
            in          X            X to
                          X are
     mark the occasion. Here X ten ways
            X
     to joinX without spending any money.
     X in X                       X




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Stop-words removal

      X X X X                 X
     You don’t need to shell out thousands,
     survive various ballots, X swap X
                              or       a
     family member forX ticket X enjoy the
                      Xa          to       X
     2012 Summer Olympic Games this    X
               X X
     year. There's all manner X free events
                               of
      X
     and associated shenanigans taking
                                     Shell thousands, survive various
     placeX London and across the UK X
            in          X            X to swap family member ticket
                                     ballots,
                          X are enjoy 2012 Summer Olympic Games
     mark the occasion. Here X ten ways
            X
     to joinX without spending any money.
     X in X                       X year. Manner free events associated
                                     shenanigans taking place London
                                     across UK mark occasion. ten ways
                                     join spending money.



Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Stemming
     Shell thousands, survive various
     ballots, swap family member ticket
     enjoy 2012 Summer Olympic Games
     year. Manner free events associated
     shenanigans taking place London
     across UK mark occasion. ten ways
     join spending money.




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Stemming
     Shell thousands, survive various
                            X
           X
     ballots, swap family member ticket
                        X
                                X
     enjoy 2012 Summer Olympic Games    X
     year. Manner free events associated
                             X         X
                 X
     shenanigans taking place London
                      X
     across UK mark occasion. ten ways
         X                            X
                X
     join spending money.




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Stemming
     Shell thousands, survive various
                            X
           X
     ballots, swap family member ticket
                        X
                                X
     enjoy 2012 Summer Olympic Games     X
     year. Manner free events associated
                             X          X
                 X
     shenanigans taking place London
                      X
     across UK mark occasion. ten ways
         X                             X
                                   Shell thousand, surviv various ballot,
                X
     join spending money.
                                   swap famil member ticket enjoy 2012
                                   Summer Olymp Game year. Manner
                                   free event associat shenanigan tak
                                   place London across UK mark
                                   occasion. ten way join spend money.




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTIC TEXTUAL ANALYSIS
   Vector representation
        TFIDF

            <free0.0116>
          <olymp, 0.0235>
          <event, 0.0012>
           <way, 0.0125>
         <london, 0.0421>
         <celebrat, 0.0005>
           <chef, 0.0127>
                  …




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
IS ENOUGH THE SOLE SYNTACTIC APPROACH?




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
IS ENOUGH THE SOLE SYNTACTIC APPROACH?
   Polysemy…

                                                    “BASS”




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
IS ENOUGH THE SOLE SYNTACTIC APPROACH?
   Synonymity…




                           Vehicle                                          Machine
                                              Car                    Auto
                                                        Automobile




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
 Taxonomy-based Classification
 Word Disambiguation




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
   Taxonomy-based Classification
        Classification Features (CF) representation




        Adopted classifiers
            Rocchio
            SVM




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
   Rocchio
        Each centroid is defined as a sum of TF-IDF values of each
         term, normalized by the number of webpages in the class




        The classification is based on the
         cosine of the angle between the
         webpage and the centroid of each class




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
   SVM




        The score is related to the
         distance of the webpage from a
         separation hyperplane

Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
   Word Disambiguation
        Bag of Concepts (BoC) representation




        Adopted lexical supports
              WordNet
              YAGO
              ConceptNet

Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
   WordNet
        A large lexical database
         of English. Nouns, verbs,
         adjectives and adverbs
         are grouped into sets of
         cognitive synonyms
         (synsets), each
         expressing a distinct
         concept.




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
   YAGO
        A semantic knowledge base, derived from Wikipedia,
         WordNet and GeoNames




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC TEXTUAL ANALYSIS
   ConceptNet
        A network of concepts connected by several semantic
         relations (e.g., “IsA”, “PartOf”)




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
MATCHING
 Similarity calculation
 Ranking




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
MATCHING
   Similarity calculation




        Adopted approaches
            Cosine similarity
            Jaccard index


Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
MATCHING
     o   Cosine similarity




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
MATCHING
     o   Jaccard index
              The Jaccard coefficient measures similarity between sample sets,
              and is defined as the size of the intersection divided by the size of
              the union of the sample sets




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
MATCHING
   Ranking
        Adopted approaches
            Simple ranking according to the calculated scores
            Learning to rank model




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
MATCHING
     o   Learning to rank model
              Pointwise approach
               o   Each query-document pair in the training data has a numerical
                   or ordinal score
               o   Regression problem approach: given a single query-document
                   pair, predict its score
              Pairwise approach
               o   Classification problem approach: learning a binary classifier
                   which can tell which document is better in a given pair of
                   documents
              Listwise approach
               o   Optimization problem approach: try to directly optimize the
                   value of one of the above evaluation measures, averaged over
                   all queries in the training data

Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
AN EXAMPLE: CONCA
CONCEPTS IN CONTEXTUAL ADVERTISING
CONCA




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
CONCA




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
RESULTS
SYNTAX VS SEMANTICS
SYNTACTICAL ANALYSIS
   Text summarization techniques comparison
        FLP vs TFLP vs S vs TS
   Comparison metrics

          | {relevant documents}  {retrieved documents} |     TP
                                                        
                       | {retrieved documents} |             TP  FP
          | {relevant documents}  {retrieved documents} |     TP
                                                        
                        | {relevant documents} |             TP  FN
                    
        F1  2
                    

   Taxonomy
        BankSearch Dataset
Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTICAL ANALYSIS
   Results
                  FLP           TFLP              S                 TS
      π         0.745          0.832           0.734            0.806
      ρ         0.719          0.801           0.730            0.804
      F1        0.732          0.816           0.732            0.805
      #t           24             26             12                 14

      Adding information about the title improves the
       performances
      TFLP has the best performance




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC ANALYSIS
   Semantic approaches comparison
        Anagnostopoulos et al. (2007) system vs Armano et al.
         (2011-TIR) vs ConCA
   Matching function
      ( p, a)    simBoC  (1   )  simCF

   Comparison metric
                               N     k

                               TP
                               i 1 j 1
                                              ij

          @k          N    k

                         (TP  FP )
                        i 1 j 1
                                         ij        ij




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC ANALYSIS
   Ad repository
        Built by hand by a domain expert
   Taxonomy
        BankSearch Dataset




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC ANALYSIS
   Results

            k       Anagnostopoulos                  Armano et al.             ConCA
                         et al.
                         π             α               π            α      π           α
            1        0.674             0            0.768           0.2   0.773        0.1
            2        0.653             0           0.750            0.2   0.752        0.1
            3        0.617           0.2           0.729            0.3   0.728        0.1
            4        0.582           0.2            0.701           0.3   0.701        0.1
            5        0.546           0.1           0.663            0.0   0.668        0.1




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC ANALYSIS
   Results




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SEMANTIC ANALYSIS
   Results
        Slight improvement by using concepts
        Low values of α → CF more impact then BoC




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTICAL ANALYSIS VS SEMANTIC ANALYSIS
   Contextual Advertising System
        Armano et al. (2011-TIR)
   Matching function
      ( p, a)    simBoW  (1   )  simCF

   Comparisons varying α
        α = 1 → pure syntax
        α = 0 → pure semantics
   Comparison metric
                              N     k

                              TP
                              i 1 j 1
                                             ij

         @k          N    k

                        (TP  FP )
                       i 1 j 1
                                        ij        ij



Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTICAL ANALYSIS VS SEMANTIC ANALYSIS
   Ad repository
        Built by hand by a domain expert
   Taxonomy
        BankSearch Dataset




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
SYNTACTICAL ANALYSIS VS SEMANTIC ANALYSIS
   Results
           α            π@1             π@2             π@3         π@4     π@5
           0           0.765           0.746           0.719        0.696   0.663
         0.1           0.767           0.749           0.724        0.698   0.663
         0.2           0.768           0.750          0.729         0.699   0.662
         0.3           0.766           0.749          0.729         0.701   0.661
         0.4           0.756           0.747          0.729         0.698   0.658
         0.5           0.744           0.735           0.721        0.693   0.651
         0.6           0.722           0.717           0.703        0.681   0.640
         0.7           0.685           0.687           0.680        0.658   0.625
         0,8           0.632           0.637           0.635        0.614   0.586
         0.9           0.557           0.552           0.548        0.534   0.512
           1           0.408           0.421           0.372        0.388   0.640
Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
CONCLUSIONS
CONCLUSIONS
   Online advertising
        represents one of the major sources of income for a large
         number of websites
        is aimed at suggesting products and services to the
         population of Internet users
   Modern contextual advertising systems
      put ads within the content of a generic, third party,
       webpage
      adopt both syntactical and semantic textual analyses to
       select the most relevant ads for a given webpage
      an example is ConCA




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
CONCLUSIONS
   Results show that
        the impact of semantics is stronger than that of syntax
        adopting more advanced semantic techniques, such as
         concepts, improves the performances
        the more the suggested ads are, the worse the
         performance is




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
REFERENCES
REFERENCES
   Syntactical Textual Analysis
        Armano G., Giuliani A., & Vargiu E. Experimenting text summarization
         techniques for contextual advertising. 2nd Italian Information Retrieval
         Workshop (IIR’11) , 2011.
        Armano G., Giuliani A. & Vargiu, E. Using snippets in text summarization: a
         comparative study and an application. 3rd Italian Information Retrieval
         Workshop (IIR’12), 2012.
        Kolcz A., Prabakarmurthi V. & Kalita J. Summarization as feature selection for
         text categorization. 10th International Conference on Information and
         Knowledge Management (CIKM’01). ACM, New York, NY, USA, pp. 365–370,
         2001.
        Porter M. An algorithm for suffix stripping. Program 14, 3, 130–137, 1980.
        Salton G., Wong A. & Yang C.S, A vector space model for automatic indexing,
         Communications of the ACM, 18, 11, pp.613-620, 1975.




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
REFERENCES
   Semantic Textual Analysis
        Cortes C. & Vapnik, V.N. Support-Vector Networks, Machine Learning, 20,
         1995.
        Fellbaum C. WordNet: An Electronic Lexical Database. Cambridge, MA: MIT
         Press, 1998.
        Liu H. & Singh P. ConceptNet: A practical commonsense reasoning tool-kit. BT
         Technology Journal 22, pp. 211–226, 2004.
        Miller G.A. WordNet: A Lexical Database for English. Communications of the
         ACM, 38, 11, pp. 39-41, 1995.
        Rocchio J. The SMART Retrieval System: Experiments in Automatic Document
         Processing. PrenticeHall, Chapter: Relevance feedback in information
         retrieval, pp. 313–323, 1971.
        Suchanek F.M., Kasneci G. & Weikum G. Yago - A Core of Semantic
         Knowledge. 16th International World Wide Web conference (WWW 2007),
         2007.




Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
REFERENCES
   Matching
      Liu T.Y. Learning to rank for information retrieval. Found. Trends Inf. Retr. 3, 3,
       pp. 225–331, 2009.
      Radomski P.J. & Goeman, T.J. The homogenizing of Minnesota lake fish
       assemblages. Fisheries, 20, pp. 20–23, 1995.

   Comparison Systems
      Anagnostopoulos A., Broder A. Z., Gabrilovich E., Josifovski V. & Riedel L. Just-
       in-time contextual advertising. 16th ACM Conference on Information and
       Knowledge Management (CIKM’07). ACM, New York, NY, USA, pp. 331–340,
       2007.
      Armano G., Giuliani A. & Vargiu E. Studying the impact of text summarization
       on contextual advertising. 8th International Workshop on Text-based
       Information Retrieval (TIR’11), 2011.
      Armano G., Giuliani A. & Vargiu E. Semantic enrichment of contextual
       advertising by using concepts. International Conference on Knowledge
       Discovery and Information Retrieval, 2011.


Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
Contact: Eloisa Vargiu – evargiu@bdigital.org

More Related Content

Viewers also liked

Interpretation
Interpretation Interpretation
Interpretation Son Pham
 
Interpreting in different type of text by i wayan suryasa
Interpreting in different type of text  by i wayan suryasaInterpreting in different type of text  by i wayan suryasa
Interpreting in different type of text by i wayan suryasaSTMIK STIKOM - Bali
 
Analyze fiction irony
Analyze fiction ironyAnalyze fiction irony
Analyze fiction ironyscarllete_27
 
Digital Winners 2013: Aleksander stensby
Digital Winners 2013: Aleksander stensbyDigital Winners 2013: Aleksander stensby
Digital Winners 2013: Aleksander stensbyTelenor Group
 
Influence mapping Toolbox Presentation London 2015
Influence mapping Toolbox Presentation London 2015Influence mapping Toolbox Presentation London 2015
Influence mapping Toolbox Presentation London 2015Jun Julien Matsushita
 
Data Driven PR: 8 Steps to Building Media Attention with Research
Data Driven PR: 8 Steps to Building Media Attention with ResearchData Driven PR: 8 Steps to Building Media Attention with Research
Data Driven PR: 8 Steps to Building Media Attention with ResearchWalkerSands
 
Analysis and Visualization of Real-Time Twitter Data
Analysis and Visualization of Real-Time Twitter DataAnalysis and Visualization of Real-Time Twitter Data
Analysis and Visualization of Real-Time Twitter DataEducational Technology
 
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...SocialBiz UserGroup
 
Business Models in the Data Economy: A Case Study from the Business Partner D...
Business Models in the Data Economy: A Case Study from the Business Partner D...Business Models in the Data Economy: A Case Study from the Business Partner D...
Business Models in the Data Economy: A Case Study from the Business Partner D...Boris Otto
 
Analyzing social conversation: a guide to data mining and data visualization
Analyzing social conversation: a guide to data mining and data visualization Analyzing social conversation: a guide to data mining and data visualization
Analyzing social conversation: a guide to data mining and data visualization Tempero UK
 
Can Digital Data help predict the results of the US elections?
Can Digital Data help predict the results of the US elections? Can Digital Data help predict the results of the US elections?
Can Digital Data help predict the results of the US elections? Laurence Borel
 
Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightMatthew Russell
 
Searching lexis nexis in power search mode
Searching lexis nexis in power search modeSearching lexis nexis in power search mode
Searching lexis nexis in power search modeJoyce Johnston
 
What is 1st, 2nd, 3rd party data?
What is 1st, 2nd, 3rd party data?What is 1st, 2nd, 3rd party data?
What is 1st, 2nd, 3rd party data?Sparc Media Poland
 
Text Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowText Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowTony Russell-Rose
 

Viewers also liked (16)

Interpretation
Interpretation Interpretation
Interpretation
 
Interpreting in different type of text by i wayan suryasa
Interpreting in different type of text  by i wayan suryasaInterpreting in different type of text  by i wayan suryasa
Interpreting in different type of text by i wayan suryasa
 
Analyze fiction irony
Analyze fiction ironyAnalyze fiction irony
Analyze fiction irony
 
Digital Winners 2013: Aleksander stensby
Digital Winners 2013: Aleksander stensbyDigital Winners 2013: Aleksander stensby
Digital Winners 2013: Aleksander stensby
 
Influence mapping Toolbox Presentation London 2015
Influence mapping Toolbox Presentation London 2015Influence mapping Toolbox Presentation London 2015
Influence mapping Toolbox Presentation London 2015
 
Data Driven PR: 8 Steps to Building Media Attention with Research
Data Driven PR: 8 Steps to Building Media Attention with ResearchData Driven PR: 8 Steps to Building Media Attention with Research
Data Driven PR: 8 Steps to Building Media Attention with Research
 
Analysis and Visualization of Real-Time Twitter Data
Analysis and Visualization of Real-Time Twitter DataAnalysis and Visualization of Real-Time Twitter Data
Analysis and Visualization of Real-Time Twitter Data
 
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...
 
Business Models in the Data Economy: A Case Study from the Business Partner D...
Business Models in the Data Economy: A Case Study from the Business Partner D...Business Models in the Data Economy: A Case Study from the Business Partner D...
Business Models in the Data Economy: A Case Study from the Business Partner D...
 
Analyzing social conversation: a guide to data mining and data visualization
Analyzing social conversation: a guide to data mining and data visualization Analyzing social conversation: a guide to data mining and data visualization
Analyzing social conversation: a guide to data mining and data visualization
 
Can Digital Data help predict the results of the US elections?
Can Digital Data help predict the results of the US elections? Can Digital Data help predict the results of the US elections?
Can Digital Data help predict the results of the US elections?
 
Unleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and InsightUnleashing Twitter Data for Fun and Insight
Unleashing Twitter Data for Fun and Insight
 
Searching lexis nexis in power search mode
Searching lexis nexis in power search modeSearching lexis nexis in power search mode
Searching lexis nexis in power search mode
 
What is 1st, 2nd, 3rd party data?
What is 1st, 2nd, 3rd party data?What is 1st, 2nd, 3rd party data?
What is 1st, 2nd, 3rd party data?
 
Text Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowText Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and Tomorrow
 
Market Mix Models: Shining a Light in the Black Box
Market Mix Models: Shining a Light in the Black BoxMarket Mix Models: Shining a Light in the Black Box
Market Mix Models: Shining a Light in the Black Box
 

More from Eloisa Vargiu

Citizen empowerment throughout the 4 pillars of health
Citizen empowerment throughout the 4 pillars of healthCitizen empowerment throughout the 4 pillars of health
Citizen empowerment throughout the 4 pillars of healthEloisa Vargiu
 
Improving Sleeping Habits: Preliminary Experiments in Barcelona and Lleida
Improving Sleeping Habits: Preliminary Experiments in Barcelona and LleidaImproving Sleeping Habits: Preliminary Experiments in Barcelona and Lleida
Improving Sleeping Habits: Preliminary Experiments in Barcelona and LleidaEloisa Vargiu
 
Medical Technology in Sleep
Medical Technology in SleepMedical Technology in Sleep
Medical Technology in SleepEloisa Vargiu
 
Patient Empowerment from an Integrated Care Approach
Patient Empowerment from an Integrated Care ApproachPatient Empowerment from an Integrated Care Approach
Patient Empowerment from an Integrated Care ApproachEloisa Vargiu
 
Connected Care for Complex Chronic Patients in Lleida
Connected Care for Complex Chronic Patients in LleidaConnected Care for Complex Chronic Patients in Lleida
Connected Care for Complex Chronic Patients in LleidaEloisa Vargiu
 
Self-Management of Complex Chronic Patients: Needs and A Proposal
Self-Management of Complex Chronic Patients: Needs and A ProposalSelf-Management of Complex Chronic Patients: Needs and A Proposal
Self-Management of Complex Chronic Patients: Needs and A ProposalEloisa Vargiu
 
Patient Empowerment in CONNECARE
Patient Empowerment in CONNECAREPatient Empowerment in CONNECARE
Patient Empowerment in CONNECAREEloisa Vargiu
 
From Healthy to Happy Ageing: the Power of Self-Management
From Healthy to Happy Ageing: the Power of Self-ManagementFrom Healthy to Happy Ageing: the Power of Self-Management
From Healthy to Happy Ageing: the Power of Self-ManagementEloisa Vargiu
 
The CONNECARE Project
The CONNECARE ProjectThe CONNECARE Project
The CONNECARE ProjectEloisa Vargiu
 
Self-management of complex chronic patients: the CONNECARE experience
Self-management of complex chronic patients: the CONNECARE experienceSelf-management of complex chronic patients: the CONNECARE experience
Self-management of complex chronic patients: the CONNECARE experienceEloisa Vargiu
 
A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...
A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...
A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...Eloisa Vargiu
 
Integrated Care for Complex Chronic Patients
Integrated Care for Complex Chronic PatientsIntegrated Care for Complex Chronic Patients
Integrated Care for Complex Chronic PatientsEloisa Vargiu
 
Automatic Support for Improving Management and Treatment of Patients with Obt...
Automatic Support for Improving Management and Treatment of Patients with Obt...Automatic Support for Improving Management and Treatment of Patients with Obt...
Automatic Support for Improving Management and Treatment of Patients with Obt...Eloisa Vargiu
 
The CONNECARE project at a glance
The CONNECARE project at a glanceThe CONNECARE project at a glance
The CONNECARE project at a glanceEloisa Vargiu
 
Challenge - Choice - Change of CONNECARE
Challenge - Choice - Change of CONNECAREChallenge - Choice - Change of CONNECARE
Challenge - Choice - Change of CONNECAREEloisa Vargiu
 
Third Generation Teleassistance - Intelligent Monitoring Makes the Difference
Third Generation Teleassistance - Intelligent Monitoring Makes the DifferenceThird Generation Teleassistance - Intelligent Monitoring Makes the Difference
Third Generation Teleassistance - Intelligent Monitoring Makes the DifferenceEloisa Vargiu
 
Monitoring Elderly People at Home: Results and Lessons Learned
Monitoring Elderly People at Home: Results and Lessons LearnedMonitoring Elderly People at Home: Results and Lessons Learned
Monitoring Elderly People at Home: Results and Lessons LearnedEloisa Vargiu
 
Monitoring people that need assistance: the BackHome experience
Monitoring people that need assistance: the BackHome experienceMonitoring people that need assistance: the BackHome experience
Monitoring people that need assistance: the BackHome experienceEloisa Vargiu
 
Brain Computer Interfaces on Track to Home: Results and Lessons Learnt
Brain Computer Interfaces on Track to Home: Results and Lessons LearntBrain Computer Interfaces on Track to Home: Results and Lessons Learnt
Brain Computer Interfaces on Track to Home: Results and Lessons LearntEloisa Vargiu
 
Monitoring People that Need Assistance through a Sensor-based System: Evaluat...
Monitoring People that Need Assistance through a Sensor-based System: Evaluat...Monitoring People that Need Assistance through a Sensor-based System: Evaluat...
Monitoring People that Need Assistance through a Sensor-based System: Evaluat...Eloisa Vargiu
 

More from Eloisa Vargiu (20)

Citizen empowerment throughout the 4 pillars of health
Citizen empowerment throughout the 4 pillars of healthCitizen empowerment throughout the 4 pillars of health
Citizen empowerment throughout the 4 pillars of health
 
Improving Sleeping Habits: Preliminary Experiments in Barcelona and Lleida
Improving Sleeping Habits: Preliminary Experiments in Barcelona and LleidaImproving Sleeping Habits: Preliminary Experiments in Barcelona and Lleida
Improving Sleeping Habits: Preliminary Experiments in Barcelona and Lleida
 
Medical Technology in Sleep
Medical Technology in SleepMedical Technology in Sleep
Medical Technology in Sleep
 
Patient Empowerment from an Integrated Care Approach
Patient Empowerment from an Integrated Care ApproachPatient Empowerment from an Integrated Care Approach
Patient Empowerment from an Integrated Care Approach
 
Connected Care for Complex Chronic Patients in Lleida
Connected Care for Complex Chronic Patients in LleidaConnected Care for Complex Chronic Patients in Lleida
Connected Care for Complex Chronic Patients in Lleida
 
Self-Management of Complex Chronic Patients: Needs and A Proposal
Self-Management of Complex Chronic Patients: Needs and A ProposalSelf-Management of Complex Chronic Patients: Needs and A Proposal
Self-Management of Complex Chronic Patients: Needs and A Proposal
 
Patient Empowerment in CONNECARE
Patient Empowerment in CONNECAREPatient Empowerment in CONNECARE
Patient Empowerment in CONNECARE
 
From Healthy to Happy Ageing: the Power of Self-Management
From Healthy to Happy Ageing: the Power of Self-ManagementFrom Healthy to Happy Ageing: the Power of Self-Management
From Healthy to Happy Ageing: the Power of Self-Management
 
The CONNECARE Project
The CONNECARE ProjectThe CONNECARE Project
The CONNECARE Project
 
Self-management of complex chronic patients: the CONNECARE experience
Self-management of complex chronic patients: the CONNECARE experienceSelf-management of complex chronic patients: the CONNECARE experience
Self-management of complex chronic patients: the CONNECARE experience
 
A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...
A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...
A Hierarchical Approach to Recognize Purposeful Movements Using Inertial Sens...
 
Integrated Care for Complex Chronic Patients
Integrated Care for Complex Chronic PatientsIntegrated Care for Complex Chronic Patients
Integrated Care for Complex Chronic Patients
 
Automatic Support for Improving Management and Treatment of Patients with Obt...
Automatic Support for Improving Management and Treatment of Patients with Obt...Automatic Support for Improving Management and Treatment of Patients with Obt...
Automatic Support for Improving Management and Treatment of Patients with Obt...
 
The CONNECARE project at a glance
The CONNECARE project at a glanceThe CONNECARE project at a glance
The CONNECARE project at a glance
 
Challenge - Choice - Change of CONNECARE
Challenge - Choice - Change of CONNECAREChallenge - Choice - Change of CONNECARE
Challenge - Choice - Change of CONNECARE
 
Third Generation Teleassistance - Intelligent Monitoring Makes the Difference
Third Generation Teleassistance - Intelligent Monitoring Makes the DifferenceThird Generation Teleassistance - Intelligent Monitoring Makes the Difference
Third Generation Teleassistance - Intelligent Monitoring Makes the Difference
 
Monitoring Elderly People at Home: Results and Lessons Learned
Monitoring Elderly People at Home: Results and Lessons LearnedMonitoring Elderly People at Home: Results and Lessons Learned
Monitoring Elderly People at Home: Results and Lessons Learned
 
Monitoring people that need assistance: the BackHome experience
Monitoring people that need assistance: the BackHome experienceMonitoring people that need assistance: the BackHome experience
Monitoring people that need assistance: the BackHome experience
 
Brain Computer Interfaces on Track to Home: Results and Lessons Learnt
Brain Computer Interfaces on Track to Home: Results and Lessons LearntBrain Computer Interfaces on Track to Home: Results and Lessons Learnt
Brain Computer Interfaces on Track to Home: Results and Lessons Learnt
 
Monitoring People that Need Assistance through a Sensor-based System: Evaluat...
Monitoring People that Need Assistance through a Sensor-based System: Evaluat...Monitoring People that Need Assistance through a Sensor-based System: Evaluat...
Monitoring People that Need Assistance through a Sensor-based System: Evaluat...
 

Recently uploaded

The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 

Recently uploaded (20)

The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 

Textual Analysis Techniques to Perform Onlibne Advertising

  • 1. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 2. OUTLINE OF THE TALK  Introduction  Online Advertising  A Modern Contextual Advertising System  Syntactic Textual Analysis  Semantic Textual Analysis  Matching  An Example: ConCA  Experimental Results  Conclusions  References Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 4. OUTERNET & INTERNET Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 5. OUTERNET & INTERNET  In Atkinson’s view something is missing… Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 6. OUTERNET & INTERNET  In Atkinson’s view something is missing… Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 7. OUTERNET & INTERNET  In Atkinson’s view something is missing… Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 8. OUTERNET & INTERNET  In Atkinson’s view something is missing… Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 9. OUTERNET & INTERNET  In Atkinson’s view something is missing… Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 11. ONLINE ADVERTISING Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 12. ONLINE ADVERTISING  Sponsored Search Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 13. ONLINE ADVERTISING  Banner Advertising Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 14. ONLINE ADVERTISING  Contextual Advertising Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 15. CONTEXTUAL ADVERTISING Webpage Ad Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 16. ONLINE ADVERTISING  Is it always a good thing? Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 17. ONLINE ADVERTISING  Is it always a good thing? Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 19. A MODERN CONTEXTUAL ADVERTISING SYSTEM Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 20. SYNTACTIC TEXTUAL ANALYSIS  Text Summarization  Bag of Words Representation Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 21. SYNTACTIC TEXTUAL ANALYSIS  Text summarization  State of the art techniques  First and Last Paragraph (FLP)  Title, First and Last Paragraph (TFLP)  Snippet (S)  Title and Snippet (TS) Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 22. SYNTACTIC TEXTUAL ANALYSIS  First and Last Paragraph (FLP) You don’t need to shell out thousands, survive various ballots, or swap a family member for a ticket to enjoy the 2012 Summer Olympic Games this year. There's all manner of free events and associated shenanigans taking place in London and across the UK to mark the occasion. Here are ten ways to join in without spending any money. Indulge in a family feast Volunteer chefs at 24 Sure Start Centres across the UK are preparing to dish up free delights throughout the period. Details, along with all the other events that make up the Cultural Olympiad, are available on the site. http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
  • 23. SYNTACTIC TEXTUAL ANALYSIS  Title, First and Last Paragraph (TFLP) You don’t need to shell out thousands, survive various ballots, or swap a family member for a ticket to enjoy the 2012 Summer Olympic Games this year. There's all manner of free events and associated shenanigans taking place in London and across the UK to mark the occasion. Here are ten ways to join in without spending any money. Indulge in a family feast Volunteer chefs at 24 Sure Start Centres across the UK are preparing to dish up free delights throughout the period. Details, along with all the other events that make up the Cultural Olympiad, are available on the site. http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
  • 24. SYNTACTIC TEXTUAL ANALYSIS  Title, First and Last Paragraph (TFLP) You don’t need to shell out thousands, survive various ballots, or swap a family member for a ticket to enjoy the 2012 Summer Olympic Games this year. There's all manner of free events and associated shenanigans taking place London 2012 – Ten ways to celebrate the Olympics for free in London and across the UK to mark the occasion. Here are ten ways to join in without spending any money. Indulge in a family feast Volunteer chefs at 24 Sure Start Centres across the UK are preparing to dish up free delights throughout the period. Details, along with all the other events that make up the Cultural Olympiad, are available on the site. http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
  • 25. SYNTACTIC TEXTUAL ANALYSIS  Snippet (S) http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
  • 26. SYNTACTIC TEXTUAL ANALYSIS  Title and Snippet (TS) http://www.roughguides.com/website/Travel/SpotLight/ViewSpotLight.aspx?spotLightID=575
  • 27. SYNTACTIC TEXTUAL ANALYSIS  Bag of Words (BoW) representation  Dimensionality reduction  Stop-words removal  Stemming  Vector representation  Set of pairs <word, occurrences> Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 28. SYNTACTIC TEXTUAL ANALYSIS  Stop-words removal You don’t need to shell out thousands, survive various ballots, or swap a family member for a ticket to enjoy the 2012 Summer Olympic Games this year. There's all manner of free events and associated shenanigans taking place in London and across the UK to mark the occasion. Here are ten ways to join in without spending any money. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 29. SYNTACTIC TEXTUAL ANALYSIS  Stop-words removal X X X X X You don’t need to shell out thousands, survive various ballots, X swap X or a family member forX ticket X enjoy the Xa to X 2012 Summer Olympic Games this X X X year. There's all manner X free events of X and associated shenanigans taking placeX London and across the UK X in X X to X are mark the occasion. Here X ten ways X to joinX without spending any money. X in X X Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 30. SYNTACTIC TEXTUAL ANALYSIS  Stop-words removal X X X X X You don’t need to shell out thousands, survive various ballots, X swap X or a family member forX ticket X enjoy the Xa to X 2012 Summer Olympic Games this X X X year. There's all manner X free events of X and associated shenanigans taking Shell thousands, survive various placeX London and across the UK X in X X to swap family member ticket ballots, X are enjoy 2012 Summer Olympic Games mark the occasion. Here X ten ways X to joinX without spending any money. X in X X year. Manner free events associated shenanigans taking place London across UK mark occasion. ten ways join spending money. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 31. SYNTACTIC TEXTUAL ANALYSIS  Stemming Shell thousands, survive various ballots, swap family member ticket enjoy 2012 Summer Olympic Games year. Manner free events associated shenanigans taking place London across UK mark occasion. ten ways join spending money. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 32. SYNTACTIC TEXTUAL ANALYSIS  Stemming Shell thousands, survive various X X ballots, swap family member ticket X X enjoy 2012 Summer Olympic Games X year. Manner free events associated X X X shenanigans taking place London X across UK mark occasion. ten ways X X X join spending money. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 33. SYNTACTIC TEXTUAL ANALYSIS  Stemming Shell thousands, survive various X X ballots, swap family member ticket X X enjoy 2012 Summer Olympic Games X year. Manner free events associated X X X shenanigans taking place London X across UK mark occasion. ten ways X X Shell thousand, surviv various ballot, X join spending money. swap famil member ticket enjoy 2012 Summer Olymp Game year. Manner free event associat shenanigan tak place London across UK mark occasion. ten way join spend money. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 34. SYNTACTIC TEXTUAL ANALYSIS  Vector representation  TFIDF <free0.0116> <olymp, 0.0235> <event, 0.0012> <way, 0.0125> <london, 0.0421> <celebrat, 0.0005> <chef, 0.0127> … Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 35. IS ENOUGH THE SOLE SYNTACTIC APPROACH? Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 36. IS ENOUGH THE SOLE SYNTACTIC APPROACH?  Polysemy… “BASS” Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 37. IS ENOUGH THE SOLE SYNTACTIC APPROACH?  Synonymity… Vehicle Machine Car Auto Automobile Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 38. SEMANTIC TEXTUAL ANALYSIS  Taxonomy-based Classification  Word Disambiguation Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 39. SEMANTIC TEXTUAL ANALYSIS  Taxonomy-based Classification  Classification Features (CF) representation  Adopted classifiers  Rocchio  SVM Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 40. SEMANTIC TEXTUAL ANALYSIS  Rocchio  Each centroid is defined as a sum of TF-IDF values of each term, normalized by the number of webpages in the class  The classification is based on the cosine of the angle between the webpage and the centroid of each class Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 41. SEMANTIC TEXTUAL ANALYSIS  SVM  The score is related to the distance of the webpage from a separation hyperplane Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 42. SEMANTIC TEXTUAL ANALYSIS  Word Disambiguation  Bag of Concepts (BoC) representation  Adopted lexical supports  WordNet  YAGO  ConceptNet Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 43. SEMANTIC TEXTUAL ANALYSIS  WordNet  A large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressing a distinct concept. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 44. SEMANTIC TEXTUAL ANALYSIS  YAGO  A semantic knowledge base, derived from Wikipedia, WordNet and GeoNames Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 45. SEMANTIC TEXTUAL ANALYSIS  ConceptNet  A network of concepts connected by several semantic relations (e.g., “IsA”, “PartOf”) Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 46. MATCHING  Similarity calculation  Ranking Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 47. MATCHING  Similarity calculation  Adopted approaches  Cosine similarity  Jaccard index Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 48. MATCHING o Cosine similarity Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 49. MATCHING o Jaccard index The Jaccard coefficient measures similarity between sample sets, and is defined as the size of the intersection divided by the size of the union of the sample sets Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 50. MATCHING  Ranking  Adopted approaches  Simple ranking according to the calculated scores  Learning to rank model Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 51. MATCHING o Learning to rank model Pointwise approach o Each query-document pair in the training data has a numerical or ordinal score o Regression problem approach: given a single query-document pair, predict its score Pairwise approach o Classification problem approach: learning a binary classifier which can tell which document is better in a given pair of documents Listwise approach o Optimization problem approach: try to directly optimize the value of one of the above evaluation measures, averaged over all queries in the training data Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 52. AN EXAMPLE: CONCA CONCEPTS IN CONTEXTUAL ADVERTISING
  • 53. CONCA Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 54. CONCA Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 56. SYNTACTICAL ANALYSIS  Text summarization techniques comparison  FLP vs TFLP vs S vs TS  Comparison metrics | {relevant documents}  {retrieved documents} | TP    | {retrieved documents} | TP  FP | {relevant documents}  {retrieved documents} | TP    | {relevant documents} | TP  FN    F1  2    Taxonomy  BankSearch Dataset Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 57. SYNTACTICAL ANALYSIS  Results FLP TFLP S TS π 0.745 0.832 0.734 0.806 ρ 0.719 0.801 0.730 0.804 F1 0.732 0.816 0.732 0.805 #t 24 26 12 14  Adding information about the title improves the performances  TFLP has the best performance Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 58. SEMANTIC ANALYSIS  Semantic approaches comparison  Anagnostopoulos et al. (2007) system vs Armano et al. (2011-TIR) vs ConCA  Matching function   ( p, a)    simBoC  (1   )  simCF  Comparison metric N k  TP i 1 j 1 ij   @k  N k  (TP  FP ) i 1 j 1 ij ij Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 59. SEMANTIC ANALYSIS  Ad repository  Built by hand by a domain expert  Taxonomy  BankSearch Dataset Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 60. SEMANTIC ANALYSIS  Results k Anagnostopoulos Armano et al. ConCA et al. π α π α π α 1 0.674 0 0.768 0.2 0.773 0.1 2 0.653 0 0.750 0.2 0.752 0.1 3 0.617 0.2 0.729 0.3 0.728 0.1 4 0.582 0.2 0.701 0.3 0.701 0.1 5 0.546 0.1 0.663 0.0 0.668 0.1 Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 61. SEMANTIC ANALYSIS  Results Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 62. SEMANTIC ANALYSIS  Results  Slight improvement by using concepts  Low values of α → CF more impact then BoC Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 63. SYNTACTICAL ANALYSIS VS SEMANTIC ANALYSIS  Contextual Advertising System  Armano et al. (2011-TIR)  Matching function   ( p, a)    simBoW  (1   )  simCF  Comparisons varying α  α = 1 → pure syntax  α = 0 → pure semantics  Comparison metric N k  TP i 1 j 1 ij   @k  N k  (TP  FP ) i 1 j 1 ij ij Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 64. SYNTACTICAL ANALYSIS VS SEMANTIC ANALYSIS  Ad repository  Built by hand by a domain expert  Taxonomy  BankSearch Dataset Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 65. SYNTACTICAL ANALYSIS VS SEMANTIC ANALYSIS  Results α π@1 π@2 π@3 π@4 π@5 0 0.765 0.746 0.719 0.696 0.663 0.1 0.767 0.749 0.724 0.698 0.663 0.2 0.768 0.750 0.729 0.699 0.662 0.3 0.766 0.749 0.729 0.701 0.661 0.4 0.756 0.747 0.729 0.698 0.658 0.5 0.744 0.735 0.721 0.693 0.651 0.6 0.722 0.717 0.703 0.681 0.640 0.7 0.685 0.687 0.680 0.658 0.625 0,8 0.632 0.637 0.635 0.614 0.586 0.9 0.557 0.552 0.548 0.534 0.512 1 0.408 0.421 0.372 0.388 0.640 Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 67. CONCLUSIONS  Online advertising  represents one of the major sources of income for a large number of websites  is aimed at suggesting products and services to the population of Internet users  Modern contextual advertising systems  put ads within the content of a generic, third party, webpage  adopt both syntactical and semantic textual analyses to select the most relevant ads for a given webpage  an example is ConCA Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 68. CONCLUSIONS  Results show that  the impact of semantics is stronger than that of syntax  adopting more advanced semantic techniques, such as concepts, improves the performances  the more the suggested ads are, the worse the performance is Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 70. REFERENCES  Syntactical Textual Analysis  Armano G., Giuliani A., & Vargiu E. Experimenting text summarization techniques for contextual advertising. 2nd Italian Information Retrieval Workshop (IIR’11) , 2011.  Armano G., Giuliani A. & Vargiu, E. Using snippets in text summarization: a comparative study and an application. 3rd Italian Information Retrieval Workshop (IIR’12), 2012.  Kolcz A., Prabakarmurthi V. & Kalita J. Summarization as feature selection for text categorization. 10th International Conference on Information and Knowledge Management (CIKM’01). ACM, New York, NY, USA, pp. 365–370, 2001.  Porter M. An algorithm for suffix stripping. Program 14, 3, 130–137, 1980.  Salton G., Wong A. & Yang C.S, A vector space model for automatic indexing, Communications of the ACM, 18, 11, pp.613-620, 1975. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 71. REFERENCES  Semantic Textual Analysis  Cortes C. & Vapnik, V.N. Support-Vector Networks, Machine Learning, 20, 1995.  Fellbaum C. WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press, 1998.  Liu H. & Singh P. ConceptNet: A practical commonsense reasoning tool-kit. BT Technology Journal 22, pp. 211–226, 2004.  Miller G.A. WordNet: A Lexical Database for English. Communications of the ACM, 38, 11, pp. 39-41, 1995.  Rocchio J. The SMART Retrieval System: Experiments in Automatic Document Processing. PrenticeHall, Chapter: Relevance feedback in information retrieval, pp. 313–323, 1971.  Suchanek F.M., Kasneci G. & Weikum G. Yago - A Core of Semantic Knowledge. 16th International World Wide Web conference (WWW 2007), 2007. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 72. REFERENCES  Matching  Liu T.Y. Learning to rank for information retrieval. Found. Trends Inf. Retr. 3, 3, pp. 225–331, 2009.  Radomski P.J. & Goeman, T.J. The homogenizing of Minnesota lake fish assemblages. Fisheries, 20, pp. 20–23, 1995.  Comparison Systems  Anagnostopoulos A., Broder A. Z., Gabrilovich E., Josifovski V. & Riedel L. Just- in-time contextual advertising. 16th ACM Conference on Information and Knowledge Management (CIKM’07). ACM, New York, NY, USA, pp. 331–340, 2007.  Armano G., Giuliani A. & Vargiu E. Studying the impact of text summarization on contextual advertising. 8th International Workshop on Text-based Information Retrieval (TIR’11), 2011.  Armano G., Giuliani A. & Vargiu E. Semantic enrichment of contextual advertising by using concepts. International Conference on Knowledge Discovery and Information Retrieval, 2011. Eloisa Vargiu (evargiu@bdigital.org) – Cagliari, 6 September 2012
  • 73. Contact: Eloisa Vargiu – evargiu@bdigital.org