SlideShare una empresa de Scribd logo
1 de 39
CSTalks

   Similarity Measures in Music Information
               Retrieval Systems




             Speaker:    Zhonghua Li
             Supervisor:    Ye Wang
           lizhongh@comp.nus.edu.sg
                                              1
Where do you search for music?
 Music is an important part of our way of life.
 Music stores
 Online music




                                  Mufin




                                                   2
3
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      4
Definition
 Music Information Retrieval
   Is the process of searching for, and finding, music objects, or
    part of objects, via a query framed musically and/or in musical
    terms.
     Music objects: Recordings (wav, mp3, etc.), scores, parts, etc.
     Musically framed query: Singing, humming, keyboad, notation-based,
      MIDI files, sound files, etc.
     Music terms: Genre, style, tempo, bibliography, etc.
   Applications
     Music search, recommendation, identification ,etc.




                                                                           5
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Music Similarity Measure
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      6
Applications -- Search
 Text-based Music Search
   Compare a textual query with the metadata
   Is adopted by most existing systems.
   Examples: Last.fm, Musicovery, …




                                                7
Applications -- Search
 Content-based Music Search
   Compare audio query with audio content
   Query-by-humming/singing/recording: midomi




                                                 8
Applications -- Search
 Content-based Music Search
   Compare rhythm tapped with audio content
   Query-by-tapping: SongTapper




                                               9
Applications -- Search
 User’s information need                                            Online     Offline


  (intention) :                                          Query


  Explicit Query: text, audio, etc.    Intention Gap
                                                                   Query
                                                                 Formation


 Similarity measure:                                  Descriptors                         Documents


  Query  Music documents in the                                 Match
                                                                                                   Descriptor
                                                                                                   Extraction
   Database                                                                     Indexing
                                       Semantic Gap      Index                             Descriptors
 Ranking: relevant documents by
  domain specific criterions (no. of                             Ranking


  hits ).                                              Ranked List


                                                                 Presentation


                                                         Results



                                                                                                         10
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      11
Applications – Recommendation
 Collaborative-Filtering-based Recommendation
   Last.fm: what you (and others ) listen to and like,
   Amazon: customers who shopped for … also shopped for …




                                                             12
Applications – Recommendation
 Collaborative-Filtering-based Recommendation
   Last.fm: what you (and others ) listen to and like,
   Amazon: customers who shopped for … also shopped for …
   Example:
     Users: A, B, and C
     Music: 1, 2, …, 8

                          Small similarity               Large similarity
                  C                              A                      B

                                                          Similarity
                  4
                             Similarity      1            Measure
                                                                       1
                             Measure
                                                 2                          2
                      6                              3                          3
              8                                  4                     5

                                                         Recommend to user A
                                                                                    13
Applications -- Recommendation
 Audio Content-based Recommendation
  Recommend songs which have similar audio content to the
   songs that you like.
  Pandora:
                                       Music database              Music Experts
               User
                                                          Listen



                                          Instrument:
       Instrument:        Similarity        Instrument:
                                          Vocal:
       Vocal:                               Vocal:
                                          Structure:
                          Measure
       Structure:                         …Structure:
       …                                    …
                                       400 Attributes/song

                     Recommendations
                                                                                   14
Applications -- Recommendation
 User’s information need                              Online / Offline        Offline


  (intention):                                       User Profile


  Implicit user profiles: ratings,   Intention Gap
                                                                  Profile
                                                                 Capture
    listening history, etc.
                                                     Descriptors                          Documents
 Similarity measure:
                                                                                                  Descriptor
  User profiles Music                                          Match                             Extraction

   documents/other user profiles     Semantic Gap       Index
                                                                               Indexing
                                                                                          Descriptors


 Ranking: relevant documents                                   Ranking

  by some domain specific
                                                     Ranked List
  criterions (no. of hits).
                                                                Presentation


                                                       Results


                                                                                                        15
Similarity Measure
 one of the most fundamental concepts in MIR
                                                   Online / Offline        Offline
 Closely related to
                                                 User Profile
                                                    /query
   What information music                                    Profile/query
                                 Intention Gap
                                                             Capture
    contains.
   How this information is
                                                 Descriptors                          Documents



   represented.                                             Match
                                                                                              Descriptor
                                                                                              Extraction


   How to match between themSemantic Gap
                                                                           Indexing
                                                    Index                             Descriptors


                                                            Ranking


                                                 Ranked List


                                                            Presentation


                                                   Results

                                                                                                    16
Music Information Plane
                              Similarity can be measure
                              from different aspects.

                                         Song1: New favorite -
                                         Alison Krauss
                                         Song2: She is Beautiful -
                                         Andrew W.K.
                                                Song1          Song2
                                                Female         Male
                          Dissimilar            Gentle         Aggressive
                                                Slow           fast

                                                   Guitar
                          Similar                  Tempo: ~162 BPM
                                                   (Beat Per Minute)

                  * O. C. Herrada. Music recommendation and discovery in
       Music      the long tail. PhD thsis. 2008.                           17
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      18
Similarity Measure Methods
 Text-based Method: Okapi BM-25 Ranking
   Given: queryQ, containing keywords q1, …, qn, music documents: bag of words.
   BM25 ranking function can be formulated as:




   f(qi, D) is qi’s term frequency (tf) in document D.
   |D| is the length of document D in words.
   avgdl is the average document length in the collection.
   k1 and b are free parameters, usually set as k1=2.0 and b=0.75.
   IDF(qi) is the inverse document frequency (idf), calculated as:



. The query term appears in this document frequently. f (qi, D)
. And it doesn’t appear in other document. IDF
                                                                                   19
Similarity Measure Methods
 Text-based Method:
   Pros
     Simple & efficient
   Cons
     Affected by noisy/wrong texts
     Songs with no text cannot be retrieved
     Require high-level domain knowledge to create good metadata
     “Text retrieval on audio metadata” not pure music retrieval




                                                                    20
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      21
Similarity Measure Methods
 Audio Feature-based Method




           Audio feature   Distribution     Model
  Music
            extraction      modeling      comparison




                                                       22
Similarity Measure Methods
 Audio Feature-based Method
   Audio Feature extraction -> Distribution modeling -> Model Comparison




                                                                    frame



            Feature
            Vector             …


                                                                            23
Existing Works
 Audio Feature-based Method
   Audio Feature extraction  Distribution modeling  Model Comparison

     Use low-level feature directly
          Pitch, loudness, MFCC (Blum et al.[3], 1999)
          Histogram of MFCC (Foote[4], 1997)
          Spectrum, rhythm, chord changesingleVector (Tzanetakis [5], 2002)

     Low-level features  higher-level features.
        Cluster MFCC=>model comparison (Aucouturier[6], 2002)
        MFCC => Gaussian Mixture Models => model comparison
        MFCC =>“anchor space”, compare probability models (Berenzweig et al.
         [7], 2003)


                                                                                24
Similarity Measure Methods
 Audio Feature-based Method
  Audio Feature extraction  Distribution modeling  Model Comparison

    Euclidean /Cosine distance (uniform-length feature vectors)


    Distance between two probability distributions
         Kullback-Leibler divergence (KL Distance) / relative entropy

    No closed form for Gaussian Distributions
       Centroid distance: Euclidean distance between the overall means;
       Sampling based method: compute the likelihood of one model given points
        sampled from another; very computationally expensive;
       Earth-Mover’s distance


              Berenzweig, A., Logan, B., Ellis, D. P., and Whitman, B. P., A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity
              Measures. Computer Music Journal. 28, 2004,                                                                                         25
Similarity Measure Methods
 Audio Feature-based Method
   Pros
     Can deal with new songs with no or few texts.
     Save human labors from annotating each song manually
   Cons
     Time complexity is relatively high.
     Features ≠ audio piece: Two songs with very similar features may sounds
      very different.
     The average performance is reaching the glass ceiling of around 65% in
      accuracy.




                                                                                26
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      27
Similarity Measure Methods
 Semantic Concept-based Method
   Nature of user queries
     Far beyond of bibliographic text and audio search
     Semantically-rich
     Syntactically- undetermined
     e.g.: “Find me a classical and happy song”, or “Find me a song to relax”
    “Find me some songs for parties/ weddings/ in churches” …
   Collaborative(social) tagging is very popular on Web 2.0.
    Users annotate their feelings or opinions to the music. Tags,
    comments, etc.



                                                                                28
Similarity Measure Methods
 Semantic Concept-based Method
     Tags VS user queries (Last.fm)
Tag Type            Frequency                          Multi-tag search queries
Genre               68%                                51%
Locale              12%                                7%
Mood                5%                                 4%
Opinion             4%                                 2%
Instrumentation     4%                                 5%
Style               3%                                 26%



                    . Paul Lamere. Social tagging and music information
                    retrieval. Journal of New Music Research. 2008.
                    . Klaas Bosteels, Elias Pampalk, and Etienne E. Kerre. Music
                    retrieval based on social tags: a case study. ISMIR, 2008.     29
Similarity Measure Methods
    Vocabulary: classical, jazz, … piano, violin, …, female, male, …


                                         …

              Model         Model          …               Model


                                             …                         Probability vector
Song1                                                                     Similarity
                                             …
Song2



                                                                                            30
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      31
Similarity Measure Methods
 Multimodal Method
  Information keeps growing.
  One of the most important ongoing trends:

             Metadata


        Audio       Semantic
       Content      Concept


    Users are important.



                                               32
Similarity Measure Methods
 Multimodal Method
                                                                             Document Vectors



 Customization




                                                                  Fuzzy Music Semantic
                                                                  Vector
                 B. Zhang, J. Shen, Q. Xiang, and Y. Wang. CompositMap: a novel framework for music similarity measure. ACM Multimedia,
                 2009.                                                                                                                    33
Outline
 Music Information Retrieval (MIR)
   Definition
   Applications – Search
   Applications -- Recommendation
 Similarity Measure Methods
   Text-based Method
   Audio Feature-based Method
   Semantic Concept-based Method
   Multimodal Fusion Method
 Conclusion and Future Directions


                                      34
Conclusion and Future Directions
 What makes MIR (and the similarity measure) so
 tricky?
  Music information is
   Multimodal: audio, metadata, social , …
   Multicultural: e.g., modern art, Indian ragas, …
   Multirepresentational: audio, MIDI, score, …
   Multifaceted: melody, tempo, beat, …
  …
 Similarity can be measured from different aspects.



                                                       35
Conclusions and Future Directions
 What do users really want?
                                                   Intention Gap
   User interactions with the system.
   Learn a good user preference modeling


 What kind of music features can really capture this need?
   Content –Tags                                  Semantic Gap
   Leverage more social data? Comments, ratings, groups, playlist, other
    user created information, …


 How to fuse multiple information effectively?
   Identify the relevant/discriminative information aspects
   Fusion Methods


                                                                            36
37
References
 [2] F[1] O. C. Herrada. Music recommendation and discovery in the long
    tail. PhD thsis. 2008.
   . Pachet. Knowledge management and musical metadata. Encyclopedia of
    Knowledge Management. Idea Group, 2005.
   [3] T. L. Blum, D. F. Keislar, J. A. Wheaton, and E. H. Wold. Method and article
    of manufacture for content-based analysis, storage, retrieval, and
    segmentation of audio information. U.S. Patent 5, 918, 223.
   [4] J. T. Foote. Content-based retrieval of music and audio. SPIE, 1997.
   [5] G. Tzanetakis. Manipulation, analysis, and retrieval system for audio
    signals. PhD thsis, 2002.
   [6] J. J. Aucouturier and F. Pachet. Music similarity measure: What’s the use?
    International Symposium on Music information retrieval. 2002.
   [7] A. Berenzweig, D. P. W. Ellis and S. Lawrence. Anchor space for
    classification and similarity measurement for music. ICME 2003.




                                                                                       38
References
 [8] B. Zhang, J. Shen, Q. Xiang and Y. Wang. CompositeMap: a
    novel framework for music similarity measure. SIGIR, 2009.
   [9] B. Whiteman and S. Lawrence. Inferring descriptions and
    similarity for music from community metadata. International
    computer music conference. 2002.
   [10] M. Schedl, T. Pohle, P. Knees and G. Widmer. Assigning
    and visualizing music genre by web-based co-occurrence
    analysis. ISMIR 2006.
   [11] B. Whitman and Paris Smaragdis. Combining musical and
    cultural features for intelligent style detection. ISMIR 2002.
   [12] L. Chen, P. Wright, and W. Nejdl. Improving music genre
    classification using collaborative tagging data. WSDM, 2009.
   [13] Benedikt Raes. Automatic generation of music metadata.
    ISMIR, 2009.


                                                                     39

Más contenido relacionado

Similar a CSTalks - Music Information Retrieval - 23 Feb

Mehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedMehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedIRJET Journal
 
Music recognition
Music recognition Music recognition
Music recognition aaronloklok
 
Knn a machine learning approach to recognize a musical instrument
Knn  a machine learning approach to recognize a musical instrumentKnn  a machine learning approach to recognize a musical instrument
Knn a machine learning approach to recognize a musical instrumentIJARIIT
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At SpotifyVidhya Murali
 
survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...Suraj Ligade
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...MusicNet
 
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...IRJET Journal
 
1. paul dwyer internet radio emma
1. paul dwyer internet radio emma1. paul dwyer internet radio emma
1. paul dwyer internet radio emmaPaul Dwyer
 
Music discovery on the net
Music discovery on the netMusic discovery on the net
Music discovery on the netguestbf080
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classificationxiaohusmile
 
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...Eoin Brazil
 
Stop Looking and Start Listening
Stop Looking and Start ListeningStop Looking and Start Listening
Stop Looking and Start ListeningBecky Stewart
 
01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdf01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdfAlexanderLerch4
 
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...I MT
 
Applications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice AssistantsApplications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice AssistantsAI Publications
 
web based music genre classification.pptx
web based music genre classification.pptxweb based music genre classification.pptx
web based music genre classification.pptxUmaMahesh786960
 
Case-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist RecommendationCase-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist RecommendationUri Levanon
 
Poster vega north
Poster vega northPoster vega north
Poster vega northAcxelVega
 

Similar a CSTalks - Music Information Retrieval - 23 Feb (20)

Mehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment DetectedMehfil : Song Recommendation System Using Sentiment Detected
Mehfil : Song Recommendation System Using Sentiment Detected
 
Music recognition
Music recognition Music recognition
Music recognition
 
Knn a machine learning approach to recognize a musical instrument
Knn  a machine learning approach to recognize a musical instrumentKnn  a machine learning approach to recognize a musical instrument
Knn a machine learning approach to recognize a musical instrument
 
Music data mining
Music  data miningMusic  data mining
Music data mining
 
Music Personalization At Spotify
Music Personalization At SpotifyMusic Personalization At Spotify
Music Personalization At Spotify
 
survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...survey on Hybrid recommendation mechanism to get effective ranking results fo...
survey on Hybrid recommendation mechanism to get effective ranking results fo...
 
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
J-P. Fauconnier, J. Roumier. Musonto - A Semantic Search Engine Dedicated to ...
 
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
Music Recommendation System using Euclidean, Cosine Similarity, Correlation D...
 
1. paul dwyer internet radio emma
1. paul dwyer internet radio emma1. paul dwyer internet radio emma
1. paul dwyer internet radio emma
 
Music discovery on the net
Music discovery on the netMusic discovery on the net
Music discovery on the net
 
Emofy
Emofy Emofy
Emofy
 
Multi modal music mood classification
Multi modal music mood classificationMulti modal music mood classification
Multi modal music mood classification
 
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
Echoes, Whispers, and Footsteps from the Conflux of Sonic Interaction Design ...
 
Stop Looking and Start Listening
Stop Looking and Start ListeningStop Looking and Start Listening
Stop Looking and Start Listening
 
01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdf01-00-ACA-Introduction-2-MIR.pdf
01-00-ACA-Introduction-2-MIR.pdf
 
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
Colloque IMT -04/04/2019- L'IA au cœur des mutations industrielles - "Machine...
 
Applications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice AssistantsApplications of AI and NLP to advance Music Recommendations on Voice Assistants
Applications of AI and NLP to advance Music Recommendations on Voice Assistants
 
web based music genre classification.pptx
web based music genre classification.pptxweb based music genre classification.pptx
web based music genre classification.pptx
 
Case-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist RecommendationCase-based Sequential Ordering of Songs for Playlist Recommendation
Case-based Sequential Ordering of Songs for Playlist Recommendation
 
Poster vega north
Poster vega northPoster vega north
Poster vega north
 

Más de cstalks

CSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 NovCSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 Novcstalks
 
CSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17AugCSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17Augcstalks
 
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17AugCSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Augcstalks
 
CSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14SepCSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14Sepcstalks
 
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17AugCSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17Augcstalks
 
CSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 AugCSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 Augcstalks
 
CSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th MayCSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th Maycstalks
 
CSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 MarCSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 Marcstalks
 
CSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 MarCSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 Marcstalks
 
CSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 MarCSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 Marcstalks
 
CSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16MarCSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16Marcstalks
 
CSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 FebCSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 Febcstalks
 
CSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 FebCSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 Febcstalks
 
CSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 JanCSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 Jancstalks
 
CSTalks - GPGPU - 19 Jan
CSTalks  -  GPGPU - 19 JanCSTalks  -  GPGPU - 19 Jan
CSTalks - GPGPU - 19 Jancstalks
 

Más de cstalks (15)

CSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 NovCSTalks-Natural Language Processing-2 Nov
CSTalks-Natural Language Processing-2 Nov
 
CSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17AugCSTalks-Natural Language Processing-17Aug
CSTalks-Natural Language Processing-17Aug
 
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17AugCSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
CSTalks-Sensor-Rich Mobile Video Indexing and Search-17Aug
 
CSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14SepCSTalks-Visualizing Software Behavior-14Sep
CSTalks-Visualizing Software Behavior-14Sep
 
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17AugCSTalks-Polymorphic heterogeneous multicore systems-17Aug
CSTalks-Polymorphic heterogeneous multicore systems-17Aug
 
CSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 AugCSTalks-Quaternary Semantics Recomandation System-24 Aug
CSTalks-Quaternary Semantics Recomandation System-24 Aug
 
CSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th MayCSTalks - Object detection and tracking - 25th May
CSTalks - Object detection and tracking - 25th May
 
CSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 MarCSTalks - The Multicore Midlife Crisis - 30 Mar
CSTalks - The Multicore Midlife Crisis - 30 Mar
 
CSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 MarCSTalks - On machine learning - 2 Mar
CSTalks - On machine learning - 2 Mar
 
CSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 MarCSTalks - Real movie recommendation - 9 Mar
CSTalks - Real movie recommendation - 9 Mar
 
CSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16MarCSTalks-LifeBeyondPhD-16Mar
CSTalks-LifeBeyondPhD-16Mar
 
CSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 FebCSTalks - Peer-to-peer - 16 Feb
CSTalks - Peer-to-peer - 16 Feb
 
CSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 FebCSTalks - Named Data Networks - 9 Feb
CSTalks - Named Data Networks - 9 Feb
 
CSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 JanCSTalks - Model Checking - 26 Jan
CSTalks - Model Checking - 26 Jan
 
CSTalks - GPGPU - 19 Jan
CSTalks  -  GPGPU - 19 JanCSTalks  -  GPGPU - 19 Jan
CSTalks - GPGPU - 19 Jan
 

Último

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 

Último (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 

CSTalks - Music Information Retrieval - 23 Feb

  • 1. CSTalks Similarity Measures in Music Information Retrieval Systems Speaker: Zhonghua Li Supervisor: Ye Wang lizhongh@comp.nus.edu.sg 1
  • 2. Where do you search for music?  Music is an important part of our way of life.  Music stores  Online music Mufin 2
  • 3. 3
  • 4. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 4
  • 5. Definition  Music Information Retrieval  Is the process of searching for, and finding, music objects, or part of objects, via a query framed musically and/or in musical terms.  Music objects: Recordings (wav, mp3, etc.), scores, parts, etc.  Musically framed query: Singing, humming, keyboad, notation-based, MIDI files, sound files, etc.  Music terms: Genre, style, tempo, bibliography, etc.  Applications  Music search, recommendation, identification ,etc. 5
  • 6. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Music Similarity Measure  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 6
  • 7. Applications -- Search  Text-based Music Search  Compare a textual query with the metadata  Is adopted by most existing systems.  Examples: Last.fm, Musicovery, … 7
  • 8. Applications -- Search  Content-based Music Search  Compare audio query with audio content  Query-by-humming/singing/recording: midomi 8
  • 9. Applications -- Search  Content-based Music Search  Compare rhythm tapped with audio content  Query-by-tapping: SongTapper 9
  • 10. Applications -- Search  User’s information need Online Offline (intention) : Query Explicit Query: text, audio, etc. Intention Gap Query Formation  Similarity measure: Descriptors Documents Query  Music documents in the Match Descriptor Extraction Database Indexing Semantic Gap Index Descriptors  Ranking: relevant documents by domain specific criterions (no. of Ranking hits ). Ranked List Presentation Results 10
  • 11. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 11
  • 12. Applications – Recommendation  Collaborative-Filtering-based Recommendation  Last.fm: what you (and others ) listen to and like,  Amazon: customers who shopped for … also shopped for … 12
  • 13. Applications – Recommendation  Collaborative-Filtering-based Recommendation  Last.fm: what you (and others ) listen to and like,  Amazon: customers who shopped for … also shopped for …  Example:  Users: A, B, and C  Music: 1, 2, …, 8 Small similarity Large similarity C A B Similarity 4 Similarity 1 Measure 1 Measure 2 2 6 3 3 8 4 5 Recommend to user A 13
  • 14. Applications -- Recommendation  Audio Content-based Recommendation  Recommend songs which have similar audio content to the songs that you like.  Pandora: Music database Music Experts User Listen Instrument: Instrument: Similarity Instrument: Vocal: Vocal: Vocal: Structure: Measure Structure: …Structure: … … 400 Attributes/song Recommendations 14
  • 15. Applications -- Recommendation  User’s information need Online / Offline Offline (intention): User Profile Implicit user profiles: ratings, Intention Gap Profile Capture listening history, etc. Descriptors Documents  Similarity measure: Descriptor User profiles Music Match Extraction documents/other user profiles Semantic Gap Index Indexing Descriptors  Ranking: relevant documents Ranking by some domain specific Ranked List criterions (no. of hits). Presentation Results 15
  • 16. Similarity Measure  one of the most fundamental concepts in MIR Online / Offline Offline  Closely related to User Profile /query  What information music Profile/query Intention Gap Capture contains.  How this information is Descriptors Documents represented. Match Descriptor Extraction  How to match between themSemantic Gap Indexing Index Descriptors Ranking Ranked List Presentation Results 16
  • 17. Music Information Plane Similarity can be measure from different aspects. Song1: New favorite - Alison Krauss Song2: She is Beautiful - Andrew W.K. Song1 Song2 Female Male Dissimilar Gentle Aggressive Slow fast Guitar Similar Tempo: ~162 BPM (Beat Per Minute) * O. C. Herrada. Music recommendation and discovery in Music the long tail. PhD thsis. 2008. 17
  • 18. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 18
  • 19. Similarity Measure Methods  Text-based Method: Okapi BM-25 Ranking  Given: queryQ, containing keywords q1, …, qn, music documents: bag of words.  BM25 ranking function can be formulated as:  f(qi, D) is qi’s term frequency (tf) in document D.  |D| is the length of document D in words.  avgdl is the average document length in the collection.  k1 and b are free parameters, usually set as k1=2.0 and b=0.75.  IDF(qi) is the inverse document frequency (idf), calculated as: . The query term appears in this document frequently. f (qi, D) . And it doesn’t appear in other document. IDF 19
  • 20. Similarity Measure Methods  Text-based Method:  Pros  Simple & efficient  Cons  Affected by noisy/wrong texts  Songs with no text cannot be retrieved  Require high-level domain knowledge to create good metadata  “Text retrieval on audio metadata” not pure music retrieval 20
  • 21. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 21
  • 22. Similarity Measure Methods  Audio Feature-based Method Audio feature Distribution Model Music extraction modeling comparison 22
  • 23. Similarity Measure Methods  Audio Feature-based Method  Audio Feature extraction -> Distribution modeling -> Model Comparison frame Feature Vector … 23
  • 24. Existing Works  Audio Feature-based Method  Audio Feature extraction  Distribution modeling  Model Comparison  Use low-level feature directly  Pitch, loudness, MFCC (Blum et al.[3], 1999)  Histogram of MFCC (Foote[4], 1997)  Spectrum, rhythm, chord changesingleVector (Tzanetakis [5], 2002)  Low-level features  higher-level features.  Cluster MFCC=>model comparison (Aucouturier[6], 2002)  MFCC => Gaussian Mixture Models => model comparison  MFCC =>“anchor space”, compare probability models (Berenzweig et al. [7], 2003) 24
  • 25. Similarity Measure Methods  Audio Feature-based Method  Audio Feature extraction  Distribution modeling  Model Comparison  Euclidean /Cosine distance (uniform-length feature vectors)  Distance between two probability distributions  Kullback-Leibler divergence (KL Distance) / relative entropy  No closed form for Gaussian Distributions  Centroid distance: Euclidean distance between the overall means;  Sampling based method: compute the likelihood of one model given points sampled from another; very computationally expensive;  Earth-Mover’s distance Berenzweig, A., Logan, B., Ellis, D. P., and Whitman, B. P., A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures. Computer Music Journal. 28, 2004, 25
  • 26. Similarity Measure Methods  Audio Feature-based Method  Pros  Can deal with new songs with no or few texts.  Save human labors from annotating each song manually  Cons  Time complexity is relatively high.  Features ≠ audio piece: Two songs with very similar features may sounds very different.  The average performance is reaching the glass ceiling of around 65% in accuracy. 26
  • 27. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 27
  • 28. Similarity Measure Methods  Semantic Concept-based Method  Nature of user queries  Far beyond of bibliographic text and audio search  Semantically-rich  Syntactically- undetermined e.g.: “Find me a classical and happy song”, or “Find me a song to relax” “Find me some songs for parties/ weddings/ in churches” …  Collaborative(social) tagging is very popular on Web 2.0. Users annotate their feelings or opinions to the music. Tags, comments, etc. 28
  • 29. Similarity Measure Methods  Semantic Concept-based Method  Tags VS user queries (Last.fm) Tag Type Frequency Multi-tag search queries Genre 68% 51% Locale 12% 7% Mood 5% 4% Opinion 4% 2% Instrumentation 4% 5% Style 3% 26% . Paul Lamere. Social tagging and music information retrieval. Journal of New Music Research. 2008. . Klaas Bosteels, Elias Pampalk, and Etienne E. Kerre. Music retrieval based on social tags: a case study. ISMIR, 2008. 29
  • 30. Similarity Measure Methods Vocabulary: classical, jazz, … piano, violin, …, female, male, … … Model Model … Model … Probability vector Song1 Similarity … Song2 30
  • 31. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 31
  • 32. Similarity Measure Methods  Multimodal Method  Information keeps growing.  One of the most important ongoing trends: Metadata Audio Semantic Content Concept  Users are important. 32
  • 33. Similarity Measure Methods  Multimodal Method Document Vectors Customization Fuzzy Music Semantic Vector B. Zhang, J. Shen, Q. Xiang, and Y. Wang. CompositMap: a novel framework for music similarity measure. ACM Multimedia, 2009. 33
  • 34. Outline  Music Information Retrieval (MIR)  Definition  Applications – Search  Applications -- Recommendation  Similarity Measure Methods  Text-based Method  Audio Feature-based Method  Semantic Concept-based Method  Multimodal Fusion Method  Conclusion and Future Directions 34
  • 35. Conclusion and Future Directions  What makes MIR (and the similarity measure) so tricky? Music information is  Multimodal: audio, metadata, social , …  Multicultural: e.g., modern art, Indian ragas, …  Multirepresentational: audio, MIDI, score, …  Multifaceted: melody, tempo, beat, … …  Similarity can be measured from different aspects. 35
  • 36. Conclusions and Future Directions  What do users really want? Intention Gap  User interactions with the system.  Learn a good user preference modeling  What kind of music features can really capture this need?  Content –Tags Semantic Gap  Leverage more social data? Comments, ratings, groups, playlist, other user created information, …  How to fuse multiple information effectively?  Identify the relevant/discriminative information aspects  Fusion Methods 36
  • 37. 37
  • 38. References  [2] F[1] O. C. Herrada. Music recommendation and discovery in the long tail. PhD thsis. 2008.  . Pachet. Knowledge management and musical metadata. Encyclopedia of Knowledge Management. Idea Group, 2005.  [3] T. L. Blum, D. F. Keislar, J. A. Wheaton, and E. H. Wold. Method and article of manufacture for content-based analysis, storage, retrieval, and segmentation of audio information. U.S. Patent 5, 918, 223.  [4] J. T. Foote. Content-based retrieval of music and audio. SPIE, 1997.  [5] G. Tzanetakis. Manipulation, analysis, and retrieval system for audio signals. PhD thsis, 2002.  [6] J. J. Aucouturier and F. Pachet. Music similarity measure: What’s the use? International Symposium on Music information retrieval. 2002.  [7] A. Berenzweig, D. P. W. Ellis and S. Lawrence. Anchor space for classification and similarity measurement for music. ICME 2003. 38
  • 39. References  [8] B. Zhang, J. Shen, Q. Xiang and Y. Wang. CompositeMap: a novel framework for music similarity measure. SIGIR, 2009.  [9] B. Whiteman and S. Lawrence. Inferring descriptions and similarity for music from community metadata. International computer music conference. 2002.  [10] M. Schedl, T. Pohle, P. Knees and G. Widmer. Assigning and visualizing music genre by web-based co-occurrence analysis. ISMIR 2006.  [11] B. Whitman and Paris Smaragdis. Combining musical and cultural features for intelligent style detection. ISMIR 2002.  [12] L. Chen, P. Wright, and W. Nejdl. Improving music genre classification using collaborative tagging data. WSDM, 2009.  [13] Benedikt Raes. Automatic generation of music metadata. ISMIR, 2009. 39