SlideShare a Scribd company logo
1 of 45
Download to read offline
MMM, Search!
Daniel Tunkelang
dtunkelang@gmail.com

Presented to Wikimedia Foundation on April 27, 2020
What is search?
Search is a process.
ā€¢ Searchers
ā€¢ start with information-seeking goals.

ā€¢ express and elaborate those goals as queries.

ā€¢ Search Engines
ā€¢ translate queries into representations of intent.

ā€¢ retrieve results relevant to that intent and rank them.

Communication isnā€™t perfect, so the process is iterative.
Search is many things.
ā€¢ Known-Item search vs. exploratory search.

ā€¢ Seeking speciļ¬c item vs. knowing when you see it.

ā€¢ Search is a means to an end, not the end itself.

ā€¢ Getting information, shopping, communication, etc.

ā€¢ It takes a lot of hard work to make search feel eļ¬€ortless.

ā€¢ Indexing, query understanding, matching, ranking.
Metrics, Models, Methods
The most important decisions for a search engine are:

ā€¢ Metrics: what we measure and optimize for.

ā€¢ Models: how we model the search experience.

ā€¢ Methods: how we help searchers achieve success.
Metrics
Metrics:
What do we need to know?
ā€¢ Binary Relevance

ā€¢ Are searchers ļ¬nding relevant results?

ā€¢ Session Success

ā€¢ How often are search sessions successful?

ā€¢ Search Eļ¬ƒciency

ā€¢ How much eļ¬€ort are searchers making?
Binary Relevance
Relevance is a measure of
information conveyed by a
document relative to a query.ā€Ø
ā€Ø
Relationship between document
and query, though necessary, is not
suļ¬ƒcient to determine relevance.

William Goļ¬€man, 1964
Relevance has shades of gray, but
non-relevance is black and white.
Example: Email
ā€¢ Can Google decide which of my emails are important?

ā€¢ ĀÆ_(惄)_/ĀÆ

ā€¢ Can Google decide which of my emails are spam?

ā€¢ Deļ¬nitely!
Measure Binary Relevance!
ā€¢ Build a (query, document) binary relevance model.

ā€¢ (weā€™ll get back to that in a moment)

ā€¢ Embrace positional bias: measure at top ranks.

ā€¢ Can use top k results or weighted sample.

ā€¢ Stratify for meaningful query and user segments.

ā€¢ Leverage query classiļ¬cation and user data.
Search is a journey.
Searcher
Search isnā€™t always one-shot.
Search canā€™t be always one-shot.
Measure Session Success!
ā€¢ Measure session conversion, not just query conversion.

ā€¢ Much better proxy for userā€™s success!

ā€¢ Compute metrics based on ļ¬rst query of session.

ā€¢ Distribution of journeys for common intent.

ā€¢ Segment sessions into tasks? Maybe, but optional. 

ā€¢ Multi-task sessions uncommon; treat as noise.
Search Efļ¬ciency
Searching is not fun.
Having found is fun.
ā€¢ If search is too hard or takes too long, searchers give up.

ā€¢ Compare successful and unsuccessful sessions.

ā€¢ Measure how much time searchers spend in sessions.

ā€¢ Especially time on search rather than results.

ā€¢ Measure searcher eļ¬€ort.

ā€¢ Pagination, reformulation, reļ¬nement, etc.
Metrics: Summary
ā€¢ Binary Relevance

ā€¢ Are searchers ļ¬nding relevant results?

ā€¢ Session Success

ā€¢ How often are search sessions successful?

ā€¢ Search Eļ¬ƒciency

ā€¢ How much eļ¬€ort are searchers making?
Models
Models:
What do we model and how?
ā€¢ Query Categorization

ā€¢ What is the primary domain for a query?

ā€¢ Query Similarity

ā€¢ Do two queries express similar / identical intent?

ā€¢ Binary Relevance

ā€¢ How to estimate relevance of results to queries?
Query Categorization
Search starts with query understanding.
Query understanding starts with categorization.
ā€¢ Map query to a primary content taxonomy.

ā€¢ Subject, product type, domain, etc.

ā€¢ Identify high-level intent, independent of content interest.

ā€¢ Title, category, brand, site help, etc.

ā€¢ Categories should be coherent, distinctive, and useful.

ā€¢ Good categorization requires good categories.
How to Train Your
Query Categorization Model
ā€¢ Label your most frequent head queries manually.

ā€¢ Top 1000 queries are probably worth it.

ā€¢ For torso queries, infer categories from engagement.

ā€¢ Looking for overwhelmingly dominant category.

ā€¢ Now train a model using labeled head and torso queries.

ā€¢ This training data is biased, but manageably so.

ā€¢ No need to use fancy deep learning / AI. Try fastText.
Query Similarity
Query ambiguity is rare.
Query similarity is common.
ā€¢ Some queries do not express a clear intent, but most do.

ā€¢ Most ā€œambiguousā€ queries turn out to be broad.

ā€¢ Bigger opportunity: multiple queries express same intent.

ā€¢ Or at least the same distribution of intents.

ā€¢ Recognizing similar / identical queries is huge opportunity.

ā€¢ Query rewriting, aggregating signals, etc.
How to Model
Query Similarity
ā€¢ Start with the simple stuļ¬€: shallow query canonicalization.

ā€¢ Character normalization, stemming, word order.

ā€¢ Look at edit distance, especially for spelling errors.

ā€¢ Tail queries at edit distance 1 from head queries.

ā€¢ Compare embeddings of queries and results.

ā€¢ Especially to keep the other methods honest.
Binary Relevance
Focus on simplest question.
ā€¢ Worry whether a result is relevant or non-relevant.

ā€¢ Relevant vs. more relevant is often subjective.

ā€¢ Assume that query understanding has done its job ļ¬rst.

ā€¢ Result relevance depends on query understanding.

ā€¢ Assume that relevance is objective and universal.

ā€¢ Personalization: a nice-to-have, not a must-have.
How to Train Your
Binary Relevance Model
ā€¢ Collect human binary relevance judgments. Lots of them. 

ā€¢ Quantity is more important than quality.

ā€¢ Pay attention to query distribution and stratify sample. 

ā€¢ Collect judgements that teach you something.

ā€¢ Come to terms with presentation and position bias.

ā€¢ Users mostly interact with top-ranked results.
Models: Summary
ā€¢ Query Categorization

ā€¢ Simple model to map query to primary intent.

ā€¢ Query Similarity

ā€¢ Recognize queries with same or similar intent.

ā€¢ Binary Relevance

ā€¢ Use human judgments to train relevance model.
Methods
Methods:
What are some useful tricks?
ā€¢ Optimize for Query Performance

ā€¢ Help searchers make better queries.

ā€¢ Map Tail Queries to Head Intents

ā€¢ Searchers arenā€™t as unique as you think!

ā€¢ Learn from Successful Sessions

ā€¢ Help others discovers successful paths.
Optimize for
Query Performance
ā€¢ Expected searcher success for query.

ā€¢ Function of query, not of any particular result.

ā€¢ Can use any measure of searcher success.

ā€¢ But consider focusing on session success.

ā€¢ Can incorporate sorting, reļ¬nement, or other factors.

ā€¢ But keep it simple. Query is probably enough.
What is query performance?
Best way to predict query performance?
Historical query performance.
Stuck in the tail? No data?
These methods can help.
Predict query performance.
Then optimize for it.
ā€¢ Consider every surface where you suggest queries.

ā€¢ Autocomplete, guides, related searches, etc.

ā€¢ Oļ¬€er suggestions with high predicted performance.

ā€¢ Or at least nudge users wherever possible.

ā€¢ Use query rewriting to improve query performance.

ā€¢ Rewrite to similar, high-performing queries.
Pull Your Tail From Your Head
Many tail queries
express head intents.
ā€¢ Misspelled queries are often misspellings of head queries.

ā€¢ Common misspellings are uncommon.

ā€¢ Many queries have a dominant singular or plural form.

ā€¢ Often, though not always, the same intent.

ā€¢ Also word order or other grammatical transformations.

ā€¢ Such removal of low-information / noise words.
Rewrite tail queries!
ā€¢ Prioritize correcting misspellings of head queries.

ā€¢ Be more aggressive, skip tokenization, etc.

ā€¢ Look for head queries equivalent to tail queries.

ā€¢ Stemming, reordering terms, dropping noise words.

ā€¢ But check to make sure intent is actually preserved!

ā€¢ Remember earlier discussion of query similarity.
Learn From Success
Successful searchers
can help everyone else.
ā€¢ Some queries lead to great performance for everyone.

ā€¢ e.g., known-item searches by name or title.

ā€¢ But for some queries, performance is user-dependent.

ā€¢ Some users are more sophisticated or persistent.

ā€¢ Successful users discovers successful paths.

ā€¢ Use trails of successful users to build shortcuts!
Optimize complex journeys.
ā€¢ Detect the searches for which searchers need help.

ā€¢ Queries for which successful sessions are long.

ā€¢ Find the actions that successful searchers take.

ā€¢ Category / facet reļ¬nements, reformulations.

ā€¢ Promote those actions in the search experience.

ā€¢ Create shortcuts in the navigational landscape.
Methods: Summary
ā€¢ Optimize for Query Performance

ā€¢ Suggest better queries and rewrite others.

ā€¢ Map Tail Queries to Head Intents

ā€¢ Rewrite tail queries as similar head queries.

ā€¢ Learn from Successful Sessions

ā€¢ Create shortcuts based on successful paths.
Putting It All Together
ā€¢ Metrics, models, and methods ā€” they all matter.

ā€¢ Query understanding ļ¬rst, then result relevance.

ā€¢ Binary result relevance ļ¬rst, then result ranking.

ā€¢ Session performance, not just query performance.

ā€¢ Get as much leverage as possible from head queries.
Thank You!
ā€¢ More Resources

ā€¢ Query Understandingā€Ø
https://queryunderstanding.com/

ā€¢ My Medium (not just about search)ā€Ø
https://medium.com/@dtunkelang

ā€¢ Contact me directly!ā€Ø
dtunkelang@gmail.com

More Related Content

What's hot

į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«
į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«
į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«ģ¢…ėƼ ź¹€
Ā 
Introduction to Data Mining / Bioinformatics
Introduction to Data Mining / BioinformaticsIntroduction to Data Mining / Bioinformatics
Introduction to Data Mining / BioinformaticsGerald Lushington
Ā 
ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©
ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©
ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©NAVER Engineering
Ā 
Query Understanding: A Manifesto
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A ManifestoDaniel Tunkelang
Ā 
ģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.js
ģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.jsģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.js
ģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.jsHeeJung Hwang
Ā 
[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼
[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼
[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼NAVER D2
Ā 
Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°
Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°
Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°ģ°¬ķ¬ ģ“
Ā 
NLP techniques for log analysis
NLP techniques for log analysisNLP techniques for log analysis
NLP techniques for log analysisJacob Perkins
Ā 
Applying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to BioinformaticsApplying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to Bioinformaticsbutest
Ā 
Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰
Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰
Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰ksdc2019
Ā 
į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·
į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·
į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·NAVER D2
Ā 
[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”! į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†Ø
[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”!   į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†Ø[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”!   į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†Ø
[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”! į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†ØNAVER D2
Ā 
Learning to rank
Learning to rankLearning to rank
Learning to rankBruce Kuo
Ā 
Learning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search GuildLearning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search GuildSujit Pal
Ā 
"La intenciĆ³n es lo que cuenta" en SEO en 2021
"La intenciĆ³n es lo que cuenta" en SEO en 2021"La intenciĆ³n es lo que cuenta" en SEO en 2021
"La intenciĆ³n es lo que cuenta" en SEO en 2021MJ CachĆ³n YƔƱez
Ā 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analysesrjorton
Ā 
[LondonSEO 2020] BigQuery & SQL for SEOs
[LondonSEO 2020] BigQuery & SQL for SEOs[LondonSEO 2020] BigQuery & SQL for SEOs
[LondonSEO 2020] BigQuery & SQL for SEOsAreej AbuAli
Ā 
SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...
SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...
SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...Koray Tugberk GUBUR
Ā 
Thought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered SearchThought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered SearchTrey Grainger
Ā 

What's hot (20)

į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«
į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«
į„€į…„į†·į„‰į…¢į†Øį„‹į…¦į†«į„Œį…µį†«į„‹į…µ į„ƒį…¦į„‹į…µį„į…„į„…į…³į†Æ į„ƒį…”į„…į…®į„‚į…³į†« į„‡į…„į†ø į„€į…µį†·į„Œį…©į†¼į„†į…µį†«
Ā 
Introduction to Data Mining / Bioinformatics
Introduction to Data Mining / BioinformaticsIntroduction to Data Mining / Bioinformatics
Introduction to Data Mining / Bioinformatics
Ā 
ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©
ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©
ėÆøė“±ė”ė‹Øģ–“ ė¬øģ œ ķ•“ź²°ģ„ ģœ„ķ•œ ė¹„ģ§€ė„ķ•™ģŠµ źø°ė°˜ ķ•œźµ­ģ–“ģžģ—°ģ–“ģ²˜ė¦¬ ė°©ė²•ė”  ė° ģ‘ģš©
Ā 
Query Understanding: A Manifesto
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A Manifesto
Ā 
ģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.js
ģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.jsģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.js
ģ°Øź³”ģ°Øź³” ģ‰½ź²Œ ģ•Œģ•„ź°€ėŠ” Elasticsearchģ™€ Node.js
Ā 
[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼
[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼
[211] į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Øį„€į…Ŗ į„ƒį…¦į„‹į…µį„į…„į„†į…”į„‹į…µį„‚į…µį†¼
Ā 
Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°
Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°
Dependency Parser, ģ˜ģ”“ źµ¬ģ”° ė¶„ģ„źø°
Ā 
NLP techniques for log analysis
NLP techniques for log analysisNLP techniques for log analysis
NLP techniques for log analysis
Ā 
Applying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to BioinformaticsApplying Hidden Markov Models to Bioinformatics
Applying Hidden Markov Models to Bioinformatics
Ā 
Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰
Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰
Elasticsearchė„¼ ķ™œģš©ķ•œ GIS ź²€ģƒ‰
Ā 
į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·
į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·
į„‹į…µį†«į„€į…©į†¼į„Œį…µį„‚į…³į†¼į„Žį…®į„Žį…„į†«į„‰į…µį„‰į…³į„į…¦į†· airsį„€į…¢į„‡į…”į†Æį„€į…µ_į„†į…©į„ƒį…¦į†Æį„…į…µį†¼į„€į…Ŗį„‰į…µį„‰į…³į„į…¦į†·
Ā 
[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”! į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†Ø
[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”!   į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†Ø[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”!   į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†Ø
[216]į„‚į…¦į„‹į…µį„‡į…„ į„€į…„į†·į„‰į…¢į†Ø į„‰į…”į„‹į…­į†¼į„Œį…”į„…į…³į†Æ į„†į…”į†«į„Œį…©į†Øį„‰į…µį„į…§į„…į…”! į„‹į…“į„ƒį…©į„‘į…”į„‹į…”į†Øį„€į…Ŗ į„‹į…“į„†į…µį„€į…„į†·į„‰į…¢į†Ø
Ā 
Learning to rank
Learning to rankLearning to rank
Learning to rank
Ā 
Learning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search GuildLearning to Rank Presentation (v2) at LexisNexis Search Guild
Learning to Rank Presentation (v2) at LexisNexis Search Guild
Ā 
"La intenciĆ³n es lo que cuenta" en SEO en 2021
"La intenciĆ³n es lo que cuenta" en SEO en 2021"La intenciĆ³n es lo que cuenta" en SEO en 2021
"La intenciĆ³n es lo que cuenta" en SEO en 2021
Ā 
NGS data formats and analyses
NGS data formats and analysesNGS data formats and analyses
NGS data formats and analyses
Ā 
[LondonSEO 2020] BigQuery & SQL for SEOs
[LondonSEO 2020] BigQuery & SQL for SEOs[LondonSEO 2020] BigQuery & SQL for SEOs
[LondonSEO 2020] BigQuery & SQL for SEOs
Ā 
SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...
SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...
SEO Başarı Hikayesi - Hangikredi.com 12 Mart'tan 24 EylĆ¼l Google Core GĆ¼ncell...
Ā 
Genome assembly
Genome assemblyGenome assembly
Genome assembly
Ā 
Thought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered SearchThought Vectors and Knowledge Graphs in AI-powered Search
Thought Vectors and Knowledge Graphs in AI-powered Search
Ā 

Similar to MMM, Search!

Statistics for MBA.pptx
Statistics for MBA.pptxStatistics for MBA.pptx
Statistics for MBA.pptxPradeep513562
Ā 
Keyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEOKeyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEONeeraj Reddy
Ā 
Business research Questionnaire Design
Business research Questionnaire DesignBusiness research Questionnaire Design
Business research Questionnaire DesignNishant Pahad
Ā 
Using Surveys to Improve Your Library: Part 1 (Sept. 2018)
Using Surveys to Improve Your Library: Part 1 (Sept. 2018)Using Surveys to Improve Your Library: Part 1 (Sept. 2018)
Using Surveys to Improve Your Library: Part 1 (Sept. 2018)ALATechSource
Ā 
CBI Workshop -.pptx
CBI Workshop -.pptxCBI Workshop -.pptx
CBI Workshop -.pptxAmira Samy
Ā 
Questionnaire design for beginners (Bart Rienties)
Questionnaire design for beginners (Bart Rienties)Questionnaire design for beginners (Bart Rienties)
Questionnaire design for beginners (Bart Rienties)Bart Rienties
Ā 
Effective interviewing skills
Effective interviewing skillsEffective interviewing skills
Effective interviewing skillsJamshaid Iqbal
Ā 
business research method chp 7]
business research method  chp 7]business research method  chp 7]
business research method chp 7]fizza tanvir
Ā 
HCI_Lecture04.pptx
HCI_Lecture04.pptxHCI_Lecture04.pptx
HCI_Lecture04.pptxHARISSheikh31
Ā 
Requirements elicitation
Requirements elicitationRequirements elicitation
Requirements elicitationSyed Zaid Irshad
Ā 
Hci evaluationa frame work lec 14
Hci evaluationa frame work lec 14Hci evaluationa frame work lec 14
Hci evaluationa frame work lec 14Anwal Mirza
Ā 
Guidelines for search features development a comparison of general users and ...
Guidelines for search features development a comparison of general users and ...Guidelines for search features development a comparison of general users and ...
Guidelines for search features development a comparison of general users and ...Ferli Castillo
Ā 
Behavioural Interviewing Skills, 2012-2013
Behavioural Interviewing Skills, 2012-2013Behavioural Interviewing Skills, 2012-2013
Behavioural Interviewing Skills, 2012-2013ankiit aggarwal
Ā 
Simple User Research Methods: the First Step to Improving Your Website
Simple User Research Methods: the First Step to Improving Your WebsiteSimple User Research Methods: the First Step to Improving Your Website
Simple User Research Methods: the First Step to Improving Your WebsiteRebecca Blakiston
Ā 
Unit 2: Research.
Unit 2: Research.Unit 2: Research.
Unit 2: Research.Louis Dowson
Ā 
Research method - How to interview?
Research method - How to interview?Research method - How to interview?
Research method - How to interview?Hafizah Hajimia
Ā 
Getting it Right with Keyword Research - Stukent Expert Session
Getting it Right with Keyword Research - Stukent Expert SessionGetting it Right with Keyword Research - Stukent Expert Session
Getting it Right with Keyword Research - Stukent Expert SessionStukent Inc.
Ā 
Training Program Evaluation
Training Program EvaluationTraining Program Evaluation
Training Program EvaluationLaura Pasquini
Ā 
Free sample 25% Professional in Business Analysis PMI-PBA
Free sample 25%  Professional in Business Analysis PMI-PBAFree sample 25%  Professional in Business Analysis PMI-PBA
Free sample 25% Professional in Business Analysis PMI-PBAEVOLVE for Instructors Materials
Ā 

Similar to MMM, Search! (20)

Search Analytics - Comperio
Search Analytics - ComperioSearch Analytics - Comperio
Search Analytics - Comperio
Ā 
Statistics for MBA.pptx
Statistics for MBA.pptxStatistics for MBA.pptx
Statistics for MBA.pptx
Ā 
Keyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEOKeyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEO
Ā 
Business research Questionnaire Design
Business research Questionnaire DesignBusiness research Questionnaire Design
Business research Questionnaire Design
Ā 
Using Surveys to Improve Your Library: Part 1 (Sept. 2018)
Using Surveys to Improve Your Library: Part 1 (Sept. 2018)Using Surveys to Improve Your Library: Part 1 (Sept. 2018)
Using Surveys to Improve Your Library: Part 1 (Sept. 2018)
Ā 
CBI Workshop -.pptx
CBI Workshop -.pptxCBI Workshop -.pptx
CBI Workshop -.pptx
Ā 
Questionnaire design for beginners (Bart Rienties)
Questionnaire design for beginners (Bart Rienties)Questionnaire design for beginners (Bart Rienties)
Questionnaire design for beginners (Bart Rienties)
Ā 
Effective interviewing skills
Effective interviewing skillsEffective interviewing skills
Effective interviewing skills
Ā 
business research method chp 7]
business research method  chp 7]business research method  chp 7]
business research method chp 7]
Ā 
HCI_Lecture04.pptx
HCI_Lecture04.pptxHCI_Lecture04.pptx
HCI_Lecture04.pptx
Ā 
Requirements elicitation
Requirements elicitationRequirements elicitation
Requirements elicitation
Ā 
Hci evaluationa frame work lec 14
Hci evaluationa frame work lec 14Hci evaluationa frame work lec 14
Hci evaluationa frame work lec 14
Ā 
Guidelines for search features development a comparison of general users and ...
Guidelines for search features development a comparison of general users and ...Guidelines for search features development a comparison of general users and ...
Guidelines for search features development a comparison of general users and ...
Ā 
Behavioural Interviewing Skills, 2012-2013
Behavioural Interviewing Skills, 2012-2013Behavioural Interviewing Skills, 2012-2013
Behavioural Interviewing Skills, 2012-2013
Ā 
Simple User Research Methods: the First Step to Improving Your Website
Simple User Research Methods: the First Step to Improving Your WebsiteSimple User Research Methods: the First Step to Improving Your Website
Simple User Research Methods: the First Step to Improving Your Website
Ā 
Unit 2: Research.
Unit 2: Research.Unit 2: Research.
Unit 2: Research.
Ā 
Research method - How to interview?
Research method - How to interview?Research method - How to interview?
Research method - How to interview?
Ā 
Getting it Right with Keyword Research - Stukent Expert Session
Getting it Right with Keyword Research - Stukent Expert SessionGetting it Right with Keyword Research - Stukent Expert Session
Getting it Right with Keyword Research - Stukent Expert Session
Ā 
Training Program Evaluation
Training Program EvaluationTraining Program Evaluation
Training Program Evaluation
Ā 
Free sample 25% Professional in Business Analysis PMI-PBA
Free sample 25%  Professional in Business Analysis PMI-PBAFree sample 25%  Professional in Business Analysis PMI-PBA
Free sample 25% Professional in Business Analysis PMI-PBA
Ā 

More from Daniel Tunkelang

Query Understanding and Ecommerce
Query Understanding and EcommerceQuery Understanding and Ecommerce
Query Understanding and EcommerceDaniel Tunkelang
Ā 
Semantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce QueriesSemantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce QueriesDaniel Tunkelang
Ā 
Helping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query UnderstandingHelping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query UnderstandingDaniel Tunkelang
Ā 
Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise IntelligenceDaniel Tunkelang
Ā 
Where should you put your data scientists?
Where should you put your data scientists?Where should you put your data scientists?
Where should you put your data scientists?Daniel Tunkelang
Ā 
Data Science: A Mindset for Productivity
Data Science: A Mindset for ProductivityData Science: A Mindset for Productivity
Data Science: A Mindset for ProductivityDaniel Tunkelang
Ā 
My Three Exā€™s: A Data Science Approach for Applied Machine Learning
My Three Exā€™s: A Data Science Approach for Applied Machine LearningMy Three Exā€™s: A Data Science Approach for Applied Machine Learning
My Three Exā€™s: A Data Science Approach for Applied Machine LearningDaniel Tunkelang
Ā 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?Daniel Tunkelang
Ā 
Social Search in a Professional Context
Social Search in a Professional ContextSocial Search in a Professional Context
Social Search in a Professional ContextDaniel Tunkelang
Ā 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneyDaniel Tunkelang
Ā 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Daniel Tunkelang
Ā 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Daniel Tunkelang
Ā 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data ScientistDaniel Tunkelang
Ā 
Information, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of NeedsInformation, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of NeedsDaniel Tunkelang
Ā 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The PeopleDaniel Tunkelang
Ā 
Content, Connections, and Context
Content, Connections, and ContextContent, Connections, and Context
Content, Connections, and ContextDaniel Tunkelang
Ā 
Scale, Structure, and Semantics
Scale, Structure, and SemanticsScale, Structure, and Semantics
Scale, Structure, and SemanticsDaniel Tunkelang
Ā 
Strata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of MicroworkStrata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of MicroworkDaniel Tunkelang
Ā 
Recommendations as a Conversation with the User
Recommendations as a Conversation with the UserRecommendations as a Conversation with the User
Recommendations as a Conversation with the UserDaniel Tunkelang
Ā 
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedInKeeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedInDaniel Tunkelang
Ā 

More from Daniel Tunkelang (20)

Query Understanding and Ecommerce
Query Understanding and EcommerceQuery Understanding and Ecommerce
Query Understanding and Ecommerce
Ā 
Semantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce QueriesSemantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce Queries
Ā 
Helping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query UnderstandingHelping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query Understanding
Ā 
Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise Intelligence
Ā 
Where should you put your data scientists?
Where should you put your data scientists?Where should you put your data scientists?
Where should you put your data scientists?
Ā 
Data Science: A Mindset for Productivity
Data Science: A Mindset for ProductivityData Science: A Mindset for Productivity
Data Science: A Mindset for Productivity
Ā 
My Three Exā€™s: A Data Science Approach for Applied Machine Learning
My Three Exā€™s: A Data Science Approach for Applied Machine LearningMy Three Exā€™s: A Data Science Approach for Applied Machine Learning
My Three Exā€™s: A Data Science Approach for Applied Machine Learning
Ā 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
Ā 
Social Search in a Professional Context
Social Search in a Professional ContextSocial Search in a Professional Context
Social Search in a Professional Context
Ā 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
Ā 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
Ā 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
Ā 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
Ā 
Information, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of NeedsInformation, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of Needs
Ā 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
Ā 
Content, Connections, and Context
Content, Connections, and ContextContent, Connections, and Context
Content, Connections, and Context
Ā 
Scale, Structure, and Semantics
Scale, Structure, and SemanticsScale, Structure, and Semantics
Scale, Structure, and Semantics
Ā 
Strata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of MicroworkStrata 2012: Humans, Machines, and the Dimensions of Microwork
Strata 2012: Humans, Machines, and the Dimensions of Microwork
Ā 
Recommendations as a Conversation with the User
Recommendations as a Conversation with the UserRecommendations as a Conversation with the User
Recommendations as a Conversation with the User
Ā 
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedInKeeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
Keeping It Professional: Relevance, Recommendations, and Reputation at LinkedIn
Ā 

Recently uploaded

%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba
Ā 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2
Ā 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech studentsHimanshiGarg82
Ā 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxAnnaArtyushina1
Ā 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
Ā 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
Ā 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2
Ā 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyviewmasabamasaba
Ā 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
Ā 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benonimasabamasaba
Ā 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...masabamasaba
Ā 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationJuha-Pekka Tolvanen
Ā 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
Ā 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...masabamasaba
Ā 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba
Ā 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
Ā 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
Ā 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburgmasabamasaba
Ā 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024VictoriaMetrics
Ā 

Recently uploaded (20)

%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand%in Midrand+277-882-255-28 abortion pills for sale in midrand
%in Midrand+277-882-255-28 abortion pills for sale in midrand
Ā 
Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] šŸ„ Women's Abortion Clinic in T...
Ā 
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
Ā 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
Ā 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Ā 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
Ā 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
Ā 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
Ā 
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
Ā 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
Ā 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
Ā 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
Ā 
What Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the SituationWhat Goes Wrong with Language Definitions and How to Improve the Situation
What Goes Wrong with Language Definitions and How to Improve the Situation
Ā 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
Ā 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
Ā 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
Ā 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
Ā 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Ā 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
Ā 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Ā 

MMM, Search!

  • 1. MMM, Search! Daniel Tunkelang dtunkelang@gmail.com Presented to Wikimedia Foundation on April 27, 2020
  • 3. Search is a process. ā€¢ Searchers ā€¢ start with information-seeking goals. ā€¢ express and elaborate those goals as queries. ā€¢ Search Engines ā€¢ translate queries into representations of intent. ā€¢ retrieve results relevant to that intent and rank them. Communication isnā€™t perfect, so the process is iterative.
  • 4. Search is many things. ā€¢ Known-Item search vs. exploratory search. ā€¢ Seeking speciļ¬c item vs. knowing when you see it. ā€¢ Search is a means to an end, not the end itself. ā€¢ Getting information, shopping, communication, etc. ā€¢ It takes a lot of hard work to make search feel eļ¬€ortless. ā€¢ Indexing, query understanding, matching, ranking.
  • 5. Metrics, Models, Methods The most important decisions for a search engine are: ā€¢ Metrics: what we measure and optimize for. ā€¢ Models: how we model the search experience. ā€¢ Methods: how we help searchers achieve success.
  • 7. Metrics: What do we need to know? ā€¢ Binary Relevance ā€¢ Are searchers ļ¬nding relevant results? ā€¢ Session Success ā€¢ How often are search sessions successful? ā€¢ Search Eļ¬ƒciency ā€¢ How much eļ¬€ort are searchers making?
  • 8. Binary Relevance Relevance is a measure of information conveyed by a document relative to a query.ā€Ø ā€Ø Relationship between document and query, though necessary, is not suļ¬ƒcient to determine relevance. William Goļ¬€man, 1964
  • 9. Relevance has shades of gray, but non-relevance is black and white.
  • 10. Example: Email ā€¢ Can Google decide which of my emails are important? ā€¢ ĀÆ_(惄)_/ĀÆ ā€¢ Can Google decide which of my emails are spam? ā€¢ Deļ¬nitely!
  • 11. Measure Binary Relevance! ā€¢ Build a (query, document) binary relevance model. ā€¢ (weā€™ll get back to that in a moment) ā€¢ Embrace positional bias: measure at top ranks. ā€¢ Can use top k results or weighted sample. ā€¢ Stratify for meaningful query and user segments. ā€¢ Leverage query classiļ¬cation and user data.
  • 12. Search is a journey. Searcher
  • 13. Search isnā€™t always one-shot. Search canā€™t be always one-shot.
  • 14. Measure Session Success! ā€¢ Measure session conversion, not just query conversion. ā€¢ Much better proxy for userā€™s success! ā€¢ Compute metrics based on ļ¬rst query of session. ā€¢ Distribution of journeys for common intent. ā€¢ Segment sessions into tasks? Maybe, but optional. ā€¢ Multi-task sessions uncommon; treat as noise.
  • 16. Searching is not fun. Having found is fun. ā€¢ If search is too hard or takes too long, searchers give up. ā€¢ Compare successful and unsuccessful sessions. ā€¢ Measure how much time searchers spend in sessions. ā€¢ Especially time on search rather than results. ā€¢ Measure searcher eļ¬€ort. ā€¢ Pagination, reformulation, reļ¬nement, etc.
  • 17. Metrics: Summary ā€¢ Binary Relevance ā€¢ Are searchers ļ¬nding relevant results? ā€¢ Session Success ā€¢ How often are search sessions successful? ā€¢ Search Eļ¬ƒciency ā€¢ How much eļ¬€ort are searchers making?
  • 19. Models: What do we model and how? ā€¢ Query Categorization ā€¢ What is the primary domain for a query? ā€¢ Query Similarity ā€¢ Do two queries express similar / identical intent? ā€¢ Binary Relevance ā€¢ How to estimate relevance of results to queries?
  • 21. Search starts with query understanding. Query understanding starts with categorization. ā€¢ Map query to a primary content taxonomy. ā€¢ Subject, product type, domain, etc. ā€¢ Identify high-level intent, independent of content interest. ā€¢ Title, category, brand, site help, etc. ā€¢ Categories should be coherent, distinctive, and useful. ā€¢ Good categorization requires good categories.
  • 22. How to Train Your Query Categorization Model ā€¢ Label your most frequent head queries manually. ā€¢ Top 1000 queries are probably worth it. ā€¢ For torso queries, infer categories from engagement. ā€¢ Looking for overwhelmingly dominant category. ā€¢ Now train a model using labeled head and torso queries. ā€¢ This training data is biased, but manageably so. ā€¢ No need to use fancy deep learning / AI. Try fastText.
  • 24. Query ambiguity is rare. Query similarity is common. ā€¢ Some queries do not express a clear intent, but most do. ā€¢ Most ā€œambiguousā€ queries turn out to be broad. ā€¢ Bigger opportunity: multiple queries express same intent. ā€¢ Or at least the same distribution of intents. ā€¢ Recognizing similar / identical queries is huge opportunity. ā€¢ Query rewriting, aggregating signals, etc.
  • 25. How to Model Query Similarity ā€¢ Start with the simple stuļ¬€: shallow query canonicalization. ā€¢ Character normalization, stemming, word order. ā€¢ Look at edit distance, especially for spelling errors. ā€¢ Tail queries at edit distance 1 from head queries. ā€¢ Compare embeddings of queries and results. ā€¢ Especially to keep the other methods honest.
  • 27. Focus on simplest question. ā€¢ Worry whether a result is relevant or non-relevant. ā€¢ Relevant vs. more relevant is often subjective. ā€¢ Assume that query understanding has done its job ļ¬rst. ā€¢ Result relevance depends on query understanding. ā€¢ Assume that relevance is objective and universal. ā€¢ Personalization: a nice-to-have, not a must-have.
  • 28. How to Train Your Binary Relevance Model ā€¢ Collect human binary relevance judgments. Lots of them. ā€¢ Quantity is more important than quality. ā€¢ Pay attention to query distribution and stratify sample. ā€¢ Collect judgements that teach you something. ā€¢ Come to terms with presentation and position bias. ā€¢ Users mostly interact with top-ranked results.
  • 29. Models: Summary ā€¢ Query Categorization ā€¢ Simple model to map query to primary intent. ā€¢ Query Similarity ā€¢ Recognize queries with same or similar intent. ā€¢ Binary Relevance ā€¢ Use human judgments to train relevance model.
  • 31. Methods: What are some useful tricks? ā€¢ Optimize for Query Performance ā€¢ Help searchers make better queries. ā€¢ Map Tail Queries to Head Intents ā€¢ Searchers arenā€™t as unique as you think! ā€¢ Learn from Successful Sessions ā€¢ Help others discovers successful paths.
  • 33. ā€¢ Expected searcher success for query. ā€¢ Function of query, not of any particular result. ā€¢ Can use any measure of searcher success. ā€¢ But consider focusing on session success. ā€¢ Can incorporate sorting, reļ¬nement, or other factors. ā€¢ But keep it simple. Query is probably enough. What is query performance?
  • 34. Best way to predict query performance? Historical query performance.
  • 35. Stuck in the tail? No data? These methods can help.
  • 36. Predict query performance. Then optimize for it. ā€¢ Consider every surface where you suggest queries. ā€¢ Autocomplete, guides, related searches, etc. ā€¢ Oļ¬€er suggestions with high predicted performance. ā€¢ Or at least nudge users wherever possible. ā€¢ Use query rewriting to improve query performance. ā€¢ Rewrite to similar, high-performing queries.
  • 37. Pull Your Tail From Your Head
  • 38. Many tail queries express head intents. ā€¢ Misspelled queries are often misspellings of head queries. ā€¢ Common misspellings are uncommon. ā€¢ Many queries have a dominant singular or plural form. ā€¢ Often, though not always, the same intent. ā€¢ Also word order or other grammatical transformations. ā€¢ Such removal of low-information / noise words.
  • 39. Rewrite tail queries! ā€¢ Prioritize correcting misspellings of head queries. ā€¢ Be more aggressive, skip tokenization, etc. ā€¢ Look for head queries equivalent to tail queries. ā€¢ Stemming, reordering terms, dropping noise words. ā€¢ But check to make sure intent is actually preserved! ā€¢ Remember earlier discussion of query similarity.
  • 41. Successful searchers can help everyone else. ā€¢ Some queries lead to great performance for everyone. ā€¢ e.g., known-item searches by name or title. ā€¢ But for some queries, performance is user-dependent. ā€¢ Some users are more sophisticated or persistent. ā€¢ Successful users discovers successful paths. ā€¢ Use trails of successful users to build shortcuts!
  • 42. Optimize complex journeys. ā€¢ Detect the searches for which searchers need help. ā€¢ Queries for which successful sessions are long. ā€¢ Find the actions that successful searchers take. ā€¢ Category / facet reļ¬nements, reformulations. ā€¢ Promote those actions in the search experience. ā€¢ Create shortcuts in the navigational landscape.
  • 43. Methods: Summary ā€¢ Optimize for Query Performance ā€¢ Suggest better queries and rewrite others. ā€¢ Map Tail Queries to Head Intents ā€¢ Rewrite tail queries as similar head queries. ā€¢ Learn from Successful Sessions ā€¢ Create shortcuts based on successful paths.
  • 44. Putting It All Together ā€¢ Metrics, models, and methods ā€” they all matter. ā€¢ Query understanding ļ¬rst, then result relevance. ā€¢ Binary result relevance ļ¬rst, then result ranking. ā€¢ Session performance, not just query performance. ā€¢ Get as much leverage as possible from head queries.
  • 45. Thank You! ā€¢ More Resources ā€¢ Query Understandingā€Ø https://queryunderstanding.com/ ā€¢ My Medium (not just about search)ā€Ø https://medium.com/@dtunkelang ā€¢ Contact me directly!ā€Ø dtunkelang@gmail.com