SlideShare una empresa de Scribd logo
1 de 30
Software Industry in India  and  Keyword Search Over Dynamic Categorized Information Manish Bhide [email_address]
My Background ,[object Object],[object Object],[object Object],[object Object]
Types of Software Companies (type of work) ,[object Object],[object Object],[object Object],[object Object]
Services Companies ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Services Companies ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Product Development ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Research and Development ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
How it all fits together ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Take-away for you… ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
How to find a Job in good companies ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Keyword Search Over Dynamic Categorized Information Joint work with: Venkatesan Chakravarthy,  Krithi Ramamritham and Prasan Roy
Motivating Example ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Motivating Example (contd..) ,[object Object],[object Object],[object Object],[object Object],[object Object]
Problem Statement ,[object Object],Q(t 1 ,t 2 ..,t l ) Keyword Query Top-K Categories d i  = Blog Posts A(d i ) = Attributes in user profile T(d i ) = Text of blog Blog posts about educational issues p c  = Text classifier “ PP Manifesto” ,[object Object],[object Object],C 1 : p c Categories C 2 : p c C 5 : p c C 6 : p c C 3 : p c C 4 : p c C N : p c d 1 : A(d 1 ), T(d 1 ) d 2 : A(d 2 ), T(d 2 )  d 3 : A(d 3 ), T(d 3 ) . . Information Repository
Scoring Function ,[object Object],[object Object],[object Object],[object Object]
Computing Top-K Categories ,[object Object],[object Object],Q(t 1 ,t 2 ..,t l ) Keyword Query Top-K Categories Meta-Data d N : A(d N ), T(d N ) C 1 : p c Categories C 2 : p c C 5 : p c C 6 : p c C 3 : p c C 4 : p c C N : p c d 1 : A(d 1 ), T(d 1 ) d 2 : A(d 2 ), T(d 2 )  d 3 : A(d 3 ), T(d 3 ) . . Information Repository
Naïve Approach: Update-all Strategy ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Need for an intelligent selective update strategy!
CS* Approach: Selective update of categories with selective data ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Overview ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Statistics Maintained by CS* C i : p c ,[object Object],[object Object],[object Object],Refresh Refresh ,[object Object],[object Object],rt(C i ) = s 6 tf s 6 (C i ,t) will be available Time-step d 1 d 2 d 3 d 4 d 5 d 6 d 7 d 8 d 9 s 1 s 2 s 3 s 4 s 5 s 6 s 7 s 8 s 9 Data-Items
Estimating approximate tf ,[object Object],[object Object],[object Object],[object Object],C i : p c Refresh rt(C i ) = s 6 tf s 6 (C i ,t) will be available Current Time ,[object Object],Time-step d 1 d 2 d 3 d 4 d 5 d 6 d 7 d 8 d* s 1 s 2 s 3 s 4 s 5 s 6 s 7 s 8 s* Data-Items
Overview ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Determining Important Categories ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Importance is a measure of the likelihood of the category being  used to answer a query in the future
Range Selection Problem ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Overview ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Query Answering Module ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Query Answering Module ,[object Object],[object Object],[object Object],[object Object],TA Score est s* (*,Q) Categories sorted based on tf est s* (*,t 1 ) x idf est s* (t 1 ) Categories sorted based on tf est s* (*,t l ) x idf est s* (t 1 ) tf est s* (*,t 1 ) x idf est s* (t 1 ) C 3 C 1 C 9 C 2 tf est s* (*,t 2 ) x idf est s* (t 2 ) C 5 C 2 C 6 C 1 tf est s* (*,t l ) x idf est s* (t l ) C 6 C 3 C 1 C 8 C 4 C 6 C 1 C 8 C 7
Query Answering Module ,[object Object],[object Object],[object Object],[object Object],[object Object]
Conclusion ,[object Object],[object Object],[object Object],[object Object],[object Object]
Thank You & Questions!

Más contenido relacionado

La actualidad más candente

Simulation of Compiler Phases
Simulation of Compiler PhasesSimulation of Compiler Phases
Simulation of Compiler Phases
Associate Professor in VSB Coimbatore
 

La actualidad más candente (17)

Document Classification Using Expectation Maximization with Semi Supervised L...
Document Classification Using Expectation Maximization with Semi Supervised L...Document Classification Using Expectation Maximization with Semi Supervised L...
Document Classification Using Expectation Maximization with Semi Supervised L...
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics Domain
 
IRJET- Review on Information Retrieval for Desktop Search Engine
IRJET-  	  Review on Information Retrieval for Desktop Search EngineIRJET-  	  Review on Information Retrieval for Desktop Search Engine
IRJET- Review on Information Retrieval for Desktop Search Engine
 
Cross Domain Recommender System using Machine Learning and Transferable Knowl...
Cross Domain Recommender System using Machine Learning and Transferable Knowl...Cross Domain Recommender System using Machine Learning and Transferable Knowl...
Cross Domain Recommender System using Machine Learning and Transferable Knowl...
 
IRJET- Text Document Clustering using K-Means Algorithm
IRJET-  	  Text Document Clustering using K-Means Algorithm IRJET-  	  Text Document Clustering using K-Means Algorithm
IRJET- Text Document Clustering using K-Means Algorithm
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
 
IRJET- Comparative Study of Classification Algorithms for Sentiment Analy...
IRJET-  	  Comparative Study of Classification Algorithms for Sentiment Analy...IRJET-  	  Comparative Study of Classification Algorithms for Sentiment Analy...
IRJET- Comparative Study of Classification Algorithms for Sentiment Analy...
 
Efficient Refining Of Why-Not Questions on Top-K Queries
Efficient Refining Of Why-Not Questions on Top-K QueriesEfficient Refining Of Why-Not Questions on Top-K Queries
Efficient Refining Of Why-Not Questions on Top-K Queries
 
Big-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunitiesBig-data analytics: challenges and opportunities
Big-data analytics: challenges and opportunities
 
Active learning
Active learningActive learning
Active learning
 
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
 
Simulation of Compiler Phases
Simulation of Compiler PhasesSimulation of Compiler Phases
Simulation of Compiler Phases
 
Learning for Big Data-林軒田
Learning for Big Data-林軒田Learning for Big Data-林軒田
Learning for Big Data-林軒田
 
Active learning
Active learningActive learning
Active learning
 
Feature Importance Analysis with XGBoost in Tax audit
Feature Importance Analysis with XGBoost in Tax auditFeature Importance Analysis with XGBoost in Tax audit
Feature Importance Analysis with XGBoost in Tax audit
 
Introduction into machine learning
Introduction into machine learningIntroduction into machine learning
Introduction into machine learning
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
 

Destacado

Diversity
DiversityDiversity
Diversity
Telnet
 
Effective XML Keyword Search with Relevance Oriented Ranking
Effective XML Keyword Search with Relevance Oriented RankingEffective XML Keyword Search with Relevance Oriented Ranking
Effective XML Keyword Search with Relevance Oriented Ranking
Fudan University
 

Destacado (7)

Diversity
DiversityDiversity
Diversity
 
Effective XML Keyword Search with Relevance Oriented Ranking
Effective XML Keyword Search with Relevance Oriented RankingEffective XML Keyword Search with Relevance Oriented Ranking
Effective XML Keyword Search with Relevance Oriented Ranking
 
Xml toc 1 day
Xml toc   1 dayXml toc   1 day
Xml toc 1 day
 
Algorithms for query result diversification
Algorithms for query result diversificationAlgorithms for query result diversification
Algorithms for query result diversification
 
Working With XML in IDS Applications
Working With XML in IDS ApplicationsWorking With XML in IDS Applications
Working With XML in IDS Applications
 
Keyword Search over RDF Graphs
Keyword Search over RDF GraphsKeyword Search over RDF Graphs
Keyword Search over RDF Graphs
 
Clustering
ClusteringClustering
Clustering
 

Similar a Research Opportunities in India & Keyword Search Over Dynamic Categorized Information

Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docx
audeleypearl
 
Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docx
roushhsiu
 
Assignment Select a company for which you have a working knowledg.docx
Assignment Select a company for which you have a working knowledg.docxAssignment Select a company for which you have a working knowledg.docx
Assignment Select a company for which you have a working knowledg.docx
ssuser562afc1
 
Feasibility Analysis.ppt
Feasibility Analysis.pptFeasibility Analysis.ppt
Feasibility Analysis.ppt
BetshaTizazu2
 
Lesson 2 data preprocessing
Lesson 2   data preprocessingLesson 2   data preprocessing
Lesson 2 data preprocessing
AbdurRazzaqe1
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data Scientist
Rohit Dubey
 
IT 510 Final Project Guidelines and Rubric Overview .docx
IT 510 Final Project Guidelines and Rubric  Overview .docxIT 510 Final Project Guidelines and Rubric  Overview .docx
IT 510 Final Project Guidelines and Rubric Overview .docx
priestmanmable
 

Similar a Research Opportunities in India & Keyword Search Over Dynamic Categorized Information (20)

Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docx
 
Module Overview Careers in Analytics In this module, we .docx
Module Overview  Careers in Analytics In this module, we .docxModule Overview  Careers in Analytics In this module, we .docx
Module Overview Careers in Analytics In this module, we .docx
 
Assignment Select a company for which you have a working knowledg.docx
Assignment Select a company for which you have a working knowledg.docxAssignment Select a company for which you have a working knowledg.docx
Assignment Select a company for which you have a working knowledg.docx
 
Data Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdfData Analytics Course In Surat.pdf
Data Analytics Course In Surat.pdf
 
Predicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systemsPredicting the relevance of search results for e-commerce systems
Predicting the relevance of search results for e-commerce systems
 
EE-232-LEC-01 Data_structures.pptx
EE-232-LEC-01 Data_structures.pptxEE-232-LEC-01 Data_structures.pptx
EE-232-LEC-01 Data_structures.pptx
 
ML Framework for auto-responding to customer support queries
ML Framework for auto-responding to customer support queriesML Framework for auto-responding to customer support queries
ML Framework for auto-responding to customer support queries
 
Feasibility Analysis.ppt
Feasibility Analysis.pptFeasibility Analysis.ppt
Feasibility Analysis.ppt
 
Self Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
 
applications and advantages of python
applications and advantages of pythonapplications and advantages of python
applications and advantages of python
 
Cis 515 Effective Communication-snaptutorial.com
Cis 515 Effective Communication-snaptutorial.comCis 515 Effective Communication-snaptutorial.com
Cis 515 Effective Communication-snaptutorial.com
 
Advances in Exploratory Data Analysis, Visualisation and Quality for Data Cen...
Advances in Exploratory Data Analysis, Visualisation and Quality for Data Cen...Advances in Exploratory Data Analysis, Visualisation and Quality for Data Cen...
Advances in Exploratory Data Analysis, Visualisation and Quality for Data Cen...
 
Understanding Knowledge Services
Understanding Knowledge ServicesUnderstanding Knowledge Services
Understanding Knowledge Services
 
Lesson 2 data preprocessing
Lesson 2   data preprocessingLesson 2   data preprocessing
Lesson 2 data preprocessing
 
Lec1
Lec1Lec1
Lec1
 
Lec1
Lec1Lec1
Lec1
 
Guide for a Data Scientist
Guide for a Data ScientistGuide for a Data Scientist
Guide for a Data Scientist
 
CIS 515 Enhance teaching / snaptutorial.com
CIS 515 Enhance teaching / snaptutorial.com CIS 515 Enhance teaching / snaptutorial.com
CIS 515 Enhance teaching / snaptutorial.com
 
CIS 515 Education Organization / snaptutorial.com
CIS 515 Education Organization / snaptutorial.comCIS 515 Education Organization / snaptutorial.com
CIS 515 Education Organization / snaptutorial.com
 
IT 510 Final Project Guidelines and Rubric Overview .docx
IT 510 Final Project Guidelines and Rubric  Overview .docxIT 510 Final Project Guidelines and Rubric  Overview .docx
IT 510 Final Project Guidelines and Rubric Overview .docx
 

Más de VNIT-ACM Student Chapter

Más de VNIT-ACM Student Chapter (12)

An approach to Programming Contests with C++
An approach to Programming Contests with C++An approach to Programming Contests with C++
An approach to Programming Contests with C++
 
An introduction to Reverse Engineering
An introduction to Reverse EngineeringAn introduction to Reverse Engineering
An introduction to Reverse Engineering
 
Introduction to the OSI 7 layer model and Data Link Layer
Introduction to the OSI 7 layer model and Data Link LayerIntroduction to the OSI 7 layer model and Data Link Layer
Introduction to the OSI 7 layer model and Data Link Layer
 
Research Opportunities in the United States
Research Opportunities in the United StatesResearch Opportunities in the United States
Research Opportunities in the United States
 
How web searching engines work
How web searching engines workHow web searching engines work
How web searching engines work
 
Hadoop Map Reduce
Hadoop Map ReduceHadoop Map Reduce
Hadoop Map Reduce
 
Web Designing
Web DesigningWeb Designing
Web Designing
 
Inaugural Session
Inaugural SessionInaugural Session
Inaugural Session
 
Hacking - Web based attacks
Hacking - Web based attacksHacking - Web based attacks
Hacking - Web based attacks
 
Computers and Algorithms - What can they do and what can they not?
Computers and Algorithms - What can they do and what can they not?Computers and Algorithms - What can they do and what can they not?
Computers and Algorithms - What can they do and what can they not?
 
Foundations of Programming Part II
Foundations of Programming Part IIFoundations of Programming Part II
Foundations of Programming Part II
 
Foundations of Programming Part I
Foundations of Programming Part IFoundations of Programming Part I
Foundations of Programming Part I
 

Último

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Último (20)

This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

Research Opportunities in India & Keyword Search Over Dynamic Categorized Information

  • 1. Software Industry in India and Keyword Search Over Dynamic Categorized Information Manish Bhide [email_address]
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11. Keyword Search Over Dynamic Categorized Information Joint work with: Venkatesan Chakravarthy, Krithi Ramamritham and Prasan Roy
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23.
  • 24.
  • 25.
  • 26.
  • 27.
  • 28.
  • 29.
  • 30. Thank You & Questions!