SlideShare una empresa de Scribd logo
1 de 24
Descargar para leer sin conexión
Big Data made Small



What enterprises do with Big Data- Part 1



                                                                  1
           © PromptCloud Technologies 2013, All rights reserved
Notes
a) This is a list of simplified requirement
   statements from ongoing/past projects (not
   verbatim and in no specific order).

b) You can easily assume that data delivered was in a
   structured format.




                                                                      2
               © PromptCloud Technologies 2013, All rights reserved
1. Collect data from Twitter every 5 minutes from
   specific geographies and categories based on a set
   of keywords. Our USP is social listening and this
   data is a crucial part of our business.




                                                                      3
               © PromptCloud Technologies 2013, All rights reserved
2. We talk social travel and so are building an
   intelligent social engine for finding and booking
   hotels. In order to get this running, we'd like you
   to provide reviews, the reviewer profiles, hotel
   and restaurant addresses from these popular
   travel sites and forums. Data could be in the range
   of tens of millions.




                                                                      4
               © PromptCloud Technologies 2013, All rights reserved
3. We're in the brand monitoring zone. So we'd like
   you to first collect all reviews belonging to say
   Nike, and then index it for us so that we could see
   what people are saying. If you could give us query
   formats using which say, I could see how many
   people said “bad” and how many said “good”, etc.
   that would be ideal.




                                                                      5
               © PromptCloud Technologies 2013, All rights reserved
4. Give me all the stories about my interest list of
   celebrities based on a list of keywords that I
   provide (like, eat, drink, travel....) from about 400
   sources and Twitter. I'm launching a celebrity
   gossip website in multiple countries.




                                                                       6
                © PromptCloud Technologies 2013, All rights reserved
5. Get me all products with all its fields (name,
   descriptions, price, specs) present on these
   supermarket stores that are all AJAX. And sorry!
   They are in Hebrew.




                                                                      7
               © PromptCloud Technologies 2013, All rights reserved
6. I'm so obsessed with near-real time data (sorry I
   belong to media) that I need news of any deals,
   acquisitions, mergers, or any other news based on
   the phrases I provide to you and expect the feed
   within minutes of the same being published
   somewhere.




                                                                      8
               © PromptCloud Technologies 2013, All rights reserved
7. We are in the used car inventory space. There are
   few platforms that automobile sites use for a set
   of cars. Please get me data from such places on a
   daily basis at these particular times in the day. I
   need both English and French data and I'm
   interested in both XML and CSV formats.




                                                                       9
                © PromptCloud Technologies 2013, All rights reserved
8. I need data from all
   the tech support
   discussion forums
   from all of these sites
   in this particular
   format. I expect
   about 2 million
   records from here.




                                                                       10
                © PromptCloud Technologies 2013, All rights reserved
9. We are looking for laptop reviews that have these
   operating systems. This is our initial list of sources
   and would be great if you could come up with
   others for us. We desire weekly updates of these
   reviews.




                                                                       11
                © PromptCloud Technologies 2013, All rights reserved
10. You get me all high resolution images from these
    200 web stores so that whenever a user
    bookmarks a product on my website, my
    algorithm can show them those related products
    to compare prices and eventually make a buying
    decision.




                                                                      12
               © PromptCloud Technologies 2013, All rights reserved
11. We offer great
    discounts on tickets for
    events and games.
    Please crawl these
    ticketing sites for us so
    we have an inventory
    of all events with their
    seating-level prices
    which we can use to
    offer discounts and
    run our analyses.

                                                                        13
                 © PromptCloud Technologies 2013, All rights reserved
12. We acquire new and updated
    ‘Pending Legislation’ (as it is being debated in
    Govt. prior to becoming a Bill/Law) documents
    and extract associated metadata from legislature
    websites. The document metadata may be
    available on the web page where we download it
    from, or within the document – which may be in
    HTML or binary formats – e.g. Word, PDF. We
    need to extract all of this data for our clients.




                                                                      14
               © PromptCloud Technologies 2013, All rights reserved
13. We're in the video-gaming industry where we
    perform research on video games and provide
    consulting. We need to extract data daily from all
    popular gaming sites and gather news, articles and
    their popularity.




                                                                      15
               © PromptCloud Technologies 2013, All rights reserved
14. Get me product feeds from the Indian E-
    commerce market with all product-level details
    and specifications. I need this to build an analytics
    engine.




                                                                       16
                © PromptCloud Technologies 2013, All rights reserved
15. We'd be interested in
    these job portal sites and
    would like to receive
    updates on a daily basis on
    the jobs posted in our
    country. We're developing
    solutions for the digital
    classifieds markets.




                                                                       17
                © PromptCloud Technologies 2013, All rights reserved
16. We are a social strategy and analytics firm and
    need lot of data to do some social data mining.
    We have about 200 sites which are a mix of blogs,
    news, forums, articles, travel sites and others,
    many of which are non-English. Please extract
    data as you find relevant to the domain. We need
    updates on a daily basis.




                                                                      18
               © PromptCloud Technologies 2013, All rights reserved
17. Our clients (who are large-scale manufacturing
    companies) would like to see how their high-value
    products are doing in the market. So we'll need
    reviews of this list of products including review
    date, author, content, review helpful,
    recommends and other such details from these
    set of sites.




                                                                      19
               © PromptCloud Technologies 2013, All rights reserved
18. I'm developing a comparison shopping engine that
    I can feed in data from other sources and have my
    users compare and shop and see some price
    trends. Please facilitate this data.




                                                                      20
               © PromptCloud Technologies 2013, All rights reserved
19. I am in the healthcare industry looking to create
    an inventory of all healthcare-related products
    from these stores. I need to go to the last level of
    detail and capture everything possible.




                                                                       21
                © PromptCloud Technologies 2013, All rights reserved
20. We're interested in creating a database of all
    companies in India that are less than x years old,
    greater than y in revenue, belong to these
    industries and provide these specific services.




                                                                       22
                © PromptCloud Technologies 2013, All rights reserved
We like to remind “Why Us”?
          Making big data small to alleviate tech-aches


•Low ETA’s                                                                 •Highly Scalable
                              •Flexible Pricing
•Precision                                                                 •Access to real-
                              based on size and
Extraction                                                                 time data
                              frequency of
•Exhaustive data
                              crawls
available as feed




Performance                         Price                                    Technology       23
                    © PromptCloud Technologies 2013, All rights reserved
Watch out for the next batch…

         For details, contact
   Email: info@promptcloud.com
    Phone: +91-96 86 56 70 70




                                                          24
   © PromptCloud Technologies 2013, All rights reserved

Más contenido relacionado

Similar a What enterprises do with big data- Part 1

Conf2013 bchristensen thebig_t
Conf2013 bchristensen thebig_tConf2013 bchristensen thebig_t
Conf2013 bchristensen thebig_tBeau Christensen
 
The LCG Digital Transformation Maturity Model
The LCG Digital Transformation Maturity ModelThe LCG Digital Transformation Maturity Model
The LCG Digital Transformation Maturity ModelLima Consulting Group
 
Identity Live Sydney: Intelligent Authentication
Identity Live Sydney: Intelligent Authentication Identity Live Sydney: Intelligent Authentication
Identity Live Sydney: Intelligent Authentication ForgeRock
 
Intelligent Authentication (Identity Live Berlin 2018)
Intelligent Authentication  (Identity Live Berlin 2018)Intelligent Authentication  (Identity Live Berlin 2018)
Intelligent Authentication (Identity Live Berlin 2018)ForgeRock
 
Big data meetup_10_9_2013
Big data meetup_10_9_2013Big data meetup_10_9_2013
Big data meetup_10_9_2013Tanya Cashorali
 
The Automotive Journey Into the Cloud
The Automotive Journey Into the CloudThe Automotive Journey Into the Cloud
The Automotive Journey Into the CloudEmtec Inc.
 
The Automotive Journey Into the Cloud
The Automotive Journey Into the CloudThe Automotive Journey Into the Cloud
The Automotive Journey Into the CloudKim Pike
 
Accelerating breakthrough business technologies in atlanta, tag featured spea...
Accelerating breakthrough business technologies in atlanta, tag featured spea...Accelerating breakthrough business technologies in atlanta, tag featured spea...
Accelerating breakthrough business technologies in atlanta, tag featured spea...Melanie Brandt
 
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPTAI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPTCprime
 
Johannes Zijlstra - Sitecore 9 and GDPR
Johannes Zijlstra - Sitecore 9 and GDPRJohannes Zijlstra - Sitecore 9 and GDPR
Johannes Zijlstra - Sitecore 9 and GDPRSagittarius
 
Bracing for the Big One: B2B Systems and Processes that Think for Themselves!
Bracing for the Big One: B2B Systems and Processes that Think for Themselves!Bracing for the Big One: B2B Systems and Processes that Think for Themselves!
Bracing for the Big One: B2B Systems and Processes that Think for Themselves!Jack Shaw
 
CASE STUDY - Ironclad Messaging & Secure App Dev for Regulated Industries
CASE STUDY - Ironclad Messaging & Secure App Dev for Regulated IndustriesCASE STUDY - Ironclad Messaging & Secure App Dev for Regulated Industries
CASE STUDY - Ironclad Messaging & Secure App Dev for Regulated IndustriesNowSecure
 
Theresa Regli Content Management Strategies for a multi-platform world
Theresa Regli Content Management Strategies for a multi-platform worldTheresa Regli Content Management Strategies for a multi-platform world
Theresa Regli Content Management Strategies for a multi-platform worldIncisive_Events
 
Top 10 tech trends 2014
Top 10 tech trends 2014Top 10 tech trends 2014
Top 10 tech trends 2014Irene Ventayol
 
Cisco IoT World Forum 2014: Airwatch Breakout Session
Cisco IoT World Forum 2014: Airwatch Breakout SessionCisco IoT World Forum 2014: Airwatch Breakout Session
Cisco IoT World Forum 2014: Airwatch Breakout SessionBasil Hashem
 
Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!DATAVERSITY
 
Validus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updatesValidus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updatesRick Catalano
 
Arvizio MR Studio Overview
Arvizio MR Studio OverviewArvizio MR Studio Overview
Arvizio MR Studio OverviewJonathan Reeves
 
Demystifying the Mobile Container - PART I
Demystifying the Mobile Container - PART IDemystifying the Mobile Container - PART I
Demystifying the Mobile Container - PART IRelayware
 

Similar a What enterprises do with big data- Part 1 (20)

Conf2013 bchristensen thebig_t
Conf2013 bchristensen thebig_tConf2013 bchristensen thebig_t
Conf2013 bchristensen thebig_t
 
The LCG Digital Transformation Maturity Model
The LCG Digital Transformation Maturity ModelThe LCG Digital Transformation Maturity Model
The LCG Digital Transformation Maturity Model
 
Identity Live Sydney: Intelligent Authentication
Identity Live Sydney: Intelligent Authentication Identity Live Sydney: Intelligent Authentication
Identity Live Sydney: Intelligent Authentication
 
Intelligent Authentication (Identity Live Berlin 2018)
Intelligent Authentication  (Identity Live Berlin 2018)Intelligent Authentication  (Identity Live Berlin 2018)
Intelligent Authentication (Identity Live Berlin 2018)
 
Big data meetup_10_9_2013
Big data meetup_10_9_2013Big data meetup_10_9_2013
Big data meetup_10_9_2013
 
The Automotive Journey Into the Cloud
The Automotive Journey Into the CloudThe Automotive Journey Into the Cloud
The Automotive Journey Into the Cloud
 
The Automotive Journey Into the Cloud
The Automotive Journey Into the CloudThe Automotive Journey Into the Cloud
The Automotive Journey Into the Cloud
 
Accelerating breakthrough business technologies in atlanta, tag featured spea...
Accelerating breakthrough business technologies in atlanta, tag featured spea...Accelerating breakthrough business technologies in atlanta, tag featured spea...
Accelerating breakthrough business technologies in atlanta, tag featured spea...
 
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPTAI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
AI for Everyone: Demystifying Large Language Models (LLMs) Like ChatGPT
 
Johannes Zijlstra - Sitecore 9 and GDPR
Johannes Zijlstra - Sitecore 9 and GDPRJohannes Zijlstra - Sitecore 9 and GDPR
Johannes Zijlstra - Sitecore 9 and GDPR
 
Bracing for the Big One: B2B Systems and Processes that Think for Themselves!
Bracing for the Big One: B2B Systems and Processes that Think for Themselves!Bracing for the Big One: B2B Systems and Processes that Think for Themselves!
Bracing for the Big One: B2B Systems and Processes that Think for Themselves!
 
CASE STUDY - Ironclad Messaging & Secure App Dev for Regulated Industries
CASE STUDY - Ironclad Messaging & Secure App Dev for Regulated IndustriesCASE STUDY - Ironclad Messaging & Secure App Dev for Regulated Industries
CASE STUDY - Ironclad Messaging & Secure App Dev for Regulated Industries
 
Theresa Regli Content Management Strategies for a multi-platform world
Theresa Regli Content Management Strategies for a multi-platform worldTheresa Regli Content Management Strategies for a multi-platform world
Theresa Regli Content Management Strategies for a multi-platform world
 
Top 10 tech trends 2014
Top 10 tech trends 2014Top 10 tech trends 2014
Top 10 tech trends 2014
 
Cisco IoT World Forum 2014: Airwatch Breakout Session
Cisco IoT World Forum 2014: Airwatch Breakout SessionCisco IoT World Forum 2014: Airwatch Breakout Session
Cisco IoT World Forum 2014: Airwatch Breakout Session
 
Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!Databases, CAP, ACID, BASE, NoSQL... oh my!
Databases, CAP, ACID, BASE, NoSQL... oh my!
 
Validus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updatesValidus investor pitch deck 03212014 rjc updates
Validus investor pitch deck 03212014 rjc updates
 
Adstuck United States
Adstuck United States  Adstuck United States
Adstuck United States
 
Arvizio MR Studio Overview
Arvizio MR Studio OverviewArvizio MR Studio Overview
Arvizio MR Studio Overview
 
Demystifying the Mobile Container - PART I
Demystifying the Mobile Container - PART IDemystifying the Mobile Container - PART I
Demystifying the Mobile Container - PART I
 

Más de PromptCloud

Big Data’s Potential for the Real Estate Industry: 2021
Big Data’s Potential for the Real Estate Industry: 2021Big Data’s Potential for the Real Estate Industry: 2021
Big Data’s Potential for the Real Estate Industry: 2021PromptCloud
 
All You Need to Know About Web Crawling.pdf
All You Need to Know About Web Crawling.pdfAll You Need to Know About Web Crawling.pdf
All You Need to Know About Web Crawling.pdfPromptCloud
 
Web Scraping Myths vs. Facts
Web Scraping Myths vs. FactsWeb Scraping Myths vs. Facts
Web Scraping Myths vs. FactsPromptCloud
 
Octoparse competitors.pdf
Octoparse competitors.pdfOctoparse competitors.pdf
Octoparse competitors.pdfPromptCloud
 
Parsehub and competitior ppt.pptx
Parsehub and competitior ppt.pptxParsehub and competitior ppt.pptx
Parsehub and competitior ppt.pptxPromptCloud
 
Product Visibility- What Is Seen First, Will ppt.pptx
Product Visibility- What Is Seen First, Will ppt.pptxProduct Visibility- What Is Seen First, Will ppt.pptx
Product Visibility- What Is Seen First, Will ppt.pptxPromptCloud
 
Data Trends in Fashion Industry
Data Trends in Fashion IndustryData Trends in Fashion Industry
Data Trends in Fashion IndustryPromptCloud
 
Data Standardization with Web Data Integration
Data Standardization with Web Data Integration Data Standardization with Web Data Integration
Data Standardization with Web Data Integration PromptCloud
 
Visualizing Marvel Cinematic Universe Movies
Visualizing Marvel Cinematic Universe MoviesVisualizing Marvel Cinematic Universe Movies
Visualizing Marvel Cinematic Universe MoviesPromptCloud
 
15 Key Metrics Every E-commerce Business Should Track
15 Key Metrics Every E-commerce Business Should Track15 Key Metrics Every E-commerce Business Should Track
15 Key Metrics Every E-commerce Business Should TrackPromptCloud
 
Top Amazon Services for Ecommerce Players
Top Amazon Services for Ecommerce PlayersTop Amazon Services for Ecommerce Players
Top Amazon Services for Ecommerce PlayersPromptCloud
 
The Birth of a Web Crawling Bot
The Birth of a Web Crawling BotThe Birth of a Web Crawling Bot
The Birth of a Web Crawling BotPromptCloud
 
Upcoming Applications of Artificial intelligence in 2019
Upcoming Applications of Artificial intelligence in 2019Upcoming Applications of Artificial intelligence in 2019
Upcoming Applications of Artificial intelligence in 2019PromptCloud
 
Zipcode based price benchmarking for retailers
Zipcode based price benchmarking for retailersZipcode based price benchmarking for retailers
Zipcode based price benchmarking for retailersPromptCloud
 
Analyzing Positiveness in 160+ Holiday Songs
Analyzing Positiveness in 160+ Holiday SongsAnalyzing Positiveness in 160+ Holiday Songs
Analyzing Positiveness in 160+ Holiday SongsPromptCloud
 
PromptCloud's Year in Review - 2019
PromptCloud's Year in Review - 2019PromptCloud's Year in Review - 2019
PromptCloud's Year in Review - 2019PromptCloud
 
Top Data Analytics Trends for 2019
Top Data Analytics Trends for 2019Top Data Analytics Trends for 2019
Top Data Analytics Trends for 2019PromptCloud
 
10 Mobile App Ideas that can be Fueled by Web Scraping
10 Mobile App Ideas that can be Fueled by Web Scraping10 Mobile App Ideas that can be Fueled by Web Scraping
10 Mobile App Ideas that can be Fueled by Web ScrapingPromptCloud
 
How Web Scraping Can Help Affiliate Marketers
How Web Scraping Can Help Affiliate MarketersHow Web Scraping Can Help Affiliate Marketers
How Web Scraping Can Help Affiliate MarketersPromptCloud
 
Hotel Review Data Analysis
Hotel Review Data AnalysisHotel Review Data Analysis
Hotel Review Data AnalysisPromptCloud
 

Más de PromptCloud (20)

Big Data’s Potential for the Real Estate Industry: 2021
Big Data’s Potential for the Real Estate Industry: 2021Big Data’s Potential for the Real Estate Industry: 2021
Big Data’s Potential for the Real Estate Industry: 2021
 
All You Need to Know About Web Crawling.pdf
All You Need to Know About Web Crawling.pdfAll You Need to Know About Web Crawling.pdf
All You Need to Know About Web Crawling.pdf
 
Web Scraping Myths vs. Facts
Web Scraping Myths vs. FactsWeb Scraping Myths vs. Facts
Web Scraping Myths vs. Facts
 
Octoparse competitors.pdf
Octoparse competitors.pdfOctoparse competitors.pdf
Octoparse competitors.pdf
 
Parsehub and competitior ppt.pptx
Parsehub and competitior ppt.pptxParsehub and competitior ppt.pptx
Parsehub and competitior ppt.pptx
 
Product Visibility- What Is Seen First, Will ppt.pptx
Product Visibility- What Is Seen First, Will ppt.pptxProduct Visibility- What Is Seen First, Will ppt.pptx
Product Visibility- What Is Seen First, Will ppt.pptx
 
Data Trends in Fashion Industry
Data Trends in Fashion IndustryData Trends in Fashion Industry
Data Trends in Fashion Industry
 
Data Standardization with Web Data Integration
Data Standardization with Web Data Integration Data Standardization with Web Data Integration
Data Standardization with Web Data Integration
 
Visualizing Marvel Cinematic Universe Movies
Visualizing Marvel Cinematic Universe MoviesVisualizing Marvel Cinematic Universe Movies
Visualizing Marvel Cinematic Universe Movies
 
15 Key Metrics Every E-commerce Business Should Track
15 Key Metrics Every E-commerce Business Should Track15 Key Metrics Every E-commerce Business Should Track
15 Key Metrics Every E-commerce Business Should Track
 
Top Amazon Services for Ecommerce Players
Top Amazon Services for Ecommerce PlayersTop Amazon Services for Ecommerce Players
Top Amazon Services for Ecommerce Players
 
The Birth of a Web Crawling Bot
The Birth of a Web Crawling BotThe Birth of a Web Crawling Bot
The Birth of a Web Crawling Bot
 
Upcoming Applications of Artificial intelligence in 2019
Upcoming Applications of Artificial intelligence in 2019Upcoming Applications of Artificial intelligence in 2019
Upcoming Applications of Artificial intelligence in 2019
 
Zipcode based price benchmarking for retailers
Zipcode based price benchmarking for retailersZipcode based price benchmarking for retailers
Zipcode based price benchmarking for retailers
 
Analyzing Positiveness in 160+ Holiday Songs
Analyzing Positiveness in 160+ Holiday SongsAnalyzing Positiveness in 160+ Holiday Songs
Analyzing Positiveness in 160+ Holiday Songs
 
PromptCloud's Year in Review - 2019
PromptCloud's Year in Review - 2019PromptCloud's Year in Review - 2019
PromptCloud's Year in Review - 2019
 
Top Data Analytics Trends for 2019
Top Data Analytics Trends for 2019Top Data Analytics Trends for 2019
Top Data Analytics Trends for 2019
 
10 Mobile App Ideas that can be Fueled by Web Scraping
10 Mobile App Ideas that can be Fueled by Web Scraping10 Mobile App Ideas that can be Fueled by Web Scraping
10 Mobile App Ideas that can be Fueled by Web Scraping
 
How Web Scraping Can Help Affiliate Marketers
How Web Scraping Can Help Affiliate MarketersHow Web Scraping Can Help Affiliate Marketers
How Web Scraping Can Help Affiliate Marketers
 
Hotel Review Data Analysis
Hotel Review Data AnalysisHotel Review Data Analysis
Hotel Review Data Analysis
 

Último

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Último (20)

Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

What enterprises do with big data- Part 1

  • 1. Big Data made Small What enterprises do with Big Data- Part 1 1 © PromptCloud Technologies 2013, All rights reserved
  • 2. Notes a) This is a list of simplified requirement statements from ongoing/past projects (not verbatim and in no specific order). b) You can easily assume that data delivered was in a structured format. 2 © PromptCloud Technologies 2013, All rights reserved
  • 3. 1. Collect data from Twitter every 5 minutes from specific geographies and categories based on a set of keywords. Our USP is social listening and this data is a crucial part of our business. 3 © PromptCloud Technologies 2013, All rights reserved
  • 4. 2. We talk social travel and so are building an intelligent social engine for finding and booking hotels. In order to get this running, we'd like you to provide reviews, the reviewer profiles, hotel and restaurant addresses from these popular travel sites and forums. Data could be in the range of tens of millions. 4 © PromptCloud Technologies 2013, All rights reserved
  • 5. 3. We're in the brand monitoring zone. So we'd like you to first collect all reviews belonging to say Nike, and then index it for us so that we could see what people are saying. If you could give us query formats using which say, I could see how many people said “bad” and how many said “good”, etc. that would be ideal. 5 © PromptCloud Technologies 2013, All rights reserved
  • 6. 4. Give me all the stories about my interest list of celebrities based on a list of keywords that I provide (like, eat, drink, travel....) from about 400 sources and Twitter. I'm launching a celebrity gossip website in multiple countries. 6 © PromptCloud Technologies 2013, All rights reserved
  • 7. 5. Get me all products with all its fields (name, descriptions, price, specs) present on these supermarket stores that are all AJAX. And sorry! They are in Hebrew. 7 © PromptCloud Technologies 2013, All rights reserved
  • 8. 6. I'm so obsessed with near-real time data (sorry I belong to media) that I need news of any deals, acquisitions, mergers, or any other news based on the phrases I provide to you and expect the feed within minutes of the same being published somewhere. 8 © PromptCloud Technologies 2013, All rights reserved
  • 9. 7. We are in the used car inventory space. There are few platforms that automobile sites use for a set of cars. Please get me data from such places on a daily basis at these particular times in the day. I need both English and French data and I'm interested in both XML and CSV formats. 9 © PromptCloud Technologies 2013, All rights reserved
  • 10. 8. I need data from all the tech support discussion forums from all of these sites in this particular format. I expect about 2 million records from here. 10 © PromptCloud Technologies 2013, All rights reserved
  • 11. 9. We are looking for laptop reviews that have these operating systems. This is our initial list of sources and would be great if you could come up with others for us. We desire weekly updates of these reviews. 11 © PromptCloud Technologies 2013, All rights reserved
  • 12. 10. You get me all high resolution images from these 200 web stores so that whenever a user bookmarks a product on my website, my algorithm can show them those related products to compare prices and eventually make a buying decision. 12 © PromptCloud Technologies 2013, All rights reserved
  • 13. 11. We offer great discounts on tickets for events and games. Please crawl these ticketing sites for us so we have an inventory of all events with their seating-level prices which we can use to offer discounts and run our analyses. 13 © PromptCloud Technologies 2013, All rights reserved
  • 14. 12. We acquire new and updated ‘Pending Legislation’ (as it is being debated in Govt. prior to becoming a Bill/Law) documents and extract associated metadata from legislature websites. The document metadata may be available on the web page where we download it from, or within the document – which may be in HTML or binary formats – e.g. Word, PDF. We need to extract all of this data for our clients. 14 © PromptCloud Technologies 2013, All rights reserved
  • 15. 13. We're in the video-gaming industry where we perform research on video games and provide consulting. We need to extract data daily from all popular gaming sites and gather news, articles and their popularity. 15 © PromptCloud Technologies 2013, All rights reserved
  • 16. 14. Get me product feeds from the Indian E- commerce market with all product-level details and specifications. I need this to build an analytics engine. 16 © PromptCloud Technologies 2013, All rights reserved
  • 17. 15. We'd be interested in these job portal sites and would like to receive updates on a daily basis on the jobs posted in our country. We're developing solutions for the digital classifieds markets. 17 © PromptCloud Technologies 2013, All rights reserved
  • 18. 16. We are a social strategy and analytics firm and need lot of data to do some social data mining. We have about 200 sites which are a mix of blogs, news, forums, articles, travel sites and others, many of which are non-English. Please extract data as you find relevant to the domain. We need updates on a daily basis. 18 © PromptCloud Technologies 2013, All rights reserved
  • 19. 17. Our clients (who are large-scale manufacturing companies) would like to see how their high-value products are doing in the market. So we'll need reviews of this list of products including review date, author, content, review helpful, recommends and other such details from these set of sites. 19 © PromptCloud Technologies 2013, All rights reserved
  • 20. 18. I'm developing a comparison shopping engine that I can feed in data from other sources and have my users compare and shop and see some price trends. Please facilitate this data. 20 © PromptCloud Technologies 2013, All rights reserved
  • 21. 19. I am in the healthcare industry looking to create an inventory of all healthcare-related products from these stores. I need to go to the last level of detail and capture everything possible. 21 © PromptCloud Technologies 2013, All rights reserved
  • 22. 20. We're interested in creating a database of all companies in India that are less than x years old, greater than y in revenue, belong to these industries and provide these specific services. 22 © PromptCloud Technologies 2013, All rights reserved
  • 23. We like to remind “Why Us”? Making big data small to alleviate tech-aches •Low ETA’s •Highly Scalable •Flexible Pricing •Precision •Access to real- based on size and Extraction time data frequency of •Exhaustive data crawls available as feed Performance Price Technology 23 © PromptCloud Technologies 2013, All rights reserved
  • 24. Watch out for the next batch… For details, contact Email: info@promptcloud.com Phone: +91-96 86 56 70 70 24 © PromptCloud Technologies 2013, All rights reserved