SlideShare una empresa de Scribd logo
1 de 14
Descargar para leer sin conexión
Elevator Pitch
An Innovative Big-Data Web Scraping Tech Compnay
Innovative Big-Data Web Scraping Tech Compnay 2
HIGHLIGHTS
v What is WebRobot?
v The Problem
v How We Can Solve It
v Team
v Track Record
v Business Model
v Trends & Opportunities
v Main Competitors
v Target
v SWOT Analysis
v Some Numbers (sales, profit, clients)
v Investment Plan
3
1. THE PROJECT
Description
 WebRobot ltd is a London-based company that operates in the web scraping and web
mining industry in which it aims to become the leader.
 In WebRobot we are building a super scalable infrastructure for data acquisition that
customers can use as a web service. It exploits cloud computing and big-data technologies,
as well as data-extraction and information-extraction algorithms.
 WebRobot will be a great ally to every company that needs to acquire this heterogeneous
network of information and wants to reduce its internal management costs. WebRobot’s
services will represent a strategic resource essential to its business success.
Innovative Big-Data Web Scraping Tech Compnay
4
1. THE PROJECT
The problem
Every company wishing to achieve, keep and improve its business success needs information (data)
on both the market, customers, and competitors, but this is challenging.
It must get good, reliable, and well-organized data. In addition, it needs to manage them properly.
The World Wide Web is made up of a huge amount of semi-structured and unstructured data.
Furthermore, it constantly changes its structure.
The cost to collect all of these data is often very expensive.
For all these reasons, we need robust and scalable algorithms that can reduce this onerous
maintenance activity.
Innovative Big-Data Web Scraping Tech Compnay
5
1. THE PROJECT
How We Can Solve the Problem
We can guarantee algorithmic and structural scalability with automatic extraction features.
We offer a powerful solution in the form of a web service.
We integrate cloud computing with big-data technologies applied in the more general web mining
context.
We use visual support tools and SDK to connect to our stack.
WebRobot’s goal is to become a complete ETL service involving data extraction,
web mining, machine learning, and big-data analytics.
Innovative Big-Data Web Scraping Tech Compnay
6
2. THE TEAM
CEO, CTO
Roger Giuffrè
71% of Equity
Mediterraneo
Capital Ltd
25% of Equity
CCO, CMO
Denis Giuffrè
4% of Equity
CFO
Antonio
Censabella
Roger Giuffrè
Denis Giuffrè Antonio Bensabella
MEDITERRANEO CAPITAL LTD
Innovative Big-Data Web Scraping Tech Compnay
7
3. TRACK RECORD
We are finalizing the first version of the web service which will include the serverless version on the
Lambda technology and Amazon EMR.
We need to integrate the wrapper induction algorithms directly into the spark context. This will help us
refine them with the latest academic findings.
API implementation is fundamentally finished. We have to complete the usability studies of the current
interface.
We need to complete the dashboard that will be released under an open-source license.
We have to design visual tools to support the ETL that has to be generated.
We have a new grammar to set up for the query.
Innovative Big-Data Web Scraping Tech Compnay
8
4. THE BUSINESS MODEL
The Strategy
We will release the service on the Amazon marketplace, available in three commercial packages:
Entry-Level, Professional, and Enterprise.
Our average selling price could be around 0.0008 Euro per page scraped, but we will make a
distinction between static and dynamic pages that need complex algorithms.
We have verified that the execution costs on a serverless environment and on an EMR cluster can
guarantee us a margin of at least 50%. This margin represents a cost constraint in our pricing policy.
In the future, we will integrate a web agents marketplace and adopt a B2B2C paradigm to fill the gap
with the end users, as well as with the actual use cases.
Innovative Big-Data Web Scraping Tech Compnay
9
5. THE MARKET AND COMPETITORS
Trends and opportunities
 Markets: Web Scraping, Web Mining, Data Analytics.
 Dimension: $2 billion of estimated value in 2020 alone (in just one single year).
 Growth: based on the market researches, we expect further growth in the
next years induced by (1) an ever-greater centrality of data in the entire
business process, and (2) the predisposition of the companies to outsource,
more and more often, the above-mentioned activities.
Innovative Big-Data Web Scraping Tech Compnay
10
Main Competitors
 Diffbot: an API for data extraction that uses machine learning heuristic and features to crawl the
pages. Unfortunately, the results are not 100% precise.
 Scrapyhub: a cloud service focused on the Scrapy framework. It offers every single service
separately plus automatic extraction functions that are still in beta version. Anyway, the results are
not always compliant.
 ImportIO: visual tools that customers can use to configure the extractors. However, it is particularly
expensive.
5. THE MARKET AND COMPETITORS
Innovative Big-Data Web Scraping Tech Compnay
11
6. TARGET
E-commerce companies that require algorithmic pricing and competition monitoring.
Big companies that produce press reviews, carry out social media analysis, opinion mining, and
sentiment analysis activities.
Hedge funds and financial institutions for which information such as financial data and sentiment
indicators are extremely important.
Marketing agencies that need web scraping for SEO and web marketing automation purposes.
Established and startup companies that run or are developing any kind of vertical search engine.
Startups and small businesses that can benefit from building dedicated applications on our stack.
Innovative Big-Data Web Scraping Tech Compnay
12
7. SWOT ANALYSIS
STRENGTHS WEAKNESSES
Scalability.
Self-service fast big-data extraction
solution.
We need PhD resources to reinforce the
algorithmic extraction.
Very specialized high-tech service that
requires an effort to make it user-friendly
(for non-technical users).
OPPORTUNITIES RISKS
Global market with big expansion
opportunities.
Profitable niche with low competition.
Restrictive regulations on the use of
personal data (in Europe), on data
collection (in Asia), on data referring to
minors (worldwide).
Innovative Big-Data Web Scraping Tech Compnay
13
8. THE NUMBERS
We are considering a medium / large customer that requires at least 1 million pages per day
at a price of €800.00 (there is a global potential request of 100 billion pages per day).
EUR (in thousands) Year 2021 Year 2022 Year 2023 Year 2024
Sales 2,880 7,200 13,248 20,160
Gross margin 1,440 3,600 6,624 10,080
Net margin 1,440 3,600 6,624 10,080
Num. Customers 10 25 46 70
Innovative Big-Data Web Scraping Tech Compnay
14
9. INVESTMENT PLAN
The investment strategy
First round: 9% in equity for €300k with a pre-money evaluation of €3 million.
Second round: 9% in equity for €2 million.
Third round: 9% in equity for €10 million.
We plan to eventually go public on the stock exchange.
Innovative Big-Data Web Scraping Tech Compnay

Más contenido relacionado

La actualidad más candente

What is the API economy?
What is the API economy?What is the API economy?
What is the API economy?IBM Integration
 
The Cloudification of Capital Markets
The Cloudification of Capital MarketsThe Cloudification of Capital Markets
The Cloudification of Capital MarketsStephane Dubois
 
PaaS: Open For Business
PaaS: Open For Business PaaS: Open For Business
PaaS: Open For Business VMware Tanzu
 
Monitor your car from the cloud! DIY Telematics and the Internet of Things
Monitor your car from the cloud! DIY Telematics and the Internet of ThingsMonitor your car from the cloud! DIY Telematics and the Internet of Things
Monitor your car from the cloud! DIY Telematics and the Internet of ThingsTom Gersic
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorBigML, Inc
 
Bmc joe goldberg
Bmc joe goldbergBmc joe goldberg
Bmc joe goldbergBigDataExpo
 
Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...
Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...
Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...*instinctools
 
Building the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data StrategiesBuilding the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data StrategiesKevin Sigliano
 
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...Distil Network Sponsor Presentation at the Property Portal Watch Conference -...
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...Property Portal Watch
 
Outside in thinking - How APIs can help open up your business
Outside in thinking - How APIs can help open up your business Outside in thinking - How APIs can help open up your business
Outside in thinking - How APIs can help open up your business Jeremy Brown
 
Digital marketing pharma - google event
Digital marketing   pharma - google eventDigital marketing   pharma - google event
Digital marketing pharma - google eventDaniel Viveiros
 
SugarCON 2013: World Class Analytics for SugarCRM with IBM
SugarCON 2013: World Class Analytics for SugarCRM with IBMSugarCON 2013: World Class Analytics for SugarCRM with IBM
SugarCON 2013: World Class Analytics for SugarCRM with IBMPalmtreeConsulting
 
apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...
apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...
apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...apidays
 
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4j
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4jKeynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4j
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4jNeo4j
 
ANTS - The Future of Digital Marketing - FPT TECH DAY 2016
ANTS - The Future of Digital Marketing - FPT TECH DAY 2016ANTS - The Future of Digital Marketing - FPT TECH DAY 2016
ANTS - The Future of Digital Marketing - FPT TECH DAY 2016ANTS
 

La actualidad más candente (18)

What is the API economy?
What is the API economy?What is the API economy?
What is the API economy?
 
The Cloudification of Capital Markets
The Cloudification of Capital MarketsThe Cloudification of Capital Markets
The Cloudification of Capital Markets
 
PaaS: Open For Business
PaaS: Open For Business PaaS: Open For Business
PaaS: Open For Business
 
Monitor your car from the cloud! DIY Telematics and the Internet of Things
Monitor your car from the cloud! DIY Telematics and the Internet of ThingsMonitor your car from the cloud! DIY Telematics and the Internet of Things
Monitor your car from the cloud! DIY Telematics and the Internet of Things
 
Machine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail SectorMachine Learning in Retail: ML in the Retail Sector
Machine Learning in Retail: ML in the Retail Sector
 
Bmc joe goldberg
Bmc joe goldbergBmc joe goldberg
Bmc joe goldberg
 
Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...
Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...
Data Integration: Huntflow and PowerBI | Case Study | Software Development Co...
 
Building the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data StrategiesBuilding the Cognitive Era : Big Data Strategies
Building the Cognitive Era : Big Data Strategies
 
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...Distil Network Sponsor Presentation at the Property Portal Watch Conference -...
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...
 
Urbanclap clone
Urbanclap cloneUrbanclap clone
Urbanclap clone
 
Outside in thinking - How APIs can help open up your business
Outside in thinking - How APIs can help open up your business Outside in thinking - How APIs can help open up your business
Outside in thinking - How APIs can help open up your business
 
Big Data
Big DataBig Data
Big Data
 
Digital marketing pharma - google event
Digital marketing   pharma - google eventDigital marketing   pharma - google event
Digital marketing pharma - google event
 
SugarCON 2013: World Class Analytics for SugarCRM with IBM
SugarCON 2013: World Class Analytics for SugarCRM with IBMSugarCON 2013: World Class Analytics for SugarCRM with IBM
SugarCON 2013: World Class Analytics for SugarCRM with IBM
 
apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...
apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...
apidays LIVE Hong Kong 2021 - Unleash the Power of Big Data with API Collabor...
 
The API Economy
The API EconomyThe API Economy
The API Economy
 
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4j
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4jKeynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4j
Keynote GraphTour Europe 2019, Emil Eifrem, CEO & Co-Founder Neo4j
 
ANTS - The Future of Digital Marketing - FPT TECH DAY 2016
ANTS - The Future of Digital Marketing - FPT TECH DAY 2016ANTS - The Future of Digital Marketing - FPT TECH DAY 2016
ANTS - The Future of Digital Marketing - FPT TECH DAY 2016
 

Similar a An Innovative Big-Data Web Scraping Tech Company

P01executive Summary Yy2009mm03dd16
P01executive Summary Yy2009mm03dd16P01executive Summary Yy2009mm03dd16
P01executive Summary Yy2009mm03dd16guest558440c
 
What is Web-Scale IT ?
What is Web-Scale IT ?What is Web-Scale IT ?
What is Web-Scale IT ?Ahmed Banafa
 
Cubitic: Predictive Analytics
Cubitic: Predictive AnalyticsCubitic: Predictive Analytics
Cubitic: Predictive Analyticshuguk
 
QOTEQ Investment Deck
QOTEQ Investment DeckQOTEQ Investment Deck
QOTEQ Investment DeckUcluster
 
Bitrock manufacturing
Bitrock manufacturing Bitrock manufacturing
Bitrock manufacturing cosma_r
 
Di in the age of digital disruptions v1.0
Di in the age of digital disruptions v1.0Di in the age of digital disruptions v1.0
Di in the age of digital disruptions v1.0Amar Roy
 
IRJET- Multi Design - Pattern React Application with Django Backend
IRJET- Multi Design - Pattern React Application with Django BackendIRJET- Multi Design - Pattern React Application with Django Backend
IRJET- Multi Design - Pattern React Application with Django BackendIRJET Journal
 
IBM InterConnect 2013 Cloud General Session: Robert LeBlanc
IBM InterConnect 2013 Cloud General Session: Robert LeBlancIBM InterConnect 2013 Cloud General Session: Robert LeBlanc
IBM InterConnect 2013 Cloud General Session: Robert LeBlancIBM Events
 
Presentation cloud as a growth engine for a smarter enterprise
Presentation   cloud as a growth engine for a smarter enterprisePresentation   cloud as a growth engine for a smarter enterprise
Presentation cloud as a growth engine for a smarter enterprisexKinAnx
 
Society Overview - 2015
Society Overview - 2015Society Overview - 2015
Society Overview - 2015Dan Glavin
 
T-Bytes Agile & AI Operation
T-Bytes Agile & AI Operation T-Bytes Agile & AI Operation
T-Bytes Agile & AI Operation EGBG Services
 
Systematix_Credential_Presentation_latest (1)
Systematix_Credential_Presentation_latest (1)Systematix_Credential_Presentation_latest (1)
Systematix_Credential_Presentation_latest (1)Alex Hunt
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceSkillspeed
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaCapgemini
 
Stepping into the Digital Future with IoT
Stepping into the Digital Future with IoTStepping into the Digital Future with IoT
Stepping into the Digital Future with IoTCognizant
 
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...Alaa Mahjoub
 
Tech. 2017 predictions presentation for meetups
Tech. 2017 predictions presentation for meetupsTech. 2017 predictions presentation for meetups
Tech. 2017 predictions presentation for meetupsSumant Parimal
 
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB
 

Similar a An Innovative Big-Data Web Scraping Tech Company (20)

P01executive Summary Yy2009mm03dd16
P01executive Summary Yy2009mm03dd16P01executive Summary Yy2009mm03dd16
P01executive Summary Yy2009mm03dd16
 
What is Web-Scale IT ?
What is Web-Scale IT ?What is Web-Scale IT ?
What is Web-Scale IT ?
 
Cubitic: Predictive Analytics
Cubitic: Predictive AnalyticsCubitic: Predictive Analytics
Cubitic: Predictive Analytics
 
QOTEQ Investment Deck
QOTEQ Investment DeckQOTEQ Investment Deck
QOTEQ Investment Deck
 
Cloud9
Cloud9Cloud9
Cloud9
 
Bitrock manufacturing
Bitrock manufacturing Bitrock manufacturing
Bitrock manufacturing
 
Di in the age of digital disruptions v1.0
Di in the age of digital disruptions v1.0Di in the age of digital disruptions v1.0
Di in the age of digital disruptions v1.0
 
IRJET- Multi Design - Pattern React Application with Django Backend
IRJET- Multi Design - Pattern React Application with Django BackendIRJET- Multi Design - Pattern React Application with Django Backend
IRJET- Multi Design - Pattern React Application with Django Backend
 
IBM InterConnect 2013 Cloud General Session: Robert LeBlanc
IBM InterConnect 2013 Cloud General Session: Robert LeBlancIBM InterConnect 2013 Cloud General Session: Robert LeBlanc
IBM InterConnect 2013 Cloud General Session: Robert LeBlanc
 
Presentation cloud as a growth engine for a smarter enterprise
Presentation   cloud as a growth engine for a smarter enterprisePresentation   cloud as a growth engine for a smarter enterprise
Presentation cloud as a growth engine for a smarter enterprise
 
Society Overview - 2015
Society Overview - 2015Society Overview - 2015
Society Overview - 2015
 
T-Bytes Agile & AI Operation
T-Bytes Agile & AI Operation T-Bytes Agile & AI Operation
T-Bytes Agile & AI Operation
 
Systematix_Credential_Presentation_latest (1)
Systematix_Credential_Presentation_latest (1)Systematix_Credential_Presentation_latest (1)
Systematix_Credential_Presentation_latest (1)
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in Finance
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
 
Stepping into the Digital Future with IoT
Stepping into the Digital Future with IoTStepping into the Digital Future with IoT
Stepping into the Digital Future with IoT
 
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
Integration of Big Data Analytics with IoT and OT Systems to Turn Insights in...
 
About smartData
About smartDataAbout smartData
About smartData
 
Tech. 2017 predictions presentation for meetups
Tech. 2017 predictions presentation for meetupsTech. 2017 predictions presentation for meetups
Tech. 2017 predictions presentation for meetups
 
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demandsMongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
MongoDB .local Chicago 2019: MongoDB – Powering the new age data demands
 

Último

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excelysmaelreyes
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 

Último (20)

RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excel
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 

An Innovative Big-Data Web Scraping Tech Company

  • 1. Elevator Pitch An Innovative Big-Data Web Scraping Tech Compnay
  • 2. Innovative Big-Data Web Scraping Tech Compnay 2 HIGHLIGHTS v What is WebRobot? v The Problem v How We Can Solve It v Team v Track Record v Business Model v Trends & Opportunities v Main Competitors v Target v SWOT Analysis v Some Numbers (sales, profit, clients) v Investment Plan
  • 3. 3 1. THE PROJECT Description  WebRobot ltd is a London-based company that operates in the web scraping and web mining industry in which it aims to become the leader.  In WebRobot we are building a super scalable infrastructure for data acquisition that customers can use as a web service. It exploits cloud computing and big-data technologies, as well as data-extraction and information-extraction algorithms.  WebRobot will be a great ally to every company that needs to acquire this heterogeneous network of information and wants to reduce its internal management costs. WebRobot’s services will represent a strategic resource essential to its business success. Innovative Big-Data Web Scraping Tech Compnay
  • 4. 4 1. THE PROJECT The problem Every company wishing to achieve, keep and improve its business success needs information (data) on both the market, customers, and competitors, but this is challenging. It must get good, reliable, and well-organized data. In addition, it needs to manage them properly. The World Wide Web is made up of a huge amount of semi-structured and unstructured data. Furthermore, it constantly changes its structure. The cost to collect all of these data is often very expensive. For all these reasons, we need robust and scalable algorithms that can reduce this onerous maintenance activity. Innovative Big-Data Web Scraping Tech Compnay
  • 5. 5 1. THE PROJECT How We Can Solve the Problem We can guarantee algorithmic and structural scalability with automatic extraction features. We offer a powerful solution in the form of a web service. We integrate cloud computing with big-data technologies applied in the more general web mining context. We use visual support tools and SDK to connect to our stack. WebRobot’s goal is to become a complete ETL service involving data extraction, web mining, machine learning, and big-data analytics. Innovative Big-Data Web Scraping Tech Compnay
  • 6. 6 2. THE TEAM CEO, CTO Roger Giuffrè 71% of Equity Mediterraneo Capital Ltd 25% of Equity CCO, CMO Denis Giuffrè 4% of Equity CFO Antonio Censabella Roger Giuffrè Denis Giuffrè Antonio Bensabella MEDITERRANEO CAPITAL LTD Innovative Big-Data Web Scraping Tech Compnay
  • 7. 7 3. TRACK RECORD We are finalizing the first version of the web service which will include the serverless version on the Lambda technology and Amazon EMR. We need to integrate the wrapper induction algorithms directly into the spark context. This will help us refine them with the latest academic findings. API implementation is fundamentally finished. We have to complete the usability studies of the current interface. We need to complete the dashboard that will be released under an open-source license. We have to design visual tools to support the ETL that has to be generated. We have a new grammar to set up for the query. Innovative Big-Data Web Scraping Tech Compnay
  • 8. 8 4. THE BUSINESS MODEL The Strategy We will release the service on the Amazon marketplace, available in three commercial packages: Entry-Level, Professional, and Enterprise. Our average selling price could be around 0.0008 Euro per page scraped, but we will make a distinction between static and dynamic pages that need complex algorithms. We have verified that the execution costs on a serverless environment and on an EMR cluster can guarantee us a margin of at least 50%. This margin represents a cost constraint in our pricing policy. In the future, we will integrate a web agents marketplace and adopt a B2B2C paradigm to fill the gap with the end users, as well as with the actual use cases. Innovative Big-Data Web Scraping Tech Compnay
  • 9. 9 5. THE MARKET AND COMPETITORS Trends and opportunities  Markets: Web Scraping, Web Mining, Data Analytics.  Dimension: $2 billion of estimated value in 2020 alone (in just one single year).  Growth: based on the market researches, we expect further growth in the next years induced by (1) an ever-greater centrality of data in the entire business process, and (2) the predisposition of the companies to outsource, more and more often, the above-mentioned activities. Innovative Big-Data Web Scraping Tech Compnay
  • 10. 10 Main Competitors  Diffbot: an API for data extraction that uses machine learning heuristic and features to crawl the pages. Unfortunately, the results are not 100% precise.  Scrapyhub: a cloud service focused on the Scrapy framework. It offers every single service separately plus automatic extraction functions that are still in beta version. Anyway, the results are not always compliant.  ImportIO: visual tools that customers can use to configure the extractors. However, it is particularly expensive. 5. THE MARKET AND COMPETITORS Innovative Big-Data Web Scraping Tech Compnay
  • 11. 11 6. TARGET E-commerce companies that require algorithmic pricing and competition monitoring. Big companies that produce press reviews, carry out social media analysis, opinion mining, and sentiment analysis activities. Hedge funds and financial institutions for which information such as financial data and sentiment indicators are extremely important. Marketing agencies that need web scraping for SEO and web marketing automation purposes. Established and startup companies that run or are developing any kind of vertical search engine. Startups and small businesses that can benefit from building dedicated applications on our stack. Innovative Big-Data Web Scraping Tech Compnay
  • 12. 12 7. SWOT ANALYSIS STRENGTHS WEAKNESSES Scalability. Self-service fast big-data extraction solution. We need PhD resources to reinforce the algorithmic extraction. Very specialized high-tech service that requires an effort to make it user-friendly (for non-technical users). OPPORTUNITIES RISKS Global market with big expansion opportunities. Profitable niche with low competition. Restrictive regulations on the use of personal data (in Europe), on data collection (in Asia), on data referring to minors (worldwide). Innovative Big-Data Web Scraping Tech Compnay
  • 13. 13 8. THE NUMBERS We are considering a medium / large customer that requires at least 1 million pages per day at a price of €800.00 (there is a global potential request of 100 billion pages per day). EUR (in thousands) Year 2021 Year 2022 Year 2023 Year 2024 Sales 2,880 7,200 13,248 20,160 Gross margin 1,440 3,600 6,624 10,080 Net margin 1,440 3,600 6,624 10,080 Num. Customers 10 25 46 70 Innovative Big-Data Web Scraping Tech Compnay
  • 14. 14 9. INVESTMENT PLAN The investment strategy First round: 9% in equity for €300k with a pre-money evaluation of €3 million. Second round: 9% in equity for €2 million. Third round: 9% in equity for €10 million. We plan to eventually go public on the stock exchange. Innovative Big-Data Web Scraping Tech Compnay