SlideShare una empresa de Scribd logo
1 de 17
Descargar para leer sin conexión
DATA SCIENCE
Data Science
Data science is the process of deriving valuable knowledge from "Big Data" consisting
of structured, unstructured or semi-structured data that large enterprises produce.
Big Data
Big data is a set of techniques and technologies which operates wits data sizes
beyond the ability of commonly used software tools to capture and manage within a
tolerable elapsed time.
Data Mining
Data mining is a process that analyzes a large amount of data to find new
and hidden information that improves business efficiency. Various industries
have been adopted data mining to their mission-critical business processes
to gain competitive advantages and help business to grow.
Machine Learning
Machine Learning is a process that gives computers the ability to learn without being
explicitly programmed.
Examples: spam filtering, recommendation systems, sales predictions.
Business domains
Any kind of data analyses is based on two major components:
technical tools and domain expertise. Deteo has significant practical
experience in the following industries proven by long term
cooperation with appropriate customers from:
• Banking sector
• Insurance
• Human resource management
• IT and Telecom
• Accounting
• Retail
Business challenges we can address
New possibility for growth depends on the ability to analyze, predict and make
decision based on existed data related to customers and market:
Retail
• Market basket analysis to provide information on what products or services
combinations were purchased or consumed together. This allows to promote and
optimize products and maximize profit.
• Analyze customer retention and locality based on recent purchases activities.
• Data mining helps detect fraudulent behavior with credit card or online
transactions
• Clustering/Segmentation for targeted marketing
Business challenges we can address
Bank and Insurance
• Detect risky behavior of customers
• Claim prediction based on information available from previous events
• Fraud detection
eCommerce
• Collaborative filtering and recommendation systems that make automatic
prediction about the interests of users by collecting preferences and tastes
information from many similar users of such systems.
• Mining social networks could be applied both to target marketing and sentiment
analysis
• Intranet search to provide capabilities to find and answer the questions based on
information available within corporation or organization networks
• Analysis on streaming/online data to prepare information for further processing
Deteo Service Offerings
Approach
In scope of Data Science service offering we are able to complete the following
scope of activities:
• Comprehensive review of customers’ current business, plans and systems
• Recommendations on connecting Data science tools and approaches to
customers’ existing Business and IT infrastructure
• Perform Data Analysis
• Data Visualization and Advanced Reporting
• Support and Maintenance or Solution Hand Over
Initiation
•Project initiation
•Team setup
•Define business
needs
Analysis
•Define business goals in
technical metrics
•Analyze current
infrastructure
•Analyze existing data
•Analyze level of data
sensitivity
•Develop required
algorithms
•Validate algorithms on
small portion of data
Data Mining
•Prepare required
infrastructure
•Perform data
masking of sensitive
data
•Run data mining
algorithms
Results
Analysis
•Root-cause
analysis
•Risks assessment
•Recommenda-
tions to fix
Reporting
•Transform mined
data into graphics,
charts and tables
understandable
for stakeholders
•Plan meeting
where prepared
reports are
presented
Hand Over
•Prepare
knowledge
transfer plan
•Prepare technical
and business
documentation
•Provide training
for customers
experts
•Handover
developed
solution to
customer
Iteration cycle: 3-6 weeks
Regular status meetings
Deteo Expertise
Case study: Car insurance
Business challenge
We received historical data about car accidents from insurance company for the last 5
years. Data was anonymized, so contained no personal information. Customer asked us
to analyze this data. There was an assumption that insurance risk was not equal for
different groups of cars.
Our solution
Using Microsoft cloud stack of technologies for data analysis we run several
experiments and have defined groups of cars with equal risk probability. Based on this
information Customer was able to adjust his insurance fee card, so for two car groups
insurance fee was decreased for 10% and customer proposition became more valuable
on the market.
Business challenge
We received unstructured logs from server farm that represented
servers and services activities. Idea was to analyze it and to find the
most problematic servers and try to analyze the reasons.
Our solution
Using Hadoop Apache technology stack we loaded and processed
about 500 GB of text files. As a result, we identified servers that failed
the most often and defined the most probable preconditions of the
fault.
Next step is to implement online logs processing and analysis in order
to predict server or service fault.
Case study: Logs analysis
• Recommendation systems
• Machine learning
• Visualization
• Data Mining
Stream processing
NoSQL databases Hadoop based infrastructure
• Microsoft HD Insight
• Oracle BigData appliance
• IBM InfoSphere BigInsights
Tools
• Hadoop, Spark, Hive, Pig
• Azure
• R, Python, Java
Vendors
• Oracle, Microsoft, IBM
• Apache
• QlikView, Tableau
Stream processing
• IBM InfoSphere Streams
• Oracle Real-Time Decisions
• Apache Storm in MS Azure
Data science
• Recommendation systems
• Machine learning
• Visualization
• Data Mining
• MongoDB
• Cassandra
• Neo4j
When the data becomes a real problem of its size and variety – it’s time for Big Data solutions
Trainings and certifications
Deteo’s data science team has passed following trainings and certifications
Coursera
• Machine Learning
• Mining Massive Datasets
• Computing for Data Analysis
• R Programming
Online Stanford University
• Statistical Learning
Other
• Hadoop: Map Reduce and Big Data
• MongoDB for Developers
• MongoDB for DBAs
Interested to know more about our abilities?
Please ping us at contact@deteo.info

Más contenido relacionado

La actualidad más candente

Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedcedrinemadera
 
Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Datameer
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?Rackspace
 
Importance of Big data for your Business
Importance of Big data for your BusinessImportance of Big data for your Business
Importance of Big data for your Businessazuyo.com
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesYellowfin
 
Overview of Business Intelligence
Overview of Business IntelligenceOverview of Business Intelligence
Overview of Business IntelligenceParthiv Dixit
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBala Iyer
 
Overview of analytics and big data in practice
Overview of analytics and big data in practiceOverview of analytics and big data in practice
Overview of analytics and big data in practiceVivek Murugesan
 
From Business Intelligence to Big Data - hack/reduce Dec 2014
From Business Intelligence to Big Data - hack/reduce Dec 2014From Business Intelligence to Big Data - hack/reduce Dec 2014
From Business Intelligence to Big Data - hack/reduce Dec 2014Adam Ferrari
 
How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?Thanakrit Lersmethasakul
 
Importance of data analytics for business
Importance of data analytics for businessImportance of data analytics for business
Importance of data analytics for businessBranliticSocial
 
Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data AnalyticsVijay Rao
 
Succeeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and TechnologySucceeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and Technologyibi
 
Fraud Detection and Compliance with Graph Learning
Fraud Detection and Compliance with Graph LearningFraud Detection and Compliance with Graph Learning
Fraud Detection and Compliance with Graph LearningTigerGraph
 
Location decisions Center of Gravity
Location decisions Center of GravityLocation decisions Center of Gravity
Location decisions Center of GravityMaarten Van Oost
 
Real-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance IndustryReal-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance IndustryDataWorks Summit
 
Big Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsBig Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsSystems Limited
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIibi
 

La actualidad más candente (20)

Gse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-sharedGse uk-cedrinemadera-2018-shared
Gse uk-cedrinemadera-2018-shared
 
Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User Webinar - Big Data: Power to the User
Webinar - Big Data: Power to the User
 
How Startups can leverage big data?
How Startups can leverage big data?How Startups can leverage big data?
How Startups can leverage big data?
 
Importance of Big data for your Business
Importance of Big data for your BusinessImportance of Big data for your Business
Importance of Big data for your Business
 
Into the Big Data Future with Watson Analytics
Into the Big Data Future with Watson AnalyticsInto the Big Data Future with Watson Analytics
Into the Big Data Future with Watson Analytics
 
Big Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer StoriesBig Data Analytic with Hadoop: Customer Stories
Big Data Analytic with Hadoop: Customer Stories
 
Overview of Business Intelligence
Overview of Business IntelligenceOverview of Business Intelligence
Overview of Business Intelligence
 
Big Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the MarketspaceBig Data & Business Analytics: Understanding the Marketspace
Big Data & Business Analytics: Understanding the Marketspace
 
Overview of analytics and big data in practice
Overview of analytics and big data in practiceOverview of analytics and big data in practice
Overview of analytics and big data in practice
 
From Business Intelligence to Big Data - hack/reduce Dec 2014
From Business Intelligence to Big Data - hack/reduce Dec 2014From Business Intelligence to Big Data - hack/reduce Dec 2014
From Business Intelligence to Big Data - hack/reduce Dec 2014
 
How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?How different between Big Data, Business Intelligence and Analytics ?
How different between Big Data, Business Intelligence and Analytics ?
 
Importance of data analytics for business
Importance of data analytics for businessImportance of data analytics for business
Importance of data analytics for business
 
Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data Analytics
 
Succeeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and TechnologySucceeding with Analytics: Mastering People, Process, and Technology
Succeeding with Analytics: Mastering People, Process, and Technology
 
Big data
Big dataBig data
Big data
 
Fraud Detection and Compliance with Graph Learning
Fraud Detection and Compliance with Graph LearningFraud Detection and Compliance with Graph Learning
Fraud Detection and Compliance with Graph Learning
 
Location decisions Center of Gravity
Location decisions Center of GravityLocation decisions Center of Gravity
Location decisions Center of Gravity
 
Real-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance IndustryReal-time Data is Changing the Face of the Insurance Industry
Real-time Data is Changing the Face of the Insurance Industry
 
Big Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data AnalyticsBig Data, Business Intelligence and Data Analytics
Big Data, Business Intelligence and Data Analytics
 
Real-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BIReal-Time Data Integration for Modern BI
Real-Time Data Integration for Modern BI
 

Destacado

Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...
Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...
Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...John Fonner
 
Data Driven Innovation: New Business Models, Products and Services
Data Driven Innovation: New Business Models, Products and ServicesData Driven Innovation: New Business Models, Products and Services
Data Driven Innovation: New Business Models, Products and ServicesAnja Hoffmann
 
Renouveau et Futures Performances du Décisionnel
Renouveau et Futures Performances du DécisionnelRenouveau et Futures Performances du Décisionnel
Renouveau et Futures Performances du DécisionnelLaetitia Le Chatton
 
INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...
INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...
INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...IAB Europe
 
SQL PASS BA London 2014 - Data Culture & Future of Analytics
SQL PASS BA London 2014 - Data Culture & Future of AnalyticsSQL PASS BA London 2014 - Data Culture & Future of Analytics
SQL PASS BA London 2014 - Data Culture & Future of AnalyticsJonathan Woodward
 
SplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for DevopsSplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for DevopsSplunk
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...butest
 
Building a distributed data-platform - A perspective on current trends in co...
Building a distributed data-platform  - A perspective on current trends in co...Building a distributed data-platform  - A perspective on current trends in co...
Building a distributed data-platform - A perspective on current trends in co...Charles Care
 
"Where's the data?" The role of metadata in enabling the transformation to a ...
"Where's the data?" The role of metadata in enabling the transformation to a ..."Where's the data?" The role of metadata in enabling the transformation to a ...
"Where's the data?" The role of metadata in enabling the transformation to a ...Roland Bullivant
 
Hadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologiesHadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologiesBigdata Meetup Kochi
 
Ellie Mirman - Creating an Agile, Data-Driven Marketing Team
Ellie Mirman - Creating an Agile, Data-Driven Marketing TeamEllie Mirman - Creating an Agile, Data-Driven Marketing Team
Ellie Mirman - Creating an Agile, Data-Driven Marketing TeamINBOUND
 
Resilient Predictive Data Pipelines (GOTO Chicago 2016)
Resilient Predictive Data Pipelines (GOTO Chicago 2016)Resilient Predictive Data Pipelines (GOTO Chicago 2016)
Resilient Predictive Data Pipelines (GOTO Chicago 2016)Sid Anand
 
Yhat - Applied Data Science - Feb 2016
Yhat - Applied Data Science - Feb 2016Yhat - Applied Data Science - Feb 2016
Yhat - Applied Data Science - Feb 2016Austin Ogilvie
 
Open Data Science Conference Agile Data
Open Data Science Conference Agile DataOpen Data Science Conference Agile Data
Open Data Science Conference Agile DataDataKitchen
 
XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...
XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...
XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...Publicis Sapient Engineering
 
software architecture cant fight lean startup
software architecture cant fight lean startupsoftware architecture cant fight lean startup
software architecture cant fight lean startupIvo Nascimento
 
The Nitty Gritty of Advanced Analytics Using Apache Spark in Python
The Nitty Gritty of Advanced Analytics Using Apache Spark in PythonThe Nitty Gritty of Advanced Analytics Using Apache Spark in Python
The Nitty Gritty of Advanced Analytics Using Apache Spark in PythonMiklos Christine
 

Destacado (20)

Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...
Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...
Jupyter Ascending: a practical hand guide to galactic scale, reproducible dat...
 
Data Driven Innovation: New Business Models, Products and Services
Data Driven Innovation: New Business Models, Products and ServicesData Driven Innovation: New Business Models, Products and Services
Data Driven Innovation: New Business Models, Products and Services
 
Renouveau et Futures Performances du Décisionnel
Renouveau et Futures Performances du DécisionnelRenouveau et Futures Performances du Décisionnel
Renouveau et Futures Performances du Décisionnel
 
Dev ops meetup
Dev ops meetupDev ops meetup
Dev ops meetup
 
INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...
INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...
INT2016 Keynote - Emil Pawlowski & Vesna Gordon (Gemius) - Data Science Revol...
 
SQL PASS BA London 2014 - Data Culture & Future of Analytics
SQL PASS BA London 2014 - Data Culture & Future of AnalyticsSQL PASS BA London 2014 - Data Culture & Future of Analytics
SQL PASS BA London 2014 - Data Culture & Future of Analytics
 
SplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for DevopsSplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for Devops
 
Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...Competitive advantage from Data Mining: some lessons learnt ...
Competitive advantage from Data Mining: some lessons learnt ...
 
Building a distributed data-platform - A perspective on current trends in co...
Building a distributed data-platform  - A perspective on current trends in co...Building a distributed data-platform  - A perspective on current trends in co...
Building a distributed data-platform - A perspective on current trends in co...
 
"Where's the data?" The role of metadata in enabling the transformation to a ...
"Where's the data?" The role of metadata in enabling the transformation to a ..."Where's the data?" The role of metadata in enabling the transformation to a ...
"Where's the data?" The role of metadata in enabling the transformation to a ...
 
Hadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologiesHadoop enhancements using next gen IA technologies
Hadoop enhancements using next gen IA technologies
 
Ellie Mirman - Creating an Agile, Data-Driven Marketing Team
Ellie Mirman - Creating an Agile, Data-Driven Marketing TeamEllie Mirman - Creating an Agile, Data-Driven Marketing Team
Ellie Mirman - Creating an Agile, Data-Driven Marketing Team
 
Du craft chez les OPS
Du craft chez les OPSDu craft chez les OPS
Du craft chez les OPS
 
Resilient Predictive Data Pipelines (GOTO Chicago 2016)
Resilient Predictive Data Pipelines (GOTO Chicago 2016)Resilient Predictive Data Pipelines (GOTO Chicago 2016)
Resilient Predictive Data Pipelines (GOTO Chicago 2016)
 
Yhat - Applied Data Science - Feb 2016
Yhat - Applied Data Science - Feb 2016Yhat - Applied Data Science - Feb 2016
Yhat - Applied Data Science - Feb 2016
 
Open Data Science Conference Agile Data
Open Data Science Conference Agile DataOpen Data Science Conference Agile Data
Open Data Science Conference Agile Data
 
XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...
XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...
XebiConFr 15 - Voyages-sncf.com - Les apports de la Data Science à la connais...
 
software architecture cant fight lean startup
software architecture cant fight lean startupsoftware architecture cant fight lean startup
software architecture cant fight lean startup
 
The Nitty Gritty of Advanced Analytics Using Apache Spark in Python
The Nitty Gritty of Advanced Analytics Using Apache Spark in PythonThe Nitty Gritty of Advanced Analytics Using Apache Spark in Python
The Nitty Gritty of Advanced Analytics Using Apache Spark in Python
 
Data-driven Innovation - Wood
Data-driven Innovation - WoodData-driven Innovation - Wood
Data-driven Innovation - Wood
 

Similar a Data Science Solutions

Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various typesloginworks software
 
Entry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data AnalyticsEntry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data AnalyticsInside Analysis
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyNeo4j
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentationPriyesh Patel
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Platforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringPlatforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringDATAVERSITY
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
Business Analytics Paradigm Change
Business Analytics Paradigm ChangeBusiness Analytics Paradigm Change
Business Analytics Paradigm ChangeDmitry Anoshin
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data productsVikas Sardana
 
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
Disrupting Insurance with Advanced Analytics The Next Generation CarrierDisrupting Insurance with Advanced Analytics The Next Generation Carrier
Disrupting Insurance with Advanced Analytics The Next Generation CarrierDataWorks Summit/Hadoop Summit
 
Data Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the knownData Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the knownYASH Technologies
 
Data Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the knownData Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the knownYASH Technologies
 

Similar a Data Science Solutions (20)

Data Mining Services in various types
Data Mining Services in various typesData Mining Services in various types
Data Mining Services in various types
 
Entry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data AnalyticsEntry Points – How to Get Rolling with Big Data Analytics
Entry Points – How to Get Rolling with Big Data Analytics
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Modern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph TechnologyModern Data Challenges require Modern Graph Technology
Modern Data Challenges require Modern Graph Technology
 
Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
AI in the Enterprise at Scale
AI in the Enterprise at ScaleAI in the Enterprise at Scale
AI in the Enterprise at Scale
 
final oracle presentation
final oracle presentationfinal oracle presentation
final oracle presentation
 
Big data Introduction by Mohan
Big data Introduction by MohanBig data Introduction by Mohan
Big data Introduction by Mohan
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Trends in data analytics
Trends in data analyticsTrends in data analytics
Trends in data analytics
 
Platforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern EngineeringPlatforming the Major Analytic Use Cases for Modern Engineering
Platforming the Major Analytic Use Cases for Modern Engineering
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
Business Analytics Paradigm Change
Business Analytics Paradigm ChangeBusiness Analytics Paradigm Change
Business Analytics Paradigm Change
 
Customer value analysis of big data products
Customer value analysis of big data productsCustomer value analysis of big data products
Customer value analysis of big data products
 
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
Disrupting Insurance with Advanced Analytics The Next Generation CarrierDisrupting Insurance with Advanced Analytics The Next Generation Carrier
Disrupting Insurance with Advanced Analytics The Next Generation Carrier
 
Data Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the knownData Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the known
 
Data Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the knownData Sciences & Analytics Discover the unknown power of the known
Data Sciences & Analytics Discover the unknown power of the known
 

Data Science Solutions

  • 2. Data Science Data science is the process of deriving valuable knowledge from "Big Data" consisting of structured, unstructured or semi-structured data that large enterprises produce.
  • 3. Big Data Big data is a set of techniques and technologies which operates wits data sizes beyond the ability of commonly used software tools to capture and manage within a tolerable elapsed time.
  • 4. Data Mining Data mining is a process that analyzes a large amount of data to find new and hidden information that improves business efficiency. Various industries have been adopted data mining to their mission-critical business processes to gain competitive advantages and help business to grow.
  • 5. Machine Learning Machine Learning is a process that gives computers the ability to learn without being explicitly programmed. Examples: spam filtering, recommendation systems, sales predictions.
  • 6. Business domains Any kind of data analyses is based on two major components: technical tools and domain expertise. Deteo has significant practical experience in the following industries proven by long term cooperation with appropriate customers from: • Banking sector • Insurance • Human resource management • IT and Telecom • Accounting • Retail
  • 7. Business challenges we can address New possibility for growth depends on the ability to analyze, predict and make decision based on existed data related to customers and market: Retail • Market basket analysis to provide information on what products or services combinations were purchased or consumed together. This allows to promote and optimize products and maximize profit. • Analyze customer retention and locality based on recent purchases activities. • Data mining helps detect fraudulent behavior with credit card or online transactions • Clustering/Segmentation for targeted marketing
  • 8. Business challenges we can address Bank and Insurance • Detect risky behavior of customers • Claim prediction based on information available from previous events • Fraud detection eCommerce • Collaborative filtering and recommendation systems that make automatic prediction about the interests of users by collecting preferences and tastes information from many similar users of such systems. • Mining social networks could be applied both to target marketing and sentiment analysis • Intranet search to provide capabilities to find and answer the questions based on information available within corporation or organization networks • Analysis on streaming/online data to prepare information for further processing
  • 10. Approach In scope of Data Science service offering we are able to complete the following scope of activities: • Comprehensive review of customers’ current business, plans and systems • Recommendations on connecting Data science tools and approaches to customers’ existing Business and IT infrastructure • Perform Data Analysis • Data Visualization and Advanced Reporting • Support and Maintenance or Solution Hand Over
  • 11. Initiation •Project initiation •Team setup •Define business needs Analysis •Define business goals in technical metrics •Analyze current infrastructure •Analyze existing data •Analyze level of data sensitivity •Develop required algorithms •Validate algorithms on small portion of data Data Mining •Prepare required infrastructure •Perform data masking of sensitive data •Run data mining algorithms Results Analysis •Root-cause analysis •Risks assessment •Recommenda- tions to fix Reporting •Transform mined data into graphics, charts and tables understandable for stakeholders •Plan meeting where prepared reports are presented Hand Over •Prepare knowledge transfer plan •Prepare technical and business documentation •Provide training for customers experts •Handover developed solution to customer Iteration cycle: 3-6 weeks Regular status meetings
  • 13. Case study: Car insurance Business challenge We received historical data about car accidents from insurance company for the last 5 years. Data was anonymized, so contained no personal information. Customer asked us to analyze this data. There was an assumption that insurance risk was not equal for different groups of cars. Our solution Using Microsoft cloud stack of technologies for data analysis we run several experiments and have defined groups of cars with equal risk probability. Based on this information Customer was able to adjust his insurance fee card, so for two car groups insurance fee was decreased for 10% and customer proposition became more valuable on the market.
  • 14. Business challenge We received unstructured logs from server farm that represented servers and services activities. Idea was to analyze it and to find the most problematic servers and try to analyze the reasons. Our solution Using Hadoop Apache technology stack we loaded and processed about 500 GB of text files. As a result, we identified servers that failed the most often and defined the most probable preconditions of the fault. Next step is to implement online logs processing and analysis in order to predict server or service fault. Case study: Logs analysis
  • 15. • Recommendation systems • Machine learning • Visualization • Data Mining Stream processing NoSQL databases Hadoop based infrastructure • Microsoft HD Insight • Oracle BigData appliance • IBM InfoSphere BigInsights Tools • Hadoop, Spark, Hive, Pig • Azure • R, Python, Java Vendors • Oracle, Microsoft, IBM • Apache • QlikView, Tableau Stream processing • IBM InfoSphere Streams • Oracle Real-Time Decisions • Apache Storm in MS Azure Data science • Recommendation systems • Machine learning • Visualization • Data Mining • MongoDB • Cassandra • Neo4j When the data becomes a real problem of its size and variety – it’s time for Big Data solutions
  • 16. Trainings and certifications Deteo’s data science team has passed following trainings and certifications Coursera • Machine Learning • Mining Massive Datasets • Computing for Data Analysis • R Programming Online Stanford University • Statistical Learning Other • Hadoop: Map Reduce and Big Data • MongoDB for Developers • MongoDB for DBAs
  • 17. Interested to know more about our abilities? Please ping us at contact@deteo.info