Más contenido relacionado La actualidad más candente (20) Similar a Providing Interactive Analytics on Excel with Billions of Rows (20) Más de Tyler Wishnoff (9) Providing Interactive Analytics on Excel with Billions of Rows2. © Kyligence Inc. 2019, Confidential.
Excel-Superhero: The Unsung Hero of Modern Analytics
The Excel-
Superhero
Fulfilling business analytics
Generating complex reports and analysis
Building complex models
AND … keeping everyone happy, moving
business forward
VS.
The Big Data Super
Villain
Large datasets, often billions of rows, sitting in data
lake, cloud storage or in distributed databases.
Hard to do number crunching and analysis, often
crashing the Excel-Superhero.
Slow processing times lead to inefficient decision making.
Complex calculations are often difficult to perform with billions of rows of data.
Petabyte-scale datasets combined with many concurrent users often becomes too challenging for many organizations.
3. © Kyligence Inc. 2019, Confidential.
Challenges in Excel with Big Data
Title
Limited Scalability
Slow Response Time
Limited
Number of
Dimensions
Limited
Scalability
Slow
Response
Time
Difficult to
Access DataThe Big Data Villain
I will finish Excel and all Excel
users!
4. © Kyligence Inc. 2019, Confidential.
contentcontentcontentcont
entcontentcontentcontentc
ontentcontentcontent
• Excel has been a tool of choice for analysts across the board and
throughout the industry.
• Analysts have been creating complicated models in Excel to build reports
using PivotTable, PivotChart, Macros etc.
• Excel is easy to use, time tested, easily available, and a reliable
application for analysts for data consuming, crunching, and analytics.
Excel – Analytics for Every Organization
Fun Fact
5. © Kyligence Inc. 2019, Confidential.
Apache Kylin
TopLevel Apache Project
The only open-source OLAP on big
data platform
BestOpen-Source Big Data Tool
InfoWorld’s Bossies (Best of Open Source
Software Awards) in 2015 & 2016
Sub-Second Interactive
Query
Large scale, high concurrency, multi-
dimensional, sub-second query latency
1,000+ Organizations
Adopted by thousands of
organizations globally
6. © Kyligence Inc. 2019, Confidential.
Kyligence = Kylin + Intelligence
• Founded in 2016 by the creators of Apache Kylin
• Built around Kylin with augmented AI, enhanced to deliver
unprecedented enterprise analytic performance
• CRN Top-10 big data startups in 2018
• Global Presence: San Jose, Seattle, New York, Shanghai, Beijing
• VCs: Fidelity International, Shunwei Capital, Broadband Capital,
Redpoint, Cisco, Coatue
Accelerate Critical Business Decisions with AI-Augmented Data Management and
Analytics
2016
Founded Pre-A
Redpoint
Cisco
2017
Series A
CBC
Shunwei
2018
Series B
8Roads
2019
Series C
Coatue
7. © Kyligence Inc. 2019, Confidential.
Excel-Superhero: The Unsung Hero of Modern Analytics
The Excel-
Superhero
9. © Kyligence Inc. 2019, Confidential.
Kyligence MDX
OLAP
A Case for OLAP for Analytics
• More data sources
• Lower TCO
• Scalable
• High performance
• Ad hoc queries
• Flexible analysis
• Enterprise security
Business AnalysisBusiness Analysis
Data Lake DW / DM
DW / DM
Kyligence Enterprise
Scale Up
Logs
Caching
QueriesData
MDX
SQL
Complex
Modeling
Semantic
Translation
Measure
Groups
Query
Engine
Multi-Level
Accesses
Multi-
Tenants
Distributed
Arch.
Orders CRM POS
Semantic Layer
Data Service Layer
MOLAP
MDX
10. © Kyligence Inc. 2019, Confidential.
Automatic Model Creation
AI-augmented engine automatically
designs the most optimal model based on
past user behaviors and query patterns.
This reduces the need for manual
modeling and maintenance.
Adaptive Schema Evolution
As analytical requests change, the model
needs to reflect those changes. Our model
automatically adapts to any schema changes.
The model evolves along with your analytical
needs.
Automatic Query Optimization
The model continuously evolves and
self-optimizes as it obtains new usage
behavior. This guarantees sub-second
performance, no matter the data
volume or concurrency.
Kyligence Solution
11. © Kyligence Inc. 2019, Confidential.
Traditional OLAP vs. Kyligence
• Rigid schema, dependent on data warehouse
• Single node solution
• End-user analytics is limited by the OLAP cube. If the measures and
dimensions do not already exist, the query cannot be answered.
• Adaptive schema
• Distributed multi-node solution
• OLAP cube provides sub-second responses
• Smart pushdown capabilities, guaranteed query responses
12. © Kyligence Inc. 2019, Confidential.
AI-Powered Data Management For Most Valuable Data
ANSI SQL
MDX
REST
Semantic Layer
FinanceMarketing
Sales
Index
AI-Augmented Engine
13. © Kyligence Inc. 2019, Confidential.
Kyligence Architecture
Data Source
Analytics
Data Service
Data Lake Azure
Blob Storage
AWS
S3
Hadoop
Google
Cloud Storage
Azure SynapseSnowflake
Management
Query Engine Semantic Layer SQL Query Engine Smart Modeling
Scaling Maintenance Monitor
Enterprise-Level Security
TCO
Database Events Files Logs IoT
Business Insights Multidimensional Analysis 3rd-Party Applications Machine Learning
Visualization Self-Service Collaboration 3rd-Party BI Tools
14. © Kyligence Inc. 2019, Confidential.
Sub-Second
Query
Multi-Level Security
Semantic Layer
Seamless Integration
with Excel/BI
Fast response time
Supports aggregation, detail, and ad hoc queries
Project, table, row, and column level access control
Supports complex business logic
Supports Excel core anlytics functionalities
Model synchronization into BI tools
Value Proposition
All or incremantal build
Query while build
Cube Building
16. © Kyligence Inc. 2019, Confidential.
Title
Originally built to replace Teradata 3 trillion rows of detail
100,000 concurrent users on
hand-held devices
With millisecond responses
Replaced Teradata
IBM Cognos replacement
From 1,200+ cubes down
to 2 cubes
Complete replacement of Greenplum
eBay Global Top
3 Bank
Top Global
Insurance
Company
World’s Largest
Credit Card
Processor
Title
17. © Kyligence Inc. 2019, Confidential.
Title
Originally built to replace Teradata 3 trillion rows of detail
100,000 concurrent users on
hand-held devices
With milli-second responses
Replaced Teradata
IBM Cognos replacement
From 1,200+ cubes down
to 2 cubes
Complete replacement of Greenplum
eBay Global Top
3 Bank
Top Global
Insurance
Company
World’s Largest
Credit Card
Processor
Excel Trivia
What number of actions can you undo in Excel ?
100
What is the earliest date allowed in Excel calculations ?
Jan 01, 1900
What is the result of =1*(0.5-0.4-0.1) in Excel?
18. © Kyligence Inc. 2019, Confidential.
Thank You!
#kyligence@kyligence
Connect with us on LinkedIn, Twitter & Facebook
Try Kyligence @ https://kyligence.io/download-free-trial/
www.kyligence.io