3. Operating System Infrastructure: Virtualized OpenStack environment
Hardware Infrastructure: high performance, redundant, networking
Health &
Performance
Monitoring
Cloud Control
Center (C3)
Data
Sources
Cloud
Sources
Shared
Storage
Cloud Computing Fabric
BI Automation SDK
Significant GoodData IP
Open Source or Commercial Software
Managed & Operated by GoodData
Web
Server &
Proxy
Data
Backup
ADS
Agile Data
Warehouse
Service
Clustered
Columnar
Database
BDS
Data Mart
MICCache
Columnar
Database
LDM
Metadata
DIS
Data
Integration
Services
Visualization
Export
Services
XAE
Extensible
Analytics
Engine
< / >
API’s
Visualize
Analytic
Designer
Browser
Cloud
Connect
SQL Client
Automation
SDK
SQL
JDBC
WebDav
4. 4
GoodData Confidential. 2014 GoodData Corporation. All rights
Operating System Infrastructure: Virtualized OpenStack environment
Hardware Infrastructure: high performance, redundant, networking
Health & Performance
Monitoring
PIPE, Splunk
Cloud Control Center (C3)
• Projects, users, services, their
associations, and access
• Spans multiple clusters
• Stored in MySQL Database
Shared Storage
• GlusterFS/NetApp
• Passes data between
stateless workers
• Data Input WebDav/FTP
• Garbage collection
Cloud Computing Fabric
• Asynchronous task distribution Can add workers (scale) at any time
• Workflow orchestration & dependencies UI will wait for a Report Task to Run
• Load distribution spread workers across any number of computers
Cloud Computing Fabric
Coordinate and do any type of work quickly and at scale
TECHNICAL DIFFERENTIATION
Scale & Flexibility
HOT-HOT-HOT Redundancy
BI Automation SDK Reusable Framework: DIS, ADS, XAE
Example: query, transformation execution, data exporter
etc. Interoperable - different programming languages
5. 5
GoodData Confidential. 2014 GoodData Corporation. All rights
API & Proxy Framework
Deliver a fast, customizable, and secure experience
TECHNICAL DIFFERENTIATION
100% of API’s are Public!
REST = Easy Web Integration
Synchronous & Asynchronous
Responsive UI no longer how long queries take
Internal Services
Exporters, Connectors
Authenticate
Web Server & Proxy
REST
API’s
Browser
< / >
• Login
• Project
• User
• Data Model
• Data Upload
• Notifications
• Report
• Dashboard
• Email
• Metadata
• Mandatory
User Filters
Automation
SDK
Customer
Applications
Cloud
Connect
C3
Proxy
Firewall
Load Balancers
HTTP
Accelerator
Web
App
Servers
6. Operating System Infrastructure
Hardware Infrastructure
Automation
SDK
Cloud
Connect
C3
User Verified
via API
Shared Storage
Raw Data Input and
Staging Area from
External Sources
Cloud Computing
Fabric
< / >
API’s
ADS
Data
Sources
Cloud
Sources
WebDav
Data Integration Services (DIS)
My data lives everywhere – bring it together
Security
All custom code can only
access external API’s
User’
s
Script
Secure Container
Extract Load
Data
Normalization
Transform
DIFFERENTIATION
Drag and Drop GUI
Can embed custom
code in cloud
Can source data
locally or cloud
Data Normalization
Garbage Collection
Data Marts
Columnar
Database
LDM
MICCache
7. 7
GoodData Confidential. 2014 GoodData Corporation. All rights
Data Mart & Analytics Engine (XAE, MAQL)
Give me data access (queries) better than I could write myself
TECHNICAL DIFFERENTIATION
Benefits of OLAP Cube but w/ High Performance
MAQL is robust to PDM changes
Data Base
Vertica or Postgres
Physical Data Model (PDM) +
Stored Data
Auto-generate SQL
SELECT * FROM WHERE
Optimize Query
w + h = a
Algebraically Prove Query
2w + 2h = 2a
Simplified Query Language
No Knowledge of LDM needed - MAQL
What & How?
Metadata
Report Definitions, Dashboards, etc.
Cloud
Connect
Analytic
Designer
No Run away Queries
Simplified Analyst Data Access
Logical Data Model (LDM)
Multi-level Intelligent
Computing Cache
• Fast Access to both full & partial prior used
reports, metrics, sub-metrics, cross tabs, etc.
• No need to re-query slow DB to do recalc
8. Operating System Infrastructure
Hardware Infrastructure
Agile Data Warehouse Service (ADS)
Store Everything. Create a Data Transformation Architecture
Automation
SDK
Cloud
Connect
SQL Client
C3: User Verified
for both JDBC & API
Shared Storage
Raw storage, import,
& Vertica backup
Cloud Computing Fabric
BI Automation SDK
Clustered Columnar DB #1
Instance A
Instance B
Instance C
Clustered Columnar DB #2
Instance D
Customer X
Customer Y
Customer Z
TECHNICAL DIFFERENTIATION
Only Multi-Tenant Vertica in market
Provisioning via REST API
SQL & JDBC w/ SSL cert. auth
Isolation/security via SQL Validator
Can Scale to Petabytes
< / >
API’s
SQL
JDBC
Output Data
Direct SQL
Commands &
Queries for
Data
Transformation
Data Marts
9. 9
GoodData Confidential. 2014 GoodData Corporation. All rights
Shared Storage
NetApp/GlusterFS
Rackspace: Elk Grove, IL, USA
ISO 27001, SOC2
Rackspace: Europe (Roadmap)
Data Location & Redundancy
Where does my Data go and… Don’t Lose it!
TECHNICAL DIFFERENTIATION
Fully Automated & Monitored
Computing Fabric is HOT-HOT-HOT
AWS (S3)
Simple Storage Service
ISO 27001,SOC2
S3 US East Zone
Multiple Locations
EncryptedData
Sources
Cloud
Sources
WebDav
Browser
ADS DB’s
SQL Client
Data Marts
10. 10
GoodData Confidential. 2014 GoodData Corporation. All rights
Security
Be more secure than I ever could be
TECHNICAL DIFFERENTIATION
Single Entry/Exit point for all Customers
Easier to Secure/Control vs. a whole Enterprise
All data in/out through encrypted channels
SSO for Platform, Customer’s Info
Private Cloud
Stateless Workers = no exe code
Shared Storage
Self Encrypting Disks
Rackspace: ISO 27001, SOC2
Rackspace: Europe (Roadmap)
Data Marts
Data
Sources
Cloud
Sources
WebDav
Browser
SQL Client
ADS DB’sJDBC
SSL
HTTPS
Encrypted
File
Systems
HTTPS
HTTPS
Cloud & Physical Security
• 24/7/365 security, phys. access control
• Proactively monitors all components
• Alerting chain all the way to the CEO
Data Security
• Security zones, protocol enforcement,
intrusion detection, penetration testing
• Data isolated/encrypt in transport/at rest
Operational Security
• Access is under strict control
• Admins allowed to invoke platform-
specific functions but not permitted to
interact with the platform components
• Admin Employee Background Checks
similar to US Top Secret
11. 11
GoodData Confidential. 2014 GoodData Corporation. All rights
Infrastructure Health & Performance Monitoring
GoodData Architecture – Scalable Multi-Tenancy
GoodData
HTML5
Client
Cloud Connect
Modeler
Resource
Manager
HTTPS,
WebDav
Data
Connector
Framework
ETL Server
Processes
Hadoop Data
Expansion Clustered
Columnar
DB
Data
Integration
Services
Columnar
DB
JDBC to ADS BI Automation SDK REST APIs
Common Metadata: Sources, ADS, LDM, Marts
Data Warehouse Management
Data
Integration
Service
Console
Extensible
Analytics
Engine
Visualization
Export
Services:
Chart & PDF
Data Mart:
Metadata
LDM &
cache
Data Orchestration Services
Cloud Control Center: Real-time, Multi-tenant resource manager
including Tenants, Projects and Users across distributed environment
Shared Storage: Redundant staging & archiving of raw data & files
Operating System Infrastructure: Virtualized OpenStack environment
Hardware Infrastructure: Custom-tailored high performance, redundant environment and networking
Sources
Significant GoodData IP Integration Framework Open Source & Commercial Software
GoodData Computing Fabric
Business
User
Experience
Analyst
Discovery
Experience
IT
Control
Experience
Cloud
Sources