SlideShare una empresa de Scribd logo
1 de 27
1 © Hortonworks Inc. 2011–2018. All rights reserved
Fortifying Multi-Cluster Hybrid Cloud Data
Lakes using Apache Knox
Sandeep Moré – Sr. Software Engineer
Kiran Matty – Sr. Product Manager
06/19/18
2 © Hortonworks Inc. 2011–2018. All rights reserved
Agenda
• Multi-Cluster Hybrid Cloud Data lakes
• Apache Knox
• Demo
• Q&A
3 © Hortonworks Inc. 2011–2018. All rights reserved
Who are We?
• Apache Knox PMC member
• Sr. Software Engineer@Hortonworks
• Software Engineer / Security Gateway –
Intel
3
• PM@Hortonworks – Apache
Knox, HDP Search/Solr, and
Platform Security
• Big Data Analytics and Security
@ startup, HPE, and Cisco
4 © Hortonworks Inc. 2011–2018. All rights reserved
Multi-Cluster Hybrid
Cloud Data Lakes
5 © Hortonworks Inc. 2011–2018. All rights reserved
Why Hybrid Cloud?
Unified Security &
Governance
Model
Cluster 2
(Unstructured)
Cluster 1
(Structured)
Cluster 3
(Structured)
Cluster 4
(Unstructured)
Data Lake 1, San Jose
Cluster 1
(Unstructured)
Cluster 2
(Structured)
Workloads (typical)
On-prem Cloud
Compliance Sensitive Non-sensitive
Flexibility Production Test/Demo
Cost
Optimization
Fixed Variable
Data Lake 2, UK
Best Practice: Run your analytics workloads where data
is stored
6 © Hortonworks Inc. 2011–2018. All rights reserved
Need to augment existing security controls offered by Cloud
Providers for Hadoop Workloads
Security Control AWS Azure GCP
Network Isolation Virtual Private
Cloud (VPC)
Microsoft Azure Virtual
Network (VNet)
Virtual Private Cloud
(VPC) network
Network security Security Groups Network Access Control
List (NACL) and Network
Security Groups (NSGs)
Firewall rules
Identity
Management
Identity and Access
management (IAM)
Azure Active Directory
(AAD)
Google Cloud Identity
and Access
Management (Cloud
IAM)
7 © Hortonworks Inc. 2011–2018. All rights reserved
A Few Issues across the Hybrid Cloud Data Lakes
How To:
 authenticate cloud users without moving your on-prem LDAP to the cloud?
 keep unauthorized users from accessing your customer data i.e. Insider attack?
 protect your clusters from stolen credentials i.e. Account Highjacking?
8 © Hortonworks Inc. 2011–2018. All rights reserved
AuthN Challenges: Connecting to on-prem Active Directory Options
Replication
Corporate DC Cloud
AD ADVPN
App AppDomain join to on-prem AD over
VPN
1
2
3
9 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Knox
10 © Hortonworks Inc. 2011–2018. All rights reserved
• an extensible reverse proxy framework
• that can be deployed in the cloud or on-prem
• for securely exposing REST APIs, HTTP, and WebSockets based services
• and out of the box it provides:
• Proxying of HTTP services - REST, UIs, Websockets
• Authentication services - pluggable authentication and federation providers and token,
SSO services
• Client services - KnoxShell for consuming cluster services through Knox
• And many other features…
Apache Knox Gateway is…
11 © Hortonworks Inc. 2011–2018. All rights reserved
• a Firewall
• a Load balancer
• a Kerberos replacement
Apache Knox Gateway is NOT…
12 © Hortonworks Inc. 2011–2018. All rights reserved
Why Knox?
Simplified Access
• Kerberos encapsulation
• Extends API reach
• Single access point
• Multi-cluster support
• Single SSL certificate
Centralized Control
• Auditing
• Service-level authorization
• Knox Admin UI
• Service Discovery and Topology Generation
Framework
Enterprise Integration
• LDAP/AD integration
• Support for SAMLv2
• SSO integration
Enhanced Security
• Proxy to abstract network details
• TLS Termination for non-SSL services
13 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Knox Community Snapshot
Mar 2013
Entered
Incubator
Oct 2013
0.1.0 - 0.3.0
Incubator
Releases
Feb 2014
Graduates
to
Apache TLP
Apr 2014
0.4.0
TLP
Release
Nov 2014
0.5.0
May 2015
0.6.0
Apr/Aug 2016
0.9.0/0.9.1
Feb 2016
0.8.0
Dec 2015
0.7.0
Nov 2016
0.10.0
Dec 2016
0.11.0
Mar 2017
0.12.0
Feb 2018
1.0
• Committers: 20
• Contributors from:
• Hortonworks, IBM, CGI,
Uber, Oracle, Blue Talon,
Microsoft, Talend
Apache Knox 0.14.0
@apache_knox
Aug 2017
0.13.0
Apache Knox 1.0.0
• Ambari Service Discovery Support
for HA-Enabled Services
• Update hadoop dependencies to
Hadoop 3
Dec 2017
0.14.0
• Service Discovery and Topology
Generation Framework
• Add support for proxying NiFi and
Livy (Spark Rest Service)
• High Availability Support For
Apache SOLR, HBase & Kafka
14 © Hortonworks Inc. 2011–2018. All rights reserved
Demo
15 © Hortonworks Inc. 2011–2018. All rights reserved
Demo Coverage
How To:
 authenticate users without moving your on-prem LDAP to the cloud?
• Knox Federation
 keep unauthorized users from accessing your customer data i.e. Insider attack?
• Knox AuthZ
 protect your clusters from stolen credentials i.e. Account Highjacking?
• MFA* on Knox
*no out of box support
16 © Hortonworks Inc. 2011–2018. All rights reserved
Knox Providers - Primer
• Providers add new features to the gateway
• These features can be used by all services
• Example providers used for federation:
• Auth Provider - Knox Federation
Header Based Pre Auth
<provider>
<role>federation</role>
<name>HeaderPreAuth</name>
<enabled>true</enabled>
<param>
<name>preauth.custom.header</name>
<value>aws_user</value>
</param>
</provider>
• Authorization Provider - Knox AuthZ
AclsAuthz
<provider>
<role>authorization</role>
<name>AclsAuthz</name>
<enabled>true</enabled>
<param>
<name>hive.acl</name>
<value>*;sales;*</value>
</param>
</provider>
17 © Hortonworks Inc. 2011–2018. All rights reserved
Knox Cloud Federation
• Part of KIP – 11 : Cloud use cases
• KNOX-1339 – Support for cloud federation
• Leverages Knox Header Based Pre Auth provider
• JDBC / Beeline / REST
• JDBC + Knoxshell for demo
• Federation Dispatch –
<dispatch classname="org.apache.knox.gateway.dispatch.HeaderPreAuthFederationDispatch" use-two-way-ssl="true" />
18 © Hortonworks Inc. 2011–2018. All rights reserved
Demo Personas
Kate
LDAP Group: DevOps
Cluster Access: Prod and Demo
AWS IAM user
Michelle
LDAP Group: Sales
Cluster Access: Demo
Not AWS IAM user
Malicious Insider
Maximus
Hacker
19 © Hortonworks Inc. 2011–2018. All rights reserved
Demo Architecture
Ambari
HDFS
Hive
Knox
LDAP
Ambari
HDFS
Hive
Knox
2-way
Inbound:8443
JDBC Client
Knoxline
Inbound: 8443
Prod (on-prem) Demo (cloud)
20 © Hortonworks Inc. 2011–2018. All rights reserved
Scenario 1: Access on-prem cluster
Ambari
HDFS
Hive
Knox
LDAP
Ambari
HDFS
Hive
Knox
1. Hive(JDBC)
2. Authenticate Kate
3. Access HDFS
4. HDFS
Response
5. Response
21 © Hortonworks Inc. 2011–2018. All rights reserved
Scenario 2: Access cloud cluster by AuthN w/ on-prem LDAP
Ambari
HDFS
Hive
Knox
LDAP
Ambari
HDFS
Hive
Knox
1. GET Webhdfs
2. Authenticate
Michelle
8. Response
3. Dispatch request to
Cloud Knox
4. Header based pre auth
5. Access HDFS
6. HDFS Response
7. Response
Knox Federation
22 © Hortonworks Inc. 2011–2018. All rights reserved
Scenario 3: Blocking Michelle’s Unauthorized access
AuthN
Run a Hive
query
against the
customer
DB to get
names and
phone
numbers
Load into
CSV file
Exfilterate
via USB
drive
23 © Hortonworks Inc. 2011–2018. All rights reserved
Scenario 3: Blocking Michelle’s Unauthorized access
Ambari
HDFS
Hive
Knox
LDAP
Ambari
HDFS
Hive
Knox
1. Hive (JDBC)
2. Authorization failure
3. 403 Forbidden
Knox AuthZ
24 © Hortonworks Inc. 2011–2018. All rights reserved
Scenario 4: Thwarting Maximus’s Kill Chain
Harvest
Kate’s
credentials
from GitHub
via social
engineering
Create an
exploit to
scan and
identify
sensitive
tables, and
exfilterate
to EC2
server
AuthN
using Kate’s
stolen
credentials
Install the
exploit to
scan
sensitive
tables
Chunk data
and send
to C2
server
Request
for
Ransom
25 © Hortonworks Inc. 2011–2018. All rights reserved
Scenario 4: Thwarting Maximus’s Kill Chain
* No out of the box support for MFA
katec@newcor.com
MFA* on Knox
26 © Hortonworks Inc. 2011–2018. All rights reserved
Questions?
27 © Hortonworks Inc. 2011–2018. All rights reserved
Thank you

Más contenido relacionado

La actualidad más candente

Managing enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystemManaging enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystemDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments Using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments Using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments Using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments Using Apache RangerDataWorks Summit
 
Running Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudRunning Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudDataWorks Summit
 
Breathing New Life into Apache Oozie with Apache Ambari Workflow Manager
Breathing New Life into Apache Oozie with Apache Ambari Workflow ManagerBreathing New Life into Apache Oozie with Apache Ambari Workflow Manager
Breathing New Life into Apache Oozie with Apache Ambari Workflow ManagerDataWorks Summit
 
A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 DataWorks Summit
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFiHortonworks
 
What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?DataWorks Summit
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash CourseDataWorks Summit
 
TriHUG October: Apache Ranger
TriHUG October: Apache RangerTriHUG October: Apache Ranger
TriHUG October: Apache Rangertrihug
 
Running Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudRunning Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudDataWorks Summit
 
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to HadoopSuccesses, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to HadoopDataWorks Summit/Hadoop Summit
 
Curb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure ClusterCurb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure Clusterahortonworks
 
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariManage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariDataWorks Summit
 
Apache Hadoop Security - Ranger
Apache Hadoop Security - RangerApache Hadoop Security - Ranger
Apache Hadoop Security - RangerIsheeta Sanghi
 
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...DataWorks Summit
 
The Future of Apache Ambari
The Future of Apache AmbariThe Future of Apache Ambari
The Future of Apache AmbariDataWorks Summit
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHortonworks
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFTDataWorks Summit
 

La actualidad más candente (20)

Managing enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystemManaging enterprise users in Hadoop ecosystem
Managing enterprise users in Hadoop ecosystem
 
Securing Data in Hybrid on-premise and Cloud Environments Using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments Using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments Using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments Using Apache Ranger
 
Running Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudRunning Enterprise Workloads in the Cloud
Running Enterprise Workloads in the Cloud
 
Breathing New Life into Apache Oozie with Apache Ambari Workflow Manager
Breathing New Life into Apache Oozie with Apache Ambari Workflow ManagerBreathing New Life into Apache Oozie with Apache Ambari Workflow Manager
Breathing New Life into Apache Oozie with Apache Ambari Workflow Manager
 
A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3 A First-Hand Look at What's New in HDP 2.3
A First-Hand Look at What's New in HDP 2.3
 
Mission to NARs with Apache NiFi
Mission to NARs with Apache NiFiMission to NARs with Apache NiFi
Mission to NARs with Apache NiFi
 
What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
TriHUG October: Apache Ranger
TriHUG October: Apache RangerTriHUG October: Apache Ranger
TriHUG October: Apache Ranger
 
Effective Spark on Multi-Tenant Clusters
Effective Spark on Multi-Tenant ClustersEffective Spark on Multi-Tenant Clusters
Effective Spark on Multi-Tenant Clusters
 
Running Enterprise Workloads in the Cloud
Running Enterprise Workloads in the CloudRunning Enterprise Workloads in the Cloud
Running Enterprise Workloads in the Cloud
 
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to HadoopSuccesses, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
Successes, Challenges, and Pitfalls Migrating a SAAS business to Hadoop
 
Curb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure ClusterCurb your insecurity with HDP - Tips for a Secure Cluster
Curb your insecurity with HDP - Tips for a Secure Cluster
 
Manage Add-On Services with Apache Ambari
Manage Add-On Services with Apache AmbariManage Add-On Services with Apache Ambari
Manage Add-On Services with Apache Ambari
 
The Avant-garde of Apache NiFi
The Avant-garde of Apache NiFiThe Avant-garde of Apache NiFi
The Avant-garde of Apache NiFi
 
Apache Hadoop Security - Ranger
Apache Hadoop Security - RangerApache Hadoop Security - Ranger
Apache Hadoop Security - Ranger
 
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
Its Finally Here! Building Complex Streaming Analytics Apps in under 10 min w...
 
The Future of Apache Ambari
The Future of Apache AmbariThe Future of Apache Ambari
The Future of Apache Ambari
 
HDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical WorkshopHDF: Hortonworks DataFlow: Technical Workshop
HDF: Hortonworks DataFlow: Technical Workshop
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFT
 

Similar a Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox

Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureDataWorks Summit
 
Saving the elephant—now, not later
Saving the elephant—now, not laterSaving the elephant—now, not later
Saving the elephant—now, not laterDataWorks Summit
 
Hadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and FutureHadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and FutureDataWorks Summit
 
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...DataWorks Summit
 
One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)DataWorks Summit
 
Built-In Security for the Cloud
Built-In Security for the CloudBuilt-In Security for the Cloud
Built-In Security for the CloudDataWorks Summit
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoopGergely Devenyi
 
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNBillie Rinaldi
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNDataWorks Summit
 
Hdp security overview
Hdp security overview Hdp security overview
Hdp security overview Hortonworks
 
MiniFi and Apache NiFi : IoT in Berlin Germany 2018
MiniFi and Apache NiFi : IoT in Berlin Germany 2018MiniFi and Apache NiFi : IoT in Berlin Germany 2018
MiniFi and Apache NiFi : IoT in Berlin Germany 2018Timothy Spann
 
IoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFiIoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFiDataWorks Summit
 
Improvements in Hadoop Security
Improvements in Hadoop SecurityImprovements in Hadoop Security
Improvements in Hadoop SecurityDataWorks Summit
 
Apache MXNet for IoT with Apache NiFi
Apache MXNet for IoT with Apache NiFiApache MXNet for IoT with Apache NiFi
Apache MXNet for IoT with Apache NiFiTimothy Spann
 
Docker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhere Docker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhere Janos Matyas
 
Running Cloudbreak on Kubernetes
Running Cloudbreak on KubernetesRunning Cloudbreak on Kubernetes
Running Cloudbreak on KubernetesKrisztián Horváth
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash CourseDataWorks Summit
 
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...huguk
 

Similar a Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox (20)

Hadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and FutureHadoop Operations - Past, Present, and Future
Hadoop Operations - Past, Present, and Future
 
Saving the elephant—now, not later
Saving the elephant—now, not laterSaving the elephant—now, not later
Saving the elephant—now, not later
 
Hadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and FutureHadoop Operations – Past, Present, and Future
Hadoop Operations – Past, Present, and Future
 
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
Bridle your Flying Islands and Castles in the Sky: Built-in Governance and Se...
 
Hadoop security
Hadoop securityHadoop security
Hadoop security
 
One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)One Click Hadoop Clusters - Anywhere (Using Docker)
One Click Hadoop Clusters - Anywhere (Using Docker)
 
Built-In Security for the Cloud
Built-In Security for the CloudBuilt-In Security for the Cloud
Built-In Security for the Cloud
 
Micro services vs hadoop
Micro services vs hadoopMicro services vs hadoop
Micro services vs hadoop
 
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARNLessons Learned Running a Container Cloud on Apache Hadoop YARN
Lessons Learned Running a Container Cloud on Apache Hadoop YARN
 
Lessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARNLessons learned running a container cloud on YARN
Lessons learned running a container cloud on YARN
 
Hdp security overview
Hdp security overview Hdp security overview
Hdp security overview
 
MiniFi and Apache NiFi : IoT in Berlin Germany 2018
MiniFi and Apache NiFi : IoT in Berlin Germany 2018MiniFi and Apache NiFi : IoT in Berlin Germany 2018
MiniFi and Apache NiFi : IoT in Berlin Germany 2018
 
IoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFiIoT with Apache MXNet and Apache NiFi and MiniFi
IoT with Apache MXNet and Apache NiFi and MiniFi
 
Improvements in Hadoop Security
Improvements in Hadoop SecurityImprovements in Hadoop Security
Improvements in Hadoop Security
 
Apache MXNet for IoT with Apache NiFi
Apache MXNet for IoT with Apache NiFiApache MXNet for IoT with Apache NiFi
Apache MXNet for IoT with Apache NiFi
 
Docker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhere Docker based Hadoop provisioning - anywhere
Docker based Hadoop provisioning - anywhere
 
Running Cloudbreak on Kubernetes
Running Cloudbreak on KubernetesRunning Cloudbreak on Kubernetes
Running Cloudbreak on Kubernetes
 
Running Cloudbreak on Kubernetes
Running Cloudbreak on KubernetesRunning Cloudbreak on Kubernetes
Running Cloudbreak on Kubernetes
 
Data in the Cloud Crash Course
Data in the Cloud Crash CourseData in the Cloud Crash Course
Data in the Cloud Crash Course
 
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
 

Más de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 

Más de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Último

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox

  • 1. 1 © Hortonworks Inc. 2011–2018. All rights reserved Fortifying Multi-Cluster Hybrid Cloud Data Lakes using Apache Knox Sandeep Moré – Sr. Software Engineer Kiran Matty – Sr. Product Manager 06/19/18
  • 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved Agenda • Multi-Cluster Hybrid Cloud Data lakes • Apache Knox • Demo • Q&A
  • 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved Who are We? • Apache Knox PMC member • Sr. Software Engineer@Hortonworks • Software Engineer / Security Gateway – Intel 3 • PM@Hortonworks – Apache Knox, HDP Search/Solr, and Platform Security • Big Data Analytics and Security @ startup, HPE, and Cisco
  • 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved Multi-Cluster Hybrid Cloud Data Lakes
  • 5. 5 © Hortonworks Inc. 2011–2018. All rights reserved Why Hybrid Cloud? Unified Security & Governance Model Cluster 2 (Unstructured) Cluster 1 (Structured) Cluster 3 (Structured) Cluster 4 (Unstructured) Data Lake 1, San Jose Cluster 1 (Unstructured) Cluster 2 (Structured) Workloads (typical) On-prem Cloud Compliance Sensitive Non-sensitive Flexibility Production Test/Demo Cost Optimization Fixed Variable Data Lake 2, UK Best Practice: Run your analytics workloads where data is stored
  • 6. 6 © Hortonworks Inc. 2011–2018. All rights reserved Need to augment existing security controls offered by Cloud Providers for Hadoop Workloads Security Control AWS Azure GCP Network Isolation Virtual Private Cloud (VPC) Microsoft Azure Virtual Network (VNet) Virtual Private Cloud (VPC) network Network security Security Groups Network Access Control List (NACL) and Network Security Groups (NSGs) Firewall rules Identity Management Identity and Access management (IAM) Azure Active Directory (AAD) Google Cloud Identity and Access Management (Cloud IAM)
  • 7. 7 © Hortonworks Inc. 2011–2018. All rights reserved A Few Issues across the Hybrid Cloud Data Lakes How To:  authenticate cloud users without moving your on-prem LDAP to the cloud?  keep unauthorized users from accessing your customer data i.e. Insider attack?  protect your clusters from stolen credentials i.e. Account Highjacking?
  • 8. 8 © Hortonworks Inc. 2011–2018. All rights reserved AuthN Challenges: Connecting to on-prem Active Directory Options Replication Corporate DC Cloud AD ADVPN App AppDomain join to on-prem AD over VPN 1 2 3
  • 9. 9 © Hortonworks Inc. 2011–2018. All rights reserved Apache Knox
  • 10. 10 © Hortonworks Inc. 2011–2018. All rights reserved • an extensible reverse proxy framework • that can be deployed in the cloud or on-prem • for securely exposing REST APIs, HTTP, and WebSockets based services • and out of the box it provides: • Proxying of HTTP services - REST, UIs, Websockets • Authentication services - pluggable authentication and federation providers and token, SSO services • Client services - KnoxShell for consuming cluster services through Knox • And many other features… Apache Knox Gateway is…
  • 11. 11 © Hortonworks Inc. 2011–2018. All rights reserved • a Firewall • a Load balancer • a Kerberos replacement Apache Knox Gateway is NOT…
  • 12. 12 © Hortonworks Inc. 2011–2018. All rights reserved Why Knox? Simplified Access • Kerberos encapsulation • Extends API reach • Single access point • Multi-cluster support • Single SSL certificate Centralized Control • Auditing • Service-level authorization • Knox Admin UI • Service Discovery and Topology Generation Framework Enterprise Integration • LDAP/AD integration • Support for SAMLv2 • SSO integration Enhanced Security • Proxy to abstract network details • TLS Termination for non-SSL services
  • 13. 13 © Hortonworks Inc. 2011–2018. All rights reserved Apache Knox Community Snapshot Mar 2013 Entered Incubator Oct 2013 0.1.0 - 0.3.0 Incubator Releases Feb 2014 Graduates to Apache TLP Apr 2014 0.4.0 TLP Release Nov 2014 0.5.0 May 2015 0.6.0 Apr/Aug 2016 0.9.0/0.9.1 Feb 2016 0.8.0 Dec 2015 0.7.0 Nov 2016 0.10.0 Dec 2016 0.11.0 Mar 2017 0.12.0 Feb 2018 1.0 • Committers: 20 • Contributors from: • Hortonworks, IBM, CGI, Uber, Oracle, Blue Talon, Microsoft, Talend Apache Knox 0.14.0 @apache_knox Aug 2017 0.13.0 Apache Knox 1.0.0 • Ambari Service Discovery Support for HA-Enabled Services • Update hadoop dependencies to Hadoop 3 Dec 2017 0.14.0 • Service Discovery and Topology Generation Framework • Add support for proxying NiFi and Livy (Spark Rest Service) • High Availability Support For Apache SOLR, HBase & Kafka
  • 14. 14 © Hortonworks Inc. 2011–2018. All rights reserved Demo
  • 15. 15 © Hortonworks Inc. 2011–2018. All rights reserved Demo Coverage How To:  authenticate users without moving your on-prem LDAP to the cloud? • Knox Federation  keep unauthorized users from accessing your customer data i.e. Insider attack? • Knox AuthZ  protect your clusters from stolen credentials i.e. Account Highjacking? • MFA* on Knox *no out of box support
  • 16. 16 © Hortonworks Inc. 2011–2018. All rights reserved Knox Providers - Primer • Providers add new features to the gateway • These features can be used by all services • Example providers used for federation: • Auth Provider - Knox Federation Header Based Pre Auth <provider> <role>federation</role> <name>HeaderPreAuth</name> <enabled>true</enabled> <param> <name>preauth.custom.header</name> <value>aws_user</value> </param> </provider> • Authorization Provider - Knox AuthZ AclsAuthz <provider> <role>authorization</role> <name>AclsAuthz</name> <enabled>true</enabled> <param> <name>hive.acl</name> <value>*;sales;*</value> </param> </provider>
  • 17. 17 © Hortonworks Inc. 2011–2018. All rights reserved Knox Cloud Federation • Part of KIP – 11 : Cloud use cases • KNOX-1339 – Support for cloud federation • Leverages Knox Header Based Pre Auth provider • JDBC / Beeline / REST • JDBC + Knoxshell for demo • Federation Dispatch – <dispatch classname="org.apache.knox.gateway.dispatch.HeaderPreAuthFederationDispatch" use-two-way-ssl="true" />
  • 18. 18 © Hortonworks Inc. 2011–2018. All rights reserved Demo Personas Kate LDAP Group: DevOps Cluster Access: Prod and Demo AWS IAM user Michelle LDAP Group: Sales Cluster Access: Demo Not AWS IAM user Malicious Insider Maximus Hacker
  • 19. 19 © Hortonworks Inc. 2011–2018. All rights reserved Demo Architecture Ambari HDFS Hive Knox LDAP Ambari HDFS Hive Knox 2-way Inbound:8443 JDBC Client Knoxline Inbound: 8443 Prod (on-prem) Demo (cloud)
  • 20. 20 © Hortonworks Inc. 2011–2018. All rights reserved Scenario 1: Access on-prem cluster Ambari HDFS Hive Knox LDAP Ambari HDFS Hive Knox 1. Hive(JDBC) 2. Authenticate Kate 3. Access HDFS 4. HDFS Response 5. Response
  • 21. 21 © Hortonworks Inc. 2011–2018. All rights reserved Scenario 2: Access cloud cluster by AuthN w/ on-prem LDAP Ambari HDFS Hive Knox LDAP Ambari HDFS Hive Knox 1. GET Webhdfs 2. Authenticate Michelle 8. Response 3. Dispatch request to Cloud Knox 4. Header based pre auth 5. Access HDFS 6. HDFS Response 7. Response Knox Federation
  • 22. 22 © Hortonworks Inc. 2011–2018. All rights reserved Scenario 3: Blocking Michelle’s Unauthorized access AuthN Run a Hive query against the customer DB to get names and phone numbers Load into CSV file Exfilterate via USB drive
  • 23. 23 © Hortonworks Inc. 2011–2018. All rights reserved Scenario 3: Blocking Michelle’s Unauthorized access Ambari HDFS Hive Knox LDAP Ambari HDFS Hive Knox 1. Hive (JDBC) 2. Authorization failure 3. 403 Forbidden Knox AuthZ
  • 24. 24 © Hortonworks Inc. 2011–2018. All rights reserved Scenario 4: Thwarting Maximus’s Kill Chain Harvest Kate’s credentials from GitHub via social engineering Create an exploit to scan and identify sensitive tables, and exfilterate to EC2 server AuthN using Kate’s stolen credentials Install the exploit to scan sensitive tables Chunk data and send to C2 server Request for Ransom
  • 25. 25 © Hortonworks Inc. 2011–2018. All rights reserved Scenario 4: Thwarting Maximus’s Kill Chain * No out of the box support for MFA katec@newcor.com MFA* on Knox
  • 26. 26 © Hortonworks Inc. 2011–2018. All rights reserved Questions?
  • 27. 27 © Hortonworks Inc. 2011–2018. All rights reserved Thank you