This presentation provides an overview of Cloudera and how a modern platform for Machine Learning and Analytics better enables a data-driven enterprise.
We began with a simple premise courtesy of Simon Sinek and his wildly popular Golden Circle framework.
https://www.ted.com/talks/simon_sinek_how_great_leaders_inspire_action
Great companies start with “why” versus with what. Sounds simple right? But it isn’t. It requires deep introspection and honesty. In our case, we came to the conclusion that what we really believe is that data has the power to make what is impossible today, possible tomorrow. And that isn’t simply a slogan or a tagline. We believe it in our core and we are making this belief more real everyday. In fact, many of you are probably wondering what the graphic is on the right. Well it comes to us courtesy of our friends at NASA, who are using Cloudera technology to help make the first man-mission to Mars a really. They plan to collect, analyze and act upon “petabytes of data on a daily basis” to ensure the safe delivery and return of the brave astronauts taking on this bold mission. So as you can see, we really mean it when we say making what’s impossible today, possible tomorrow.
Ok, now that you understand why we do what we do at Cloudera and what drives us each and every day, you probably want to better understand “How” we do it. Well simply stated, we help people, like all of you in attendance here today” to transform the endless amount of data you are undoubtedly collecting into clear and actionable insights. In other words, we help turn data into action. We are automating the decision making process. Specifically, we do this in 3 ways.
Protect your business. At the most base level, sort of the lowest level of the organizational Maslow’s hierarchy, is protecting your business, your customers and your employees. As we have all seen, this is becoming increasingly difficult as a wide range of new threats arise each and every day…such as the recent WannaCry worm clearly illustrated. At Cloudera, we take the security of your data, all of your data, very seriously and we’ve built specific solutions to ensure that it remains well protected.
Connect products and services. The next thing we empower people to do is to connect all of the data generated by their products and services and transform it into actionable insights. This is critical for an increasingly connected world – an Internet of Things, or IoT, world – where everything and everybody is connected. So whether you are connecting vehicle sensors, kitchen appliances, or health care monitors – Cloudera can help.
Drive customer insights. To continue with the Maslow’s hierarchy metaphor, one of the toughest things for any organization to do is to grow – quickly and predictably – it is the equivalent of self-actualization for an individual. After all, cutting costs is actually pretty easy, perhaps not fun, but easy to execute. Growing a business on the other hand will require that you make full use of your most valuable resource – your data. You must collect and analyze all customer and prospective customer data if you want a leg up on the competition. Cloudera helps hundreds of customers grow their businesses by analyzing, predicting and acting on insights gleaned from their customer data.
OK. Now that you know why we do what we do – imagining new possibilities through the power of data. And, you know how we do it – by helping businesses become more secure, better connected and higher growing. I’m sure you’re interested in what we do. Well, simply stated, we deliver the modern data platform for machine learning and advanced analytics. The core technologies for collecting, storing and analyzing data that were built decades ago, simply won’t deliver the speed, scale, agility and security needed for a world suddenly awash in massive quantities of new, important data. Companies like Google and Yahoo discovered this first, and the race to Big Data was born. Our founders at Cloudera were among the first to see this opportunity. Today, we are proud that what was once a dream is now a reality. Today, Cloudera provides a modern platform that literally runs anywhere – on your premises, in the public cloud, or any combination you can imagine. That type of flexibility enables our customers to run in the most agile and cost-efficient environment they need for their unique business needs. It also provides them with the enterprise-grade capabilities they need to run a secure, high-performance and well governed data infrastructure. Perhaps most importantly, this sound foundation provides the ultimate platform for the advanced machine learning and analytic workloads that are changing the way we work – enabling everything from predictive maintenance to predictive medicine and beyond.
The evolving datascape
Cloudera delivers an integrated suite of capabilities for data management, machine learning and advanced analytics, affording customers an agile, scalable and cost effective solution for transforming their businesses.
Cloudera unites the best of both worlds for massive enterprise scale
Data Science & Data Engineering
Advanced Analytics
Operational Processing
The SCP Support Standard provides clear guidelines that enable organizations to:
Increase customer satisfaction and loyalty by improving operational effectiveness and staff productivity
Implement a continuous improvement program to achieve and maintain world-class levels of performance
Benchmark technical support operations against best in class organizations and best practices to further enhance performance
Leveraging SCP Standards helps to improve the capability and performance of service operations, while letting customers know that the company is committed to excellence and willing to adhere to global standards.
From a Support perspective, you could say we are fighting above our weight class with our Support capability. This is witnessed by other SCP certified companies including EMC, NetApp, McKesson and Juniper Networks - all considered to have mature Support operations that service large enterprise customers. Having said that most customers would be more interested in the maturity and usability of our products and maturing our product quality practices to gauge us as enterprise.
Cloudera Navigator is the only integrated data management and governance platform for Hadoop.
It is a critical part of Cloudera Enterprise and is trusted in production by hundreds of our customers across multiple industries (regulated and not). With over two years of development, Cloudera was the first Hadoop vendor to introduce a data management and governance solution. Cloudera Navigator is a mature tool that going well beyond auditing and metadata collection.
Cloudera Navigator and data governance is a key part of passing compliance audits. Cloudera is the only Hadoop distribution to pass a compliance audit (PCI-DSS with Mastercard) and Navigator plays a huge part in that
Cloudera Navigator also features interoperability with the broad partner ecosystem. It integrates with the leading tools for data lineage, policies, audits, quality, and more so you can manage data both within the Hadoop platform and beyond.
Alex Gutow
Invent or distribute variety of useful and diverse workloads
Create architecture to ingest, store, and share data across parallel workloads
Imbue numerous enterprise qualities into those workloads
Make it work reliably and cost effectively in multi-tenant, multiple environments
Self-service for knowledge workers with varying needs and control access
Optimize performance for customer’s production environment
Open source innovation
Multi-cloud – No vendor lock in. Work in the environment of your choice. Better pricing leverage
Managed TCO – Multiple pricing and deployment options
Integrated – Integrated components with shared metadata, security and operations
Secure - Protect sensitive data from unauthorized access – encryption, key management
Compliance – Full auditing and visibility
Governance – Ensure data veracity
Charles: why can you only do this with all data in one place. What would happen? More cumbersome? Expensive? Not at all possible? What do we unlock?
Terry Kline CIO at Navistar: "We have a number of different applications running after our data every day from truck drivers to dealers to parents to students riding the school busses, and Cloudera SDX is key to making that happen at Navistar. SDX is foundational on how we track and govern our data and protect the data of the owner of the truck."
--
This multi-function approach helps freight businesses prevent vehicle downtime. They do this by ingesting a variety of telematics data in real time from the fleet of trucks, using machine learning to predict the likelihood that a certain part will fail at a given time, and then running analytics to determine the best way to pull the truck off the road and service it in a manner that minimizes downtime.
Third, experience has also shown that a scalable and consistent security and governance model is a prerequisite for businesses to enable a diverse set of data practitioners to interact with a shared set of sensitive or regulated data.
----
Pharmaceutical businesses are working to accelerate drug research programs by providing a self-service analytics experience on a shared pool of data to their entire research team. However, since much of this data is regulated by HIPAA, this more efficient method of drug research would not be possible if the data management team was not able to first ensure that a consistent security and governance model had been applied consistently throughout.
With this in mind, it is clear that the preferred choice for any business should be a platform that provides a reliable implementation of each of these core functions and simultaneously provides a shared data experience to all of the data practitioners operating on that platform. This unified model for enterprise data management is indeed the most cost effective, the fastest to deploy, and the easiest to secure and govern. SDX makes this unified model possible. SDX creates this shared data experience for Cloudera’s customers.