Lessons learned building a big data analytics engine, from proprietary to open source by Álvaro Santamaria & Joel Brunger
After spending four years building a proprietary all-in-one streaming analytics engine for financial services, it became clear that open-source was starting to pull ahead. Alvaro will talk about the challenges of creating an IT operations solution for financial services; what to build, what not to build, and how to use open source tools to get past the infrastructure and focus on the business problems that matter.
Lessons learned building a big data analytics engine, from proprietary to open source
1. Álvaro Santamaría
Data Scientist – ITRS
@dofideas
Joel Brunger
System Engineer - MapR
@joelbrunger
Lessons Learned with Visualisation
and Machine Learning for Big Data
5. Jay Krepps – 2013
“The Log: What every software engineer should know about real-time data's unifying abstraction”
The uber-system Lego-like, OS-based
13. 3. Information extraction
GROUP BY country,
timestamp window of 10 minutes
SELECT count(),
average(temperature),
median(temperature),
max(temperature),
...
tdigest(temperature)
30. Why did ITRS choose MapR for ‘Gateway Hub’
MapR is the industry’s leading data platform for AI and Analytics.
Ø Simplicity (integrated platform)
Ø Real-time
Ø Processing must be performed in the cluster
Ø Enterprise features
31. MapR enables ITRS ‘Gateway Hub’ to provide the following benefits
MapR is the industry’s leading data platform for AI and Analytics.
Ø Smarter monitoring
Ø Additional features, application and Services
Ø Global Data Fabric
Ø Support ML in real-time
33. Álvaro Santamaría
Data Scientist – ITRS
@dofideas
Joel Brunger
System Engineer - MapR
@joelbrunger
Lessons Learned with Visualisation
and Machine Learning for Big Data