In this talk from the Cloud World Forum Big Data event in June this year, I discuss the benefits of using the AWS Cloud for large scale computation and data processing workloads.
45. Real-time response to content
in semi-structured data streams
Relatively simple computations
on data (aggregates, filters,
sliding window, etc.)
46. Hourly server logs: how your
systems went wrong an hour ago
Weekly / Monthly Bill: What you
spent this past billing cycle
Daily customer report from your
website: tells you what deal or ad
to try next time
Daily fraud reports: tells you if there
was fraud yesterday
Daily business reports: tells me
how customers used AWS services
yesterday
Real-time metrics: what just went
wrong now
Real-time spending alerts/caps:
guaranteeing you can’t overspend
Real-time analysis: what to offer
the current customer now
Real-time detection: blocks
fraudulent use now
Fast ETL into Amazon Redshift:
how are customers using services
now
56. Further References
Atomic Fiction Case-Study Video!
https://www.youtube.com/watch?v=ljHo1_5sWxo!
Slideshare with full details on the Schrodinger Materials Science case-study!
http://www.slideshare.net/insideHPC/cycle-computing-recordbreaking-peta-scale-hpc-run!
Real-time Streaming and Querying with Amazon Kinesis and Amazon EMR Video!
https://www.youtube.com/watch?v=NIa33ZwFa8E!
!
!