24. HADOOP SUMMIT 2013
Hadoop data load (Camus)
Open sourced:
– https://github.com/linkedin/camus
One job loads all events
~10 minute ETA on average from producer to HDFS
Hive registration done automatically
Schema evolution handled transparently