2. Sqoop : RDB 與 Hadoop 的橋樑
• Apache Sqoop is a “tool” designed to transfer
data between hadoop and structured datastores.
• 從..拿資料
– RDBMS
– Data warehources
– NoSQL
• 寫資料到..
– Hive
– Hbase
• 使用 mapreduce framework to transfer data in
parallel
2
figure Source : http://bigdataanalyticsnews.com/data-transfer-mysql-cassandra-using-sqoop/