9. group
group profile
group group segment segment
profile detail 3500w
profile 12GB
$ ls | grep group
profile group 0.seg 0
profile group 1.seg 0
profile group 2.seg 0
henshao Kingso Profile
10. segment
group segment (1<<20) doc
segment
$ ls | grep seg
profile group 0.seg 0
profile group 0.seg 1
profile group 0.seg 2
henshao Kingso Profile
11. encode
provcity
mlr feature prop vid
group 6GB
$ ls | grep encode
cat id path.encode idx
cat id path.encode cnt
henshao Kingso Profile
22. mapred.map.tasks.speculative.execution=false
data node
tar
index tar
rm
hadoop fs -cat index.tar | tar xf - -C output
tar -c index | hadoop fs -put - index.tar
get/put index profile
detail
detail index profile
job
henshao Kingso Profile