24. Developers
Summit
Pigで分散を計算
register path/to/udfs.jar
set job.priority very_low;
set job.name 'CalcVariance';
define VAR com.gsd.pig.udf.Variance();
A = load 'mydata' as (data:double);
B = group A all;
C = foreach B generate VAR(A.data);
store C into 'path/to/hdfs/rawdata';
Developers Summit 2013 Action ! 24
Friday, February 15, 13