More Related Content
Similar to Apache Mahout Algorithms (20)
Apache Mahout Algorithms
- 4. What Apache Mahout is
- Java, Hadoop
- Collaborative Filtering
- Mahout In Action
- user@mahout.apache.org
- 0.9 (1-Feb-2014)
- 10. Need to know ML?
hadoop.jar mahout-core-0.8-job.jar
org.apache.mahout.cf.taste.hadoop.item.
RecommenderJob
-Dmapred.input.dir=input/input.txt
-Dmapred.output.dir=output
--usersFile input/users.txt --booleanData
- 17. Item Based - Predict
- Weighted Sum:
r^(3,1) = 2 * 0.91 + ...
- 19. Item Based.. Why in Mahout
- Generic recommender like User Based
- User Based similarity matrix is heavier
- 25. m * n → m * k + n * k
10M → 100K + 10K
Lets say; m=10K
n = 1K
k=10
Singular Value Decomposition (SVD)
- 29. SVD.. Why in Mahout
- Won Netflix Prize
- Parallelizable by row, column
- 36. Map / Reduce ItemBased
hadoop.jar mahout-core-0.8-job.jar
org.apache.mahout.cf.taste.hadoop.item.
RecommenderJob
-Dmapred.input.dir=input/input.txt
-Dmapred.output.dir=output
--usersFile input/users.txt --booleanData