Personal Information
Organización/Lugar de trabajo
San Francisco Bay Area, QC United States
Ocupación
Data scientist at Stitch Fix
Sector
Retail
Acerca de
Data paranoid, failed entrepreneur, ex stock trader, father, Canadian in US, Shanghainese.
Programming since 13 (QBasic in DOS on a 386 PC with a 5' floppy disk). Once studied Physics then went to Canada to learn more on business. Built a company then got hit by financial crisis. Got married and moved to US. Moved to Silicon Valley with wife as she got a job there.
Love freedom and enjoy all the randomness in life.
Highest Kaggle rank: 1076th / 300k https://www.kaggle.com/piggybox
http://stackoverflow.com/users/2102764/piggybox
https://github.com/piggybox
Etiquetas
database
time-series
functional programming
inventory
spark redshift data-engineering spark-summit
spark
redshift
data quality
data cleansing
machine learning
etl
data munging
data wrangling
Ver más
Presentaciones
(4)Recomendaciones
(7)Kubernetes on AWS at Zalando: Failures & Learnings - DevOps NRW
Henning Jacobs
•
Hace 6 años
Top 5 Mistakes to Avoid When Writing Apache Spark Applications
Cloudera, Inc.
•
Hace 8 años
(BDT303) Running Spark and Presto on the Netflix Big Data Platform
Amazon Web Services
•
Hace 8 años
Spark shuffle introduction
colorant
•
Hace 9 años
Streaming SQL
Julian Hyde
•
Hace 8 años
Choosing an HDFS data storage format- Avro vs. Parquet and more - StampedeCon 2015
StampedeCon
•
Hace 8 años
Effective testing for spark programs Strata NY 2015
Holden Karau
•
Hace 8 años
Personal Information
Organización/Lugar de trabajo
San Francisco Bay Area, QC United States
Ocupación
Data scientist at Stitch Fix
Sector
Retail
Acerca de
Data paranoid, failed entrepreneur, ex stock trader, father, Canadian in US, Shanghainese.
Programming since 13 (QBasic in DOS on a 386 PC with a 5' floppy disk). Once studied Physics then went to Canada to learn more on business. Built a company then got hit by financial crisis. Got married and moved to US. Moved to Silicon Valley with wife as she got a job there.
Love freedom and enjoy all the randomness in life.
Highest Kaggle rank: 1076th / 300k https://www.kaggle.com/piggybox
http://stackoverflow.com/users/2102764/piggybox
https://github.com/piggybox
Etiquetas
database
time-series
functional programming
inventory
spark redshift data-engineering spark-summit
spark
redshift
data quality
data cleansing
machine learning
etl
data munging
data wrangling
Ver más