Personal Information
Organización/Lugar de trabajo
London, United Kingdom United Kingdom
Ocupación
Data Science and Big Data
Sector
Technology / Software / Internet
Acerca de
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Etiquetas
newbie
pycon
python
programming
pycon2010
Ver más
Presentaciones
(2)Documentos
(1)Recomendaciones
(24)Netezza Architecture and Administration
Braja Krishna Das
•
Hace 7 años
Netezza Deep Dives
Rush Shah
•
Hace 7 años
Notes from Coursera Deep Learning courses by Andrew Ng
Tess Ferrandez
•
Hace 6 años
Strata NYC 2015: Sketching Big Data with Spark: randomized algorithms for large-scale data analytics
Databricks
•
Hace 8 años
Developing Real-Time Data Pipelines with Apache Kafka
Joe Stein
•
Hace 8 años
Scala - The Simple Parts, SFScala presentation
Martin Odersky
•
Hace 9 años
Pragmatic Real-World Scala (short version)
Jonas Bonér
•
Hace 15 años
Scala Data Pipelines @ Spotify
Neville Li
•
Hace 8 años
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
•
Hace 9 años
Hive tuning
Michael Zhang
•
Hace 10 años
Spark SQL Deep Dive @ Melbourne Spark Meetup
Databricks
•
Hace 8 años
Spark Summit East 2015 Advanced Devops Student Slides
Databricks
•
Hace 9 años
DTCC '14 Spark Runtime Internals
Cheng Lian
•
Hace 10 años
Tuning and Debugging in Apache Spark
Databricks
•
Hace 9 años
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San Jose 2015
Databricks
•
Hace 9 años
Why Scala Is Taking Over the Big Data World
Dean Wampler
•
Hace 9 años
storm at twitter
Krishna Gade
•
Hace 10 años
Collaborative Filtering with Spark
Chris Johnson
•
Hace 9 años
DataFu @ ApacheCon 2014
William Vaughan
•
Hace 10 años
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Paco Nathan
•
Hace 9 años
Hadoop World 2011: Advanced HBase Schema Design - Lars George, Cloudera
Cloudera, Inc.
•
Hace 12 años
HBase schema design Big Data TechCon Boston
amansk
•
Hace 11 años
HBaseCon 2012 | HBase Schema Design - Ian Varley, Salesforce
Cloudera, Inc.
•
Hace 11 años
The 21 Coolest Internet Of Things Gadgets
Bernard Marr
•
Hace 9 años
Personal Information
Organización/Lugar de trabajo
London, United Kingdom United Kingdom
Ocupación
Data Science and Big Data
Sector
Technology / Software / Internet
Acerca de
Problem Solver. Python/Hadoop Coder. I have done end to end work involving development, administration and Data Science in Big Data.
I have set up Hadoop clusters, built ETL pipelines by writing MapReduce/Spark code and have worked on data science problems. I have used a variety of technologies including Spark, Hive, Pig, HBase, R, etc.
I look at Big Data everyday and use map reduce features of Hadoop to solve big data problems and extract useful information from them. I have done expert work in search quality by analyzing millions of queries searched by users everyday.
Here are some Data Science problems I have worked on solving so far
1) Understand the relationships between users wh...
Etiquetas
newbie
pycon
python
programming
pycon2010
Ver más