Personal Information
Organización/Lugar de trabajo
San Francisco Bay Area, CA United States
Ocupación
Data Expert with System Architecture Insight
Sector
Technology / Software / Internet
Sitio web
goldenorbit.wordpress.com
Acerca de
With the thorough understandings of data, application & network architecture, Eric has developed & proven a set of approaches to improve the performance & ROI by 50%~200% based on the company's existing DW/BI infrastructure.
His 1st philosophy is to make the best use of the tools and to create better tools, as he has witnessed many poor project results simply because everyone expects the out-of-box features to satisfy all the requirements, yet few are willing to to deep dive into the tool and explore its full potential.
We often debates about which tool is the best, yet Eric believes that it is crucial to provide the valuable consulting and eduction to enable more team members and clien...
Etiquetas
hadoop
incremental
upsert
time travel
data warehouse
hive
hudi
delta
iceberg
data lake
big data
json
etl
nosql
sql
elt
jdbc
fastload
mapreduce
tdch
teradata
Ver más
Presentaciones
(4)Recomendaciones
(67)Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Tristan Baker
•
Hace 2 años
Spark SQL Bucketing at Facebook
Databricks
•
Hace 4 años
Modernizing Big Data Workload Using Amazon EMR & AWS Glue
Noritaka Sekiyama
•
Hace 4 años
How to test infrastructure code: automated testing for Terraform, Kubernetes, Docker, Packer and more
Yevgeniy Brikman
•
Hace 4 años
Presto Strata London 2019: Cost-Based Optimizer for interactive SQL on anything
Piotr Findeisen
•
Hace 5 años
Trillion Dollar Coach Book (Bill Campbell)
Eric Schmidt
•
Hace 5 años
"Smooth Operator" [Bay Area NewSQL meetup]
Kevin Xu
•
Hace 5 años
Dynamic pricing of Lyft rides using streaming
Amar Pai
•
Hace 5 años
YugaByte DB Internals - Storage Engine and Transactions
Yugabyte
•
Hace 5 años
What’s new in Apache Spark 2.3
DataWorks Summit
•
Hace 5 años
ORC improvement in Apache Spark 2.3
DataWorks Summit
•
Hace 6 años
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
Hace 6 años
Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache
Dremio Corporation
•
Hace 6 años
Apache Arrow: In Theory, In Practice
Dremio Corporation
•
Hace 6 años
What is Artificial Intelligence | Artificial Intelligence Tutorial For Beginners | Edureka
Edureka!
•
Hace 6 años
Top 5 Deep Learning and AI Stories - October 6, 2017
NVIDIA
•
Hace 6 años
Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Rosen, Databricks)
Spark Summit
•
Hace 8 años
Handling Data Skew Adaptively In Spark Using Dynamic Repartitioning
Spark Summit
•
Hace 7 años
Scala Reflection & Runtime MetaProgramming
Meir Maor
•
Hace 7 años
What to Expect for Big Data and Apache Spark in 2017
Databricks
•
Hace 7 años
Hive: Loading Data
Benjamin Leonhardi
•
Hace 8 años
Tuning Java for Big Data
Scott Seighman
•
Hace 9 años
Deep Dive Into Catalyst: Apache Spark 2.0'S Optimizer
Spark Summit
•
Hace 7 años
Introducing Neo4j 3.0
Neo4j
•
Hace 8 años
File Format Benchmark - Avro, JSON, ORC & Parquet
DataWorks Summit/Hadoop Summit
•
Hace 7 años
Dongwon Kim – A Comparative Performance Evaluation of Flink
Flink Forward
•
Hace 8 años
Why apache Flink is the 4G of Big Data Analytics Frameworks
Slim Baltagi
•
Hace 8 años
Apache Hive Hook
Minwoo Kim
•
Hace 10 años
Spark etl
Imran Rashid
•
Hace 8 años
Hive tuning
Michael Zhang
•
Hace 10 años
Personal Information
Organización/Lugar de trabajo
San Francisco Bay Area, CA United States
Ocupación
Data Expert with System Architecture Insight
Sector
Technology / Software / Internet
Sitio web
goldenorbit.wordpress.com
Acerca de
With the thorough understandings of data, application & network architecture, Eric has developed & proven a set of approaches to improve the performance & ROI by 50%~200% based on the company's existing DW/BI infrastructure.
His 1st philosophy is to make the best use of the tools and to create better tools, as he has witnessed many poor project results simply because everyone expects the out-of-box features to satisfy all the requirements, yet few are willing to to deep dive into the tool and explore its full potential.
We often debates about which tool is the best, yet Eric believes that it is crucial to provide the valuable consulting and eduction to enable more team members and clien...
Etiquetas
hadoop
incremental
upsert
time travel
data warehouse
hive
hudi
delta
iceberg
data lake
big data
json
etl
nosql
sql
elt
jdbc
fastload
mapreduce
tdch
teradata
Ver más