Personal Information
Organización/Lugar de trabajo
San Francisco, CA United States
Ocupación
Senior Data Engineer at Workday
Sector
Technology / Software / Internet
Sitio web
github.com/erenavsarogullari
Acerca de
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Etiquetas
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
Ver más
Presentaciones
(6)Personal Information
Organización/Lugar de trabajo
San Francisco, CA United States
Ocupación
Senior Data Engineer at Workday
Sector
Technology / Software / Internet
Sitio web
github.com/erenavsarogullari
Acerca de
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Etiquetas
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
Ver más