SlideShare a Scribd company logo
1 of 227
RAD
How We Replicate Terabytes of Data Around the World Every Day
Jason Koppe
System Administrator
Indeed is the #1
external source of hire
64% of US job searchers search
on indeed each month
Unique Visitors (millions)
Million unique visitors
2009 2011 2012 2013 2014 2015
0
20
40
60
80
100
120
140
160
180
2010
180M
180 million
unique users
80.2M
unique US visitors per month
16M
jobs
50+
countries
28
languages
How We Build Systems
fast simple resilient scalable
fast
Fast
Job Search Browser Rendering
median ~0.5 seconds
Feb 24 Feb 25 Feb 26 Feb 27 Feb 28 Feb 29 Mar 1 Mar 2 Mar 3 Mar 4 Mar 5 Mar 6 Mar 7 Mar 8
0
100
200
300
400
500
600
700
800
milliseconds
simple
2004 launch: a few servers, 1.8m US jobs
2004
Aggregation
MySQL
Job Search
Every job on
the web
relational database,
accessed across the network
NOT fast at full text search
NOT a search engine
2004
Indeed
1999
Lucene
LuceneTM
a high-performance, full featured
text search engine library
LuceneTM
NOT a remote database,
files must be on local disk
MySQL
Database Server Lucene Index Server
Index Builder
/data/jobindex
Index Builder Index Builder Index Builder Index Builder
/data/jobindex /data/jobindex /data/jobindex /data/jobindex
MySQL
MySQL
Database Server Indexer Server
Index Builder
/data/jobindex
Search Engine
/data/jobindex
4 Search Servers
any combination of data, not just lucene
lucene +
model
lucene +
model
bitset
lucene +
model
bitset
lucene +
custom
binary
lucene +
model
bitset
lucene +
custom
binary
json +
csv
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Artifact
is read-optimized data stored in a directory on the file system
Producer
creates and updates a data artifact
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
MySQL
Consumer
reads a data artifact
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
MySQL
produce once, consume many times
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: minimize database access
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: compute artifact once
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: scale consumers independently
MySQL
Expensive
Index Builder
Producer
Artifact Artifact
Commodity
Search Engine
Benefit: scale consumers independently
MySQL
Database Server
Index Builder
Producer
Artifact Artifact
Consumers
Search Engine
Benefit: separate code deployables
fast resilient scalable
Producer
artifact
Search Engine
Consumers
artifact
Index Builder
Producer
artifact
Search Engine
Consumers
artifact
Index Builder
rsync
efficient point-to-point file transfer utility
1
consumers should
reload data regularly
1
consumers should
reload data regularly
2
roll back
consumers should
reload data regularly
2
roll back
3
data reload should
not interrupt requests
1
artifact versioning
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
new directory for new version
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
jobindex.latest -> jobindex.3
symlink to know current version
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
jobindex.4
jobindex.latest -> jobindex.4
load new data
$ ls -d jobindex.*
jobindex.1
jobindex.2
jobindex.3
jobindex.4
jobindex.latest -> jobindex.3
roll back
each new version takes disk space & time
versions
total bytes on disk
normal disk copy
versions
disk
latency
total bytes on disk
normal disk copy
versions
version
create time
disk
latency
total bytes on disk
normal disk copy
1.8m jobs, change <2% per hour
all jobs
00:00 AM
all jobs
00:00 AM
all jobs
04:00 AM
new jobs
changed jobs
all jobs
00:00 AM
all jobs
04:00 AM
new jobs
changed jobs
unchanged
incremental updates
save disk space & time
share data between versions
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
jobindex.2
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
5GB
jobindex.3
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
5GB
jobindex.3
= 12GB+ +
5GB
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
1GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
1GB
jobindex.3
=+ +
file1.bin
file2.bin
file3.bin
file4.bin
jobindex.2
file1.bin
file2.bin
file3.bin
file5.bin
jobindex.3
deleted
1GB 1GB = 5GB+ 2GB
file4.bin
remove referenced file of symlink, data is gone
hardlink
additional name for an existing file
hardlink != symlink
file1.bin
file2.bin
file3.bin
3GB
jobindex.1
file1.bin
file2.bin
file3.bin
file4.bin
1GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
1GB
jobindex.3
= 5GB+ +
file1.bin
file2.bin
file3.bin
file4.bin
4GB
jobindex.2
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
1GB
jobindex.3
= 5GB+
file1.bin
file2.bin
file3.bin
file4.bin
file5.bin
5GB
jobindex.3
= 5GB
remove last hardlink, data is gone
artifact versions: symlinks + hardlinks + rsync
scale: single producer, many consumers
Job Search Browser Rendering
median ~0.5 seconds
Feb 24 Feb 25 Feb 26 Feb 27 Feb 28 Feb 29 Mar 1 Mar 2 Mar 3 Mar 4 Mar 5 Mar 6 Mar 7 Mar 8
0
100
200
300
400
500
600
700
800
milliseconds
fast simple resilient scalable
How We Build Systems
2004
Indeed
1999
Lucene
2008
6 countries
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2004 2008 200920062005
22.5 M5.2 M 7.1 M4.0 M1.8 M
jobs added or modified each month
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2nd
datacenter
Producer
Consumers
artifacts
DC1
Staging
Consumers
artifacts
DC2
multi-dc rsync
Staging
Consumers
artifacts
DC3
Producer
Consumers
artifacts
DC1
Staging
Consumers
artifacts
DC2
Staging
Consumers
artifacts
DC3
minimize
Internet
bandwidth
2011
52 countries
4 datacenters
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2004 2008 200920062005
22.5 M5.2 M 7.1 M4.0 M1.8 M
jobs added or modified each month
2011
32.5 M
rsync system growing pains
Simple: serially copy one artifact at a time
DC1
Producer Artifacts
DC2
Staging Artifacts
Problem: serially can cause delays
Producer
Staging
New
New
New
Old
DC1
DC2
smalllarge2large1
smalllarge2large1
Workaround: copy separately in “streams”
DC1
DC2
Staging
Producer
Simple: point-to-point datacenter rsync paths
DC4
DC3
DC2
DC1
Problem: Internet, why did you do that?
Down
DC4
DC3
DC2
DC1
Workaround: shift replication path
DC4
DC3
DC2
DC1
Scale: few consumers with rsync
Producer
Artifacts Consumers
Consumers
Producer
Grow: many consumers with rsync
Artifacts
Consumers
Consumers
Producer
Problem: too many consumers with rsync
Artifacts
Consumers
network
100%
used
Workaround: add more network bandwidth
Consumers
Producer
Artifacts
Consumers
Workaround: add staging tiers
Consumers
Producer
Artifacts
Staging
Artifacts Artifacts
Staging
Artifacts
Staging
Artifacts
Consumers Consumers Consumers Consumers Consumers Consumers Consumers
Staging
rsync growth required sysad intervention
2011
52 countries
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2014
rsync growth
100 artifacts, adding +1 producer each month
producing 1,761 TB per month
over 200 consumers, +2 each month
replicating over 21,931 TB per month
staging tiers or network bandwidth, quarterly
modify replication path, monthly
requiring too much intervention from system
administrators
sysad
dev
sysad
dev
+50%
+100%
2014
January December
2011
52 countries
2004
Indeed
1999
Lucene
2008
6 countries
2009
23 countries
2014
rsync limits
Julie Scully
Software Engineer
Jobsearch backend team produces a lot of data
RAD
“Resilient Artifact Distribution”
Design GoalsDesign Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
4
5
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
3
4
5
1
2
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
5
4
3
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
5
4
Design Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
4
5
Design GoalsDesign Goals
Minimize network bottlenecks
Loose coupling
Automatic recovery
Developer empowerment
System-wide visibility
1
2
3
4
5
No more point-to-point
Measure time and
network traffic
Bittorrent: Would it work?
Sample replication to
3 consumers
https://github.com/shevek/ttorrent
Network Test
Total MB received + transmitted for 700MB artifact
Producer 2,240
Consumer 1 746
Consumer 2 747
Consumer 3 747
machine RSYNC
Network Test
Total MB received + transmitted for 700MB artifact
Producer 2,240 782
Consumer 1 746 1,226
Consumer 2 747 1,225
Consumer 3 747 1,245
machine BITTORRENTRSYNC
Network Test
Total MB received + transmitted for 700MB artifact
Producer 2,240 782
Consumer 1 746 1,226
Consumer 2 747 1,225
Consumer 3 747 1,245
Total 4,481 4,480
machine BITTORRENTRSYNC
24 minutes
rsync
5.5 minutes
bittorrent
Timing Test
How does bittorrent work?
Data split into small pieces of equal size
Hash computed for each piece
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
Piece 1: 75 MB
Piece 2: 75 MB
Piece 3: 75 MB
Piece 4: 75 MB
Piece 5: 25 MB
torrent metadata file
{ files:file1.bin,100MB;
file2.bin,200MB;
file3.bin,50MB }
{ piecelength:75MB }
{
infohash:XSDJSK;JDISJLD;DJKJDB;KDJB
OP;FJEIODK; }
.torrent metadata file:
{ files:file1.bin,100MB;
file2.bin,200MB;
file3.bin,50MB }
{ piecelength:75MB }
{
infohash:XSDJSK;JDISJLD;DJKJDB;KDJB
OP;FJEIODK; }
.torrent metadata file:
{ files:file1.bin,100MB;
file2.bin,200MB;
file3.bin,50MB }
{ piecelength:75MB }
{
infohash:XSDJSK;JDISJLD;DJKJDB;KDJB
OP;FJEIODK; }
.torrent metadata file:
Tracker
Coordinator of the download
Seeder
Any client providing data
Seeder
Data
I have pieces for info hash
Tracker
.torrent
Info Hash
File manifest
Data .torrent
Info Hash
File manifest
Seeder Tracker
Info hash peer
Map
Ok!
I have pieces for info hash
Consumer
Any client downloading data
Peers for infohash
Consumer Tracker
.torrent
Info Hash
File manifest
Tracker URL
Map
Info hash peer
How a consumer gets the first piece
Peers for infohash
Peerlist
Consumer Tracker
.torrent
Info Hash
File manifest
Tracker URL
Map
Info hash peer
How a consumer gets the first piece
Data .torrent
Info Hash
File manifest
Consumer/
Seeder
I have pieces for infohash
Tracker
Info hash peer
Map
It is also a seeder
Consumer 1
Seeding as it downloads
Consumer 2
Seeding as it downloads
Consumer 3
Seeding as it downloads
Seeder
SWARM
Didn’t quite meet our needs
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH6
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH7
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH6
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH7
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1 jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.2
File0.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Piece 1: HASH6
Piece 2: HASH7
Piece 3: HASH8
Piece 4: HASH9
Piece 5: HASH10
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH11
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1 jobindex.2
File4.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.2
File0.bin
(50MB)
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
Control sort order?
jobindex.2
File3.bin
(50MB)
File1.bin
(150MB)
File2.bin
(200MB)
Piece 1: HASH6
Piece 2: HASH7
Piece 3: HASH8
Piece 4: HASH9
Piece 5: HASH10
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH11
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
File3.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
jobindex.1
Piece 1: HASH6
Piece 2: HASH7
Piece 3: HASH8
Piece 4: HASH9
Piece 5: HASH10
Piece 1: HASH1
Piece 2: HASH2
Piece 3: HASH3
Piece 4: HASH4
Piece 5: HASH5
Piece 6: HASH11
File3.bin
(50MB)
File1.bin
(150MB)
File2.bin
(200MB)
jobindex.2
hash each file?
Compare files not pieces
{ files:file1.bin,100MB,DATETIME;
file2.bin,200MB,DATETIME;
file3.bin,50MB,DATETIME }
{ piecelength:75MB }
...
.torrent metadata file contents:
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.1
Piece 1: File 0, File1
Piece 2: File 1
Piece 3: File 1, File 2
Piece 4: File 2
Piece 5: File 2, File 3
Piece 6: File 3
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.2
File0.bin
(50MB)
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.1
File1.bin
(100MB)
File2.bin
(200MB)
File3.bin
(50MB)
jobindex.2
File0.bin
(50MB) Piece 1: File 0, File1
Piece 2: File 1
Piece 3: File 1, File 2
Piece 4: File 2
Piece 5: File 2, File 3
Piece 6: File 3
Bittorrent Evaluation Result
substantially faster drastically reduces
network load on the
producer machine
horizontally scalable
Design GoalsDesign Goals
Automatic recovery
Developer empowerment
System-wide visibility
3
4
5
Loose coupling2
Minimize network bottlenecks1
Service-oriented architecture
Headwater
The beginning of a river
Headwater
Host
Data
Producer Data
Publish
my data
Headwater takes ownership of the data
(hardlink + read-only)
Headwater
Host
Data
Producer Data
Publish
my data
Will do!
Headwater
Host
Data
Producer Data
create the .torrent metadata file
Headwater
The beginning of a river
River
Course the water carves
across the landscape
Rhone
RhoneRhone
Zookeeper
Rhone: multi-master coordinator service
Rhone
Headwater
Host
Data
Producer Data
Rhone
Headwater
Host
Data
Producer Datadata.version
torrent metadata
Rhone
Headwater
Host
Data
Producer Datadata.version
torrent metadata
Rhone
Headwater
Host
Data
Producer Data
Tracker
.torrent metadata
can be retrieved
data.version
torrent metadata
Headwater
The beginning of a river
River
Course the water carves
across the landscape
Delta
The end of the river
Subscribe
to data!
Delta
Host
Data
Consumer
Make all subscribed artifacts available
RhoneDelta
Host
Data
Consumer
Headwater
Host
Data
Producer Data
Delta
Data
Consumer
Rhone
Host
Tracker
Delta
Host
Data
ConsumerData
/rad/data
Delta
Host
Data
ConsumerData
Where’s
the latest
data?
/rad/data
It’s at
/rad/data
Delta
Host
Data
ConsumerData
Where’s the
latest data?
/rad/data
Delta
Host
Data
ConsumerData
/rad/data
Keep all subscribed artifacts current
Delta
Data
Consumer
Rhone
Host
Rhone
Data
Host
Artifact Availability Flow
Delta Headwater
Host
Data
Consumer
Data
Producer Data
Design GoalsDesign Goals
Automatic recovery
Developer empowerment
System-wide visibility
4
5
Minimize network bottlenecks1
Loose coupling2
3
Rhone
Headwater
Host
Data
Producer Data
Crash!
Rhone
Headwater
Data
Producer Datadata.version
torrent metadata
Tracker
Crash!
Host
Development philosophy:
Make recovery the common case
Durable state with atomic filesystem operations
All service calls are idempotent
RAD handles network recovery
DC4
DC3
DC2
DC1
rsync is point-to-point
DC1
DC4
DC3
DC2
bittorrent peer-to-peer
Down
DC1
DC4
DC3
DC2
No problem with bittorrent swarm
RAD treats artifact independently
Design GoalsDesign Goals
Developer empowerment
System-wide visibility5
Minimize network bottlenecks1
Loose coupling2
Automatic recovery3
4
Adding a new artifact in the rsync system
Ask System Administrators
Adding a new artifact in the RAD system
Declare it in the code
REST API is language agnostic
Design GoalsDesign Goals
System-wide visibility
Minimize network bottlenecks1
Loose coupling2
Automatic recovery3
Developer empowerment4
5
Rhone already knows all artifacts
Rhone stores list of versions by artifact.
version 4
version 5
version 6
artifactA
version 221
version 226
version 227
version 228
artifactB
version 1artifactC
Heartbeats from Delta and Headwater
Rhone has system-wide view
RADAR: Developers can easily see where their data is
RADAR: Developers can easily see where their data is
RADAR: Developers can easily see where their data is
RADAR: Developers can easily see where their data is
start simple and iterate
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
Lesson learned: prevent people from
using the system incorrectly
We made configuration TOO easy
New Requirement: protect the disks
Delta
Prevent downloading artifacts that will fill the disk (and alarm)
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2016
80 RAD
artifacts
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2016
80 RAD
artifacts
100 artifacts in 10 years
100 artifacts in 10 years
2011
52 countries
2004
Indeed
2008
6 countries
2009
23 countries
2014
rsync limits
1st artifact
migrated to RAD
2015
critical artifacts
migrated
2016
80 RAD
artifacts
80 new
artifacts
in 1 year
7,666
versions published
Producer
Consumer
56
unique producers
52,357
versions downloaded
670
unique consumers
RAD Stats
March 23, 2016
Duration of JobIndex replication in RAD v. Rsync
Jan 18 6 AM 12 PM 6 PM Jan 19 6 AM 12 PM 6 PM
1,000
2,000
3,000
RAD rsync
time
replicating over 65,193 TB per month
Learn More
Engineering blog & talks http://indeed.tech
Open Source http://opensource.indeedeng.io
Careers http://indeed.jobs
Twitter @IndeedEng

More Related Content

What's hot

Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent) K...
Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent)  K...Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent)  K...
Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent) K...confluent
 
CQRS and Event Sourcing, An Alternative Architecture for DDD
CQRS and Event Sourcing, An Alternative Architecture for DDDCQRS and Event Sourcing, An Alternative Architecture for DDD
CQRS and Event Sourcing, An Alternative Architecture for DDDDennis Doomen
 
Event Sourcing & CQRS, Kafka, Rabbit MQ
Event Sourcing & CQRS, Kafka, Rabbit MQEvent Sourcing & CQRS, Kafka, Rabbit MQ
Event Sourcing & CQRS, Kafka, Rabbit MQAraf Karsh Hamid
 
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...Flink Forward
 
DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on Kubernetes
DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on KubernetesDoK Talks #91- Leveraging Druid Operator to manage Apache Druid on Kubernetes
DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on KubernetesDoKC
 
What is in a Lucene index?
What is in a Lucene index?What is in a Lucene index?
What is in a Lucene index?lucenerevolution
 
How Netflix Tunes EC2 Instances for Performance
How Netflix Tunes EC2 Instances for PerformanceHow Netflix Tunes EC2 Instances for Performance
How Netflix Tunes EC2 Instances for PerformanceBrendan Gregg
 
Making Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeDatabricks
 
Dynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theoremDynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theoremGrisha Weintraub
 
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...HostedbyConfluent
 
OLTP+OLAP=HTAP
 OLTP+OLAP=HTAP OLTP+OLAP=HTAP
OLTP+OLAP=HTAPEDB
 
An Overview of Spanner: Google's Globally Distributed Database
An Overview of Spanner: Google's Globally Distributed DatabaseAn Overview of Spanner: Google's Globally Distributed Database
An Overview of Spanner: Google's Globally Distributed DatabaseBenjamin Bengfort
 
확장가능한 웹 아키텍쳐 구축 방안
확장가능한 웹 아키텍쳐 구축 방안 확장가능한 웹 아키텍쳐 구축 방안
확장가능한 웹 아키텍쳐 구축 방안 IMQA
 
Running Kafka as a Native Binary Using GraalVM with Ozan Günalp
Running Kafka as a Native Binary Using GraalVM with Ozan GünalpRunning Kafka as a Native Binary Using GraalVM with Ozan Günalp
Running Kafka as a Native Binary Using GraalVM with Ozan GünalpHostedbyConfluent
 
Common Patterns of Multi Data-Center Architectures with Apache Kafka
Common Patterns of Multi Data-Center Architectures with Apache KafkaCommon Patterns of Multi Data-Center Architectures with Apache Kafka
Common Patterns of Multi Data-Center Architectures with Apache Kafkaconfluent
 
MongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and ShardingMongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and ShardingKnoldus Inc.
 

What's hot (20)

Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent) K...
Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent)  K...Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent)  K...
Event Sourcing, Stream Processing and Serverless (Ben Stopford, Confluent) K...
 
CQRS and Event Sourcing, An Alternative Architecture for DDD
CQRS and Event Sourcing, An Alternative Architecture for DDDCQRS and Event Sourcing, An Alternative Architecture for DDD
CQRS and Event Sourcing, An Alternative Architecture for DDD
 
Event Sourcing & CQRS, Kafka, Rabbit MQ
Event Sourcing & CQRS, Kafka, Rabbit MQEvent Sourcing & CQRS, Kafka, Rabbit MQ
Event Sourcing & CQRS, Kafka, Rabbit MQ
 
Google Spanner
Google SpannerGoogle Spanner
Google Spanner
 
Helix talk at RelateIQ
Helix talk at RelateIQHelix talk at RelateIQ
Helix talk at RelateIQ
 
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
Streaming Event Time Partitioning with Apache Flink and Apache Iceberg - Juli...
 
DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on Kubernetes
DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on KubernetesDoK Talks #91- Leveraging Druid Operator to manage Apache Druid on Kubernetes
DoK Talks #91- Leveraging Druid Operator to manage Apache Druid on Kubernetes
 
What is in a Lucene index?
What is in a Lucene index?What is in a Lucene index?
What is in a Lucene index?
 
How Netflix Tunes EC2 Instances for Performance
How Netflix Tunes EC2 Instances for PerformanceHow Netflix Tunes EC2 Instances for Performance
How Netflix Tunes EC2 Instances for Performance
 
Making Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta Lake
 
Dynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theoremDynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theorem
 
CephFS Update
CephFS UpdateCephFS Update
CephFS Update
 
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...
What is the State of my Kafka Streams Application? Unleashing Metrics. | Neil...
 
OLTP+OLAP=HTAP
 OLTP+OLAP=HTAP OLTP+OLAP=HTAP
OLTP+OLAP=HTAP
 
An Overview of Spanner: Google's Globally Distributed Database
An Overview of Spanner: Google's Globally Distributed DatabaseAn Overview of Spanner: Google's Globally Distributed Database
An Overview of Spanner: Google's Globally Distributed Database
 
확장가능한 웹 아키텍쳐 구축 방안
확장가능한 웹 아키텍쳐 구축 방안 확장가능한 웹 아키텍쳐 구축 방안
확장가능한 웹 아키텍쳐 구축 방안
 
Running Kafka as a Native Binary Using GraalVM with Ozan Günalp
Running Kafka as a Native Binary Using GraalVM with Ozan GünalpRunning Kafka as a Native Binary Using GraalVM with Ozan Günalp
Running Kafka as a Native Binary Using GraalVM with Ozan Günalp
 
Galera Cluster Best Practices for DBA's and DevOps Part 1
Galera Cluster Best Practices for DBA's and DevOps Part 1Galera Cluster Best Practices for DBA's and DevOps Part 1
Galera Cluster Best Practices for DBA's and DevOps Part 1
 
Common Patterns of Multi Data-Center Architectures with Apache Kafka
Common Patterns of Multi Data-Center Architectures with Apache KafkaCommon Patterns of Multi Data-Center Architectures with Apache Kafka
Common Patterns of Multi Data-Center Architectures with Apache Kafka
 
MongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and ShardingMongoDB: Advance concepts - Replication and Sharding
MongoDB: Advance concepts - Replication and Sharding
 

Similar to @Indeedeng: RAD - How We Replicate Terabytes of Data Around the World Every Day

How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014Dipti Borkar
 
Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"Lviv Startup Club
 
A Call for Sanity in NoSQL
A Call for Sanity in NoSQLA Call for Sanity in NoSQL
A Call for Sanity in NoSQLC4Media
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist SoftServe
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Gruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter
 
Move a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloudMove a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloudIke Ellis
 
Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)Wooga
 
Data as a Service
Data as a Service Data as a Service
Data as a Service Kyle Hailey
 
Softshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with CouchbaseSoftshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with CouchbaseTugdual Grall
 
Immutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine ImagesImmutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine ImagesC4Media
 
Puppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdbPuppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdbm_richardson
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDBPuppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDBPuppet
 
Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014ALTER WAY
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet
 
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)lakeFS
 
Patterns of Streaming Applications
Patterns of Streaming ApplicationsPatterns of Streaming Applications
Patterns of Streaming ApplicationsC4Media
 
Scaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInScaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInDataWorks Summit
 
Scaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road AheadScaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road AheadC4Media
 

Similar to @Indeedeng: RAD - How We Replicate Terabytes of Data Around the World Every Day (20)

How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014How companies use NoSQL & Couchbase - NoSQL Now 2014
How companies use NoSQL & Couchbase - NoSQL Now 2014
 
Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"Anton Lytunenko "Data Lake. Make data pleasant to swim in"
Anton Lytunenko "Data Lake. Make data pleasant to swim in"
 
A Call for Sanity in NoSQL
A Call for Sanity in NoSQLA Call for Sanity in NoSQL
A Call for Sanity in NoSQL
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Gruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in TelcoGruter TECHDAY 2014 Realtime Processing in Telco
Gruter TECHDAY 2014 Realtime Processing in Telco
 
WSDM09-keynote
WSDM09-keynoteWSDM09-keynote
WSDM09-keynote
 
Move a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloudMove a successful onpremise oltp application to the cloud
Move a successful onpremise oltp application to the cloud
 
Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)Innovation dank DevOps (DevOpsCon Berlin 2015)
Innovation dank DevOps (DevOpsCon Berlin 2015)
 
Data as a Service
Data as a Service Data as a Service
Data as a Service
 
Softshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with CouchbaseSoftshake 2013: Introduction to NoSQL with Couchbase
Softshake 2013: Introduction to NoSQL with Couchbase
 
Immutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine ImagesImmutable Infrastructure: Rise of the Machine Images
Immutable Infrastructure: Rise of the Machine Images
 
Puppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdbPuppetcamp Melbourne - puppetdb
Puppetcamp Melbourne - puppetdb
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDBPuppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
 
Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014Elasticsearch : petit déjeuner du 13 mars 2014
Elasticsearch : petit déjeuner du 13 mars 2014
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
 
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)Ensuring Quality in Data Lakes  (D&D Meetup Feb 22)
Ensuring Quality in Data Lakes (D&D Meetup Feb 22)
 
Patterns of Streaming Applications
Patterns of Streaming ApplicationsPatterns of Streaming Applications
Patterns of Streaming Applications
 
Scaling Hadoop at LinkedIn
Scaling Hadoop at LinkedInScaling Hadoop at LinkedIn
Scaling Hadoop at LinkedIn
 
Scaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road AheadScaling Slack - The Good, the Unexpected, and the Road Ahead
Scaling Slack - The Good, the Unexpected, and the Road Ahead
 

More from indeedeng

Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-DrivenWeapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-Drivenindeedeng
 
Alchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That WorkAlchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That Workindeedeng
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...indeedeng
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...indeedeng
 
Improving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentationImproving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentationindeedeng
 
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision MakingData-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Makingindeedeng
 
Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...indeedeng
 
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)indeedeng
 
Data Day Texas - Recommendations
Data Day Texas - RecommendationsData Day Texas - Recommendations
Data Day Texas - Recommendationsindeedeng
 
Vectorized VByte Decoding
Vectorized VByte DecodingVectorized VByte Decoding
Vectorized VByte Decodingindeedeng
 
[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshop[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshopindeedeng
 
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...indeedeng
 
[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotepindeedeng
 
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisionsindeedeng
 
[@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol [@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol indeedeng
 
[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product design[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product designindeedeng
 
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctorindeedeng
 
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...indeedeng
 
[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacenters[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacentersindeedeng
 
[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Search[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Searchindeedeng
 

More from indeedeng (20)

Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-DrivenWeapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
Weapons of Math Instruction: Evolving from Data0-Driven to Science-Driven
 
Alchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That WorkAlchemy and Science: Choosing Metrics That Work
Alchemy and Science: Choosing Metrics That Work
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
 
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
Indeed Engineering and The Lead Developer Present: Tech Leadership and Manage...
 
Improving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentationImproving the development process with metrics driven insights presentation
Improving the development process with metrics driven insights presentation
 
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision MakingData-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
Data-Driven off a Cliff: Anti-Patterns in Evidence-Based Decision Making
 
Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...Automation and Developer Infrastructure — Empowering Engineers to Move from I...
Automation and Developer Infrastructure — Empowering Engineers to Move from I...
 
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
Indeed My Jobs: A case study in ReactJS and Redux (Meetup talk March 2016)
 
Data Day Texas - Recommendations
Data Day Texas - RecommendationsData Day Texas - Recommendations
Data Day Texas - Recommendations
 
Vectorized VByte Decoding
Vectorized VByte DecodingVectorized VByte Decoding
Vectorized VByte Decoding
 
[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshop[@IndeedEng] Imhotep Workshop
[@IndeedEng] Imhotep Workshop
 
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...@IndeedEng:  Tokens and Millicents - technical challenges in launching Indeed...
@IndeedEng: Tokens and Millicents - technical challenges in launching Indeed...
 
[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep[@IndeedEng] Large scale interactive analytics with Imhotep
[@IndeedEng] Large scale interactive analytics with Imhotep
 
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
[@IndeedEng] Logrepo: Enabling Data-Driven Decisions
 
[@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol [@IndeedEng] Boxcar: A self-balancing distributed services protocol
[@IndeedEng] Boxcar: A self-balancing distributed services protocol
 
[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product design[@IndeedEng Talk] Diving deeper into data-driven product design
[@IndeedEng Talk] Diving deeper into data-driven product design
 
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
[@IndeedEng] Managing Experiments and Behavior Dynamically with Proctor
 
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
[@IndeedEng] Engineering Velocity: Building Great Software Through Fast Itera...
 
[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacenters[@IndeedEng] Redundant Array of Inexpensive Datacenters
[@IndeedEng] Redundant Array of Inexpensive Datacenters
 
[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Search[@IndeedEng] Building Indeed Resume Search
[@IndeedEng] Building Indeed Resume Search
 

Recently uploaded

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 

Recently uploaded (20)

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 

@Indeedeng: RAD - How We Replicate Terabytes of Data Around the World Every Day