SlideShare una empresa de Scribd logo
1 de 22
May 9, 2016
1
May 9, 2016
Deduplication in Resistive CAM
Based SSD
Roman Kaplan, Leonid Yavits,
Amir Morad, Ran Ginosar
2015
May 9, 2016
2
Outline
1. What is ReCAM ?
2. What is deduplication ?
– How is it done today?
3. Deduplication in ReCAM
– How is it simpler?
4. Simulation results
May 9, 2016
3
Resistive CAM – What is it?
• CAM = Content Addressable Memory
1. Search for data in the entire array
2. Store address explicitly  function like RAM
• Memristors:
May 9, 2016
4
ReCAM Crossbar
May 9, 2016
5
Resistive CAM – Operations
What can ReCAM do:
1. Compare all its contents to a specific word
2. Write to specific columns in parallel
3. Write to specific rows in parallel
May 9, 2016
6
What is Deduplication?
1. Data is broken into fixed blocks
2. A fingerprint (FP) is calculated for each block
May 9, 2016
7
What is Deduplication?
1. Data is broken into fixed blocks
2. A fingerprint (FP) is calculated for each block
3. Identical blocks aren’t stored (deduplicated)
May 9, 2016
8
Deduplication Uses
1. Useful when there is repeating data
– Virtual machines
– WAN optimizations (networking)
– Backups
2. Compression ratio depend on type of data –
can reach up to 40x
May 9, 2016
9
Deduplication using RAM+CPU: Write
1. Calculate FP (Hash)
2. Search for it in the
chunk index (takes
very long time)
3. Act accordingly
(next slides)
Data
Hash
2
…
1
…
1
PA(A)
…
PA(B)
…
PA(C)
Hash(A)
….
Hash(B)
…..
Hash(C)
Chunk Index
Fingerprint Physical
Address
?
CNT
1
2
1
2
May 9, 2016
10
RAM+CPU Deduplication: Write (Case 1)
Case 1: If the FP is found
Data block already exists
I. Add LA+PA to ATT
II. Increment FP counter
in chunk index
1
…
1
…
1
PA(A)
…
PA(B)
…
PA(C)
Hash(A)
….
Hash(B)
…..
Hash(C)
Chunk Index
Fingerprint Physical
Address
CNT
Hash(D)
Address
Decoder
A
B
C
Data Blocks
Storage
D
PA(D) 1
Address
Translation
Table
𝐿𝐴(D)
𝐿𝐴(A)
𝐿𝐴 B
𝐿𝐴(C)
PA(D)
PA(A)
PA(B)
PA(C)
Logical
Address
Physical
Address
𝐿𝐴2(D) PA(D) 2A
A
B
B
May 9, 2016
11
RAM+CPU Deduplication: Write (Case 2)
Case 2: If the FP is not
found
A unique data block
I. Write block to storage
II. Add LA+PA to ATT
III. Add FP to chunk index
1
…
1
…
1
PA(A)
…
PA(B)
…
PA(C)
Hash(A)
….
Hash(B)
…..
Hash(C)
Chunk Index
Fingerprint Physical
Address
CNT
Hash(D)
Address
Decoder
A
B
C
Data Blocks
Storage
D
PA(D) 1
A
B
C
A
C
Address
Translation
Table
𝐿𝐴(A)
𝐿𝐴 B
𝐿𝐴(C)
PA(A)
PA(B)
PA(C)
Logical
Address
Physical
Address
𝐿𝐴(D) PA(D)B
May 9, 2016
12
Deduplication is Hard with RAM+CPU
• Delete is even more complicated than write
• Requires complex data structures &
computations Large memory & CPU
• Example: EMC XtremIO Xbrick
• 5TB all-flash storage
• 256GB RAM
• Quad-core CPU
May 9, 2016
13
Deduplication in ReCAM
• Much simpler than with RAM
• Chunk index is not required anymore
• Allows to compare all data blocks in storage
simultaneously
– If found, store only address-pointers
Chunk Index
May 9, 2016
14
Deduplication in ReCAM
1. Search for new data
block in the storage
2. Act accordingly
(next slides)
Data
Hash
A
B
C
Data Blocks
Storage
PA(A)
PA(B)
PA(C)
Physical
Address
?
May 9, 2016
15
Deduplication in ReCAM
Case 1: If the Data is found
Data block already exists
I. Add address to ATT
Storage
PA(A)
PA(B)
PA(C)
Physical
Address
A
B
C
Data Blocks
DPA(D)
Logical
Address
Physical
Address
Address
Translation
Table
𝐿𝐴(D)
𝐿𝐴(A)
𝐿𝐴 B
𝐿𝐴(C)
PA(D)
PA(A)
PA(B)
PA(C)
𝐿𝐴2(D) PA(D)
May 9, 2016
16
Deduplication in ReCAM
Case 2: If the Data is not
found
New Data block
I. Write Data to storage
II. Add address to ATT
Address Translation Table
Storage
PA(A)
PA(B)
PA(C)
Physical
Address
A
B
C
Data Blocks
DPA(D)
A
A
B
𝐿𝐴(A)
𝐿𝐴 B
𝐿𝐴(C)
PA(A)
PA(B)
PA(C)
Logical
Address
Physical
Address
𝐿𝐴(D) PA(D)B
May 9, 2016
17
Deduplication in ReCAM
Much Simpler than with RAM
• Write:
1. Compare the entire array data simultaneously
2. If match, save only a pointer
3. If not, save the data block + pointer
• Delete isn’t more complicated than write
– If no addresses pointing to the data  delete
May 9, 2016
18
Simulations
• ReCAM
– Cycle-accurate simulator: Size = 256GB, Clock = 1GHz
– SPICE  each cycle power + performance
• Opendedup for comparison
– Intel PCM for CPU+DRAM energy
– Only deduplication energy was measured
– Per-block processing time for performance
• 50GB of writes
– Varying % of duplicate data
May 9, 2016
19
Simulations – ReCAM vs. OpenDedup
0 50 100
10
4
10
5
10
6
10
7
10
8
Percentage of deduplicated blocks
Peakwriteperformance(IOPS)
ReCAM 1KB
ReCAM 2KB
ReCAM 4KB
ReCAM 8KB
OPNDDP 1KB
OPNDDP 2KB
OPNDDP 4KB
OPNDDP 8KB
OpenDedup
ReCAM
Throughput vs. duplicate %
May 9, 2016
20
Simulations – ReCAM & OpenDedup
Energy vs. duplicate %
0 20 40 60 80 100
10
3
10
4
10
5
Percentage of deduplicated blocks
EnergyConsumption(Joule)
ReCAM 1KB
ReCAM 2KB
ReCAM 4KB
ReCAM 8KB
OPNDDP 1KB
OPNDDP 2KB
OPNDDP 4KB
OPNDDP 8KB
May 9, 2016
21
Conclusions
• ReCAM has 100x higher throughput than
deduplication with RAM+CPU
• Energy consumption is similar or lower for the
common block sizes (4 & 8KB)
• Can be used as cache in hybrid storage systems
• Future technology may allow for TBs of storage on a
single chip
May 9, 2016
22
Thank you
Questions ?
22

Más contenido relacionado

La actualidad más candente

postgres loader
postgres loaderpostgres loader
postgres loaderINRIA-OAK
 
RDF Join Query Processing with Dual Simulation Pruning
RDF Join Query Processing with Dual Simulation PruningRDF Join Query Processing with Dual Simulation Pruning
RDF Join Query Processing with Dual Simulation Pruningwajrcs
 
ICOS Carbon Data Portal
ICOS Carbon Data PortalICOS Carbon Data Portal
ICOS Carbon Data PortalOscar Fonts
 
BGP Scanner - Isolario BGP-MRT Data Reader C Library and Tool
BGP Scanner - Isolario BGP-MRT Data Reader C Library and ToolBGP Scanner - Isolario BGP-MRT Data Reader C Library and Tool
BGP Scanner - Isolario BGP-MRT Data Reader C Library and ToolAPNIC
 
openCypher: Naming and Addressing Multiple Graphs
openCypher: Naming and Addressing Multiple GraphsopenCypher: Naming and Addressing Multiple Graphs
openCypher: Naming and Addressing Multiple GraphsopenCypher
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationINRIA-OAK
 
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...Daniele Dell'Aglio
 
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...Dr.-Ing. Thomas Hartmann
 
Map Analytics in Starcraft II
Map Analytics in Starcraft IIMap Analytics in Starcraft II
Map Analytics in Starcraft IIgy8
 
Poster Version 1.7
Poster Version 1.7Poster Version 1.7
Poster Version 1.7Dewey Potts
 
Map Analytics in Starcraft II (2/3/2015)
Map Analytics in Starcraft II (2/3/2015)Map Analytics in Starcraft II (2/3/2015)
Map Analytics in Starcraft II (2/3/2015)gy8
 
Internship - Final Presentation (26-08-2015)
Internship - Final Presentation (26-08-2015)Internship - Final Presentation (26-08-2015)
Internship - Final Presentation (26-08-2015)Sean Krail
 
Strings, C# and Unmanaged Memory
Strings, C# and Unmanaged MemoryStrings, C# and Unmanaged Memory
Strings, C# and Unmanaged MemoryMichael Yarichuk
 
Dynamo db and Cross Region Migration
Dynamo db and Cross Region MigrationDynamo db and Cross Region Migration
Dynamo db and Cross Region MigrationAnamika Gupta
 
Versioned Triple Pattern Fragments
Versioned Triple Pattern FragmentsVersioned Triple Pattern Fragments
Versioned Triple Pattern FragmentsRuben Taelman
 

La actualidad más candente (19)

postgres loader
postgres loaderpostgres loader
postgres loader
 
RDF Join Query Processing with Dual Simulation Pruning
RDF Join Query Processing with Dual Simulation PruningRDF Join Query Processing with Dual Simulation Pruning
RDF Join Query Processing with Dual Simulation Pruning
 
ICOS Carbon Data Portal
ICOS Carbon Data PortalICOS Carbon Data Portal
ICOS Carbon Data Portal
 
BGP Scanner - Isolario BGP-MRT Data Reader C Library and Tool
BGP Scanner - Isolario BGP-MRT Data Reader C Library and ToolBGP Scanner - Isolario BGP-MRT Data Reader C Library and Tool
BGP Scanner - Isolario BGP-MRT Data Reader C Library and Tool
 
No sql
No sqlNo sql
No sql
 
Matlab lecture 4 loops@taj
Matlab lecture 4  loops@tajMatlab lecture 4  loops@taj
Matlab lecture 4 loops@taj
 
openCypher: Naming and Addressing Multiple Graphs
openCypher: Naming and Addressing Multiple GraphsopenCypher: Naming and Addressing Multiple Graphs
openCypher: Naming and Addressing Multiple Graphs
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimation
 
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
RSEP-QL: A Query Model to Capture Event Pattern Matching in RDF Stream Proces...
 
Web scale monitoring
Web scale monitoringWeb scale monitoring
Web scale monitoring
 
Big data quiz
Big data quizBig data quiz
Big data quiz
 
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
2014.10 - Towards Description Set Profiles for RDF Using SPARQL as Intermedia...
 
Map Analytics in Starcraft II
Map Analytics in Starcraft IIMap Analytics in Starcraft II
Map Analytics in Starcraft II
 
Poster Version 1.7
Poster Version 1.7Poster Version 1.7
Poster Version 1.7
 
Map Analytics in Starcraft II (2/3/2015)
Map Analytics in Starcraft II (2/3/2015)Map Analytics in Starcraft II (2/3/2015)
Map Analytics in Starcraft II (2/3/2015)
 
Internship - Final Presentation (26-08-2015)
Internship - Final Presentation (26-08-2015)Internship - Final Presentation (26-08-2015)
Internship - Final Presentation (26-08-2015)
 
Strings, C# and Unmanaged Memory
Strings, C# and Unmanaged MemoryStrings, C# and Unmanaged Memory
Strings, C# and Unmanaged Memory
 
Dynamo db and Cross Region Migration
Dynamo db and Cross Region MigrationDynamo db and Cross Region Migration
Dynamo db and Cross Region Migration
 
Versioned Triple Pattern Fragments
Versioned Triple Pattern FragmentsVersioned Triple Pattern Fragments
Versioned Triple Pattern Fragments
 

Destacado

Adapting Web Mapping Curriculum to Open Source Technologies
Adapting Web Mapping Curriculum to Open Source TechnologiesAdapting Web Mapping Curriculum to Open Source Technologies
Adapting Web Mapping Curriculum to Open Source TechnologiesCarl Sack
 
Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...
Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...
Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...Kieran Nolan
 
Using Open Source Technologies to Spatially Enable Aceh
Using Open Source Technologies to Spatially Enable AcehUsing Open Source Technologies to Spatially Enable Aceh
Using Open Source Technologies to Spatially Enable AcehMaurits van der Vlugt
 
Open source technologies in Microsoft cloud
Open source technologies in Microsoft cloudOpen source technologies in Microsoft cloud
Open source technologies in Microsoft cloudAlexey Bokov
 
Open source technologies
Open source technologiesOpen source technologies
Open source technologiesBrizGo
 
Open source educational technologies for teaching and learning
Open source educational technologies for teaching and learningOpen source educational technologies for teaching and learning
Open source educational technologies for teaching and learningKimera Richard
 
Open source technologies
Open source technologiesOpen source technologies
Open source technologiesSubash V
 
The future of open source and assistive technologies
The future of open source and assistive technologiesThe future of open source and assistive technologies
The future of open source and assistive technologiesDavid Banes
 
OSCon 2011 Talk: The implications of open source technologies in safety criti...
OSCon 2011 Talk: The implications of open source technologies in safety criti...OSCon 2011 Talk: The implications of open source technologies in safety criti...
OSCon 2011 Talk: The implications of open source technologies in safety criti...Shahid Shah
 
Open source technologies
Open source technologiesOpen source technologies
Open source technologiesankita9765
 
RRAM Status and Opportunities
RRAM Status and Opportunities RRAM Status and Opportunities
RRAM Status and Opportunities Deepak Sekar
 
Open Source ERP Technologies for Java Developers
Open Source ERP Technologies for Java DevelopersOpen Source ERP Technologies for Java Developers
Open Source ERP Technologies for Java Developerscboecking
 
Open source technology
Open source technologyOpen source technology
Open source technologyaparnaz1
 
Big Data Processing Utilizing Open-source Technologies - May 2015
Big Data Processing Utilizing Open-source Technologies - May 2015Big Data Processing Utilizing Open-source Technologies - May 2015
Big Data Processing Utilizing Open-source Technologies - May 2015Amir Sedighi
 
Open source Software: pros and cons
Open source Software: pros and consOpen source Software: pros and cons
Open source Software: pros and consygpriya
 
PROPRIETARY AND OPEN SOURCE SOFTWARE
PROPRIETARY AND OPEN SOURCE SOFTWARE PROPRIETARY AND OPEN SOURCE SOFTWARE
PROPRIETARY AND OPEN SOURCE SOFTWARE Kak Yong
 
Monitoring using Open source technologies
Monitoring using Open source technologiesMonitoring using Open source technologies
Monitoring using Open source technologiesUTKARSH BHATNAGAR
 
Power Point Presentation on Open Source Software
Power Point Presentation on Open Source Software Power Point Presentation on Open Source Software
Power Point Presentation on Open Source Software opensourceacademy
 
Open source software vs proprietary software
Open source software vs proprietary softwareOpen source software vs proprietary software
Open source software vs proprietary softwareLavan1997
 

Destacado (20)

Adapting Web Mapping Curriculum to Open Source Technologies
Adapting Web Mapping Curriculum to Open Source TechnologiesAdapting Web Mapping Curriculum to Open Source Technologies
Adapting Web Mapping Curriculum to Open Source Technologies
 
Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...
Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...
Open Source Hardware Technologies in Art, Design, Interaction, Learning and T...
 
Using Open Source Technologies to Spatially Enable Aceh
Using Open Source Technologies to Spatially Enable AcehUsing Open Source Technologies to Spatially Enable Aceh
Using Open Source Technologies to Spatially Enable Aceh
 
Open source technologies in Microsoft cloud
Open source technologies in Microsoft cloudOpen source technologies in Microsoft cloud
Open source technologies in Microsoft cloud
 
Open source technologies
Open source technologiesOpen source technologies
Open source technologies
 
Open source educational technologies for teaching and learning
Open source educational technologies for teaching and learningOpen source educational technologies for teaching and learning
Open source educational technologies for teaching and learning
 
Open source technologies
Open source technologiesOpen source technologies
Open source technologies
 
The future of open source and assistive technologies
The future of open source and assistive technologiesThe future of open source and assistive technologies
The future of open source and assistive technologies
 
OSCon 2011 Talk: The implications of open source technologies in safety criti...
OSCon 2011 Talk: The implications of open source technologies in safety criti...OSCon 2011 Talk: The implications of open source technologies in safety criti...
OSCon 2011 Talk: The implications of open source technologies in safety criti...
 
Open source technologies
Open source technologiesOpen source technologies
Open source technologies
 
RRAM Status and Opportunities
RRAM Status and Opportunities RRAM Status and Opportunities
RRAM Status and Opportunities
 
Open Source ERP Technologies for Java Developers
Open Source ERP Technologies for Java DevelopersOpen Source ERP Technologies for Java Developers
Open Source ERP Technologies for Java Developers
 
Mobile Hardware
Mobile HardwareMobile Hardware
Mobile Hardware
 
Open source technology
Open source technologyOpen source technology
Open source technology
 
Big Data Processing Utilizing Open-source Technologies - May 2015
Big Data Processing Utilizing Open-source Technologies - May 2015Big Data Processing Utilizing Open-source Technologies - May 2015
Big Data Processing Utilizing Open-source Technologies - May 2015
 
Open source Software: pros and cons
Open source Software: pros and consOpen source Software: pros and cons
Open source Software: pros and cons
 
PROPRIETARY AND OPEN SOURCE SOFTWARE
PROPRIETARY AND OPEN SOURCE SOFTWARE PROPRIETARY AND OPEN SOURCE SOFTWARE
PROPRIETARY AND OPEN SOURCE SOFTWARE
 
Monitoring using Open source technologies
Monitoring using Open source technologiesMonitoring using Open source technologies
Monitoring using Open source technologies
 
Power Point Presentation on Open Source Software
Power Point Presentation on Open Source Software Power Point Presentation on Open Source Software
Power Point Presentation on Open Source Software
 
Open source software vs proprietary software
Open source software vs proprietary softwareOpen source software vs proprietary software
Open source software vs proprietary software
 

Similar a Roman Kaplan, Graduate Student,Technion

Optimizing SSD Architecture for Client Workloads
Optimizing SSD Architecture for Client WorkloadsOptimizing SSD Architecture for Client Workloads
Optimizing SSD Architecture for Client WorkloadsJonathan Long
 
Xian He Sun Data-Centric Into
Xian He Sun Data-Centric IntoXian He Sun Data-Centric Into
Xian He Sun Data-Centric IntoSciCompIIT
 
TechEvent Apache Cassandra
TechEvent Apache CassandraTechEvent Apache Cassandra
TechEvent Apache CassandraTrivadis
 
High-Performance Physics Solver Design for Next Generation Consoles
High-Performance Physics Solver Design for Next Generation ConsolesHigh-Performance Physics Solver Design for Next Generation Consoles
High-Performance Physics Solver Design for Next Generation ConsolesSlide_N
 
CS 542 -- Query Execution
CS 542 -- Query ExecutionCS 542 -- Query Execution
CS 542 -- Query ExecutionJ Singh
 
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...Lucidworks
 
Spark Overview - Oleg Mürk
Spark Overview - Oleg MürkSpark Overview - Oleg Mürk
Spark Overview - Oleg MürkPlanet OS
 
Deep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDeep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDatabricks
 
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...Databricks
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...Reynold Xin
 

Similar a Roman Kaplan, Graduate Student,Technion (20)

Optimizing SSD Architecture for Client Workloads
Optimizing SSD Architecture for Client WorkloadsOptimizing SSD Architecture for Client Workloads
Optimizing SSD Architecture for Client Workloads
 
Xian He Sun Data-Centric Into
Xian He Sun Data-Centric IntoXian He Sun Data-Centric Into
Xian He Sun Data-Centric Into
 
TechEvent Apache Cassandra
TechEvent Apache CassandraTechEvent Apache Cassandra
TechEvent Apache Cassandra
 
High-Performance Physics Solver Design for Next Generation Consoles
High-Performance Physics Solver Design for Next Generation ConsolesHigh-Performance Physics Solver Design for Next Generation Consoles
High-Performance Physics Solver Design for Next Generation Consoles
 
CS 542 -- Query Execution
CS 542 -- Query ExecutionCS 542 -- Query Execution
CS 542 -- Query Execution
 
memory.ppt
memory.pptmemory.ppt
memory.ppt
 
memory.ppt
memory.pptmemory.ppt
memory.ppt
 
Tuning Solr & Pipeline for Logs
Tuning Solr & Pipeline for LogsTuning Solr & Pipeline for Logs
Tuning Solr & Pipeline for Logs
 
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
Tuning Solr and its Pipeline for Logs: Presented by Rafał Kuć & Radu Gheorghe...
 
Spark Overview - Oleg Mürk
Spark Overview - Oleg MürkSpark Overview - Oleg Mürk
Spark Overview - Oleg Mürk
 
Deep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache SparkDeep Dive: Memory Management in Apache Spark
Deep Dive: Memory Management in Apache Spark
 
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
Running Apache Spark on a High-Performance Cluster Using RDMA and NVMe Flash ...
 
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
 
Cache recap
Cache recapCache recap
Cache recap
 
Cache recap
Cache recapCache recap
Cache recap
 
Cache recap
Cache recapCache recap
Cache recap
 
Cache recap
Cache recapCache recap
Cache recap
 
Cache recap
Cache recapCache recap
Cache recap
 
Cache recap
Cache recapCache recap
Cache recap
 
Cache recap
Cache recapCache recap
Cache recap
 

Más de chiportal

Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China chiportal
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...chiportal
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...chiportal
 
Prof. Uri Weiser,Technion
Prof. Uri Weiser,TechnionProf. Uri Weiser,Technion
Prof. Uri Weiser,Technionchiportal
 
Ken Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, FaradayKen Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, Faradaychiportal
 
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 Prof. Danny Raz, Director, Bell Labs Israel, Nokia  Prof. Danny Raz, Director, Bell Labs Israel, Nokia
Prof. Danny Raz, Director, Bell Labs Israel, Nokia chiportal
 
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, SynopsysMarco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, Synopsyschiportal
 
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzDr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzchiportal
 
Eddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, IntelEddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, Intelchiportal
 
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 Dr. John Bainbridge, Principal Application Architect, NetSpeed  Dr. John Bainbridge, Principal Application Architect, NetSpeed
Dr. John Bainbridge, Principal Application Architect, NetSpeed chiportal
 
Xavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, ArterisXavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, Arterischiportal
 
Asi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, VtoolAsi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, Vtoolchiportal
 
Zvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQZvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQchiportal
 
Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC chiportal
 
Kunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-SiliconKunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-Siliconchiportal
 
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, SynopsysGert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, Synopsyschiportal
 
Tuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano RetinaTuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano Retinachiportal
 
Sagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-SiliconSagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-Siliconchiportal
 
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP SemiconductorRonen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductorchiportal
 
Prof. Emanuel Cohen, Technion
Prof. Emanuel Cohen, TechnionProf. Emanuel Cohen, Technion
Prof. Emanuel Cohen, Technionchiportal
 

Más de chiportal (20)

Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China Prof. Zhihua Wang, Tsinghua University, Beijing, China
Prof. Zhihua Wang, Tsinghua University, Beijing, China
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
Prof. Steve Furber, University of Manchester, Principal Designer of the BBC M...
 
Prof. Uri Weiser,Technion
Prof. Uri Weiser,TechnionProf. Uri Weiser,Technion
Prof. Uri Weiser,Technion
 
Ken Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, FaradayKen Liao, Senior Associate VP, Faraday
Ken Liao, Senior Associate VP, Faraday
 
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 Prof. Danny Raz, Director, Bell Labs Israel, Nokia  Prof. Danny Raz, Director, Bell Labs Israel, Nokia
Prof. Danny Raz, Director, Bell Labs Israel, Nokia
 
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, SynopsysMarco Casale-Rossi, Product Mktg. Manager, Synopsys
Marco Casale-Rossi, Product Mktg. Manager, Synopsys
 
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzDr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazz
 
Eddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, IntelEddy Kvetny, System Engineering Group Leader, Intel
Eddy Kvetny, System Engineering Group Leader, Intel
 
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 Dr. John Bainbridge, Principal Application Architect, NetSpeed  Dr. John Bainbridge, Principal Application Architect, NetSpeed
Dr. John Bainbridge, Principal Application Architect, NetSpeed
 
Xavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, ArterisXavier van Ruymbeke, App. Engineer, Arteris
Xavier van Ruymbeke, App. Engineer, Arteris
 
Asi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, VtoolAsi Lifshitz, VP R&D, Vtool
Asi Lifshitz, VP R&D, Vtool
 
Zvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQZvika Rozenshein,General Manager, EngineeringIQ
Zvika Rozenshein,General Manager, EngineeringIQ
 
Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC Lewis Chu,Marketing Director,GUC
Lewis Chu,Marketing Director,GUC
 
Kunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-SiliconKunal Varshney, VLSI Engineer, Open-Silicon
Kunal Varshney, VLSI Engineer, Open-Silicon
 
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, SynopsysGert Goossens,Sen. Director, ASIP Tools, Synopsys
Gert Goossens,Sen. Director, ASIP Tools, Synopsys
 
Tuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano RetinaTuvia Liran, Director of VLSI, Nano Retina
Tuvia Liran, Director of VLSI, Nano Retina
 
Sagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-SiliconSagar Kadam, Lead Software Engineer, Open-Silicon
Sagar Kadam, Lead Software Engineer, Open-Silicon
 
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP SemiconductorRonen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
Ronen Shtayer,Director of ASG Operations & PMO, NXP Semiconductor
 
Prof. Emanuel Cohen, Technion
Prof. Emanuel Cohen, TechnionProf. Emanuel Cohen, Technion
Prof. Emanuel Cohen, Technion
 

Último

It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayNZSG
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageMatteo Carbone
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear RegressionRavindra Nath Shukla
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...lizamodels9
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLSeo
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...Any kyc Account
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfAdmir Softic
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataExhibitors Data
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMANIlamathiKannappan
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communicationskarancommunications
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesDipal Arora
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Dave Litwiller
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Serviceritikaroy0888
 
John Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdfJohn Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdfAmzadHosen3
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptxB.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptxpriyanshujha201
 

Último (20)

It will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 MayIt will be International Nurses' Day on 12 May
It will be International Nurses' Day on 12 May
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
 
Regression analysis: Simple Linear Regression Multiple Linear Regression
Regression analysis:  Simple Linear Regression Multiple Linear RegressionRegression analysis:  Simple Linear Regression Multiple Linear Regression
Regression analysis: Simple Linear Regression Multiple Linear Regression
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
Russian Call Girls In Gurgaon ❤️8448577510 ⊹Best Escorts Service In 24/7 Delh...
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
KYC-Verified Accounts: Helping Companies Handle Challenging Regulatory Enviro...
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
RSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors DataRSA Conference Exhibitor List 2024 - Exhibitors Data
RSA Conference Exhibitor List 2024 - Exhibitors Data
 
A DAY IN THE LIFE OF A SALESMAN / WOMAN
A DAY IN THE LIFE OF A  SALESMAN / WOMANA DAY IN THE LIFE OF A  SALESMAN / WOMAN
A DAY IN THE LIFE OF A SALESMAN / WOMAN
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Pharma Works Profile of Karan Communications
Pharma Works Profile of Karan CommunicationsPharma Works Profile of Karan Communications
Pharma Works Profile of Karan Communications
 
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best ServicesMysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
Mysore Call Girls 8617370543 WhatsApp Number 24x7 Best Services
 
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
Enhancing and Restoring Safety & Quality Cultures - Dave Litwiller - May 2024...
 
Call Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine ServiceCall Girls In Panjim North Goa 9971646499 Genuine Service
Call Girls In Panjim North Goa 9971646499 Genuine Service
 
John Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdfJohn Halpern sued for sexual assault.pdf
John Halpern sued for sexual assault.pdf
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptxB.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
B.COM Unit – 4 ( CORPORATE SOCIAL RESPONSIBILITY ( CSR ).pptx
 

Roman Kaplan, Graduate Student,Technion

  • 1. May 9, 2016 1 May 9, 2016 Deduplication in Resistive CAM Based SSD Roman Kaplan, Leonid Yavits, Amir Morad, Ran Ginosar 2015
  • 2. May 9, 2016 2 Outline 1. What is ReCAM ? 2. What is deduplication ? – How is it done today? 3. Deduplication in ReCAM – How is it simpler? 4. Simulation results
  • 3. May 9, 2016 3 Resistive CAM – What is it? • CAM = Content Addressable Memory 1. Search for data in the entire array 2. Store address explicitly  function like RAM • Memristors:
  • 5. May 9, 2016 5 Resistive CAM – Operations What can ReCAM do: 1. Compare all its contents to a specific word 2. Write to specific columns in parallel 3. Write to specific rows in parallel
  • 6. May 9, 2016 6 What is Deduplication? 1. Data is broken into fixed blocks 2. A fingerprint (FP) is calculated for each block
  • 7. May 9, 2016 7 What is Deduplication? 1. Data is broken into fixed blocks 2. A fingerprint (FP) is calculated for each block 3. Identical blocks aren’t stored (deduplicated)
  • 8. May 9, 2016 8 Deduplication Uses 1. Useful when there is repeating data – Virtual machines – WAN optimizations (networking) – Backups 2. Compression ratio depend on type of data – can reach up to 40x
  • 9. May 9, 2016 9 Deduplication using RAM+CPU: Write 1. Calculate FP (Hash) 2. Search for it in the chunk index (takes very long time) 3. Act accordingly (next slides) Data Hash 2 … 1 … 1 PA(A) … PA(B) … PA(C) Hash(A) …. Hash(B) ….. Hash(C) Chunk Index Fingerprint Physical Address ? CNT 1 2 1 2
  • 10. May 9, 2016 10 RAM+CPU Deduplication: Write (Case 1) Case 1: If the FP is found Data block already exists I. Add LA+PA to ATT II. Increment FP counter in chunk index 1 … 1 … 1 PA(A) … PA(B) … PA(C) Hash(A) …. Hash(B) ….. Hash(C) Chunk Index Fingerprint Physical Address CNT Hash(D) Address Decoder A B C Data Blocks Storage D PA(D) 1 Address Translation Table 𝐿𝐴(D) 𝐿𝐴(A) 𝐿𝐴 B 𝐿𝐴(C) PA(D) PA(A) PA(B) PA(C) Logical Address Physical Address 𝐿𝐴2(D) PA(D) 2A A B B
  • 11. May 9, 2016 11 RAM+CPU Deduplication: Write (Case 2) Case 2: If the FP is not found A unique data block I. Write block to storage II. Add LA+PA to ATT III. Add FP to chunk index 1 … 1 … 1 PA(A) … PA(B) … PA(C) Hash(A) …. Hash(B) ….. Hash(C) Chunk Index Fingerprint Physical Address CNT Hash(D) Address Decoder A B C Data Blocks Storage D PA(D) 1 A B C A C Address Translation Table 𝐿𝐴(A) 𝐿𝐴 B 𝐿𝐴(C) PA(A) PA(B) PA(C) Logical Address Physical Address 𝐿𝐴(D) PA(D)B
  • 12. May 9, 2016 12 Deduplication is Hard with RAM+CPU • Delete is even more complicated than write • Requires complex data structures & computations Large memory & CPU • Example: EMC XtremIO Xbrick • 5TB all-flash storage • 256GB RAM • Quad-core CPU
  • 13. May 9, 2016 13 Deduplication in ReCAM • Much simpler than with RAM • Chunk index is not required anymore • Allows to compare all data blocks in storage simultaneously – If found, store only address-pointers Chunk Index
  • 14. May 9, 2016 14 Deduplication in ReCAM 1. Search for new data block in the storage 2. Act accordingly (next slides) Data Hash A B C Data Blocks Storage PA(A) PA(B) PA(C) Physical Address ?
  • 15. May 9, 2016 15 Deduplication in ReCAM Case 1: If the Data is found Data block already exists I. Add address to ATT Storage PA(A) PA(B) PA(C) Physical Address A B C Data Blocks DPA(D) Logical Address Physical Address Address Translation Table 𝐿𝐴(D) 𝐿𝐴(A) 𝐿𝐴 B 𝐿𝐴(C) PA(D) PA(A) PA(B) PA(C) 𝐿𝐴2(D) PA(D)
  • 16. May 9, 2016 16 Deduplication in ReCAM Case 2: If the Data is not found New Data block I. Write Data to storage II. Add address to ATT Address Translation Table Storage PA(A) PA(B) PA(C) Physical Address A B C Data Blocks DPA(D) A A B 𝐿𝐴(A) 𝐿𝐴 B 𝐿𝐴(C) PA(A) PA(B) PA(C) Logical Address Physical Address 𝐿𝐴(D) PA(D)B
  • 17. May 9, 2016 17 Deduplication in ReCAM Much Simpler than with RAM • Write: 1. Compare the entire array data simultaneously 2. If match, save only a pointer 3. If not, save the data block + pointer • Delete isn’t more complicated than write – If no addresses pointing to the data  delete
  • 18. May 9, 2016 18 Simulations • ReCAM – Cycle-accurate simulator: Size = 256GB, Clock = 1GHz – SPICE  each cycle power + performance • Opendedup for comparison – Intel PCM for CPU+DRAM energy – Only deduplication energy was measured – Per-block processing time for performance • 50GB of writes – Varying % of duplicate data
  • 19. May 9, 2016 19 Simulations – ReCAM vs. OpenDedup 0 50 100 10 4 10 5 10 6 10 7 10 8 Percentage of deduplicated blocks Peakwriteperformance(IOPS) ReCAM 1KB ReCAM 2KB ReCAM 4KB ReCAM 8KB OPNDDP 1KB OPNDDP 2KB OPNDDP 4KB OPNDDP 8KB OpenDedup ReCAM Throughput vs. duplicate %
  • 20. May 9, 2016 20 Simulations – ReCAM & OpenDedup Energy vs. duplicate % 0 20 40 60 80 100 10 3 10 4 10 5 Percentage of deduplicated blocks EnergyConsumption(Joule) ReCAM 1KB ReCAM 2KB ReCAM 4KB ReCAM 8KB OPNDDP 1KB OPNDDP 2KB OPNDDP 4KB OPNDDP 8KB
  • 21. May 9, 2016 21 Conclusions • ReCAM has 100x higher throughput than deduplication with RAM+CPU • Energy consumption is similar or lower for the common block sizes (4 & 8KB) • Can be used as cache in hybrid storage systems • Future technology may allow for TBs of storage on a single chip
  • 22. May 9, 2016 22 Thank you Questions ? 22