SlideShare una empresa de Scribd logo
1 de 33
Descargar para leer sin conexión
www.bsc.es
Comprehensive Evaluation of Supply Voltage
Underscaling in FPGA on-chip Memories
Behzad Salami, Osman Unsal, and Adrian Cristal
Barcelona Supercomptuing Center (BSC)
The 51st Annual IEEE/ACM International Symposium on Microarchitecture ® (MICOR),
Fukuoka City, Japan.
2
Aggressive Undervolting
 Aggressive undervolting- Underscaling the supply voltage below the nominal
and safe level:
 Power/Energy Efficiency: Reduces dynamic and static power quadratically and
linearly, respectively.
 Reliability: Increases the circuit delay and in turn, causes timing faults.
 Dual/Multi-Vdd, DVS, and DVFS: Similar but different mechanisms to
aggressive undervolting:
 Similarity: Underscaling the supply voltage.
 Difference: Undervolting is until a certain safe level, usually constrained by
vendors.
Reliability
Power/Energy
Efficiency
3
State-of-the-art
1. Aggressive undervolting has shown significant efficiency
to reduce the energy consumption.
 Devices:
 CPUs: Itanium II (ISCA2014), X86 (IOLTS2017), ARM (HPCA2017)
 GPUs: NVidia (Micro2015)
 DRAMs: Multiple Brands (Sigmetrics2017)
 FPGA: This work
 Focus of the previous works:
 Voltage guardband
 Minimum safe voltage, i.e., Vmin prediction
 Fault characterization and mitigation
 Chip-to-chip, core-to-core, and workload-to-workload variation
 ….
2. More straightforward and more parameters
but less precise
 ASIC DNN: Minerva (Micro2016), Thundervolt (DAC2018)
 CPU: Bravo (HPCA2017 )
 Network On-Chip (HPCA2014)
Real hardware:
Simulation-based studies:
4
Undervolting on FPGAs: Motivation
Contribution of FPGAs in large data centers is growing, expected to
be in 30% of datacenter servers by 2020 (Top500 news).
 In comparison to ASICs,
energy efficiency of FPGAs is
a serious concern, i.e., 10X-
100X less-efficient.
 Nominal voltage reduction
of FPGAs is naturally applied
for different generations.
Undervolting
[Intel/Altera]
[Xilinx]
5
Contributions
1. Undervolting FPGAs
 Voltage guardband
 Overall power and reliability trade-off
2. Fault characterization in FPGA on-chip memories
 Fault type, location, and rate
 Temperature, Chip
3. Low-voltage FPGA-based Neural Network (NN)
 Power consumption and NN accuracy characterization
 Fault mitigation techniques
 Application-aware technique
 Built-in ECC
6
Voltage Scaling Capability in Xilinx
VC707: performance-efficient design
KC705: power-efficient design (A & B)
Evaluated Xilinx platforms
VC707
Voltage distribution on Xilinx platforms
Voltage regulator
 Power Management Bus (PMBus).
 Hardwired to the host.
ZC702: ARM integrated with FPGA
VCCINT
VCCBRAM
7
Overall Voltage Behavior
 FPGA stops operating below
Vcrash, min operating voltageCRASH
 No observable fault
 Voltage Guardband below Vnom
SAFE
 Faults manifest
 Below Vmin, min safe voltageCRITICAL
 Voltage guardband: to ensure
the worst-case environmental
and process technologies.
 Experimental conditions: At
ambient temperature and
maximum operating frequency.
We performed more detailed studies on FPGA on-chip memories (BRAMs).
Vnom
Vmin
Vcrash
0
0.2
0.4
0.6
0.8
1
VC707 ZC702 KC705-A KC705-B
VCCBRAM(V)
Platform
SAFE (Guardband)
CRITICAL
CRASH
0
0.2
0.4
0.6
0.8
1
VC707 ZC702 KC705-A KC705-B
VCCINT(V)
Platform
SAFE (Guardband)
CRITICAL
CRASH
8
Floorplan of VC707
Experimental Methodology
 FPGA BRAMs:
 Hierarchy of set of bit-cells
distributed over the chip.
 Size of each BRAM: 16-kbits
 Experimental Methodology:
 HW: Transfer content of BRAMs to the host.
 SW: Analyze data, and adjust voltage of BRAMs. (2060 BRAMs)
9
0
200
400
600
800
0
1
2
3
1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55
FaultRate(per1Mbit)
BRAMPower(Watts)
VCCBRAM (V)
BRAM Power
Fault Rate
Vmin=0.61V
Vcrash=0.54V
0
400
800
0
0.1
0.2
0.3
0.4
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
per1Mbit
Watts
Vnom=1V
Overall Trade-offs on BRAMs- Power & Reliability
VC707
10
0
150
300
0
0.05
0.1
0.15
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
per1Mbit
Watts
Overall Trade-offs on BRAMs- Multiple Platforms
0
200
400
600
800
0
1
2
3
1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55
FaultRate
(per1Mbit)
BRAMPower
(Watts)
VCCBRAM (V)
Vnom=1V
Vmin=0.61V
0
400
800
0
0.2
0.4
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
per1Mbit
Watts
VC707
0
50
100
150
200
0
10
20
30
1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55
FaultRate
(per1Mbit)
BRAMPower
(mWatts)
VCCBRAM (V)
Vnom=1V
Vcrash=0.53V
0
100
200
0
2
4
0.59 0.58 0.57 0.56 0.55 0.54 0.53
per1Mbit
mWatts
ZC702
0
100
200
300
0
1
2
3
1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55
FaultRate
(per1Mbit)
BRAMPower
(Watts)
VCCBRAM (V)
Vnom=1V
Vcrash=0.54V
Vmin=0.61V
KC705-A
0
20
40
60
80
0
1
2
3
1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55
FaultRate
(per1Mbit)
BRAMPower
(Watts)
VCCBRAM (V)
Vnom=1V
Vmin=0.57V
Vcrash=0.54V
0
40
80
0
0.05
0.1
0.15
0.57 0.56 0.55 0.54
per1Mbit
Watts
KC705-B
Vmin=0.59V
Vcrash=0.54V
11
Key Points of the First Contribution
 Voltage regions: Safe, Critical, and Crash voltage regions exist
for all platforms, slightly different among studied platforms.
 Voltage guardbands: Large voltage guardband confirmed for
all platforms on the studied voltage rails, i.e., VCCBRAM and
VCCINT.
 Power reduction: There is significant power reduction through
aggressive undervolting, with more details studied for BRAMs.
 Reliability costs: Fault rates exponentially increase in the
Critical voltage region.
12
Contributions
1. Undervolting FPGAs
 Voltage guardband
 Overall power and reliability trade-off
2. Fault characterization in FPGA on-chip memories
 Fault type, location, and rate
 Temperature, Chip
3. Low-voltage FPGA-based Neural Network (NN)
 Power consumption and NN accuracy characterization
 Fault mitigation techniques
 Application-aware technique
 Built-in ECC
13
Fault Characterization at CRITICAL Region
 Fully non-uniform fault distribution.
 Majority of BRAMs do not experience many faults.
Fault variability among FPGA BRAMs:
Fully non-uniform fault distribution
VC707 (2060 BRAMs)
VCCBRAM@ Vcrash= 0.54V
Temperature@ Ambient
0.0%
0.3%
0.6%
0.9%
1.2%
1.5%
BRAMFaultRate(%)
%BRAMs Average Fault Rate (%)
1.8% 0.86%
High-vulnerable
9.4% 0.24%
Mid-vulnerable
52.3% 0.03%
Low-vulnerable
36.3% 0.0%
Zero-vulnerable
K-means clustering
14
Fault Characterization at CRITICAL Region
Type of undervolting faults:
Permanent faults at specific voltage
 There is no considerable change on the rate and location of faults over time.
 Validated by repeating experiments for 100 times.
 The physical location of BRAMs is extracted using Vivado.
 Fault Variation Map (FVM): Fault rate mapped to the physical location of BRAMs.
FVM can be potentially used in fault mitigation techniques!
FPGA x-axis
FPGAy-axis
BRAMFaultRate(%)
FVM @
(VCCBRAM @Vcrash, T= ambient, chip= VC707)
1 10 20 30 40 50 60 70 80 90 100
0
200
400
600
800
1000
1 11 21 31 41 51 61 71 81 91
FaultRate(per1Mbit)
#Run
Individual Run Cumulative Median
Key observations discussed:
1. Fault rate exponentially increases by further undervolting.
2. BRAMs have fully different reliability behavior against undervolting faults.
3. The fault rate and location is deterministic over the time.
Three parameters orthogonally have significant impact on the rate
and location of faults:
1. Voltage
2. Temperature
3. Chip
15
Fault Characterization (Voltage Impacts)
Location of undervolting faults:
Fault Inclusion Property (FIP)
 FIP: A corrupted bit at a specific voltage stays faulty in lower voltages as well.
 FIP can be used in mitigation techniques.
0.1
1
10
100
1000
10000
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
FaultRate(per1Mbit)
logscale
VCCBRAM (V)
Illustration of FIP FIP shown as fault rate for VC707
16
Fault Characterization (Temperature Impacts)
 Methodology: Adjusting environmental temperature, monitoring on-board
temperature via PMBus.
 Experimental Observation:
 At higher temperatures, fault rate is significantly reduced.
 Inverse Temperature Dependency :
 For nano-scale technology nodes, under ultra low-voltage operations, the circuit
delay reduces at higher temperatures since supply voltage approaches the
threshold voltage.
* x-axis: VCCBRAM (V). * y-axis: fault rate (per 1Mbit).
50 60 70 80
Practical confirmation of Inverse Temperature Dependency (ITD)
(1) Neshatpour, K., Burleson, W., Khajeh, A., & Homayoun, H. (2018). Enhancing Power, Performance, and Energy Efficiency in Chip
Multiprocessors Exploiting Inverse Thermal Dependence. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, (4), 778-791.
17
Fault Characterization (Chip Impacts)
KC705-BKC705-A
 Methodology: Repeating experiments on two identical samples of KC705
(A&B).
 Observations:
 Fault rates significantly vary, more than 4X.
 Fault Variation Maps (FVMs) are entirely different.
Fault location Fault location
@VCCBRAM= Vcrash @VCCBRAM= Vcrash
Even identical samples of same chips have totally different reliability
behavior, due to the process variation/aging effects.
Fault rate
0
100
200
300
0.57 0.56 0.55 0.54
Per1Mbit VCCBRAM (V)
Fault rate
0
100
200
300
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
Per1Mbit
VCCBRAM (V)
18
Key Points of the Second Contribution
 Fault rate: The increase of the fault rate by further undervolting is
exponential.
 Non-uniform fault distribution among BRAMs: BRAMs do not have similar
sensitivity against undervolting.
 Deterministic behavior of faults: The location of faults does not change
over the time, at certain voltage and temperature, and for a certain chip.
 Reliability behavior over different voltage levels: There is Fault Inclusion
Property (FIP).
 Environmental temperature: At higher temperatures, FPGA BRAMs shows
better reliability behavior, i.e., less fault rate.
 Reliability differences for chips: Even identical chips shows fully different
reliability behaviors.
19
Contributions
1. Undervolting FPGAs
 Voltage guardband
 Overall power and reliability trade-off
2. Fault characterization in FPGA on-chip memories
 Fault type, location, and rate
 Temperature, Chip
3. Low-voltage FPGA-based Neural Network (NN)
 Power consumption and NN accuracy characterization
 Fault mitigation techniques
 Application-aware technique
 Built-in ECC
20
Experimental Methodology
Neural Network (NN)
Type Fully-connected classifier
Total number of weights ~1.5 millions
Activation function Logsig (logarithmic sigmoid)
Major benchmark
Name-type MNIST- handwritten digit images
Number of images Training: 60000, Classification: 10000
Number of pixels per image 28*28=256
Number of output classes 10
Additional benchmarks
Names Forest and Reuters
Data representation model
Type 16-bits fixed-point
Precision Minimum sign and digit per layer
An example implementation on VC707
Frequency 100 Mhz
BRAM usage (total: 2060) 70.8%
21
NN Implementation on FPGA
 Input data: off-chip DDR memory.
 Weights: on-chip FPGA BRAM.
 Computation: Streaming data
onto DSPs and LUTs.
 We undervolt VCCBRAM:
 Weights of the NN are
potentially affected.
FPGA Implementation
22
Low-Voltage FPGA-based NN
 Significant power reduction until
the minimum safe voltage, i.e., Vmin
(By eliminating the voltage
guardband).
 Additional 40% power reduction
below the voltage guardband.
 The NN classification error exponentially
increases from 2.56% (inherent
classification error) to 6.74% through
undervolting BRAMs beyond Vmin.
 Fault mitigation techniques to prevent
the accuracy loss:
 Application-aware mechanism
 Built-in ECC
Power saving
NN accuracy loss
2.39
0.25 0.15
6.47
6.47 6.47
0
2
4
6
8
10
Vnom= 1 V Vmin= 0.61V Vcrash= 0.54V
On-chipPower(Watts)
BRAM Rest
0
100
200
300
400
0%
2%
4%
6%
8%
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
FaultRatio(per1Mbit)
ClassificationError(%)
VCCBRAM (V)
NN Error Ratio Fault Ratio
Inherent Error rate: 2.56%
23
Intelligently-Constrained BRAM placement (ICBP)
 Below voltage guardband level at CRITICAL voltage region, we present ICBP
to prevent NN classification error rate loss.
 Core Idea: Map most-sensitive weights to faults into robust BRAMs.
 Q: Which are the most-sensitive NN weights? A: Deeper Layers.
ICBP-Additional
ConstraintsintheFPGA
placementstage
1 1.4
2.1
3
5.7
LAYER0 LAYER1 LAYER2 LAYER3 LAYER4
Normalized
Vulnerability
NN Layers
24
ICBP Evaluation
 Pros:
 Significant accuracy loss prevention.
 No power and performance overhead.
 Cons:
 Needs the FVM as a pre-process step  Built-in ECC is evaluated
without having this cost.
0
0.1
0.2
0.3
0.4
0%
2%
4%
6%
8%
10%
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
BRAMsPower(Watt)
NNClassificationError(%)
VCCBRAM (V)
NN Error by Default Placement NN Error by ICBP BRAM Power
Inherent NN Error: 2.56%
25
Built-in ECC
 Built-in ECC of FPGA BRAMs:
 Hamming-code.
 Two (2) additional bits per row are reserved as parities.
 SECDED (Single-Error Correction and Double-Error Detection).
 Experimental Methodology:
 Activate built-in ECC under low-voltage read operations.
 Experimental Observations:
 >90% fault correction
 >7% fault detection (not correction)
0
200
400
600
800
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
Faultrate(per1Mbit)
VCCBRAM (V)
Without ECC With ECC
Parity Bits
single-bit
double-bit
multiple-bit
26
ECC for NN Accelerator
0%
2%
4%
6%
8%
0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54
ClassificationError(%)
VCCBRAM (V)
Without ECC With ECC
Inherent NN Error: 2.56%
Area Utilization (%)
BRAM LUT FF
Without ECC 96% 3% 0.25%
With ECC 100% 12% 0.25%
BRAM Power (W)
Vnom= 1V Vmin= 0.61V Vcrash= 0.54V
Without ECC 2.4 0.31 0.198
With ECC ---- ---- 0.211
ECC efficiency to prevent NN accuracy loss
ECC area and power costs
 Pros:
 Significant accuracy loss prevention.
 Negligible power and performance overhead.
 Cons:
 Requires larger data rows/lines.
 Not all FPGAs are equipped with this technique.
27
Key Points of the Third Contribution
 Power reduction for FPGA-based accelerates: Significant
energy improvement can be achieved for FPGA-based
accelerators (studied for typical NN) through undervolting:
 By eliminating the voltage guardband
 By further undervolting in the critical voltage region
 Cost of undervolting: Accuracy loss is also significant but
controllable at the critical voltage region.
 Fault mitigation techniques: According to the fault
characterization study, efficient mitigation techniques can be
deployed to prevent the NN accuracy loss.
28
Wrap up
Summary
Conclusion
Potential Next Steps
….
29
Summary
 We experimentally showed how Xilinx FPGAs work under
aggressive low-voltage operations.
 There is a conservative voltage guardband below the nominal
voltage level, i.e., Vnom.
 BRAMs power significantly reduces through undervolting;
however, reliability degrades below the minimum safe voltage,
i.e., Vmin.
 We characterized the behavior of undervolting faults at the
critical region.
 We evaluated FPGA undervolting for a typical NN accelerator.
30
Conclusion
 There is significant potential in commercial FPGAs to improve
the energy efficiency through aggressive undervolting.
 By eliminating the conservative voltage guardband
 By further undervolting into the voltage critical region
 Undervolting faults manifest deterministic behaviors.
 Efficient fault mitigation techniques can be deployed which
can allow to further energy saving.
 State-of-the-art FPGA-based accelerators can be adapted by
undervolting approach.
31
Constraints of the Xilinx FPGAs for Undervolting
 Many FPGA platforms, e.g., Zynq are not equipped with voltage scaling
capability.
 There is no standard about the voltage distribution among platform
components.
 In Xilinx products, voltage regulators are hardwired to the host through
PMBus interface.
 In many cases, several components on the FPGA platform share a single
voltage rail.
 Vendors set unnecessarily conservative voltage guardbands that increase
the energy.
 There is no publicly-available circuit-level information of FPGAs.
32
Potential Next Steps
 Different computing paradigms, e.g., Heterogeneous Computing,
Approximate Computing, Stochastic Computing, among others.
 Generalizing observations by extending the experiments to other
FPGA vendors like Intel/Altera.
 Evaluation the undervolting in noisy and harsh environments.
 More advanced designs, where other components such as I/O and
DSP are undervolted.
 Application profiling to analyze workload-to-workload variation.
 Dynamic Vmin prediction and scaling, adapted by frequency and
temperature.
www.bsc.es
Thanks!
Any Question/Comment?
Contact:
behzad.salami@bsc.es

Más contenido relacionado

La actualidad más candente

Surge Protect, a critical issue you could not fail to know while implementing...
Surge Protect, a critical issue you could not fail to know while implementing...Surge Protect, a critical issue you could not fail to know while implementing...
Surge Protect, a critical issue you could not fail to know while implementing...Victor Lee
 
ECET 220 Enthusiastic Study / snaptutorial.com
ECET 220 Enthusiastic Study / snaptutorial.comECET 220 Enthusiastic Study / snaptutorial.com
ECET 220 Enthusiastic Study / snaptutorial.comStephenson28
 
1.2V Analog I/O with full local ESD protection for TSMC 65nm technology
1.2V Analog I/O with full local ESD protection for TSMC 65nm technology1.2V Analog I/O with full local ESD protection for TSMC 65nm technology
1.2V Analog I/O with full local ESD protection for TSMC 65nm technologySofics
 
1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technology
1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technology1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technology
1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technologySofics
 
Tsmc65 1v2 full local protection analog io + cdm
Tsmc65 1v2 full local protection analog io + cdmTsmc65 1v2 full local protection analog io + cdm
Tsmc65 1v2 full local protection analog io + cdmSofics
 
1.2V core power clamp for TSMC 65nm technology
1.2V core power clamp for TSMC 65nm technology1.2V core power clamp for TSMC 65nm technology
1.2V core power clamp for TSMC 65nm technologySofics
 
Original Power Supply IC MIP2C2 2C2 DIP-7 New
Original Power Supply IC MIP2C2 2C2 DIP-7 NewOriginal Power Supply IC MIP2C2 2C2 DIP-7 New
Original Power Supply IC MIP2C2 2C2 DIP-7 NewAUTHELECTRONIC
 
DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...
DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...
DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...Benjamin Parpillon
 
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzDr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzchiportal
 
Residual current circuit_breakers
Residual current circuit_breakersResidual current circuit_breakers
Residual current circuit_breakersMinka Grdesic
 
As built and proposed SLD
As built and proposed SLDAs built and proposed SLD
As built and proposed SLDQasim Muhammad
 
2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...
2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...
2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...Sofics
 
SPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARK
SPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARKSPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARK
SPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARKTsuyoshi Horigome
 
Original Transistor PNP TIP42C TIP42 6A 100V TO-220 New
Original Transistor PNP TIP42C TIP42 6A 100V TO-220 NewOriginal Transistor PNP TIP42C TIP42 6A 100V TO-220 New
Original Transistor PNP TIP42C TIP42 6A 100V TO-220 NewAUTHELECTRONIC
 
Schnieder mccb li st
Schnieder mccb li stSchnieder mccb li st
Schnieder mccb li stRamesh Meti
 

La actualidad más candente (18)

Surge Protect, a critical issue you could not fail to know while implementing...
Surge Protect, a critical issue you could not fail to know while implementing...Surge Protect, a critical issue you could not fail to know while implementing...
Surge Protect, a critical issue you could not fail to know while implementing...
 
ECET 220 Enthusiastic Study / snaptutorial.com
ECET 220 Enthusiastic Study / snaptutorial.comECET 220 Enthusiastic Study / snaptutorial.com
ECET 220 Enthusiastic Study / snaptutorial.com
 
1.2V Analog I/O with full local ESD protection for TSMC 65nm technology
1.2V Analog I/O with full local ESD protection for TSMC 65nm technology1.2V Analog I/O with full local ESD protection for TSMC 65nm technology
1.2V Analog I/O with full local ESD protection for TSMC 65nm technology
 
1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technology
1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technology1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technology
1.2V Over-voltage tolerant Analog I/O for TSMC 65nm technology
 
Tsmc65 1v2 full local protection analog io + cdm
Tsmc65 1v2 full local protection analog io + cdmTsmc65 1v2 full local protection analog io + cdm
Tsmc65 1v2 full local protection analog io + cdm
 
1.2V core power clamp for TSMC 65nm technology
1.2V core power clamp for TSMC 65nm technology1.2V core power clamp for TSMC 65nm technology
1.2V core power clamp for TSMC 65nm technology
 
Original Power Supply IC MIP2C2 2C2 DIP-7 New
Original Power Supply IC MIP2C2 2C2 DIP-7 NewOriginal Power Supply IC MIP2C2 2C2 DIP-7 New
Original Power Supply IC MIP2C2 2C2 DIP-7 New
 
ELVD Shock Preventer
ELVD Shock Preventer ELVD Shock Preventer
ELVD Shock Preventer
 
DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...
DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...
DS-00593-14-XTR70020-High-temperature, High Input Voltage, 2A Positive LDO Re...
 
Dr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazzDr.Efraim Aharoni, ESD Leader, TowerJazz
Dr.Efraim Aharoni, ESD Leader, TowerJazz
 
Thông số kĩ thuật MCCB LS
Thông số kĩ thuật MCCB LSThông số kĩ thuật MCCB LS
Thông số kĩ thuật MCCB LS
 
Residual current circuit_breakers
Residual current circuit_breakersResidual current circuit_breakers
Residual current circuit_breakers
 
MCCB Easy Pact ECLB
MCCB Easy Pact ECLBMCCB Easy Pact ECLB
MCCB Easy Pact ECLB
 
As built and proposed SLD
As built and proposed SLDAs built and proposed SLD
As built and proposed SLD
 
2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...
2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...
2011 Latch-up immune ESD Protection Clamp for High Voltage optimized on TSMC ...
 
SPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARK
SPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARKSPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARK
SPICE MODEL of 02DZ2.4_X (Standard Model) in SPICE PARK
 
Original Transistor PNP TIP42C TIP42 6A 100V TO-220 New
Original Transistor PNP TIP42C TIP42 6A 100V TO-220 NewOriginal Transistor PNP TIP42C TIP42 6A 100V TO-220 New
Original Transistor PNP TIP42C TIP42 6A 100V TO-220 New
 
Schnieder mccb li st
Schnieder mccb li stSchnieder mccb li st
Schnieder mccb li st
 

Similar a Comprehensive evaluation of supply voltage underscaling in FPGA on chip memories

Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...
Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...
Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...Behzad Salami
 
How to protect electronic systems against esd
How to protect electronic systems against esdHow to protect electronic systems against esd
How to protect electronic systems against esdMohamed Saadna
 
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...Behzad Salami
 
FPL 2018: Fault Characterization Through FPGAs Undervolting
FPL 2018: Fault Characterization Through FPGAs UndervoltingFPL 2018: Fault Characterization Through FPGAs Undervolting
FPL 2018: Fault Characterization Through FPGAs UndervoltingLEGATO project
 
On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...
On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...
On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...Sofics
 
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...iosrjce
 
testing purpose for ic testing and othettastinng
testing purpose for ic testing and othettastinngtesting purpose for ic testing and othettastinng
testing purpose for ic testing and othettastinngPvaveen
 
FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...
FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...
FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...LEGATO project
 
Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...
Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...
Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...jbpatel7290
 
Emmvee Crystal Off-grid Module:
Emmvee Crystal Off-grid Module:Emmvee Crystal Off-grid Module:
Emmvee Crystal Off-grid Module:Emmvee India
 
ACS37800-Datasheet.pdf
ACS37800-Datasheet.pdfACS37800-Datasheet.pdf
ACS37800-Datasheet.pdfSvenSong
 
42jssc07-pfeiffer-proof
42jssc07-pfeiffer-proof42jssc07-pfeiffer-proof
42jssc07-pfeiffer-proofDavid Goren
 
Steper Motor Control Through Wireless
Steper Motor Control Through WirelessSteper Motor Control Through Wireless
Steper Motor Control Through WirelessPawan Bahuguna
 
under grund fault ppt (1).pptx
under grund fault ppt (1).pptxunder grund fault ppt (1).pptx
under grund fault ppt (1).pptxPoojaG86
 

Similar a Comprehensive evaluation of supply voltage underscaling in FPGA on chip memories (20)

Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...
Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...
Understanding the Reliability and Power-Efficiency Trade-offs of Modern FPGAs...
 
How to protect electronic systems against esd
How to protect electronic systems against esdHow to protect electronic systems against esd
How to protect electronic systems against esd
 
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
ISCA2021 Tutorial-Methods for Characterization and Analysis of Voltage Margin...
 
FPL 2018: Fault Characterization Through FPGAs Undervolting
FPL 2018: Fault Characterization Through FPGAs UndervoltingFPL 2018: Fault Characterization Through FPGAs Undervolting
FPL 2018: Fault Characterization Through FPGAs Undervolting
 
On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...
On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...
On-Chip ESD Protection Achieving 8kV HBM Without Compromising the 3.4Gbps HDM...
 
Fsq100
Fsq100Fsq100
Fsq100
 
FirmLeak
FirmLeakFirmLeak
FirmLeak
 
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
Vlsi Design of Low Transition Low Power Test Pattern Generator Using Fault Co...
 
H010613642
H010613642H010613642
H010613642
 
Abb uno-6
Abb uno-6Abb uno-6
Abb uno-6
 
testing purpose for ic testing and othettastinng
testing purpose for ic testing and othettastinngtesting purpose for ic testing and othettastinng
testing purpose for ic testing and othettastinng
 
FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...
FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...
FPL 2018: A Demo of FPGA Aggressive Voltage Downscaling: Power and Reliabilit...
 
Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...
Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...
Power system-protection-presentation-dated-03-10-2013-integrated-protection-c...
 
Emmvee Crystal Off-grid Module:
Emmvee Crystal Off-grid Module:Emmvee Crystal Off-grid Module:
Emmvee Crystal Off-grid Module:
 
ACS37800-Datasheet.pdf
ACS37800-Datasheet.pdfACS37800-Datasheet.pdf
ACS37800-Datasheet.pdf
 
42jssc07-pfeiffer-proof
42jssc07-pfeiffer-proof42jssc07-pfeiffer-proof
42jssc07-pfeiffer-proof
 
Steper Motor Control Through Wireless
Steper Motor Control Through WirelessSteper Motor Control Through Wireless
Steper Motor Control Through Wireless
 
under grund fault ppt (1).pptx
under grund fault ppt (1).pptxunder grund fault ppt (1).pptx
under grund fault ppt (1).pptx
 
O0704080084
O0704080084O0704080084
O0704080084
 
PMIC_V3
PMIC_V3PMIC_V3
PMIC_V3
 

Más de LEGATO project

Scrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitScrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitLEGATO project
 
A practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemA practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemLEGATO project
 
TEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsTEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsLEGATO project
 
secureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworksecureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworkLEGATO project
 
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...LEGATO project
 
LEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGATO project
 
Smart Home AI at the edge
Smart Home AI at the edgeSmart Home AI at the edge
Smart Home AI at the edgeLEGATO project
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGATO project
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGATO project
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGATO project
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGATO project
 
LEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGATO project
 
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneTZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneLEGATO project
 
Infection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingInfection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingLEGATO project
 
Smart Home - AI at the edge
Smart Home - AI at the edgeSmart Home - AI at the edge
Smart Home - AI at the edgeLEGATO project
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...LEGATO project
 
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsScheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsLEGATO project
 
RECS – Cloud to Edge Microserver Platform for Energy-Efficient Computing
RECS – Cloud to Edge Microserver Platform for Energy-Efficient ComputingRECS – Cloud to Edge Microserver Platform for Energy-Efficient Computing
RECS – Cloud to Edge Microserver Platform for Energy-Efficient ComputingLEGATO project
 

Más de LEGATO project (20)

Scrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitScrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for Profit
 
A practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemA practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating system
 
TEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsTEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEs
 
secureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworksecureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow Framework
 
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
 
LEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use Case
 
Smart Home AI at the edge
Smart Home AI at the edgeSmart Home AI at the edge
Smart Home AI at the edge
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
 
LEGaTO Integration
LEGaTO IntegrationLEGaTO Integration
LEGaTO Integration
 
LEGaTO: Use cases
LEGaTO: Use casesLEGaTO: Use cases
LEGaTO: Use cases
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming Models
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack Runtimes
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous Hardware
 
LEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing Workshop
 
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneTZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
 
Infection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingInfection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow Computing
 
Smart Home - AI at the edge
Smart Home - AI at the edgeSmart Home - AI at the edge
Smart Home - AI at the edge
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
 
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsScheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
 
RECS – Cloud to Edge Microserver Platform for Energy-Efficient Computing
RECS – Cloud to Edge Microserver Platform for Energy-Efficient ComputingRECS – Cloud to Edge Microserver Platform for Energy-Efficient Computing
RECS – Cloud to Edge Microserver Platform for Energy-Efficient Computing
 

Último

Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsNurulAfiqah307317
 

Último (20)

Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 

Comprehensive evaluation of supply voltage underscaling in FPGA on chip memories

  • 1. www.bsc.es Comprehensive Evaluation of Supply Voltage Underscaling in FPGA on-chip Memories Behzad Salami, Osman Unsal, and Adrian Cristal Barcelona Supercomptuing Center (BSC) The 51st Annual IEEE/ACM International Symposium on Microarchitecture ® (MICOR), Fukuoka City, Japan.
  • 2. 2 Aggressive Undervolting  Aggressive undervolting- Underscaling the supply voltage below the nominal and safe level:  Power/Energy Efficiency: Reduces dynamic and static power quadratically and linearly, respectively.  Reliability: Increases the circuit delay and in turn, causes timing faults.  Dual/Multi-Vdd, DVS, and DVFS: Similar but different mechanisms to aggressive undervolting:  Similarity: Underscaling the supply voltage.  Difference: Undervolting is until a certain safe level, usually constrained by vendors. Reliability Power/Energy Efficiency
  • 3. 3 State-of-the-art 1. Aggressive undervolting has shown significant efficiency to reduce the energy consumption.  Devices:  CPUs: Itanium II (ISCA2014), X86 (IOLTS2017), ARM (HPCA2017)  GPUs: NVidia (Micro2015)  DRAMs: Multiple Brands (Sigmetrics2017)  FPGA: This work  Focus of the previous works:  Voltage guardband  Minimum safe voltage, i.e., Vmin prediction  Fault characterization and mitigation  Chip-to-chip, core-to-core, and workload-to-workload variation  …. 2. More straightforward and more parameters but less precise  ASIC DNN: Minerva (Micro2016), Thundervolt (DAC2018)  CPU: Bravo (HPCA2017 )  Network On-Chip (HPCA2014) Real hardware: Simulation-based studies:
  • 4. 4 Undervolting on FPGAs: Motivation Contribution of FPGAs in large data centers is growing, expected to be in 30% of datacenter servers by 2020 (Top500 news).  In comparison to ASICs, energy efficiency of FPGAs is a serious concern, i.e., 10X- 100X less-efficient.  Nominal voltage reduction of FPGAs is naturally applied for different generations. Undervolting [Intel/Altera] [Xilinx]
  • 5. 5 Contributions 1. Undervolting FPGAs  Voltage guardband  Overall power and reliability trade-off 2. Fault characterization in FPGA on-chip memories  Fault type, location, and rate  Temperature, Chip 3. Low-voltage FPGA-based Neural Network (NN)  Power consumption and NN accuracy characterization  Fault mitigation techniques  Application-aware technique  Built-in ECC
  • 6. 6 Voltage Scaling Capability in Xilinx VC707: performance-efficient design KC705: power-efficient design (A & B) Evaluated Xilinx platforms VC707 Voltage distribution on Xilinx platforms Voltage regulator  Power Management Bus (PMBus).  Hardwired to the host. ZC702: ARM integrated with FPGA VCCINT VCCBRAM
  • 7. 7 Overall Voltage Behavior  FPGA stops operating below Vcrash, min operating voltageCRASH  No observable fault  Voltage Guardband below Vnom SAFE  Faults manifest  Below Vmin, min safe voltageCRITICAL  Voltage guardband: to ensure the worst-case environmental and process technologies.  Experimental conditions: At ambient temperature and maximum operating frequency. We performed more detailed studies on FPGA on-chip memories (BRAMs). Vnom Vmin Vcrash 0 0.2 0.4 0.6 0.8 1 VC707 ZC702 KC705-A KC705-B VCCBRAM(V) Platform SAFE (Guardband) CRITICAL CRASH 0 0.2 0.4 0.6 0.8 1 VC707 ZC702 KC705-A KC705-B VCCINT(V) Platform SAFE (Guardband) CRITICAL CRASH
  • 8. 8 Floorplan of VC707 Experimental Methodology  FPGA BRAMs:  Hierarchy of set of bit-cells distributed over the chip.  Size of each BRAM: 16-kbits  Experimental Methodology:  HW: Transfer content of BRAMs to the host.  SW: Analyze data, and adjust voltage of BRAMs. (2060 BRAMs)
  • 9. 9 0 200 400 600 800 0 1 2 3 1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 FaultRate(per1Mbit) BRAMPower(Watts) VCCBRAM (V) BRAM Power Fault Rate Vmin=0.61V Vcrash=0.54V 0 400 800 0 0.1 0.2 0.3 0.4 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 per1Mbit Watts Vnom=1V Overall Trade-offs on BRAMs- Power & Reliability VC707
  • 10. 10 0 150 300 0 0.05 0.1 0.15 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 per1Mbit Watts Overall Trade-offs on BRAMs- Multiple Platforms 0 200 400 600 800 0 1 2 3 1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 FaultRate (per1Mbit) BRAMPower (Watts) VCCBRAM (V) Vnom=1V Vmin=0.61V 0 400 800 0 0.2 0.4 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 per1Mbit Watts VC707 0 50 100 150 200 0 10 20 30 1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 FaultRate (per1Mbit) BRAMPower (mWatts) VCCBRAM (V) Vnom=1V Vcrash=0.53V 0 100 200 0 2 4 0.59 0.58 0.57 0.56 0.55 0.54 0.53 per1Mbit mWatts ZC702 0 100 200 300 0 1 2 3 1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 FaultRate (per1Mbit) BRAMPower (Watts) VCCBRAM (V) Vnom=1V Vcrash=0.54V Vmin=0.61V KC705-A 0 20 40 60 80 0 1 2 3 1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 FaultRate (per1Mbit) BRAMPower (Watts) VCCBRAM (V) Vnom=1V Vmin=0.57V Vcrash=0.54V 0 40 80 0 0.05 0.1 0.15 0.57 0.56 0.55 0.54 per1Mbit Watts KC705-B Vmin=0.59V Vcrash=0.54V
  • 11. 11 Key Points of the First Contribution  Voltage regions: Safe, Critical, and Crash voltage regions exist for all platforms, slightly different among studied platforms.  Voltage guardbands: Large voltage guardband confirmed for all platforms on the studied voltage rails, i.e., VCCBRAM and VCCINT.  Power reduction: There is significant power reduction through aggressive undervolting, with more details studied for BRAMs.  Reliability costs: Fault rates exponentially increase in the Critical voltage region.
  • 12. 12 Contributions 1. Undervolting FPGAs  Voltage guardband  Overall power and reliability trade-off 2. Fault characterization in FPGA on-chip memories  Fault type, location, and rate  Temperature, Chip 3. Low-voltage FPGA-based Neural Network (NN)  Power consumption and NN accuracy characterization  Fault mitigation techniques  Application-aware technique  Built-in ECC
  • 13. 13 Fault Characterization at CRITICAL Region  Fully non-uniform fault distribution.  Majority of BRAMs do not experience many faults. Fault variability among FPGA BRAMs: Fully non-uniform fault distribution VC707 (2060 BRAMs) VCCBRAM@ Vcrash= 0.54V Temperature@ Ambient 0.0% 0.3% 0.6% 0.9% 1.2% 1.5% BRAMFaultRate(%) %BRAMs Average Fault Rate (%) 1.8% 0.86% High-vulnerable 9.4% 0.24% Mid-vulnerable 52.3% 0.03% Low-vulnerable 36.3% 0.0% Zero-vulnerable K-means clustering
  • 14. 14 Fault Characterization at CRITICAL Region Type of undervolting faults: Permanent faults at specific voltage  There is no considerable change on the rate and location of faults over time.  Validated by repeating experiments for 100 times.  The physical location of BRAMs is extracted using Vivado.  Fault Variation Map (FVM): Fault rate mapped to the physical location of BRAMs. FVM can be potentially used in fault mitigation techniques! FPGA x-axis FPGAy-axis BRAMFaultRate(%) FVM @ (VCCBRAM @Vcrash, T= ambient, chip= VC707) 1 10 20 30 40 50 60 70 80 90 100 0 200 400 600 800 1000 1 11 21 31 41 51 61 71 81 91 FaultRate(per1Mbit) #Run Individual Run Cumulative Median Key observations discussed: 1. Fault rate exponentially increases by further undervolting. 2. BRAMs have fully different reliability behavior against undervolting faults. 3. The fault rate and location is deterministic over the time. Three parameters orthogonally have significant impact on the rate and location of faults: 1. Voltage 2. Temperature 3. Chip
  • 15. 15 Fault Characterization (Voltage Impacts) Location of undervolting faults: Fault Inclusion Property (FIP)  FIP: A corrupted bit at a specific voltage stays faulty in lower voltages as well.  FIP can be used in mitigation techniques. 0.1 1 10 100 1000 10000 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 FaultRate(per1Mbit) logscale VCCBRAM (V) Illustration of FIP FIP shown as fault rate for VC707
  • 16. 16 Fault Characterization (Temperature Impacts)  Methodology: Adjusting environmental temperature, monitoring on-board temperature via PMBus.  Experimental Observation:  At higher temperatures, fault rate is significantly reduced.  Inverse Temperature Dependency :  For nano-scale technology nodes, under ultra low-voltage operations, the circuit delay reduces at higher temperatures since supply voltage approaches the threshold voltage. * x-axis: VCCBRAM (V). * y-axis: fault rate (per 1Mbit). 50 60 70 80 Practical confirmation of Inverse Temperature Dependency (ITD) (1) Neshatpour, K., Burleson, W., Khajeh, A., & Homayoun, H. (2018). Enhancing Power, Performance, and Energy Efficiency in Chip Multiprocessors Exploiting Inverse Thermal Dependence. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, (4), 778-791.
  • 17. 17 Fault Characterization (Chip Impacts) KC705-BKC705-A  Methodology: Repeating experiments on two identical samples of KC705 (A&B).  Observations:  Fault rates significantly vary, more than 4X.  Fault Variation Maps (FVMs) are entirely different. Fault location Fault location @VCCBRAM= Vcrash @VCCBRAM= Vcrash Even identical samples of same chips have totally different reliability behavior, due to the process variation/aging effects. Fault rate 0 100 200 300 0.57 0.56 0.55 0.54 Per1Mbit VCCBRAM (V) Fault rate 0 100 200 300 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 Per1Mbit VCCBRAM (V)
  • 18. 18 Key Points of the Second Contribution  Fault rate: The increase of the fault rate by further undervolting is exponential.  Non-uniform fault distribution among BRAMs: BRAMs do not have similar sensitivity against undervolting.  Deterministic behavior of faults: The location of faults does not change over the time, at certain voltage and temperature, and for a certain chip.  Reliability behavior over different voltage levels: There is Fault Inclusion Property (FIP).  Environmental temperature: At higher temperatures, FPGA BRAMs shows better reliability behavior, i.e., less fault rate.  Reliability differences for chips: Even identical chips shows fully different reliability behaviors.
  • 19. 19 Contributions 1. Undervolting FPGAs  Voltage guardband  Overall power and reliability trade-off 2. Fault characterization in FPGA on-chip memories  Fault type, location, and rate  Temperature, Chip 3. Low-voltage FPGA-based Neural Network (NN)  Power consumption and NN accuracy characterization  Fault mitigation techniques  Application-aware technique  Built-in ECC
  • 20. 20 Experimental Methodology Neural Network (NN) Type Fully-connected classifier Total number of weights ~1.5 millions Activation function Logsig (logarithmic sigmoid) Major benchmark Name-type MNIST- handwritten digit images Number of images Training: 60000, Classification: 10000 Number of pixels per image 28*28=256 Number of output classes 10 Additional benchmarks Names Forest and Reuters Data representation model Type 16-bits fixed-point Precision Minimum sign and digit per layer An example implementation on VC707 Frequency 100 Mhz BRAM usage (total: 2060) 70.8%
  • 21. 21 NN Implementation on FPGA  Input data: off-chip DDR memory.  Weights: on-chip FPGA BRAM.  Computation: Streaming data onto DSPs and LUTs.  We undervolt VCCBRAM:  Weights of the NN are potentially affected. FPGA Implementation
  • 22. 22 Low-Voltage FPGA-based NN  Significant power reduction until the minimum safe voltage, i.e., Vmin (By eliminating the voltage guardband).  Additional 40% power reduction below the voltage guardband.  The NN classification error exponentially increases from 2.56% (inherent classification error) to 6.74% through undervolting BRAMs beyond Vmin.  Fault mitigation techniques to prevent the accuracy loss:  Application-aware mechanism  Built-in ECC Power saving NN accuracy loss 2.39 0.25 0.15 6.47 6.47 6.47 0 2 4 6 8 10 Vnom= 1 V Vmin= 0.61V Vcrash= 0.54V On-chipPower(Watts) BRAM Rest 0 100 200 300 400 0% 2% 4% 6% 8% 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 FaultRatio(per1Mbit) ClassificationError(%) VCCBRAM (V) NN Error Ratio Fault Ratio Inherent Error rate: 2.56%
  • 23. 23 Intelligently-Constrained BRAM placement (ICBP)  Below voltage guardband level at CRITICAL voltage region, we present ICBP to prevent NN classification error rate loss.  Core Idea: Map most-sensitive weights to faults into robust BRAMs.  Q: Which are the most-sensitive NN weights? A: Deeper Layers. ICBP-Additional ConstraintsintheFPGA placementstage 1 1.4 2.1 3 5.7 LAYER0 LAYER1 LAYER2 LAYER3 LAYER4 Normalized Vulnerability NN Layers
  • 24. 24 ICBP Evaluation  Pros:  Significant accuracy loss prevention.  No power and performance overhead.  Cons:  Needs the FVM as a pre-process step  Built-in ECC is evaluated without having this cost. 0 0.1 0.2 0.3 0.4 0% 2% 4% 6% 8% 10% 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 BRAMsPower(Watt) NNClassificationError(%) VCCBRAM (V) NN Error by Default Placement NN Error by ICBP BRAM Power Inherent NN Error: 2.56%
  • 25. 25 Built-in ECC  Built-in ECC of FPGA BRAMs:  Hamming-code.  Two (2) additional bits per row are reserved as parities.  SECDED (Single-Error Correction and Double-Error Detection).  Experimental Methodology:  Activate built-in ECC under low-voltage read operations.  Experimental Observations:  >90% fault correction  >7% fault detection (not correction) 0 200 400 600 800 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 Faultrate(per1Mbit) VCCBRAM (V) Without ECC With ECC Parity Bits single-bit double-bit multiple-bit
  • 26. 26 ECC for NN Accelerator 0% 2% 4% 6% 8% 0.61 0.6 0.59 0.58 0.57 0.56 0.55 0.54 ClassificationError(%) VCCBRAM (V) Without ECC With ECC Inherent NN Error: 2.56% Area Utilization (%) BRAM LUT FF Without ECC 96% 3% 0.25% With ECC 100% 12% 0.25% BRAM Power (W) Vnom= 1V Vmin= 0.61V Vcrash= 0.54V Without ECC 2.4 0.31 0.198 With ECC ---- ---- 0.211 ECC efficiency to prevent NN accuracy loss ECC area and power costs  Pros:  Significant accuracy loss prevention.  Negligible power and performance overhead.  Cons:  Requires larger data rows/lines.  Not all FPGAs are equipped with this technique.
  • 27. 27 Key Points of the Third Contribution  Power reduction for FPGA-based accelerates: Significant energy improvement can be achieved for FPGA-based accelerators (studied for typical NN) through undervolting:  By eliminating the voltage guardband  By further undervolting in the critical voltage region  Cost of undervolting: Accuracy loss is also significant but controllable at the critical voltage region.  Fault mitigation techniques: According to the fault characterization study, efficient mitigation techniques can be deployed to prevent the NN accuracy loss.
  • 29. 29 Summary  We experimentally showed how Xilinx FPGAs work under aggressive low-voltage operations.  There is a conservative voltage guardband below the nominal voltage level, i.e., Vnom.  BRAMs power significantly reduces through undervolting; however, reliability degrades below the minimum safe voltage, i.e., Vmin.  We characterized the behavior of undervolting faults at the critical region.  We evaluated FPGA undervolting for a typical NN accelerator.
  • 30. 30 Conclusion  There is significant potential in commercial FPGAs to improve the energy efficiency through aggressive undervolting.  By eliminating the conservative voltage guardband  By further undervolting into the voltage critical region  Undervolting faults manifest deterministic behaviors.  Efficient fault mitigation techniques can be deployed which can allow to further energy saving.  State-of-the-art FPGA-based accelerators can be adapted by undervolting approach.
  • 31. 31 Constraints of the Xilinx FPGAs for Undervolting  Many FPGA platforms, e.g., Zynq are not equipped with voltage scaling capability.  There is no standard about the voltage distribution among platform components.  In Xilinx products, voltage regulators are hardwired to the host through PMBus interface.  In many cases, several components on the FPGA platform share a single voltage rail.  Vendors set unnecessarily conservative voltage guardbands that increase the energy.  There is no publicly-available circuit-level information of FPGAs.
  • 32. 32 Potential Next Steps  Different computing paradigms, e.g., Heterogeneous Computing, Approximate Computing, Stochastic Computing, among others.  Generalizing observations by extending the experiments to other FPGA vendors like Intel/Altera.  Evaluation the undervolting in noisy and harsh environments.  More advanced designs, where other components such as I/O and DSP are undervolted.  Application profiling to analyze workload-to-workload variation.  Dynamic Vmin prediction and scaling, adapted by frequency and temperature.