SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
Wavelet Lifting on
Application Specific Vector Processor
David Barina Pavel Zemcik
Faculty of Information Technology, BUT, Czech Republic
September 17, 2013
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 1 / 13
Motivation
discrete wavelet transform (DWT)
compression, e.g. JPEG 2000, Dirac
lifting scheme is SIMD-friendly
feasible for embedded systems
evaluated on Application Specific Vector Processor (ASVP)
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 2 / 13
Separability
2D using 1D
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 3 / 13
Lifting CDF 9/7
lifting scheme
P(z) =
1 α(1 + z−1)
0 1
1 0
β(1 + z) 1
1 γ(1 + z−1)
0 1
1 0
δ(1 + z) 1
ζ 0
0 1/ζ
α, β, γ, δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 4 / 13
Lifting CDF 9/7
lifting scheme practically
s
(0)
l = x2l
d
(0)
l = x2l+1
d
(1)
l = d
(0)
l +α (s
(0)
l + s
(0)
l+1)
s
(1)
l = s
(0)
l +β (d
(1)
l + d
(1)
l−1)
d
(2)
l = d
(1)
l +γ (s
(1)
l + s
(1)
l+1)
s
(2)
l = s
(1)
l +δ (d
(2)
l + d
(2)
l−1)
sl = ζ s
(2)
l
dl = d
(2)
l /ζ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 5 / 13
Data flow graph
α
β
γ
δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 6 / 13
Horizontal vectorisation
α
β
γ
δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 7 / 13
Vertical vectorisation
α
β
γ
δ
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 8 / 13
ASVP platform
bus
CPU
BCE
DMA
banks
SDRAM
sCPU VPU
memory
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 9 / 13
FPGA kit SP605
Figure : Xilinx Spartan-6 FPGA SP605 Evaluation Kit
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 10 / 13
ASVP platform
two C source codes (MicroBlaze + PicoBlaze)
accelerate operations on vectors of floats
Vector operations
VCOPY M0[i] ← M1[j]
VADD M0[i] ← M1[j] + M2[k]
VSUB M0[i] ← M1[j] − M2[k]
VMUL M0[i] ← M1[j] · M2[k]
VMAC M0[i] ← M3[l] + (M1[j] · M2[k])
VMSUBAC M0[i] ← M3[l] − (M1[j] · M2[k])
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 11 / 13
Evaluation
100n
1µ
10µ
100µ
10.0 100.0 1.0k 10.0k 100.0k 1.0M
seconds/sample
horizontal BCE
vertical
samples
horizontal CPU
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 12 / 13
Summary
two methods compared on ASVP platform
achieved speedup is up to 2.6×
next research: an adaptation to the 2-D wavelet transform
vectorisation µs/sample speedup
CPU horizontal 1.1 1.0
CPU vertical 0.8 1.4
BCE horizontal 0.4 2.6
David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 13 / 13

Más contenido relacionado

Similar a Wavelet Lifting on ASVP

Costin, francillon ghost is in the air(traffic)
Costin, francillon   ghost is in the air(traffic)Costin, francillon   ghost is in the air(traffic)
Costin, francillon ghost is in the air(traffic)DefconRussia
 
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfDell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfAhmedMaher181036
 
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015Sckipio
 
Assessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus ApplicationsAssessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus ApplicationsDavid Ingram
 
Booster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation noteBooster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation noteAli Elkashef
 

Similar a Wavelet Lifting on ASVP (6)

Costin, francillon ghost is in the air(traffic)
Costin, francillon   ghost is in the air(traffic)Costin, francillon   ghost is in the air(traffic)
Costin, francillon ghost is in the air(traffic)
 
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdfDell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
Dell Inspiron 17R N7110 Quanta R03_V03 UMA DAORO3MB6E0 Rev E Schematics.pdf
 
RTL2838 DVB-T Deep dive
RTL2838 DVB-T Deep diveRTL2838 DVB-T Deep dive
RTL2838 DVB-T Deep dive
 
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
Sckipio G.fast Presentation at TNO Ultrafast Broadband 2015
 
Assessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus ApplicationsAssessment of Real-Time Networks and Timing for Process Bus Applications
Assessment of Real-Time Networks and Timing for Process Bus Applications
 
Booster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation noteBooster fan cr1 fn02 platform & duct support calculation note
Booster fan cr1 fn02 platform & duct support calculation note
 

Más de David Bařina

Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field CompressionDavid Bařina
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiositiesDavid Bařina
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG FormatDavid Bařina
 
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDavid Bařina
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformDavid Bařina
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesDavid Bařina
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000David Bařina
 
Lifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet TransformLifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet TransformDavid Bařina
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingDavid Bařina
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceDavid Bařina
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDDavid Bařina
 
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersDavid Bařina
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmeticDavid Bařina
 

Más de David Bařina (20)

CCSDS 122.0
CCSDS 122.0CCSDS 122.0
CCSDS 122.0
 
Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field Compression
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiosities
 
C/C++ tricks
C/C++ tricksC/C++ tricks
C/C++ tricks
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG Format
 
JPEG
JPEGJPEG
JPEG
 
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel Architectures
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet Transform
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for Images
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000
 
Lifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet TransformLifting Scheme Cores for Wavelet Transform
Lifting Scheme Cores for Wavelet Transform
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet Lifting
 
Wavelet News
Wavelet NewsWavelet News
Wavelet News
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkce
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMD
 
GStreamer
GStreamerGStreamer
GStreamer
 
FFmpeg
FFmpegFFmpeg
FFmpeg
 
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: Integers
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmetic
 
Wavelets @ CPU
Wavelets @ CPUWavelets @ CPU
Wavelets @ CPU
 

Último

Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxAArockiyaNisha
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Jshifa
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.PraveenaKalaiselvan1
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 

Último (20)

Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptxPhysiochemical properties of nanomaterials and its nanotoxicity.pptx
Physiochemical properties of nanomaterials and its nanotoxicity.pptx
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)Recombination DNA Technology (Microinjection)
Recombination DNA Technology (Microinjection)
 
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
BIOETHICS IN RECOMBINANT DNA TECHNOLOGY.
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 

Wavelet Lifting on ASVP

  • 1. Wavelet Lifting on Application Specific Vector Processor David Barina Pavel Zemcik Faculty of Information Technology, BUT, Czech Republic September 17, 2013 David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 1 / 13
  • 2. Motivation discrete wavelet transform (DWT) compression, e.g. JPEG 2000, Dirac lifting scheme is SIMD-friendly feasible for embedded systems evaluated on Application Specific Vector Processor (ASVP) David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 2 / 13
  • 3. Separability 2D using 1D David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 3 / 13
  • 4. Lifting CDF 9/7 lifting scheme P(z) = 1 α(1 + z−1) 0 1 1 0 β(1 + z) 1 1 γ(1 + z−1) 0 1 1 0 δ(1 + z) 1 ζ 0 0 1/ζ α, β, γ, δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 4 / 13
  • 5. Lifting CDF 9/7 lifting scheme practically s (0) l = x2l d (0) l = x2l+1 d (1) l = d (0) l +α (s (0) l + s (0) l+1) s (1) l = s (0) l +β (d (1) l + d (1) l−1) d (2) l = d (1) l +γ (s (1) l + s (1) l+1) s (2) l = s (1) l +δ (d (2) l + d (2) l−1) sl = ζ s (2) l dl = d (2) l /ζ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 5 / 13
  • 6. Data flow graph α β γ δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 6 / 13
  • 7. Horizontal vectorisation α β γ δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 7 / 13
  • 8. Vertical vectorisation α β γ δ David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 8 / 13
  • 9. ASVP platform bus CPU BCE DMA banks SDRAM sCPU VPU memory David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 9 / 13
  • 10. FPGA kit SP605 Figure : Xilinx Spartan-6 FPGA SP605 Evaluation Kit David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 10 / 13
  • 11. ASVP platform two C source codes (MicroBlaze + PicoBlaze) accelerate operations on vectors of floats Vector operations VCOPY M0[i] ← M1[j] VADD M0[i] ← M1[j] + M2[k] VSUB M0[i] ← M1[j] − M2[k] VMUL M0[i] ← M1[j] · M2[k] VMAC M0[i] ← M3[l] + (M1[j] · M2[k]) VMSUBAC M0[i] ← M3[l] − (M1[j] · M2[k]) David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 11 / 13
  • 12. Evaluation 100n 1µ 10µ 100µ 10.0 100.0 1.0k 10.0k 100.0k 1.0M seconds/sample horizontal BCE vertical samples horizontal CPU David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 12 / 13
  • 13. Summary two methods compared on ASVP platform achieved speedup is up to 2.6× next research: an adaptation to the 2-D wavelet transform vectorisation µs/sample speedup CPU horizontal 1.1 1.0 CPU vertical 0.8 1.4 BCE horizontal 0.4 2.6 David Barina, Pavel Zemcik (FIT BUT) Wavelet Lifting on ASVP September 17, 2013 13 / 13