SlideShare una empresa de Scribd logo
1 de 25
Efficient Floating-Point Texture Decompression Tomi Aarnio (NRC Tampere) Claudio Brunelli (NRC Tampere) Timo Viitanen (TUT)
Texturing pipeline in a GPU
Texturing pipeline in a GPU Memory bandwidth is the worst bottleneck
Texturing pipeline in a GPU Cache size is another Memory bandwidth is the worst bottleneck
Texturing pipeline in a GPU Cache size is another Memory bandwidth is the worst bottleneck Texture compression can alleviate both!
Texturing pipeline in a GPU Must be very fast: ~40 gigatexels/sec
The established solution Nearly all existing schemes work the same way Partition the image into blocks of 4 x 4 pixels Compress each block independently Use a fixed compression ratio (6:1) Our focus is on high dynamic range (HDR) textures RGB colors in 16-bit floating-point (FP16) Compressed from 48 bits per pixel, down to 8 bpp
FP16 texture compression Roimela et al. [SIGGRAPH 2006, I3D 2008] Munkberg et al. [SIGGRAPH 2006, CGF 2008] Sun et al. [Graphics Hardware 2008,  IEEE TVCG 2010] BC6H/BPTC [DirectX 11, OpenGL 4]
FP16 texture compression Roimela et al. [SIGGRAPH 2006, I3D 2008] Munkberg et al. [SIGGRAPH 2006, CGF 2008] Sun et al. [Graphics Hardware 2008,  IEEE TVCG 2010] BC6H/BPTC [DirectX 11, OpenGL 4] Far too high complexity
FP16 texture compression Roimela et al. [SIGGRAPH 2006, I3D 2008] Munkberg et al. [SIGGRAPH 2006, CGF 2008] Sun et al. [Graphics Hardware 2008,  IEEE TVCG 2010] BC6H/BPTC [DirectX 11, OpenGL 4] Our contribution Implemented and optimized #1 (a.k.a. ”NXR”) Benchmarked against #4
Red Baseline decoder Extract bitfields R, B, Lexponent Lmantissa int-to-fp16 converter fp16 multiplier R R 210             Green int-to-fp16 converter fp16 multiplier G Blue int-to-fp16 converter fp16 multiplier B B Lexponent fp16 normalizer Lmantissa
Optimizations Simplify this Red Extract bitfields R, B, Lexponent Lmantissa int-to-fp16 converter fp16 multiplier R R 210             Green int-to-fp16 converter fp16 multiplier G Blue int-to-fp16 converter fp16 multiplier B B Simplify this Lexponent fp16 normalizer Lmantissa
Optimizations (Part 1) Red and Blue are in 0.10-bit fixed point  Can be treated as fp16 denormals with no conversion logic Simplify the multipliers (L*R and L*B) Exponent can’t increase – remove biasing and overflow logic Mantissa will fit in 1.20 fixed point – remove overflow logic At most 10 leading zeros – truncate post-normalizers No need to deal with signs, infinities and NaNs
Red Extract bitfields R, B, Lexponent Lmantissa Green Blue Optimized decoder
Optimized decoder CLZ Count Leading Zeros << Shift Left 10 x 11 -bit multiplier Extract bitfields R, B, Lexponent Lmantissa Red Clamp, Shift & Pack Rexponent Lexponent R R CLZ Rmantissa << Green Lmantissa Blue
Optimized decoder CLZ Count Leading Zeros << Shift Left 10 x 11 -bit multiplier Extract bitfields R, B, Lexponent Lmantissa Red Clamp, Shift & Pack Rexponent Lexponent R R CLZ Rmantissa << Green Lmantissa Blue << Clamp, Shift & Pack Bmantissa B B CLZ Lexponent Bexponent
Optimizations (Part 2) Eliminate the green channel multiplier LG = L (1024 – (R + B)) = 1024L – (LR + LB) Two 20-bit adders are much cheaper than a 10-bit multiplier Round to zero instead of nearest Introduces a maximum of 1-bit error Compression error is much larger, 4-8 bits
Optimized decoder CLZ Count Leading Zeros << Shift Left 10 x 11 -bit multiplier Extract bitfields R, B, Lexponent Lmantissa Red Clamp, Shift & Pack Rexponent Lexponent R R CLZ Rmantissa << Green Lexponent Clamp, Shift & Pack 220 Gexponent Lmantissa G CLZ Gmantissa << Blue << Clamp, Shift & Pack Bmantissa B B CLZ Lexponent Bexponent
FPGA synthesis (Altera Stratix III)
ASIC synthesis @ 180 nm (Synopsys)
ASIC synthesis @ 180 nm (Synopsys) Only one of 14 modes. A complete decoder would be somewhat larger.
ASIC synthesis @ 180 nm (Synopsys) Relatively long critical path, due to leading-zero counters.
Summary VHDL implementation of a floating-point texture decoder Our optimizations reduced area by ~50% Competing decoder turned out 75% larger Main weakness: long critical path Completely feasible to put on real hardware
Future work Measure power consumption More important than silicon area Optimize the long latency Can also help reduce area & power Implement an encoder in ASIC Textures are increasingly generated in real time
Efficient Floating-Point Texture Decompression Tomi Aarnio (NRC Tampere) Claudio Brunelli (NRC Tampere) Timo Viitanen (TUT)

Más contenido relacionado

Último

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 

Último (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

Destacado

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Destacado (20)

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 

Efficient floating-point texture decompression

  • 1. Efficient Floating-Point Texture Decompression Tomi Aarnio (NRC Tampere) Claudio Brunelli (NRC Tampere) Timo Viitanen (TUT)
  • 3. Texturing pipeline in a GPU Memory bandwidth is the worst bottleneck
  • 4. Texturing pipeline in a GPU Cache size is another Memory bandwidth is the worst bottleneck
  • 5. Texturing pipeline in a GPU Cache size is another Memory bandwidth is the worst bottleneck Texture compression can alleviate both!
  • 6. Texturing pipeline in a GPU Must be very fast: ~40 gigatexels/sec
  • 7. The established solution Nearly all existing schemes work the same way Partition the image into blocks of 4 x 4 pixels Compress each block independently Use a fixed compression ratio (6:1) Our focus is on high dynamic range (HDR) textures RGB colors in 16-bit floating-point (FP16) Compressed from 48 bits per pixel, down to 8 bpp
  • 8. FP16 texture compression Roimela et al. [SIGGRAPH 2006, I3D 2008] Munkberg et al. [SIGGRAPH 2006, CGF 2008] Sun et al. [Graphics Hardware 2008, IEEE TVCG 2010] BC6H/BPTC [DirectX 11, OpenGL 4]
  • 9. FP16 texture compression Roimela et al. [SIGGRAPH 2006, I3D 2008] Munkberg et al. [SIGGRAPH 2006, CGF 2008] Sun et al. [Graphics Hardware 2008, IEEE TVCG 2010] BC6H/BPTC [DirectX 11, OpenGL 4] Far too high complexity
  • 10. FP16 texture compression Roimela et al. [SIGGRAPH 2006, I3D 2008] Munkberg et al. [SIGGRAPH 2006, CGF 2008] Sun et al. [Graphics Hardware 2008, IEEE TVCG 2010] BC6H/BPTC [DirectX 11, OpenGL 4] Our contribution Implemented and optimized #1 (a.k.a. ”NXR”) Benchmarked against #4
  • 11. Red Baseline decoder Extract bitfields R, B, Lexponent Lmantissa int-to-fp16 converter fp16 multiplier R R 210 Green int-to-fp16 converter fp16 multiplier G Blue int-to-fp16 converter fp16 multiplier B B Lexponent fp16 normalizer Lmantissa
  • 12. Optimizations Simplify this Red Extract bitfields R, B, Lexponent Lmantissa int-to-fp16 converter fp16 multiplier R R 210 Green int-to-fp16 converter fp16 multiplier G Blue int-to-fp16 converter fp16 multiplier B B Simplify this Lexponent fp16 normalizer Lmantissa
  • 13. Optimizations (Part 1) Red and Blue are in 0.10-bit fixed point  Can be treated as fp16 denormals with no conversion logic Simplify the multipliers (L*R and L*B) Exponent can’t increase – remove biasing and overflow logic Mantissa will fit in 1.20 fixed point – remove overflow logic At most 10 leading zeros – truncate post-normalizers No need to deal with signs, infinities and NaNs
  • 14. Red Extract bitfields R, B, Lexponent Lmantissa Green Blue Optimized decoder
  • 15. Optimized decoder CLZ Count Leading Zeros << Shift Left 10 x 11 -bit multiplier Extract bitfields R, B, Lexponent Lmantissa Red Clamp, Shift & Pack Rexponent Lexponent R R CLZ Rmantissa << Green Lmantissa Blue
  • 16. Optimized decoder CLZ Count Leading Zeros << Shift Left 10 x 11 -bit multiplier Extract bitfields R, B, Lexponent Lmantissa Red Clamp, Shift & Pack Rexponent Lexponent R R CLZ Rmantissa << Green Lmantissa Blue << Clamp, Shift & Pack Bmantissa B B CLZ Lexponent Bexponent
  • 17. Optimizations (Part 2) Eliminate the green channel multiplier LG = L (1024 – (R + B)) = 1024L – (LR + LB) Two 20-bit adders are much cheaper than a 10-bit multiplier Round to zero instead of nearest Introduces a maximum of 1-bit error Compression error is much larger, 4-8 bits
  • 18. Optimized decoder CLZ Count Leading Zeros << Shift Left 10 x 11 -bit multiplier Extract bitfields R, B, Lexponent Lmantissa Red Clamp, Shift & Pack Rexponent Lexponent R R CLZ Rmantissa << Green Lexponent Clamp, Shift & Pack 220 Gexponent Lmantissa G CLZ Gmantissa << Blue << Clamp, Shift & Pack Bmantissa B B CLZ Lexponent Bexponent
  • 19. FPGA synthesis (Altera Stratix III)
  • 20. ASIC synthesis @ 180 nm (Synopsys)
  • 21. ASIC synthesis @ 180 nm (Synopsys) Only one of 14 modes. A complete decoder would be somewhat larger.
  • 22. ASIC synthesis @ 180 nm (Synopsys) Relatively long critical path, due to leading-zero counters.
  • 23. Summary VHDL implementation of a floating-point texture decoder Our optimizations reduced area by ~50% Competing decoder turned out 75% larger Main weakness: long critical path Completely feasible to put on real hardware
  • 24. Future work Measure power consumption More important than silicon area Optimize the long latency Can also help reduce area & power Implement an encoder in ASIC Textures are increasingly generated in real time
  • 25. Efficient Floating-Point Texture Decompression Tomi Aarnio (NRC Tampere) Claudio Brunelli (NRC Tampere) Timo Viitanen (TUT)

Notas del editor

  1. The latest NVIDIA GeForce GTX 480 can fetch 42 billion texels per second, and the decoder must keep up with that.