SlideShare a Scribd company logo
1 of 19
Download to read offline
Tools for Creating
Next-Gen Computer Vision
Apps on Snapdragon
Judd Heape
VP Product Management for Camera,
Computer Vision and Video Technology
Computer Vision in Snapdragon
Three function levels to provide comprehensive CV solutions
2
CV hardware
Acceleration blocks to
support and enable
hardware, software and
system designs in
Snapdragon platforms
1
CV Algorithms to
demonstrate complete
workflows that provide
state-of-the-art
solutions to certain
perception problems
2
CV end-to-end
Applications in
mobile, XR,
Automotive and IOT
market segments to
enable unique and
enhanced user
experiences
3
© 2022 Qualcomm
Engine for Visual Analytics (EVA):
Computer Vision Hardware Blocks
3
EVA1.x
EVA2.x
EVA3.x
Object/Face
Detection
Mobile Camera / Video XR Auto IoT
Optical Flow Depth Estimation
Feature
Extraction
Geometry
Correction
XR & 3DR
Object Detection
HOG/SVM
Feature – video encode
with 30% BR reduction
HCD/NCC
Semi-dense OF
GMO for video encode
DFS 1080p@30
Video Bokeh
Lens Distortion
Correction
ACF/RDF Face
Detection
Dense OF (SGM based)
Dense motion map for
multi-frame processing,
sensor alignment
DFS (SGM based)
• Bokeh better quality
• Visual special effect
• XR 3D reconstruction
HCD/ORB – Centralized
ME
for camera
Flow improvement
for XR 6DoF/VIO
Exposure
Compensation
Motion and depth
map warping
LSR (in EVAa 3.5)
XRA - DoH,DoG
FREAK, R-BRIEF
(In EVAa/EVAv 3.5)
© 2022 Qualcomm
Optical Flow
4
Semi-Dense OF Dense OF
Motion Density Every 2x2 block Every pixel
Motion Accuracy 1/8 pixel 1/16 pixel
Motion Range (X,Y) ±128, ±64 ±64, ±32
Max Resolution 1920x1080 1152x648
Confidence Map 8-bit 8-bit
Frames per Second 60 60
Sparse Motion
• Feature Point Detection, Local and Global Motion
• Various Detector and Descriptors
(Harris, DoH, DoG, FREAK)
Dense Motion
• Semi-dense Optical Flow (sDOF)
• Dense Optical Flow (DOF)
• Hybrid Deep Learning based Motion + OF
Segmentation Enhanced
© 2022 Qualcomm
Depth from Stereo Estimation
5
Depth from Stereo (DFS)
• Super-pixel Segmentation on SLIC
• Feature Extraction and Matching
• Confidence Map and Post Processing
DFS Engine
Depth Density Every pixel
Disparity Accuracy 1/16 pixel
Disparity Level [0,63]
Max Resolution 720P@60FPS
Input Images Flat Area Detection Depth Map
SLIC Map
Confidence Measure Post Processing
SLIC
(Simple Linear
Iterative Clustering)
Census Feature SGM
© 2022 Qualcomm
Geometric Correction Engine (GCE)
6
Low-power High-quality Warping
• ICA maps output pixels to input pixels
Effective Transformation
• Sparse grid transformation (35x27 or 67x51)
• Dense grid transformation ( 8 pixel grid )
• Perspective transformation (3x3 transform)
Output domain
Upscale +
Offset
Input/IFE domain
Virtual domain
Effective
transform
Virtual domain
Offset +
Downscale
Effective Transform
Sparse Grid
Transformation
Dense Grid
Transformation
Perspective
Transformation
GCE Use Cases
• Lens distortion correction
• Motion vector grid composition
• Rectification
Rectification
Lens Distortion Correction
© 2022 Qualcomm
Normalized Cross Correlation
7
NCC Supports Two Modes
• Patch to Frame Mode
• Frame to Frame Mode
18
8
8x8 templates: Prepared
by application, can come
from different sources
Reference frame
Patch-to-frame mode
Reference frame Current frame
Templates: All
in the same
frame
Frame-to-frame mode
Frame Matching Using Harris Corners and NCC
© 2022 Qualcomm
Face Detection
8
Deep Learning based Face Detection (FD)
• Min Face Size: 32x32
• Detection Accuracy: 95%
• 1080p@60FPS
• Multiple cameras supported
Under Non-Ideal Conditions
• Strong Backlight
• Full Profile
• Occlusions – Face Masks, Hats, Glasses, Sunglasses
Strong Backlight Full Profile Occlusions
© 2022 Qualcomm
EVA Architecture and Access
9
• The EVA APIs are exposed both from
the CPU and Hexagon Processor sides
• It includes both synchronous APIs
and asynchronous APIs
• There are direct interrupts between
the Hexagon Processor and EVA cores
for low latency communication
• EVA includes embedded CPU primarily
for task scheduling and hardware
pipes
• EVA hardware pipes are shared
between certain functions
Hardware Pipes
EVA
CPU
Data
API & Control
Hexagon Processor
EVA API
CV App
CV Engine CV Engine
EVA Driver EVA Driver
Firmware CPU
OF/DFS GCE HCD, NCC, ORB, DS
DDR
© 2022 Qualcomm
EVA Feature APIs
10
EVA3.0 Features EVA API
Image Warping evaWarp_Sync / evaWarp_Async
Depth from Stereo (DFS) evaDfs_Sync / evaDfs_Async
Normalized Cross Correlation (NCC) evaNccFrame_Sync / evaNccFrame_Async
Optical Flow (OF) evaOF_Sync / evaOF_Async
Feature Extraction (HCD) evaFeaturePoint_Sync / evaFeaturePoint_Async
Feature Descriptor Calc & Matching evaDcm_Sync / evaDcm_Async
Downscaler evaScaledown_Sync / evaScaledown_Async
Pyramid Image evaPyramidImage_Sync / evaPyramidImage_Async
© 2022 Qualcomm
EVA SDK Simulator
11
© 2022 Qualcomm
CV Use Case 1
Depth Map from Stereo Cameras (DFS)
12
Applications
• Accurate Camera/Video Bokeh effect
• Background replacement in video
recording or Zoom call
• AR/VR
(3D Reconstruction, Video
Passthrough, Occlusion)
© 2022 Qualcomm
CV Use Case 2
Real Time Bokeh Effect using Depth Map
from Stereo Cameras (DFS)
13
Applications
• Accurate Camera/Video Bokeh effect
© 2022 Qualcomm
CV Use Case 3
Dense Motion Map (DMM) for Video MCTF
14
Key Benefits of EVA
• Register multiple frames with
local motion compensated
• Remove ghosting artifacts in
combined video frames
© 2022 Qualcomm
CV Use Case 4
Dense Motion Map (DMM) for Video MFHDR
15
Key Benefits of EVA
• Estimating and compensating
for motion is key to achieve
high quality HDR video
• Remove ghosting artifacts in
combined video frames
• Running global motion and
local motion estimation
simultaneously requires large
amount of computation power
© 2022 Qualcomm
CV Use Case 5
Face Detection (FD) and
Face Landmark Detection (FLD)
16
Applications
• Gender/Expression/
Emotion/Gaze detection
• Avatar animation
• Geometric personalization
Qualcomm Deep Learning-based
3D face landmark detection reaches
high accuracy in locating
115or 300facial landmarks
© 2022 Qualcomm
Start Developing on Snapdragon
17
Capture at higher FPS Extend battery life
Tap into hardware-accelerated
CV features with an SDK not
previously available
© 2022 Qualcomm
Start Developing on Snapdragon
18
Xin Zhong
Director, Product Management
xzhong@qti.qualcomm.com
For access to the SDK contact:
© 2022 Qualcomm
Thank You
19

More Related Content

What's hot

Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)Herman Kurnadi
 
Lecture 2 Presence and Perception
Lecture 2 Presence and PerceptionLecture 2 Presence and Perception
Lecture 2 Presence and PerceptionMark Billinghurst
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentationMrsShwetaBanait1
 
입문 Visual SLAM 14강 - 2장 Introduction to slam
입문 Visual SLAM 14강  - 2장 Introduction to slam입문 Visual SLAM 14강  - 2장 Introduction to slam
입문 Visual SLAM 14강 - 2장 Introduction to slamjdo
 
Sign Language Recognition using Machine Learning
Sign Language Recognition using Machine LearningSign Language Recognition using Machine Learning
Sign Language Recognition using Machine LearningIRJET Journal
 
Multimodal Biometric Systems
Multimodal Biometric SystemsMultimodal Biometric Systems
Multimodal Biometric SystemsPiyush Mittal
 
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...Edge AI and Vision Alliance
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition Intel Nervana
 
2022 COMP4010 Lecture2: Perception
2022 COMP4010 Lecture2: Perception2022 COMP4010 Lecture2: Perception
2022 COMP4010 Lecture2: PerceptionMark Billinghurst
 
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View SynthesisNeural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View SynthesisVincent Sitzmann
 
FACE RECOGNITION USING NEURAL NETWORK
FACE RECOGNITION USING NEURAL NETWORKFACE RECOGNITION USING NEURAL NETWORK
FACE RECOGNITION USING NEURAL NETWORKTITHI DAN
 
Iris recognition system
Iris recognition systemIris recognition system
Iris recognition systemNilu Desai
 
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...Edge AI and Vision Alliance
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic SegmentationSemantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation岳華 杜
 
Face detection and recognition
Face detection and recognitionFace detection and recognition
Face detection and recognitionPankaj Thakur
 
Biometric recognition using deep learning
Biometric recognition using deep learningBiometric recognition using deep learning
Biometric recognition using deep learningSwatiNarkhede1
 
Moving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNMoving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNNITISHKUMAR1401
 

What's hot (20)

Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)Deep learning on face recognition (use case, development and risk)
Deep learning on face recognition (use case, development and risk)
 
Lecture 2 Presence and Perception
Lecture 2 Presence and PerceptionLecture 2 Presence and Perception
Lecture 2 Presence and Perception
 
Object tracking presentation
Object tracking  presentationObject tracking  presentation
Object tracking presentation
 
입문 Visual SLAM 14강 - 2장 Introduction to slam
입문 Visual SLAM 14강  - 2장 Introduction to slam입문 Visual SLAM 14강  - 2장 Introduction to slam
입문 Visual SLAM 14강 - 2장 Introduction to slam
 
Face recognisation system
Face recognisation systemFace recognisation system
Face recognisation system
 
Sign Language Recognition using Machine Learning
Sign Language Recognition using Machine LearningSign Language Recognition using Machine Learning
Sign Language Recognition using Machine Learning
 
Multimodal Biometric Systems
Multimodal Biometric SystemsMultimodal Biometric Systems
Multimodal Biometric Systems
 
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
“Responsible AI: Tools and Frameworks for Developing AI Solutions,” a Present...
 
Object Detection and Recognition
Object Detection and Recognition Object Detection and Recognition
Object Detection and Recognition
 
2022 COMP4010 Lecture2: Perception
2022 COMP4010 Lecture2: Perception2022 COMP4010 Lecture2: Perception
2022 COMP4010 Lecture2: Perception
 
=SLAM ppt.pdf
=SLAM ppt.pdf=SLAM ppt.pdf
=SLAM ppt.pdf
 
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View SynthesisNeural Scene Representation & Rendering: Introduction to Novel View Synthesis
Neural Scene Representation & Rendering: Introduction to Novel View Synthesis
 
FACE RECOGNITION USING NEURAL NETWORK
FACE RECOGNITION USING NEURAL NETWORKFACE RECOGNITION USING NEURAL NETWORK
FACE RECOGNITION USING NEURAL NETWORK
 
Iris Recognition
Iris RecognitionIris Recognition
Iris Recognition
 
Iris recognition system
Iris recognition systemIris recognition system
Iris recognition system
 
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
“CMOS Image Sensors: A Guide to Building the Eyes of a Vision System,” a Pres...
 
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic SegmentationSemantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
Semantic Segmentation - Fully Convolutional Networks for Semantic Segmentation
 
Face detection and recognition
Face detection and recognitionFace detection and recognition
Face detection and recognition
 
Biometric recognition using deep learning
Biometric recognition using deep learningBiometric recognition using deep learning
Biometric recognition using deep learning
 
Moving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNNMoving Object Detection And Tracking Using CNN
Moving Object Detection And Tracking Using CNN
 

Similar to “Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentation from Qualcomm

“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...
“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...
“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...Edge AI and Vision Alliance
 
20200509 sid china digital optics and digital modulation_v5.0
20200509 sid china digital optics and digital modulation_v5.020200509 sid china digital optics and digital modulation_v5.0
20200509 sid china digital optics and digital modulation_v5.0Chun-Wei Tsai
 
AWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdfAWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdfV-Nova
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002Enrico Busto
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002Enrico Busto
 
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro..."High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...Edge AI and Vision Alliance
 
“Efficient Video Perception Through AI,” a Presentation from Qualcomm
“Efficient Video Perception Through AI,” a Presentation from Qualcomm“Efficient Video Perception Through AI,” a Presentation from Qualcomm
“Efficient Video Perception Through AI,” a Presentation from QualcommEdge AI and Vision Alliance
 
Cloud Graphical Rendering: A New Paradigm
Cloud Graphical Rendering:  A New ParadigmCloud Graphical Rendering:  A New Paradigm
Cloud Graphical Rendering: A New ParadigmJoel Isaacson
 
XLcloud 3-d remote rendering
XLcloud 3-d remote renderingXLcloud 3-d remote rendering
XLcloud 3-d remote renderingMarius Preda PhD
 
Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...MD Abdullah Al Nasim
 
Multi Processor Architecture for image processing
Multi Processor Architecture for image processingMulti Processor Architecture for image processing
Multi Processor Architecture for image processingideas2ignite
 
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axelaparuma
 
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an..."Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...Edge AI and Vision Alliance
 
“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...
“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...
“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...Edge AI and Vision Alliance
 
“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...
“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...
“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...Edge AI and Vision Alliance
 
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Intel® Software
 
2008 brokerage 04 smart vision system [compatibility mode]
2008 brokerage 04 smart vision system [compatibility mode]2008 brokerage 04 smart vision system [compatibility mode]
2008 brokerage 04 smart vision system [compatibility mode]imec.archive
 

Similar to “Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentation from Qualcomm (20)

“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...
“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...
“Develop Next-gen Camera Apps Using Snapdragon Computer Vision Technologies,”...
 
20200509 sid china digital optics and digital modulation_v5.0
20200509 sid china digital optics and digital modulation_v5.020200509 sid china digital optics and digital modulation_v5.0
20200509 sid china digital optics and digital modulation_v5.0
 
AWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdfAWS_Re_invent_22_VNova.pdf
AWS_Re_invent_22_VNova.pdf
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002
 
Imaging automotive 2015 addfor v002
Imaging automotive 2015   addfor v002Imaging automotive 2015   addfor v002
Imaging automotive 2015 addfor v002
 
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro..."High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
"High-resolution 3D Reconstruction on a Mobile Processor," a Presentation fro...
 
“Efficient Video Perception Through AI,” a Presentation from Qualcomm
“Efficient Video Perception Through AI,” a Presentation from Qualcomm“Efficient Video Perception Through AI,” a Presentation from Qualcomm
“Efficient Video Perception Through AI,” a Presentation from Qualcomm
 
Cloud Graphical Rendering: A New Paradigm
Cloud Graphical Rendering:  A New ParadigmCloud Graphical Rendering:  A New Paradigm
Cloud Graphical Rendering: A New Paradigm
 
XLcloud 3-d remote rendering
XLcloud 3-d remote renderingXLcloud 3-d remote rendering
XLcloud 3-d remote rendering
 
Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...Real-time Bangla License Plate Recognition System for Low Resource Video-base...
Real-time Bangla License Plate Recognition System for Low Resource Video-base...
 
Multi Processor Architecture for image processing
Multi Processor Architecture for image processingMulti Processor Architecture for image processing
Multi Processor Architecture for image processing
 
Resume marky20181025
Resume marky20181025Resume marky20181025
Resume marky20181025
 
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe
[03 1][gpu용 개발자 도구 - parallel nsight 및 axe] miller axe
 
Real Time Video Processing in FPGA
Real Time Video Processing in FPGA Real Time Video Processing in FPGA
Real Time Video Processing in FPGA
 
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an..."Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
"Computer-vision-based 360-degree Video Systems: Architectures, Algorithms an...
 
“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...
“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...
“Jumpstart Your Edge AI Vision Application with New Development Kits from Avn...
 
“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...
“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...
“Tensilica Processor Cores Enable Sensor Fusion for Robust Perception,” a Pre...
 
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)
 
Cuda project paper
Cuda project paperCuda project paper
Cuda project paper
 
2008 brokerage 04 smart vision system [compatibility mode]
2008 brokerage 04 smart vision system [compatibility mode]2008 brokerage 04 smart vision system [compatibility mode]
2008 brokerage 04 smart vision system [compatibility mode]
 

More from Edge AI and Vision Alliance

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...Edge AI and Vision Alliance
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...Edge AI and Vision Alliance
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...Edge AI and Vision Alliance
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...Edge AI and Vision Alliance
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...Edge AI and Vision Alliance
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...Edge AI and Vision Alliance
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsightsEdge AI and Vision Alliance
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...Edge AI and Vision Alliance
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...Edge AI and Vision Alliance
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...Edge AI and Vision Alliance
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...Edge AI and Vision Alliance
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...Edge AI and Vision Alliance
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...Edge AI and Vision Alliance
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...Edge AI and Vision Alliance
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from SamsaraEdge AI and Vision Alliance
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...Edge AI and Vision Alliance
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...Edge AI and Vision Alliance
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...Edge AI and Vision Alliance
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...Edge AI and Vision Alliance
 

More from Edge AI and Vision Alliance (20)

“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
“Learning Compact DNN Models for Embedded Vision,” a Presentation from the Un...
 
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
“Introduction to Computer Vision with CNNs,” a Presentation from Mohammad Hag...
 
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
“Selecting Tools for Developing, Monitoring and Maintaining ML Models,” a Pre...
 
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
“Building Accelerated GStreamer Applications for Video and Audio AI,” a Prese...
 
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...
 
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
“Introduction to Modern LiDAR for Machine Perception,” a Presentation from th...
 
“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...“Vision-language Representations for Robotics,” a Presentation from the Unive...
“Vision-language Representations for Robotics,” a Presentation from the Unive...
 
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
“ADAS and AV Sensors: What’s Winning and Why?,” a Presentation from TechInsights
 
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
“Computer Vision in Sports: Scalable Solutions for Downmarkets,” a Presentati...
 
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
“Detecting Data Drift in Image Classification Neural Networks,” a Presentatio...
 
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
“Deep Neural Network Training: Diagnosing Problems and Implementing Solutions...
 
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
“AI Start-ups: The Perils of Fishing for Whales (War Stories from the Entrepr...
 
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
“A Computer Vision System for Autonomous Satellite Maneuvering,” a Presentati...
 
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
“Bias in Computer Vision—It’s Bigger Than Facial Recognition!,” a Presentatio...
 
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
“Sensor Fusion Techniques for Accurate Perception of Objects in the Environme...
 
“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara“Updating the Edge ML Development Process,” a Presentation from Samsara
“Updating the Edge ML Development Process,” a Presentation from Samsara
 
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
“Combating Bias in Production Computer Vision Systems,” a Presentation from R...
 
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
“Developing an Embedded Vision AI-powered Fitness System,” a Presentation fro...
 
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
“Navigating the Evolving Venture Capital Landscape for Edge AI Start-ups,” a ...
 
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
“Advanced Presence Sensing: What It Means for the Smart Home,” a Presentation...
 

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 

“Tools for Creating Next-Gen Computer Vision Apps on Snapdragon,” a Presentation from Qualcomm

  • 1. Tools for Creating Next-Gen Computer Vision Apps on Snapdragon Judd Heape VP Product Management for Camera, Computer Vision and Video Technology
  • 2. Computer Vision in Snapdragon Three function levels to provide comprehensive CV solutions 2 CV hardware Acceleration blocks to support and enable hardware, software and system designs in Snapdragon platforms 1 CV Algorithms to demonstrate complete workflows that provide state-of-the-art solutions to certain perception problems 2 CV end-to-end Applications in mobile, XR, Automotive and IOT market segments to enable unique and enhanced user experiences 3 © 2022 Qualcomm
  • 3. Engine for Visual Analytics (EVA): Computer Vision Hardware Blocks 3 EVA1.x EVA2.x EVA3.x Object/Face Detection Mobile Camera / Video XR Auto IoT Optical Flow Depth Estimation Feature Extraction Geometry Correction XR & 3DR Object Detection HOG/SVM Feature – video encode with 30% BR reduction HCD/NCC Semi-dense OF GMO for video encode DFS 1080p@30 Video Bokeh Lens Distortion Correction ACF/RDF Face Detection Dense OF (SGM based) Dense motion map for multi-frame processing, sensor alignment DFS (SGM based) • Bokeh better quality • Visual special effect • XR 3D reconstruction HCD/ORB – Centralized ME for camera Flow improvement for XR 6DoF/VIO Exposure Compensation Motion and depth map warping LSR (in EVAa 3.5) XRA - DoH,DoG FREAK, R-BRIEF (In EVAa/EVAv 3.5) © 2022 Qualcomm
  • 4. Optical Flow 4 Semi-Dense OF Dense OF Motion Density Every 2x2 block Every pixel Motion Accuracy 1/8 pixel 1/16 pixel Motion Range (X,Y) ±128, ±64 ±64, ±32 Max Resolution 1920x1080 1152x648 Confidence Map 8-bit 8-bit Frames per Second 60 60 Sparse Motion • Feature Point Detection, Local and Global Motion • Various Detector and Descriptors (Harris, DoH, DoG, FREAK) Dense Motion • Semi-dense Optical Flow (sDOF) • Dense Optical Flow (DOF) • Hybrid Deep Learning based Motion + OF Segmentation Enhanced © 2022 Qualcomm
  • 5. Depth from Stereo Estimation 5 Depth from Stereo (DFS) • Super-pixel Segmentation on SLIC • Feature Extraction and Matching • Confidence Map and Post Processing DFS Engine Depth Density Every pixel Disparity Accuracy 1/16 pixel Disparity Level [0,63] Max Resolution 720P@60FPS Input Images Flat Area Detection Depth Map SLIC Map Confidence Measure Post Processing SLIC (Simple Linear Iterative Clustering) Census Feature SGM © 2022 Qualcomm
  • 6. Geometric Correction Engine (GCE) 6 Low-power High-quality Warping • ICA maps output pixels to input pixels Effective Transformation • Sparse grid transformation (35x27 or 67x51) • Dense grid transformation ( 8 pixel grid ) • Perspective transformation (3x3 transform) Output domain Upscale + Offset Input/IFE domain Virtual domain Effective transform Virtual domain Offset + Downscale Effective Transform Sparse Grid Transformation Dense Grid Transformation Perspective Transformation GCE Use Cases • Lens distortion correction • Motion vector grid composition • Rectification Rectification Lens Distortion Correction © 2022 Qualcomm
  • 7. Normalized Cross Correlation 7 NCC Supports Two Modes • Patch to Frame Mode • Frame to Frame Mode 18 8 8x8 templates: Prepared by application, can come from different sources Reference frame Patch-to-frame mode Reference frame Current frame Templates: All in the same frame Frame-to-frame mode Frame Matching Using Harris Corners and NCC © 2022 Qualcomm
  • 8. Face Detection 8 Deep Learning based Face Detection (FD) • Min Face Size: 32x32 • Detection Accuracy: 95% • 1080p@60FPS • Multiple cameras supported Under Non-Ideal Conditions • Strong Backlight • Full Profile • Occlusions – Face Masks, Hats, Glasses, Sunglasses Strong Backlight Full Profile Occlusions © 2022 Qualcomm
  • 9. EVA Architecture and Access 9 • The EVA APIs are exposed both from the CPU and Hexagon Processor sides • It includes both synchronous APIs and asynchronous APIs • There are direct interrupts between the Hexagon Processor and EVA cores for low latency communication • EVA includes embedded CPU primarily for task scheduling and hardware pipes • EVA hardware pipes are shared between certain functions Hardware Pipes EVA CPU Data API & Control Hexagon Processor EVA API CV App CV Engine CV Engine EVA Driver EVA Driver Firmware CPU OF/DFS GCE HCD, NCC, ORB, DS DDR © 2022 Qualcomm
  • 10. EVA Feature APIs 10 EVA3.0 Features EVA API Image Warping evaWarp_Sync / evaWarp_Async Depth from Stereo (DFS) evaDfs_Sync / evaDfs_Async Normalized Cross Correlation (NCC) evaNccFrame_Sync / evaNccFrame_Async Optical Flow (OF) evaOF_Sync / evaOF_Async Feature Extraction (HCD) evaFeaturePoint_Sync / evaFeaturePoint_Async Feature Descriptor Calc & Matching evaDcm_Sync / evaDcm_Async Downscaler evaScaledown_Sync / evaScaledown_Async Pyramid Image evaPyramidImage_Sync / evaPyramidImage_Async © 2022 Qualcomm
  • 11. EVA SDK Simulator 11 © 2022 Qualcomm
  • 12. CV Use Case 1 Depth Map from Stereo Cameras (DFS) 12 Applications • Accurate Camera/Video Bokeh effect • Background replacement in video recording or Zoom call • AR/VR (3D Reconstruction, Video Passthrough, Occlusion) © 2022 Qualcomm
  • 13. CV Use Case 2 Real Time Bokeh Effect using Depth Map from Stereo Cameras (DFS) 13 Applications • Accurate Camera/Video Bokeh effect © 2022 Qualcomm
  • 14. CV Use Case 3 Dense Motion Map (DMM) for Video MCTF 14 Key Benefits of EVA • Register multiple frames with local motion compensated • Remove ghosting artifacts in combined video frames © 2022 Qualcomm
  • 15. CV Use Case 4 Dense Motion Map (DMM) for Video MFHDR 15 Key Benefits of EVA • Estimating and compensating for motion is key to achieve high quality HDR video • Remove ghosting artifacts in combined video frames • Running global motion and local motion estimation simultaneously requires large amount of computation power © 2022 Qualcomm
  • 16. CV Use Case 5 Face Detection (FD) and Face Landmark Detection (FLD) 16 Applications • Gender/Expression/ Emotion/Gaze detection • Avatar animation • Geometric personalization Qualcomm Deep Learning-based 3D face landmark detection reaches high accuracy in locating 115or 300facial landmarks © 2022 Qualcomm
  • 17. Start Developing on Snapdragon 17 Capture at higher FPS Extend battery life Tap into hardware-accelerated CV features with an SDK not previously available © 2022 Qualcomm
  • 18. Start Developing on Snapdragon 18 Xin Zhong Director, Product Management xzhong@qti.qualcomm.com For access to the SDK contact: © 2022 Qualcomm