Deep learning based object detection

•Descargar como PPTX, PDF•

0 recomendaciones•486 vistas

This document summarizes deep learning based object detection. It describes popular datasets like PASCAL VOC, COCO, and others that are used for training and evaluating object detection models. It also explains different types of object detection models including two-stage detectors like R-CNN, Fast R-CNN, Faster R-CNN, Mask R-CNN and one-stage detectors like YOLO, YOLO v2, YOLO v3, SSD, and DSSD. It discusses the methodology and improvements of these models and concludes that while detecting all objects is an endless task, improved targeted detection is already possible and will continue to progress.

Educación

A SURVEY OF
DEEP
LEARNING
BASED OBJECT
DETECTION
Chetan Kulkarni

PASCAL VOC
• 20 object categories as 4 main branches-vehicles, animals,
household objects, and people
• spread over 11,000 images.
• Over 27,000 object instance bounding boxes are labeled
• 7,000 have detailed segmentations.

COCO DATASET
• 91 common object categories
• 82 of them having more than 5,000 labeled instances.
• These categories cover the 20 categories in the PASCAL VOC
dataset.
• 2,500,000 labeled instances in 328,000 images

OBJECT DETECTION
Identify and locate objects in an image
or video
Source : https://www.fritz.ai/object-
detection/#:~:text=Object%20detection%20is%20a%20computer,all%20while%20accurately%
20labeling%20them.

KINDS
OF
OBJECT
DETECTI
ON
Two-Stage
Detector
One Stage
Detector

EXAMPLE
S OF TWO
STAGE
DETECTO
RS
1.R-CNN
2.Fast R-CNN
3.Faster R-CNN
4.Masked R-CNN

R-CNN
1. Generates category-independent region proposals.
2. Extract a fixed-length feature vector from each region proposal.
3. Set of class-specific linear SVMs to classify the objects in one image.
4. Bounding-box regressor for precisely bounding-box prediction.

FAST R-
CNN• Fast R-CNN produces Region
of Interest(RoI) using the Max
Pooling layer
• the SVM layer is replaced
with SVD which fastens the
process even further.

FASTER R-
CNN:
• The Region interested in Fast
R-CNN was based on a
selective search using Max
Pooling layers, this was slow.
• So in Faster R-CNN replaces
the region selection method
with a novel RPN

MASK R-CNN
• The faster R-CNN performs well, but it has an Instance
Segmentation Problem.
• It generates proposals about the regions where there might be
an object based on the input image.
• It predicts the class of the object, refines the bounding box, and
generates a mask in the pixel level of the object based on the
first stage proposal.

ONE-STAGE DETECTORS
1. Yolo
2. Yolo v2
3. Yolo V3
4. SSD
5. DSSD

YOLO
• There is no region
creation and then again
processing on top of that
• Rather there is one
convolution network that
creates boxes and class
predictions for each box.

YOLO V2
Following were introduced
Batch Normalization
High-Resolution Classifier
Use Anchor Boxes For Bounding
Boxes

YOLO V3
This has the following updated changes:
1. Multi-Label Classification
2. Use of Feature Maps to predict Bounding Boxes
3. Uses Darknet as final Feature Extractor

SINGLE SHOT DETECTOR
(SSD)
Single Shot: this means
that the tasks of object
localization and
classification are done in
a single forward pass of
the network
01
MultiBox: this is the name
of a technique for
bounding box regression
02
Detector: The network is
an object detector that
also classifies those
detected objects
03

DECONVOLUTIONAL SINGLE SHOT
DETECTOR (DSSD)
Gradual
deconvolution to
enlarge the feature
maps
Feature Combination
from convolution
path and
deconvolution path

APPLICATIONS
1. Pedestrian Detection
2. Face Detection
3. Generic Object Detection
4. Theft Detection

CONCLUSION
There are unimaginable
number of objects and building
a framework capable to detect
them is going to be never
ending task.
But more improved targeted
application is already possible
and will be more robust in
coming days

Más contenido relacionado

La actualidad más candente

[PR12] You Only Look Once (YOLO): Unified Real-Time Object DetectionTaegyun Jeon

#10 pydata warsaw object detection with dn nsAndrew Brozek

You only look once (YOLO) : unified real time object detectionEntrepreneur / Startup

You only look onceGin Kyeng Lee

Yolo v2 ai_tech_20190421穗碧陳

Object detectionROUSHAN RAJ KUMAR

You Only Look Once: Unified, Real-Time Object DetectionDADAJONJURAKUZIEV

Object detectionJksuryawanshi

Object detectionSomesh Vyas

Machine Learning - Object Detection and ClassificationVikas Jain

YoloNEHA Kapoor

Object Detection Using R-CNN Deep Learning FrameworkNader Karimi

A Brief History of Object Detection / Tommi KerolaPreferred Networks

YOLO v1오 혜린

PR-207: YOLOv3: An Incremental ImprovementJinwon Lee

CNN TutorialSungjoon Choi

Object Detection & TrackingAkshay Gujarathi

Recent Progress on Object Detection_20170331Jihong Kang

Object Detection and Recognition Intel Nervana

YoloBang Tsui Liou

La actualidad más candente (20)

[PR12] You Only Look Once (YOLO): Unified Real-Time Object Detection

#10 pydata warsaw object detection with dn ns

You only look once (YOLO) : unified real time object detection

You only look once

Yolo v2 ai_tech_20190421

Object detection

You Only Look Once: Unified, Real-Time Object Detection

Object detection

Machine Learning - Object Detection and Classification

Yolo

Object Detection Using R-CNN Deep Learning Framework

A Brief History of Object Detection / Tommi Kerola

YOLO v1

PR-207: YOLOv3: An Incremental Improvement

CNN Tutorial

Object Detection & Tracking

Recent Progress on Object Detection_20170331

Object Detection and Recognition

Yolo

Similar a Deep learning based object detection

Deep learning based object detectionMonicaDommaraju

Deep Learning AtoC with Image PerspectiveDong Heon Cho

Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Sergey Karayev

SimCLR: A Simple Framework for Contrastive Learning of Visual Representationsynxm25hpxp

Object Detection - Míriam Bellver - UPC Barcelona 2018Universitat Politècnica de Catalunya

IRJET- Real-Time Object Detection using Deep Learning: A SurveyIRJET Journal

object-detection.pptxMohamedAliHabib3

YOLOgeothomas18

“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...Edge AI and Vision Alliance

“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...Edge AI and Vision Alliance

20220811 - computer visionJamie (Taka) Wang

Introducción a las redes convolucionalesJoseAlGarcaGutierrez

IISc Internship ReportHarshilJain26

Yolo releases gianmariaDeep Learning Italia

MLIP - Chapter 5 - Detection, Segmentation, CaptioningCharles Deledalle

A-13 Iomp-1.pptxJayendranath3

Review: You Only Look One-level FeatureDongmin Choi

Objects as points (CenterNet) review [CDM]Dongmin Choi

object detection paper reviewYoonho Na

Object Detection An Overviewijtsrd

Similar a Deep learning based object detection (20)

Deep learning based object detection

Deep Learning AtoC with Image Perspective

Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...

SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Object Detection - Míriam Bellver - UPC Barcelona 2018

IRJET- Real-Time Object Detection using Deep Learning: A Survey

object-detection.pptx

YOLO

“Understanding, Selecting and Optimizing Object Detectors for Edge Applicatio...

“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...

20220811 - computer vision

Introducción a las redes convolucionales

IISc Internship Report

Yolo releases gianmaria

MLIP - Chapter 5 - Detection, Segmentation, Captioning

A-13 Iomp-1.pptx

Review: You Only Look One-level Feature

Objects as points (CenterNet) review [CDM]

object detection paper review

Object Detection An Overview

Último

Alper Gobel In Media Res Media ComponentInMediaRes1

Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth

18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

MENTAL STATUS EXAMINATION format.docxPoojaSen20

Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD

PSYCHIATRIC History collection FORMAT.pptxPoojaSen20

Introduction to AI in Higher Education_draft.pptxpboyjonauth

BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy

Arihant handbook biology for class 11 .pdfchloefrazer622

Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique

microwave assisted reaction. General introductionMaksud Ahmed

Measures of Central Tendency: Mean, Median and ModeThiyagu K

“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr

Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732

Mastering the Unannounced Regulatory InspectionSafetyChain Software

Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre

Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron

The Most Excellent Way | 1 Corinthians 13Steve Thomason

Deep learning based object detection

1. A SURVEY OF DEEP LEARNING BASED OBJECT DETECTION Chetan Kulkarni

3. DATASETS

4. PASCAL VOC • 20 object categories as 4 main branches-vehicles, animals, household objects, and people • spread over 11,000 images. • Over 27,000 object instance bounding boxes are labeled • 7,000 have detailed segmentations.

5. COCO DATASET • 91 common object categories • 82 of them having more than 5,000 labeled instances. • These categories cover the 20 categories in the PASCAL VOC dataset. • 2,500,000 labeled instances in 328,000 images

6. OTHER DATASE TS Open Images ImageNet

7. OBJECT DETECTION Identify and locate objects in an image or video Source : https://www.fritz.ai/object- detection/#:~:text=Object%20detection%20is%20a%20computer,all%20while%20accurately% 20labeling%20them.

8. KINDS OF OBJECT DETECTI ON Two-Stage Detector One Stage Detector

9. EXAMPLE S OF TWO STAGE DETECTO RS 1.R-CNN 2.Fast R-CNN 3.Faster R-CNN 4.Masked R-CNN

10. R-CNN 1. Generates category-independent region proposals. 2. Extract a fixed-length feature vector from each region proposal. 3. Set of class-specific linear SVMs to classify the objects in one image. 4. Bounding-box regressor for precisely bounding-box prediction.

11. FAST R- CNN• Fast R-CNN produces Region of Interest(RoI) using the Max Pooling layer • the SVM layer is replaced with SVD which fastens the process even further.

12. FASTER R- CNN: • The Region interested in Fast R-CNN was based on a selective search using Max Pooling layers, this was slow. • So in Faster R-CNN replaces the region selection method with a novel RPN

13. MASK R-CNN • The faster R-CNN performs well, but it has an Instance Segmentation Problem. • It generates proposals about the regions where there might be an object based on the input image. • It predicts the class of the object, refines the bounding box, and generates a mask in the pixel level of the object based on the first stage proposal.

14. ONE-STAGE DETECTORS 1. Yolo 2. Yolo v2 3. Yolo V3 4. SSD 5. DSSD

15. YOLO • There is no region creation and then again processing on top of that • Rather there is one convolution network that creates boxes and class predictions for each box.

16. YOLO V2 Following were introduced Batch Normalization High-Resolution Classifier Use Anchor Boxes For Bounding Boxes

17. YOLO V3 This has the following updated changes: 1. Multi-Label Classification 2. Use of Feature Maps to predict Bounding Boxes 3. Uses Darknet as final Feature Extractor

18. SINGLE SHOT DETECTOR (SSD) Single Shot: this means that the tasks of object localization and classification are done in a single forward pass of the network 01 MultiBox: this is the name of a technique for bounding box regression 02 Detector: The network is an object detector that also classifies those detected objects 03

19. DECONVOLUTIONAL SINGLE SHOT DETECTOR (DSSD) Gradual deconvolution to enlarge the feature maps Feature Combination from convolution path and deconvolution path

20. APPLICATIONS 1. Pedestrian Detection 2. Face Detection 3. Generic Object Detection 4. Theft Detection

21. CONCLUSION There are unimaginable number of objects and building a framework capable to detect them is going to be never ending task. But more improved targeted application is already possible and will be more robust in coming days

Deep learning based object detection

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Deep learning based object detection

Similar a Deep learning based object detection (20)

Último

Último (20)

Deep learning based object detection