SlideShare una empresa de Scribd logo
1 de 14
Descargar para leer sin conexión
TRECVID-2016
Concept Localization : Overview
George Awad
National Institute of Standards and Technology
Dakota Consulting, Inc
1TRECVID 2016
•  Goal
•  Make concept detection more precise in time and space than
current shot-level evaluation.
•  Encourage context independent concepts design to increase their
reusability.
•  Task set up
•  For each of the 10 new test concepts, NIST provided set of ≈1000
shots.
•  Any shot may or may not contain the target concept.
•  Task
•  For each I-Frame within the shot that contains the target, return
the x,y coordinates of the (UL,LR) vertices of a bounding
rectangle containing all of the target concept and as little more as
possible.
•  Systems were allowed to submit more than 1 bounding box per I-
frame but only the ones with maximum f-score were scored.
2TRECVID 2016
3
10 New evaluated concepts
TRECVID 2016
Non action concepts New action concepts
Animal Bicycling
Boy Dancing
Baby Instrumental_musician
Running
Sitting_down
Skier
Explosion_fire
NIST Evaluation framework
•  Testing data
•  IACC.2.A-C (600 h, used between 2013 to 2015 in semantic indexing
task).
•  About 1000 shots per concept were sampled from the ground truth (with
true positive (TP) clips of max = 300, avg = 178, min = 12).
•  Total of 9587 shots and 2205140 i-frames were distributed to systems.
•  Human assessors were given all the i-frames (total of 55789 images) of
all TP shots to create the ground truth (drawing bounding box around the
concept if it exists).
•  Human assessors had to watch the video clips of the images to verify the
concepts.
4TRECVID 2016
Evaluation metrics
• Temporal localization: precision, recall and f-score
based on the judged I-frames.
• Spatial localization: precision, recall and f-score
based on the located pixels representing the
concept.
• An average of precision, recall and f-score for
temporal and spatial localization across all I-frames
for each concept and for each run.
5TRECVID 2016
Participants (Finishers: 3 out of 21)
•  3 teams submitted 11 runs
•  TokyoTech (4 runs)
•  Tokyo Institute of Technology
•  NII_Hitachi_UIT (3 runs)
•  National Institute of Informatics; Hitachi, Ltd; University of Information Technology
•  UTS_CMU_D2DCRC (4 runs)
•  University of Technology, Sydney; Carnegie Mellon University; D2DCRC
6TRECVID 2016
Temporal localization results by run (sorted by F-score)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Meanperrunacrossallconcepts
I-frame F-score
I-frame Precision
I-frame Recall
7TRECVID 2016
TRECVID 2016 8
0
0.2
0.4
0.6
0.8
1
Meanperrunacrossallconcepts
2013
0
0.2
0.4
0.6
0.8
1
Meanperrunacrossallconcepts
2014
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
CCNY_sub1.result.txt
CCNY_sub2.result.txt
CCNY_sub3.result.txt
CCNY_sub4.result.txt
insightdcu.DCU_Loc
MediaMill_Qualcomm
MediaMill_Qualcomm
MediaMill_Qualcomm
MediaMill_Qualcomm
PicSOM.PicSOM_LO
PicSOM.PicSOM_LO
PicSOM.PicSOM_LO
PicSOM.PicSOM_LO
TokyoTech.run_tokyo
TokyoTech.run_tokyo
TokyoTech.run_tokyo
TokyoTech.run_tokyo
Trimps_1.txt
Trimps_2_NEG_04.tx
Trimps_3_NEG_NOC
Trimps_3_NOC_015.
Meanperrunacrossallconcepts
2015
2016 (mainly action) >> 2013 & 2014
(mainly objects)
ONLY TP shots were given
to systems to localize.
Temporal Localization results
Spatial Localization results by run (sorted by F-score)
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1Meanperrunacrossallconcepts
Mean Pixel F-score
Mean Pixel Precision
Mean Pixel Recall
9
Harder than
temporal
localization
TRECVID 2016
TRECVID 2016 10
0
0.2
0.4
0.6
0.8
1Meanperrunacrossallconcepts
2013
0
0.2
0.4
0.6
0.8
1
Meanperrunacrossallconcepts
2014
0
0.2
0.4
0.6
0.8
1
CCNY_sub1.result.txt
CCNY_sub2.result.txt
CCNY_sub3.result.txt
CCNY_sub4.result.txt
insightdcu.DCU_Loca
MediaMill_Qualcomm
MediaMill_Qualcomm
MediaMill_Qualcomm
MediaMill_Qualcomm
PicSOM.PicSOM_LO
PicSOM.PicSOM_LO
PicSOM.PicSOM_LO
PicSOM.PicSOM_LO
TokyoTech.run_tokyot
TokyoTech.run_tokyot
TokyoTech.run_tokyot
TokyoTech.run_tokyot
Trimps_1.txt
Trimps_2_NEG_04.tx
Trimps_3_NEG_NOC
Trimps_3_NOC_015.t
Meanperrunacrossallconcepts
2015 2016 (actions) > 2013 (objects)
2016 (actions) ~ 2014 (objects)
ONLY TP shots were given
to systems to localize.
Spatial Localization results
Results per concept
top 10 runs
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
F-score
Median
10
9
8
7
6
5
4
3
2
1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
MeanF-score
Median
10
9
8
7
6
5
4
3
2
1
Temporal localization Spatial localization
Most concepts perform better in temporal compared to spatial localization
A lot of resemblance between same concepts
11TRECVID 2016
Results per concept across all runs
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Recall
Precision
Temporal localization
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.10.20.30.40.50.60.70.80.9 1
MeanRecall
Mean precision
Spatial localization
12
submitted bounding boxes
approximate the size of ground truth
boxes and overlap with them. Many
systems are good in finding the real
box sizes.
Many	systems	submi-ed	a	lot	
of	non-target	I-frames,	while	
few	found	a	good	balance.	
	
TRECVID 2016
baby
Inst_musi
bicycling
General Observations
• Consistent observations in the last 4 years
ü Temporal localization is easier than spatial localization.
ü Systems report approximate G.T box sizes.
• Performance of action/dynamic concepts are higher
than object concepts tested in 2013 to 2014.
• Assessment of action/dynamic concepts proved to be
challenging in many cases to the human assessors.
• Lower finishing% of teams compared to signups.
13TRECVID 2016
Next team talks
• TokyoTech
• UTS_CMU_D2DCRC
TRECVID 2016 14

Más contenido relacionado

Destacado

David Walker's Appeal
David Walker's AppealDavid Walker's Appeal
David Walker's Appeal
Bro Cadence
 
Boletin digital 03
Boletin digital 03Boletin digital 03
Boletin digital 03
chemadelaf
 
Website management model
Website management modelWebsite management model
Website management model
adgclarke
 
диаграмм 5
диаграмм  5диаграмм  5
диаграмм 5
ttuya
 
Contextualization and-location-ntot-ap-g10
Contextualization and-location-ntot-ap-g10Contextualization and-location-ntot-ap-g10
Contextualization and-location-ntot-ap-g10
Jared Ram Juezan
 

Destacado (12)

Spratne montazne kuce
Spratne montazne kuceSpratne montazne kuce
Spratne montazne kuce
 
David Walker's Appeal
David Walker's AppealDavid Walker's Appeal
David Walker's Appeal
 
TRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event DetectionTRECVID 2016 : Multimedia event Detection
TRECVID 2016 : Multimedia event Detection
 
Boletin digital 03
Boletin digital 03Boletin digital 03
Boletin digital 03
 
Tertulia dialógica
Tertulia dialógicaTertulia dialógica
Tertulia dialógica
 
Eko montazne kuce
Eko montazne kuceEko montazne kuce
Eko montazne kuce
 
Website management model
Website management modelWebsite management model
Website management model
 
диаграмм 5
диаграмм  5диаграмм  5
диаграмм 5
 
Kabinet kerja
Kabinet kerjaKabinet kerja
Kabinet kerja
 
Localization of brain lesion by Prof Dr Bashir Ahmed Dar Sopore Kashmir
Localization of brain lesion by Prof Dr Bashir Ahmed Dar Sopore KashmirLocalization of brain lesion by Prof Dr Bashir Ahmed Dar Sopore Kashmir
Localization of brain lesion by Prof Dr Bashir Ahmed Dar Sopore Kashmir
 
Contextualization and-location-ntot-ap-g10
Contextualization and-location-ntot-ap-g10Contextualization and-location-ntot-ap-g10
Contextualization and-location-ntot-ap-g10
 
Women in Localization 2017 "Be Bold for Change" Award Winners
Women in Localization 2017 "Be Bold for Change" Award WinnersWomen in Localization 2017 "Be Bold for Change" Award Winners
Women in Localization 2017 "Be Bold for Change" Award Winners
 

Similar a TRECVID 2016 : Concept Localization

Similar a TRECVID 2016 : Concept Localization (20)

Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...Sparse representation based human action recognition using an action region-a...
Sparse representation based human action recognition using an action region-a...
 
2019 cvpr paper_overview
2019 cvpr paper_overview2019 cvpr paper_overview
2019 cvpr paper_overview
 
2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee2019 cvpr paper overview by Ho Seong Lee
2019 cvpr paper overview by Ho Seong Lee
 
The Open-Source Approach for Computational Modeling and Simulation for Earthq...
The Open-Source Approach for Computational Modeling and Simulation for Earthq...The Open-Source Approach for Computational Modeling and Simulation for Earthq...
The Open-Source Approach for Computational Modeling and Simulation for Earthq...
 
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box DetectorIRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
IRJET- Object Detection and Recognition using Single Shot Multi-Box Detector
 
Applications of Deep Learning in Construction Industry
Applications of Deep Learning in Construction IndustryApplications of Deep Learning in Construction Industry
Applications of Deep Learning in Construction Industry
 
NMSL_2017summer
NMSL_2017summerNMSL_2017summer
NMSL_2017summer
 
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
Lecture 2.B: Computer Vision Applications - Full Stack Deep Learning - Spring...
 
Tracing Tuples Across Dimensions: A Comparison of Scatterplots and Parallel C...
Tracing Tuples Across Dimensions: A Comparison of Scatterplots and Parallel C...Tracing Tuples Across Dimensions: A Comparison of Scatterplots and Parallel C...
Tracing Tuples Across Dimensions: A Comparison of Scatterplots and Parallel C...
 
Taming Cross-Tool Traceability in the Wild.pptx
Taming Cross-Tool Traceability in the Wild.pptxTaming Cross-Tool Traceability in the Wild.pptx
Taming Cross-Tool Traceability in the Wild.pptx
 
Image Object Detection Pipeline
Image Object Detection PipelineImage Object Detection Pipeline
Image Object Detection Pipeline
 
Fractional step discriminant pruning
Fractional step discriminant pruningFractional step discriminant pruning
Fractional step discriminant pruning
 
People counting in low density video sequences2
People counting in low density video sequences2People counting in low density video sequences2
People counting in low density video sequences2
 
Ontology-based data access: why it is so cool!
Ontology-based data access: why it is so cool!Ontology-based data access: why it is so cool!
Ontology-based data access: why it is so cool!
 
Gunjan insight student conference v2
Gunjan insight student conference v2Gunjan insight student conference v2
Gunjan insight student conference v2
 
“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...
“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...
“Understanding DNN-Based Object Detectors,” a Presentation from Au-Zone Techn...
 
Inspection Systems for Additive Manufacturing
Inspection Systems for Additive ManufacturingInspection Systems for Additive Manufacturing
Inspection Systems for Additive Manufacturing
 
Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)Tackling Open Images Challenge (2019)
Tackling Open Images Challenge (2019)
 
Deep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUSDeep learning fundamental and Research project on IBM POWER9 system from NUS
Deep learning fundamental and Research project on IBM POWER9 system from NUS
 
Transformer in Vision
Transformer in VisionTransformer in Vision
Transformer in Vision
 

Más de George Awad

Más de George Awad (19)

Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022Movie Summarization at TRECVID 2022
Movie Summarization at TRECVID 2022
 
Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022Disaster Scene Indexing at TRECVID 2022
Disaster Scene Indexing at TRECVID 2022
 
Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022Activity Detection at TRECVID 2022
Activity Detection at TRECVID 2022
 
Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022Multimedia Understanding at TRECVID 2022
Multimedia Understanding at TRECVID 2022
 
Video Search at TRECVID 2022
Video Search at TRECVID 2022Video Search at TRECVID 2022
Video Search at TRECVID 2022
 
Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022Video Captioning at TRECVID 2022
Video Captioning at TRECVID 2022
 
TRECVID 2019 Video to Text
TRECVID 2019 Video to TextTRECVID 2019 Video to Text
TRECVID 2019 Video to Text
 
TRECVID 2019 introduction slides
TRECVID 2019 introduction slidesTRECVID 2019 introduction slides
TRECVID 2019 introduction slides
 
TRECVID 2019 Instance Search
TRECVID 2019 Instance SearchTRECVID 2019 Instance Search
TRECVID 2019 Instance Search
 
TRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video SearchTRECVID 2019 Ad-hoc Video Search
TRECVID 2019 Ad-hoc Video Search
 
[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text[TRECVID 2018] Video to Text
[TRECVID 2018] Video to Text
 
[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search[TRECVID 2018] Instance Search
[TRECVID 2018] Instance Search
 
[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search[TRECVID 2018] Ad-hoc Video Search
[TRECVID 2018] Ad-hoc Video Search
 
Instance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVIDInstance Search (INS) task at TRECVID
Instance Search (INS) task at TRECVID
 
Activity detection at TRECVID
Activity detection at TRECVIDActivity detection at TRECVID
Activity detection at TRECVID
 
The Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVIDThe Video-to-Text task evaluation at TRECVID
The Video-to-Text task evaluation at TRECVID
 
Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)Introduction about TRECVID (The Video Retreival benchmark)
Introduction about TRECVID (The Video Retreival benchmark)
 
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
TRECVID ECCV2018 Tutorial (Ad-hoc Video Search)
 
TRECVID 2016 workshop
TRECVID 2016 workshopTRECVID 2016 workshop
TRECVID 2016 workshop
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 

TRECVID 2016 : Concept Localization