SlideShare una empresa de Scribd logo
1 de 23
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Implementing artificial
intelligence strategies for
content annotation and
publication online Vasileios Mezaris, CERTH-ITI
Johan Oomen, NISV
1
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Archives’ needs
Fundamental need:
- Generate value out of your own AV content; nothing good comes out of just
keeping the content locked in your digital basement
Technology-wise, this requires:
- Understanding the content / making it discoverable
- Adapting / re-purposing the (discovered) content; generating video summaries
This is where AI can step in!
2
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Understanding the content / making it discoverable
Content fragmentation and annotation:
- Identify the different temporal fragments of a video (subshots/shots/scenes)
- Annotate fragments with concept labels that describe them (many thousand labels)
- Generate descriptive captions for each fragment
Research (and business) challenges:
- Accuracy
- Computational efficiency / compactness of the deep networks -> affects costs!
(faster than real-time for a bundle of analysis methods that include fragmentation,
concept detection, brand and logo detection, ad detection,...)
3
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Understanding the content / making it discoverable
4
Shot #15
Scene #4 Scene #5
Shot #11 Shot #12 Shot #13 Shot #14 Shot #16
Subshot #58 Subshot #59
Shot #17 Shot #18
Subshot #60
…
… …
……
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Understanding the content / making it discoverable
5
Sample video frame Top detected concepts
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Understanding the content / making it discoverable
Web application for video analysis and search (try it with your video!):
http://multimedia2.iti.gr/onlinevideoanalysis/service/start.html
Demo video:
https://youtu.be/mO-NRpIJ9UU
REST service available (for integration
in different applications / CMSs)
6
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Understanding the content / making it discoverable
Behind the scenes:
- Frame-comparison-based methods for video fragmentation [1]; soon to be
augmented with a deep-learning-based method
- Elaborate deep-convolutional-neural-network architectures for concept-based
annotation [2][3] (and for video captioning; not shown in the demo)
[1] E. Apostolidis, V. Mezaris, "Fast Shot Segmentation Combining Global and Local Visual Descriptors", Proc. IEEE Int. Conf. on Acoustics, Speech and Signal
Processing (ICASSP), Florence, Italy, May 2014. Software available at https://mklab.iti.gr/results/video-shot-and-scene-segmentation/.
[2] F. Markatopoulou, V. Mezaris, I. Patras, "Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation", IEEE
Transactions on Circuits and Systems for Video Technology, vol. 29, no. 6, pp. 1631-1644, June 2019. DOI:10.1109/TCSVT.2018.2848458. Software available at
https://github.com/markatopoulou/fvmtl-ccelc.
[3] N. Gkalelis, V. Mezaris, "Subclass deep neural networks: re-enabling neglected classes in deep network training for multimedia classification", Proc. 26th Int.
Conf. on Multimedia Modeling (MMM2020), Daejeon, Korea, Jan. 2020.
7
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Adapting / re-purposing the content
Main requirements:
- Target distribution platforms & devices have varying requirements (e.g. the
optimal duration of a video differs from one platform to another)
- Target audiences have different preferences / information needs
Video summarization:
- Create editions of the content that are adapted to different platforms and
audiences
- Post these versions on different platforms: generate value from your content;
attract more audience to it!
8
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Adapting / re-purposing the content
Example
- Original video (1’38’’)
- 14’’ summary
- Fully automatic summary generation;
but, editor-in-the-loop mode is also
supported
- REST service available (for
integration in applications / CMSs)
9
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Adapting / re-purposing the content
Behind the scenes:
- Elaborate generative adversarial learning architectures (GANs) for
unsupervised learning [4][5]
- Can be trained differently for different content, e.g. separate trained models
can be used for different shows; but, creating these models does not require
manually-generated training data (it’s (almost) for free!)
[4] E. Apostolidis, A. Metsai, E. Adamantidou, V. Mezaris, I. Patras, "A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised
Video Summarization", Proc. 1st Int. Workshop on AI for Smart TV Content Production, Access and Delivery (AI4TV'19) at ACM Multimedia 2019, Nice, France,
October 2019.
[5] E. Apostolidis, E. Adamantidou, A. Metsai, V. Mezaris, I. Patras, "Unsupervised Video Summarization via Attention-Driven Adversarial Learning", Proc. 26th Int.
Conf. on Multimedia Modeling (MMM2020), Daejeon, Korea, Jan. 2020.
10
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
ReTV: Audiovisual Content Adaptation,
Repurposing and Publication across Digital Vectors
11
Professional use case:
editorial workflow support
Consumer use case:
chat bot
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Editorial workflow for content publication
12
Topic Selection
Content Adaptation
Optimal Publication
Engagement Monitoring
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Editorial workflow for content publication
13
Topic Selection
Content Adaptation
Optimal Publication
Engagement Monitoring
- real-time monitoring of trends in the
media
- prediction of trending topics related to
your collection
- suggestions for topics in the editorial
calendar
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
example: trends at IFA
14
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 15
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Editorial workflow for content publication
16
Topic Selection
Content Adaptation
Optimal Publication
Engagement Monitoring
- automated video summarisation replacing
manual video editing
- adaptation for specific social media
platforms - different length, cropping
format
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 17
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Editorial workflow for content publication
18
Topic Selection
Content Adaptation
Optimal Publication
Engagement Monitoring
- publishing time tailored for each vector
based audience behaviour
- text suggestions for creating stories with
impact
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 19
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Editorial workflow for content publication
20
Topic Selection
Content Adaptation
Optimal Publication
Engagement Monitoring
- improving future posts by monitoring
audience engagement
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
ReTV Chatbot
Bringing TV content via channels convenient to
audiences
Delivering content tailored for online consumption
Creating engagement
Content personalisation for each user via
interaction with via chatbot
21
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 22
retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project
Vasileios Mezaris, CERTH-ITI
bmezaris@iti.gr
Johan Oomen, NISV
joomen@beeldengeluid.nl
@johanoomen
23
This work was supported by the EUs Horizon 2020
research and innovation programme under grant
agreement H2020-780656 ReTV

Más contenido relacionado

La actualidad más candente

La actualidad más candente (14)

Using TV Metadata to optimise the repurposing and republication of TV Content...
Using TV Metadata to optimise the repurposing and republication of TV Content...Using TV Metadata to optimise the repurposing and republication of TV Content...
Using TV Metadata to optimise the repurposing and republication of TV Content...
 
GAN-based video summarization
GAN-based video summarizationGAN-based video summarization
GAN-based video summarization
 
PoR_evaluation_measure_acm_mm_2020
PoR_evaluation_measure_acm_mm_2020PoR_evaluation_measure_acm_mm_2020
PoR_evaluation_measure_acm_mm_2020
 
ReTV AI4TV 2020
ReTV AI4TV 2020ReTV AI4TV 2020
ReTV AI4TV 2020
 
ReTV @ cross media cafe 2018
ReTV @ cross media cafe 2018ReTV @ cross media cafe 2018
ReTV @ cross media cafe 2018
 
Hard-Negatives Selection Strategy for Cross-Modal Retrieval
Hard-Negatives Selection Strategy for Cross-Modal RetrievalHard-Negatives Selection Strategy for Cross-Modal Retrieval
Hard-Negatives Selection Strategy for Cross-Modal Retrieval
 
From TV to ReTV, Keynote by Lyndon Nixon at TVX 2019 @datatv
 From TV to ReTV, Keynote by Lyndon Nixon at TVX 2019 @datatv  From TV to ReTV, Keynote by Lyndon Nixon at TVX 2019 @datatv
From TV to ReTV, Keynote by Lyndon Nixon at TVX 2019 @datatv
 
HbbTV 2.0 for LinkedTV: specification and gaps
HbbTV 2.0 for LinkedTV: specification and gapsHbbTV 2.0 for LinkedTV: specification and gaps
HbbTV 2.0 for LinkedTV: specification and gaps
 
Requirements document for LinkedTV user interfaces
Requirements document for LinkedTV user interfacesRequirements document for LinkedTV user interfaces
Requirements document for LinkedTV user interfaces
 
Eee Gov 2009 Peppol Enlargement Process
Eee Gov 2009 Peppol Enlargement ProcessEee Gov 2009 Peppol Enlargement Process
Eee Gov 2009 Peppol Enlargement Process
 
LinkedTV Deliverable 9.3 Final LinkedTV Project Report
LinkedTV Deliverable 9.3 Final LinkedTV Project ReportLinkedTV Deliverable 9.3 Final LinkedTV Project Report
LinkedTV Deliverable 9.3 Final LinkedTV Project Report
 
About IRT Nanoelec
About IRT NanoelecAbout IRT Nanoelec
About IRT Nanoelec
 
Ecpg recommendations tenderix_bendo_092010
Ecpg recommendations tenderix_bendo_092010Ecpg recommendations tenderix_bendo_092010
Ecpg recommendations tenderix_bendo_092010
 
LinkedTV Deliverable D2.6 LinkedTV Framework for Generating Video Enrichments...
LinkedTV Deliverable D2.6 LinkedTV Framework for Generating Video Enrichments...LinkedTV Deliverable D2.6 LinkedTV Framework for Generating Video Enrichments...
LinkedTV Deliverable D2.6 LinkedTV Framework for Generating Video Enrichments...
 

Similar a Implementing artificial intelligence strategies for content annotation and publication online

Shanling_resume_1019
Shanling_resume_1019Shanling_resume_1019
Shanling_resume_1019
lucifer1986
 
Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...
Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...
Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...
Nuxeo
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrieval
eSAT Journals
 
Freddie Mac Internship Overview
Freddie Mac Internship OverviewFreddie Mac Internship Overview
Freddie Mac Internship Overview
Charles Stolze
 

Similar a Implementing artificial intelligence strategies for content annotation and publication online (20)

Content Adaptation, Personalisation and Fine-Grained Retrieval: Applying AI ...
Content Adaptation, Personalisation and Fine-Grained Retrieval:  Applying AI ...Content Adaptation, Personalisation and Fine-Grained Retrieval:  Applying AI ...
Content Adaptation, Personalisation and Fine-Grained Retrieval: Applying AI ...
 
Arneb
ArnebArneb
Arneb
 
Matteo Valoriani, Antimo Musone - The Future of Factory - Codemotion Rome 2019
Matteo Valoriani, Antimo Musone - The Future of Factory - Codemotion Rome 2019Matteo Valoriani, Antimo Musone - The Future of Factory - Codemotion Rome 2019
Matteo Valoriani, Antimo Musone - The Future of Factory - Codemotion Rome 2019
 
Shanling_resume_1019
Shanling_resume_1019Shanling_resume_1019
Shanling_resume_1019
 
MICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media AnalysisMICO — Towards Contextual Media Analysis
MICO — Towards Contextual Media Analysis
 
Shanling_resume
Shanling_resumeShanling_resume
Shanling_resume
 
Advene As A Tailorable Hypervideo Authoring Tool A Case Study
Advene As A Tailorable Hypervideo Authoring Tool  A Case StudyAdvene As A Tailorable Hypervideo Authoring Tool  A Case Study
Advene As A Tailorable Hypervideo Authoring Tool A Case Study
 
[Webinar] Building a Front-end for the Nuxeo Platform with AngularJS
[Webinar] Building a Front-end for the Nuxeo Platform with AngularJS[Webinar] Building a Front-end for the Nuxeo Platform with AngularJS
[Webinar] Building a Front-end for the Nuxeo Platform with AngularJS
 
Mini Project- Personal Multimedia Portfolio
Mini Project- Personal Multimedia PortfolioMini Project- Personal Multimedia Portfolio
Mini Project- Personal Multimedia Portfolio
 
"Platform Engineering in practice — Why and How to start", Serg Hospodarets
"Platform Engineering in practice — Why and How to start", Serg Hospodarets "Platform Engineering in practice — Why and How to start", Serg Hospodarets
"Platform Engineering in practice — Why and How to start", Serg Hospodarets
 
SUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOSSUMMARY GENERATION FOR LECTURING VIDEOS
SUMMARY GENERATION FOR LECTURING VIDEOS
 
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
 
Building a design system with (p)react
Building a design system with (p)reactBuilding a design system with (p)react
Building a design system with (p)react
 
Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...Rosinski ibm ai overview with several examples of projects in the media and l...
Rosinski ibm ai overview with several examples of projects in the media and l...
 
Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...
Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...
Leveraging OSGi-based Architecture, GWT, and Eclipse to build a large ajax-ba...
 
Review on content based video lecture retrieval
Review on content based video lecture retrievalReview on content based video lecture retrieval
Review on content based video lecture retrieval
 
RAI Archives: Looking to the future. Alberto Messina, Laurent Boch, RAI.
RAI Archives: Looking to the future. Alberto Messina, Laurent Boch, RAI.RAI Archives: Looking to the future. Alberto Messina, Laurent Boch, RAI.
RAI Archives: Looking to the future. Alberto Messina, Laurent Boch, RAI.
 
Freddie Mac Internship Overview
Freddie Mac Internship OverviewFreddie Mac Internship Overview
Freddie Mac Internship Overview
 
SensorStudio introduction (IDC 2016)
SensorStudio introduction (IDC 2016)SensorStudio introduction (IDC 2016)
SensorStudio introduction (IDC 2016)
 
Knowledge base Design for Project Based Consulting Orgs
Knowledge base Design for Project Based Consulting OrgsKnowledge base Design for Project Based Consulting Orgs
Knowledge base Design for Project Based Consulting Orgs
 

Último

%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 

Último (20)

WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital TransformationWSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
WSO2Con2024 - WSO2's IAM Vision: Identity-Led Digital Transformation
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
VTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learnVTU technical seminar 8Th Sem on Scikit-learn
VTU technical seminar 8Th Sem on Scikit-learn
 
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
WSO2CON 2024 - Cloud Native Middleware: Domain-Driven Design, Cell-Based Arch...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
 
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
 
Artyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptxArtyushina_Guest lecture_YorkU CS May 2024.pptx
Artyushina_Guest lecture_YorkU CS May 2024.pptx
 
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open SourceWSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
WSO2CON 2024 - Freedom First—Unleashing Developer Potential with Open Source
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
WSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security ProgramWSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security Program
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 

Implementing artificial intelligence strategies for content annotation and publication online

  • 1. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Implementing artificial intelligence strategies for content annotation and publication online Vasileios Mezaris, CERTH-ITI Johan Oomen, NISV 1
  • 2. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Archives’ needs Fundamental need: - Generate value out of your own AV content; nothing good comes out of just keeping the content locked in your digital basement Technology-wise, this requires: - Understanding the content / making it discoverable - Adapting / re-purposing the (discovered) content; generating video summaries This is where AI can step in! 2
  • 3. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Understanding the content / making it discoverable Content fragmentation and annotation: - Identify the different temporal fragments of a video (subshots/shots/scenes) - Annotate fragments with concept labels that describe them (many thousand labels) - Generate descriptive captions for each fragment Research (and business) challenges: - Accuracy - Computational efficiency / compactness of the deep networks -> affects costs! (faster than real-time for a bundle of analysis methods that include fragmentation, concept detection, brand and logo detection, ad detection,...) 3
  • 4. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Understanding the content / making it discoverable 4 Shot #15 Scene #4 Scene #5 Shot #11 Shot #12 Shot #13 Shot #14 Shot #16 Subshot #58 Subshot #59 Shot #17 Shot #18 Subshot #60 … … … ……
  • 5. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Understanding the content / making it discoverable 5 Sample video frame Top detected concepts
  • 6. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Understanding the content / making it discoverable Web application for video analysis and search (try it with your video!): http://multimedia2.iti.gr/onlinevideoanalysis/service/start.html Demo video: https://youtu.be/mO-NRpIJ9UU REST service available (for integration in different applications / CMSs) 6
  • 7. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Understanding the content / making it discoverable Behind the scenes: - Frame-comparison-based methods for video fragmentation [1]; soon to be augmented with a deep-learning-based method - Elaborate deep-convolutional-neural-network architectures for concept-based annotation [2][3] (and for video captioning; not shown in the demo) [1] E. Apostolidis, V. Mezaris, "Fast Shot Segmentation Combining Global and Local Visual Descriptors", Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, May 2014. Software available at https://mklab.iti.gr/results/video-shot-and-scene-segmentation/. [2] F. Markatopoulou, V. Mezaris, I. Patras, "Implicit and Explicit Concept Relations in Deep Neural Networks for Multi-Label Video/Image Annotation", IEEE Transactions on Circuits and Systems for Video Technology, vol. 29, no. 6, pp. 1631-1644, June 2019. DOI:10.1109/TCSVT.2018.2848458. Software available at https://github.com/markatopoulou/fvmtl-ccelc. [3] N. Gkalelis, V. Mezaris, "Subclass deep neural networks: re-enabling neglected classes in deep network training for multimedia classification", Proc. 26th Int. Conf. on Multimedia Modeling (MMM2020), Daejeon, Korea, Jan. 2020. 7
  • 8. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Adapting / re-purposing the content Main requirements: - Target distribution platforms & devices have varying requirements (e.g. the optimal duration of a video differs from one platform to another) - Target audiences have different preferences / information needs Video summarization: - Create editions of the content that are adapted to different platforms and audiences - Post these versions on different platforms: generate value from your content; attract more audience to it! 8
  • 9. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Adapting / re-purposing the content Example - Original video (1’38’’) - 14’’ summary - Fully automatic summary generation; but, editor-in-the-loop mode is also supported - REST service available (for integration in applications / CMSs) 9
  • 10. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Adapting / re-purposing the content Behind the scenes: - Elaborate generative adversarial learning architectures (GANs) for unsupervised learning [4][5] - Can be trained differently for different content, e.g. separate trained models can be used for different shows; but, creating these models does not require manually-generated training data (it’s (almost) for free!) [4] E. Apostolidis, A. Metsai, E. Adamantidou, V. Mezaris, I. Patras, "A Stepwise, Label-based Approach for Improving the Adversarial Training in Unsupervised Video Summarization", Proc. 1st Int. Workshop on AI for Smart TV Content Production, Access and Delivery (AI4TV'19) at ACM Multimedia 2019, Nice, France, October 2019. [5] E. Apostolidis, E. Adamantidou, A. Metsai, V. Mezaris, I. Patras, "Unsupervised Video Summarization via Attention-Driven Adversarial Learning", Proc. 26th Int. Conf. on Multimedia Modeling (MMM2020), Daejeon, Korea, Jan. 2020. 10
  • 11. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project ReTV: Audiovisual Content Adaptation, Repurposing and Publication across Digital Vectors 11 Professional use case: editorial workflow support Consumer use case: chat bot
  • 12. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Editorial workflow for content publication 12 Topic Selection Content Adaptation Optimal Publication Engagement Monitoring
  • 13. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Editorial workflow for content publication 13 Topic Selection Content Adaptation Optimal Publication Engagement Monitoring - real-time monitoring of trends in the media - prediction of trending topics related to your collection - suggestions for topics in the editorial calendar
  • 14. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project example: trends at IFA 14
  • 15. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 15
  • 16. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Editorial workflow for content publication 16 Topic Selection Content Adaptation Optimal Publication Engagement Monitoring - automated video summarisation replacing manual video editing - adaptation for specific social media platforms - different length, cropping format
  • 17. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 17
  • 18. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Editorial workflow for content publication 18 Topic Selection Content Adaptation Optimal Publication Engagement Monitoring - publishing time tailored for each vector based audience behaviour - text suggestions for creating stories with impact
  • 19. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 19
  • 20. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Editorial workflow for content publication 20 Topic Selection Content Adaptation Optimal Publication Engagement Monitoring - improving future posts by monitoring audience engagement
  • 21. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project ReTV Chatbot Bringing TV content via channels convenient to audiences Delivering content tailored for online consumption Creating engagement Content personalisation for each user via interaction with via chatbot 21
  • 22. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project 22
  • 23. retv-project.eu @ReTV_EU @ReTVproject retv-project retv_project Vasileios Mezaris, CERTH-ITI bmezaris@iti.gr Johan Oomen, NISV joomen@beeldengeluid.nl @johanoomen 23 This work was supported by the EUs Horizon 2020 research and innovation programme under grant agreement H2020-780656 ReTV

Notas del editor

  1. https://www.storypact.com/