Kim Steenstrup Pedersen, lektor, Image Section, Department of Computer Science, København Universitet
Overblik over kunstig intelligens og digital billedanalyse. For øjeblikket sker der en rivende udvikling indenfor kunstig intelligens og især inden for analyse af digitale billeder og film. Vi ser jævnlige historier i pressen om nye fantastiske gennembrud indenfor kunstig intelligens (en del af disse historier udspringer fra store virksomheder som Google, Facebook og Amazon). Det er nærliggende at spørge – kan jeg anvende kunstig intelligens på min billedsamling? I dette foredrag vil jeg give et overblik over hvad kunstig intelligens og digital billedanalyse er og hvad det kan anvendes til. Jeg vil også give et indblik i styrker og svagheder ved eksisterende metoder og specielt hvad man skal være opmærksom på hvis man ønsker at anvende kunstig intelligens på sine billedsamlinger.
Dubai Call Girl Number # 0522916705 # Call Girl Number In Dubai # (UAE)
Overblik over kunstig intelligens og digital billedanalyse
1. 30/01/20 1
Kim Steenstrup Pedersen
Associate Professor, Ph.D.
Head of IMAGE Section
Dept. of Computer Science
The pros and cons
of modern AI &
computer vision
2. What is modern Artificial Intelligence (AI)?
30/01/20 2
Machine Learning
Bioinformatics
Information
Retrieval
Medical Image
Analysis
Audio /
Speech
Analysis
And more …
Computer
Vision
Natural
Language
Processing
I am here
Robotics
Oxford Dict. Def.: The theory and development of computer systems able to
perform tasks normally requiring human intelligence, such as visual perception,
speech recognition, decision-making, and translation between languages.
There is no clear definition (that all agrees upon) of what AI is!
3. What is machine learning?
30/01/20 3
Data is represented as points x in some high dimensional space.
Learn = Estimate a function (model) that groups / separate items in
this space.
y(x;w) ! Apple,Pear{ } A classification
problem with two
classes
4. • We need data in order to learn the function /
parameters.
• We call this training data and we also talk about
validation and test data.
• Approaches to machine learning:
• Supervised learning: We have training data examples of both
the input and wanted output
• Unsupervised learning: We only have training data examples
of the input
• And combinations thereof: Semi-supervised, active learning,
reinforcement learning, …
What is machine learning?
30/01/20 4
x1= , y1=Apple x2= , y2=Pear x3= , y3=Pear
x1= , y1=? x2= , y2=? x3= , y3=?
5. What is machine learning?
30/01/20 5
The goal is to make as few mistakes on the training set as possible
and to make good predictions on new unseen data (test data)
y(x;w) ! Apple,Pear{ }
?
6. What is Computer Vision?
My definition: The design of algorithms for interpreting visual data by
mimicking (and surpassing) the human visual perceptual system.
7. Computer Vision
30/01/20 7
Railroad asset mapping: Detection of
signals, signs, cabinets etc.
Articulated tracking of humans
Some research highlights:
Digital natural history: Specimen
detection, species recognition,
automatic label reading
8. Object recognition and detection
What is in this image?
“Car”
“Window”
“Lamp post”
Where?“a car” or “street scene”
Recognition: Detection:
10. Marr’s layers of abstraction (1982)
Primal sketch
Layers of Abstractions
3D Model
2 1/2D sketchDepth
Orientation
Occlusions
Features
Segments
Information reduction by abstraction is
important in computer vision
But what abstractions should we use?
Construct by design or learn from data?
In CV we don’t use the pixels directly,
we use local descriptors often called
features
13. Commercial solutions can do a lot, but …
(e.g. Google Vision API, Amazon Rekognition)
28/01/20 13
Lerbæk herregård
Dansk bondegård
Output from Google Vision API
14. Bias in public training data: Mainly data from
North America and limited object categories
28/01/20 14
You need huge training datasets
in order to train deep learning
models.
15. 30/01/20 15
Vision is full of hard
problems!
The human visual system is
also challenged
17. Humans have a bias when interpreting
scenes – here we have orientation bias
30/01/20 17
18. Humans have a bias when interpreting
scenes – here we have orientation bias
30/01/20 18
We prefer interpretations we have used before!
And the same holds for a computer vision system!
Choice of training data is important!
19. Vondrick et al 2013
Do you see the same objects as the computer?
What’s the problem? The representation is not strong enough!
22. Photos & film archives: Potential problems
• Automated meta-data generation:
• How do we ensure meaningful labeling / meta-data
generation?
• Potential problems with bias – both geographically (cultural
bias) and temporally (objects change appearance over time).
• Train on your own data: Where do we get training data labels
from? Manual labor? Crowdsourcing? Other idea?
• Reading and interpreting text in photos or film might help. This
is a natural language processing problem.
• Geo-location:
• Where was this photo taken? Some solutions exist, but does it
work on your data?
• Search for similar images in archive:
• There are commercial solutions for this.
28/01/20 22
23. Summary
• AI & Computer vision algorithms are getting more and
more advanced and we can solve many hard
problems.
• However, just as with humans, computer vision
systems can be fooled and do fail.
• A computer vision system is based on a set of
assumptions – they may be wrong or broken!
• Learning-based computer vision systems can only be
as good as the training data we use when building the
system. Garbage in = garbage out.
• Maybe you should join forces and use your combined
archives for improved training and quality of outcome.
28/01/20 23
25. Literature
• LeCun, Bengio, Hinton: Deep learning. Nature review,
Vol. 521: 436 – 444, 2015.
• Ponti et al.: Everything you wanted to know about
Deep Learning for Computer Vision but were afraid to
ask. Online, 2017.
• Goodfellow et al.: Deep Learning. MIT Press, 2016.
http://www.deeplearningbook.org
25