SlideShare a Scribd company logo
1 of 18
Download to read offline
Computer Vision:
Extracting Data from the
Visual World
A Brief Example...
!
Steven Mitchell, Ph.D.
Componica, LLC
About us.
Componica, LLC (http://www.componica.com/)

Strong Background in Computer Vision
Copyright 2011 - Componica, LLC (http://www.componica.com/)
About us.
Componica seamlessly combines the worlds of
machine learning, computer visioning & mobile
development & applying the latest in visionary
technology to the world of mobile media. 

Words for Spanish / Russian / French

Copyright 2011 - Componica, LLC (http://www.componica.com/)
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Why is computer vision relevant?
How do these things work?

Should I be concernd?
Copyright 2011 - Componica, LLC (http://www.componica.com/)
In this slideshow:
Facial Detection - Find me a face.

Facial Recognition - Who’s face is it?

Image Registration - Aligning pictures together.

...which leads to augmented reality.

QR Codes - They’re everywhere.

Optical Character Recognition - Reading Stuff.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Face Detection
This is NOT facial recognition.

Developed by Viola / Jones in 2000. Major break-thru in
image recognition...this was not possible prior.

How much does a cow weigh?

An army of simple face detectors.

"Robust Real-time Object Detection"!
Paul Viola and Michael Jones
Copyright 2011 - Componica, LLC (http://www.componica.com/)
BTW, It’s how the Kinect sees people.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
BTW, It’s how the Kinect sees people.
"Real-Time Human Pose Recognition in Parts from Single Depth Images"!
Shotton, Fitzgibbon, Cook, Sharp, Finocchio, Moore, Kipman, Blake!
Microsoft Research Cambridge & Xbox Incubation
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Facial Recognition
Remove effects caused by lighting and
perspective.

After you find a face, reduce it to numbers.
"Statistical Models of Appearance for Computer Vision"!
T.F. Cootes and C.J.Taylor
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Facial Recognition
Let’s mix some paint...

Comparing numbers in hyperspace
k-Nearest Neighbor, Wikipedia
Copyright 2011 - Componica, LLC (http://www.componica.com/)
The most common way to register images. Find the most
interesting points on the two images.

Compare all the interesting points from one image to the other
forming matching pairs of points between images.
Image Registration - Interesting Points
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Augmented Reality
FAST interest point detection 0.55ms
Building query bit masks 0.12ms
Matching into database 0.35ms
Robust pose estimation 0.1ms
Total frame time 1.12ms
Table 1. Timings for the stages of our approach on a dataset with
images taken from within the range of trained viewpoints.
Figure 5. Increasing the range of viewpoint bins in the training set
allows more viewpoint invariance to be added in a straightforward
manner.
gests that the bit count dissimilarity score provides a reason-
able way of scoring matches. To confirm this we computed
the average number of inlier and outlier matches over all of
the frames in the two sequences, and plotted these against
the dissimilarity score obtained for the match in Figure 4.
For the sequence on the left where the viewpoints are in-
cluded in the training set many good matches are found in
Once you have correspondence, you
can compute 3D geometry.
http://mi.eng.cam.ac.uk/~er258/work/fast.html
http://nghiaho.com
Copyright 2011 - Componica, LLC (http://www.componica.com/)
QR Codes
http://en.wikipedia.org/wiki/QR_Code
!
"Quick Response code" invented
by Toyota subsidiary Denso
Wave in 1994.

Open License

Up to 2.5K of data

Error Correction

Easy to read and generate:

ZXing library
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Optical Character Recognition
iPhone 4th Gen
iPod Touch 4th Gen
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Optical Character Recognition
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Commentary
Ubiquitous Surveillance...extreme dislike.

Birthday Paradox...The probability that, in a set of
n randomly chosen people, some pair of them will
have the same birthday.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Commentary
Video Cameras may fit the criteria of legally blind.
Copyright 2011 - Componica, LLC (http://www.componica.com/)
Computer visioning technology and society:
opportunities, possibilities:

Smartphones that ID diseases, plants, insects.

Robotic lawnmowers that don’t run over the
neighbor’s cat.

Computers that judge emotions by reading your
face.

Keyless entry based on face, iris.

Automated inspection of manufactured parts.

Conclusion

More Related Content

Viewers also liked

Information management
Information managementInformation management
Information managementLorie Lynne
 
Step by-step compsressor Selection and sizing
Step by-step compsressor Selection and sizingStep by-step compsressor Selection and sizing
Step by-step compsressor Selection and sizingtantoy13
 
Beep...Destroy All Humans!
Beep...Destroy All Humans!Beep...Destroy All Humans!
Beep...Destroy All Humans!Componica LLC
 
Binary Features for Object Detection and Landmarking
Binary Features for Object Detection and LandmarkingBinary Features for Object Detection and Landmarking
Binary Features for Object Detection and LandmarkingComponica LLC
 
General knowledge
General knowledgeGeneral knowledge
General knowledgeBelindaB83
 
Introduction to Computer Vision
Introduction to Computer VisionIntroduction to Computer Vision
Introduction to Computer VisionComponica LLC
 

Viewers also liked (7)

وحدة الفقه الاسلامي للصف التاسع
وحدة الفقه الاسلامي للصف التاسعوحدة الفقه الاسلامي للصف التاسع
وحدة الفقه الاسلامي للصف التاسع
 
Information management
Information managementInformation management
Information management
 
Step by-step compsressor Selection and sizing
Step by-step compsressor Selection and sizingStep by-step compsressor Selection and sizing
Step by-step compsressor Selection and sizing
 
Beep...Destroy All Humans!
Beep...Destroy All Humans!Beep...Destroy All Humans!
Beep...Destroy All Humans!
 
Binary Features for Object Detection and Landmarking
Binary Features for Object Detection and LandmarkingBinary Features for Object Detection and Landmarking
Binary Features for Object Detection and Landmarking
 
General knowledge
General knowledgeGeneral knowledge
General knowledge
 
Introduction to Computer Vision
Introduction to Computer VisionIntroduction to Computer Vision
Introduction to Computer Vision
 

Similar to Computer Vision: Extracting Data from the Visual World

OTA16 Talk: Innovative Experiences
OTA16 Talk: Innovative ExperiencesOTA16 Talk: Innovative Experiences
OTA16 Talk: Innovative ExperiencesCarin Campanario
 
A reading of ibm research innovations - for 2018 and ahead
A reading of ibm research innovations - for 2018 and aheadA reading of ibm research innovations - for 2018 and ahead
A reading of ibm research innovations - for 2018 and aheadPietro Leo
 
Why won’t my bank let me play?
Why won’t my bank let me play?Why won’t my bank let me play?
Why won’t my bank let me play?Aden Davies
 
Veronika Demedetska. Robot Simulation from Scratch
Veronika Demedetska. Robot Simulation from ScratchVeronika Demedetska. Robot Simulation from Scratch
Veronika Demedetska. Robot Simulation from ScratchIT Arena
 
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...Seattle Interactive Conference
 
Panacea - Augmented Reality
Panacea - Augmented Reality Panacea - Augmented Reality
Panacea - Augmented Reality Ritesh Nayak
 
Free Microsoft Apps
Free Microsoft AppsFree Microsoft Apps
Free Microsoft Appstarannau20
 
MTC Spring 2013 - crossplatform woes - robert virkus - 2013-03-13
MTC Spring 2013 -  crossplatform woes - robert virkus - 2013-03-13MTC Spring 2013 -  crossplatform woes - robert virkus - 2013-03-13
MTC Spring 2013 - crossplatform woes - robert virkus - 2013-03-13Enough Software
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingDesign World
 
Somo AI Breakfast Briefing
Somo AI Breakfast BriefingSomo AI Breakfast Briefing
Somo AI Breakfast BriefingSomo
 
IBM Watson & Cognitive Computing - Tech In Asia 2016
IBM Watson & Cognitive Computing - Tech In Asia 2016IBM Watson & Cognitive Computing - Tech In Asia 2016
IBM Watson & Cognitive Computing - Tech In Asia 2016Nugroho Gito
 
Building windows phone_apps_-_a_developers_guide_v7_no_cover
Building windows phone_apps_-_a_developers_guide_v7_no_coverBuilding windows phone_apps_-_a_developers_guide_v7_no_cover
Building windows phone_apps_-_a_developers_guide_v7_no_coverPhan Sanh
 
How Augment your Reality: Different perspective on the Reality / Virtuality C...
How Augment your Reality: Different perspective on the Reality / Virtuality C...How Augment your Reality: Different perspective on the Reality / Virtuality C...
How Augment your Reality: Different perspective on the Reality / Virtuality C...Matteo Valoriani
 
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...Matteo Valoriani - How Augment your Reality: different perspective on the Rea...
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...Codemotion
 
New Technologies In Design Education
New Technologies In Design EducationNew Technologies In Design Education
New Technologies In Design EducationGardner Campbell
 
An AI Based ATM Intelligent Security System using Open CV and YOLO
An AI Based ATM Intelligent Security System using Open CV and YOLOAn AI Based ATM Intelligent Security System using Open CV and YOLO
An AI Based ATM Intelligent Security System using Open CV and YOLOYogeshIJTSRD
 
I Like iPhone & Android But I am .NET Developer
I Like iPhone & Android But I am .NET DeveloperI Like iPhone & Android But I am .NET Developer
I Like iPhone & Android But I am .NET DeveloperTellago
 
Open Cv – An Introduction To The Vision
Open Cv – An Introduction To The VisionOpen Cv – An Introduction To The Vision
Open Cv – An Introduction To The VisionHemanth Haridas
 
Blurring The Lines With Augmented Reality
Blurring The Lines With Augmented RealityBlurring The Lines With Augmented Reality
Blurring The Lines With Augmented RealityMatthew Szymczyk
 

Similar to Computer Vision: Extracting Data from the Visual World (20)

OTA16 Talk: Innovative Experiences
OTA16 Talk: Innovative ExperiencesOTA16 Talk: Innovative Experiences
OTA16 Talk: Innovative Experiences
 
A reading of ibm research innovations - for 2018 and ahead
A reading of ibm research innovations - for 2018 and aheadA reading of ibm research innovations - for 2018 and ahead
A reading of ibm research innovations - for 2018 and ahead
 
Why won’t my bank let me play?
Why won’t my bank let me play?Why won’t my bank let me play?
Why won’t my bank let me play?
 
Veronika Demedetska. Robot Simulation from Scratch
Veronika Demedetska. Robot Simulation from ScratchVeronika Demedetska. Robot Simulation from Scratch
Veronika Demedetska. Robot Simulation from Scratch
 
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...
From AI-Generated Stories to Interactive Volumetric Content by Adam Myhill, U...
 
Panacea - Augmented Reality
Panacea - Augmented Reality Panacea - Augmented Reality
Panacea - Augmented Reality
 
Free Microsoft Apps
Free Microsoft AppsFree Microsoft Apps
Free Microsoft Apps
 
MTC Spring 2013 - crossplatform woes - robert virkus - 2013-03-13
MTC Spring 2013 -  crossplatform woes - robert virkus - 2013-03-13MTC Spring 2013 -  crossplatform woes - robert virkus - 2013-03-13
MTC Spring 2013 - crossplatform woes - robert virkus - 2013-03-13
 
2010 And Beyond
2010 And Beyond2010 And Beyond
2010 And Beyond
 
Robotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensingRobotic design: Frontiers in visual and tactile sensing
Robotic design: Frontiers in visual and tactile sensing
 
Somo AI Breakfast Briefing
Somo AI Breakfast BriefingSomo AI Breakfast Briefing
Somo AI Breakfast Briefing
 
IBM Watson & Cognitive Computing - Tech In Asia 2016
IBM Watson & Cognitive Computing - Tech In Asia 2016IBM Watson & Cognitive Computing - Tech In Asia 2016
IBM Watson & Cognitive Computing - Tech In Asia 2016
 
Building windows phone_apps_-_a_developers_guide_v7_no_cover
Building windows phone_apps_-_a_developers_guide_v7_no_coverBuilding windows phone_apps_-_a_developers_guide_v7_no_cover
Building windows phone_apps_-_a_developers_guide_v7_no_cover
 
How Augment your Reality: Different perspective on the Reality / Virtuality C...
How Augment your Reality: Different perspective on the Reality / Virtuality C...How Augment your Reality: Different perspective on the Reality / Virtuality C...
How Augment your Reality: Different perspective on the Reality / Virtuality C...
 
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...Matteo Valoriani - How Augment your Reality: different perspective on the Rea...
Matteo Valoriani - How Augment your Reality: different perspective on the Rea...
 
New Technologies In Design Education
New Technologies In Design EducationNew Technologies In Design Education
New Technologies In Design Education
 
An AI Based ATM Intelligent Security System using Open CV and YOLO
An AI Based ATM Intelligent Security System using Open CV and YOLOAn AI Based ATM Intelligent Security System using Open CV and YOLO
An AI Based ATM Intelligent Security System using Open CV and YOLO
 
I Like iPhone & Android But I am .NET Developer
I Like iPhone & Android But I am .NET DeveloperI Like iPhone & Android But I am .NET Developer
I Like iPhone & Android But I am .NET Developer
 
Open Cv – An Introduction To The Vision
Open Cv – An Introduction To The VisionOpen Cv – An Introduction To The Vision
Open Cv – An Introduction To The Vision
 
Blurring The Lines With Augmented Reality
Blurring The Lines With Augmented RealityBlurring The Lines With Augmented Reality
Blurring The Lines With Augmented Reality
 

Recently uploaded

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Recently uploaded (20)

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Computer Vision: Extracting Data from the Visual World

  • 1. Computer Vision: Extracting Data from the Visual World A Brief Example... ! Steven Mitchell, Ph.D. Componica, LLC
  • 2. About us. Componica, LLC (http://www.componica.com/) Strong Background in Computer Vision Copyright 2011 - Componica, LLC (http://www.componica.com/)
  • 3. About us. Componica seamlessly combines the worlds of machine learning, computer visioning & mobile development & applying the latest in visionary technology to the world of mobile media. Words for Spanish / Russian / French Copyright 2011 - Componica, LLC (http://www.componica.com/)
  • 4. Copyright 2011 - Componica, LLC (http://www.componica.com/) Why is computer vision relevant? How do these things work? Should I be concernd?
  • 5. Copyright 2011 - Componica, LLC (http://www.componica.com/) In this slideshow: Facial Detection - Find me a face. Facial Recognition - Who’s face is it? Image Registration - Aligning pictures together. ...which leads to augmented reality. QR Codes - They’re everywhere. Optical Character Recognition - Reading Stuff.
  • 6. Copyright 2011 - Componica, LLC (http://www.componica.com/) Face Detection This is NOT facial recognition. Developed by Viola / Jones in 2000. Major break-thru in image recognition...this was not possible prior.
 How much does a cow weigh? An army of simple face detectors.
 "Robust Real-time Object Detection"! Paul Viola and Michael Jones
  • 7. Copyright 2011 - Componica, LLC (http://www.componica.com/) BTW, It’s how the Kinect sees people.
  • 8. Copyright 2011 - Componica, LLC (http://www.componica.com/) BTW, It’s how the Kinect sees people. "Real-Time Human Pose Recognition in Parts from Single Depth Images"! Shotton, Fitzgibbon, Cook, Sharp, Finocchio, Moore, Kipman, Blake! Microsoft Research Cambridge & Xbox Incubation
  • 9. Copyright 2011 - Componica, LLC (http://www.componica.com/) Facial Recognition Remove effects caused by lighting and perspective. After you find a face, reduce it to numbers. "Statistical Models of Appearance for Computer Vision"! T.F. Cootes and C.J.Taylor
  • 10. Copyright 2011 - Componica, LLC (http://www.componica.com/) Facial Recognition Let’s mix some paint... Comparing numbers in hyperspace k-Nearest Neighbor, Wikipedia
  • 11. Copyright 2011 - Componica, LLC (http://www.componica.com/) The most common way to register images. Find the most interesting points on the two images. Compare all the interesting points from one image to the other forming matching pairs of points between images. Image Registration - Interesting Points
  • 12. Copyright 2011 - Componica, LLC (http://www.componica.com/) Augmented Reality FAST interest point detection 0.55ms Building query bit masks 0.12ms Matching into database 0.35ms Robust pose estimation 0.1ms Total frame time 1.12ms Table 1. Timings for the stages of our approach on a dataset with images taken from within the range of trained viewpoints. Figure 5. Increasing the range of viewpoint bins in the training set allows more viewpoint invariance to be added in a straightforward manner. gests that the bit count dissimilarity score provides a reason- able way of scoring matches. To confirm this we computed the average number of inlier and outlier matches over all of the frames in the two sequences, and plotted these against the dissimilarity score obtained for the match in Figure 4. For the sequence on the left where the viewpoints are in- cluded in the training set many good matches are found in Once you have correspondence, you can compute 3D geometry. http://mi.eng.cam.ac.uk/~er258/work/fast.html http://nghiaho.com
  • 13. Copyright 2011 - Componica, LLC (http://www.componica.com/) QR Codes http://en.wikipedia.org/wiki/QR_Code ! "Quick Response code" invented by Toyota subsidiary Denso Wave in 1994. Open License Up to 2.5K of data Error Correction Easy to read and generate: ZXing library
  • 14. Copyright 2011 - Componica, LLC (http://www.componica.com/) Optical Character Recognition iPhone 4th Gen iPod Touch 4th Gen
  • 15. Copyright 2011 - Componica, LLC (http://www.componica.com/) Optical Character Recognition
  • 16. Copyright 2011 - Componica, LLC (http://www.componica.com/) Commentary Ubiquitous Surveillance...extreme dislike. Birthday Paradox...The probability that, in a set of n randomly chosen people, some pair of them will have the same birthday.
  • 17. Copyright 2011 - Componica, LLC (http://www.componica.com/) Commentary Video Cameras may fit the criteria of legally blind.
  • 18. Copyright 2011 - Componica, LLC (http://www.componica.com/) Computer visioning technology and society: opportunities, possibilities: Smartphones that ID diseases, plants, insects. Robotic lawnmowers that don’t run over the neighbor’s cat. Computers that judge emotions by reading your face. Keyless entry based on face, iris. Automated inspection of manufactured parts. Conclusion