SlideShare una empresa de Scribd logo
1 de 31
Introduction
Physiological Characteristics
Behavioral Characteristic
 Biometrics are automated methods of
recognizing a person based on a physiological
or behavioral characteristic.
 Physiological characteristics are related with
the shape of the body.
 Behavioral charcteristics are related with
behavior of a person included but not limited to
voice recognition.

IQBAL
Reg # 9952
MBA(M) – Section A
 Speech Recognition Simply is the process of
converting spoken input to text.
 It is also known as Speech-to-Text and Voice
Recognition.
 Technically Speech recognition is the process of
converting an acoustic signal, captured by a
microphone or a telephone, to a set of words.
 Dragon Naturally Speaking developed and
acquired by Dragon Systems and Nuance
Communications respectively.
 Microsoft Speech Recognition by Microsoft.
 Via Voice by IBM
 NUANCE COMMUNICATIONS:-
 This Nuance Communications is a 
multinational computer software technology
 corporation, headquartered in Burlington,
Massachusetts, USA, that provides speech and
imaging applications.
Current business products focus on server & embedded
speech recognition, telephone call steering systems,
automated telephone directory services, medical
transcription software & systems, optical character
recognition software, and desktop imaging software.
ScanSoft and Nuance merged in October 2005;
before the merger, the two companies competed in
the commercial large scale speech application
business.
 Nuance was founded in 1994 as a spinoff
of SRI International's Speech Technology
and Research (STAR) Laboratory to
commercialise the speaker-independent
speech recognition technology developed for
the US government at SRI.
 Based in Menlo Park, California, Nuance
deployed their first commercial large-scale
speech application in 1996.
1994 – Nuance spun off from SRI's
STAR Lab.
1996 – Nuance deployed its first
commercial speech application.
2000 April 13 – Nuance files initial
public offering on the Nasdaq under the
symbol NUANE
 Dragon speech recognition software is a
Naturally Speaking Language.
 This software has three primary features of
functionality.
 Dictation
 Text-To-Speech
 Command Input
 Dictation
 As user dictates the words it will converts it into
text and it displays.
 Text-To-Speech
 And as text what is present or selected can be
converted to speech.
 Command Input
 User can control the operations by means of
his voice without using keyboard by just giving
commands.
 TRANSLATION
 It cannot translate from one language to
another language here comes translation
problem.
 UNTRAINED
 It cannot work without training ,training is
required,dynamic acceptance is not present.
 PLATFORM DEPENDENT
 It cannot work on another platforms other than
windows like mac o.s,ubuntu etc.
• To develop a translation feature in near
future to spread the availabilty of
product to all type of users.
• To make the system platform
independent.
• Home Automation
There is a lot of interest in the use of SR in
domestic appliances such as ovens,
refrigerators, dishwashers and washing
machines.
• Wearable Computers
The most futuristic application is in the use
and functionality of wearable computers.
The most futuristic application is in the
use and functionality of wearable
computers. These would allow people
to go about their everyday lives, but
still store information (thoughts, notes, to-do lists)
verbally, or communicate via email, phone or videophone,
through wearable devices. Crucially, this would be done
without having to interact with the device, or even
remember that it is there; the user would just speak, the
device would know what to do with the speech, and would
carry out the appropriate task.
• People with Disabilities
Speech recognition technology helps people with
disabilities interact with computers more easily.
People with motor limitations, who cannot use a
standard keyboard and mouse, can use their voices
to navigate the computer and create documents.
• Dyslexic People
Speech Recognition Technology is helpful for people
with learning disabilities, who experience difficulty
with spelling and writing.
 Speech to text module
 Command Input module
 Input predefined execute
command commands command
define
command |
 Sound Cards
soundcard with the cleanest A/D (Analog
to Digital) conversions are recommended.
 Microphone
The best choice for microphone is the
headset style.
 Computers / Processors
The more the speed the better Speech
Recognition would work. For good Speech
Recognition you should be having 1 GHz
processor and 1 GB of RAM.
 Windows Operating System(NT,XP,7,8).
 Audio Driver Software
 As for a bussiness like online
shopping,organisations like amazon etc have
separate dept for replying to customers in that
place of replying e-mails this can be used to
minimisation of time.
 Cost required for developing the product is
more.
 Time required for developing the product is
medium.
• Speech recognition will revolutionize the way
people conduct business over the Web and will,
ultimately, differentiate world-class e-
businesses. VoiceXML ties speech recognition
and telephony together and provides the
technology with which businesses can develop
and deploy voice-enabled Web solutions
TODAY!
 These solutions can greatly expand the
accessibility of Web-based self-service
transactions to customers who would otherwise
not have access, and, at the same time,
leverage a business’ existing Web investments.
 Speech recognition and VoiceXML clearly
represent the next wave of the Web. In near
future people will be using their home and
business computers by speech not by keyboard
or mouse. Home automation will be completely
based on speech recognition system. 
Abstract of speech recognition

Más contenido relacionado

La actualidad más candente

Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
Amrita More
 
Download as PPTX - PowerPoint Presentation
Download as PPTX - PowerPoint PresentationDownload as PPTX - PowerPoint Presentation
Download as PPTX - PowerPoint Presentation
butest
 

La actualidad más candente (20)

Speech recognition An overview
Speech recognition An overviewSpeech recognition An overview
Speech recognition An overview
 
Voice Recognition
Voice RecognitionVoice Recognition
Voice Recognition
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Voice recognition
Voice recognitionVoice recognition
Voice recognition
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Silent sound technologyrevathippt
Silent sound technologyrevathipptSilent sound technologyrevathippt
Silent sound technologyrevathippt
 
Artificial intelligence Speech recognition system
Artificial intelligence Speech recognition systemArtificial intelligence Speech recognition system
Artificial intelligence Speech recognition system
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
The State of Automatic Speech Recognition 2022 (2).pdf
The State of Automatic Speech Recognition 2022 (2).pdfThe State of Automatic Speech Recognition 2022 (2).pdf
The State of Automatic Speech Recognition 2022 (2).pdf
 
biometric technology
biometric technologybiometric technology
biometric technology
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
E0ad silent sound technology
E0ad silent  sound technologyE0ad silent  sound technology
E0ad silent sound technology
 
Text to-speech & voice recognition
Text to-speech & voice recognitionText to-speech & voice recognition
Text to-speech & voice recognition
 
Biometric technology
Biometric technologyBiometric technology
Biometric technology
 
Download as PPTX - PowerPoint Presentation
Download as PPTX - PowerPoint PresentationDownload as PPTX - PowerPoint Presentation
Download as PPTX - PowerPoint Presentation
 
Speech Recognition by Iqbal
Speech Recognition by IqbalSpeech Recognition by Iqbal
Speech Recognition by Iqbal
 
Smart note taker ppt
Smart note taker ppt  Smart note taker ppt
Smart note taker ppt
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Ppt presentation
Ppt presentationPpt presentation
Ppt presentation
 

Destacado (7)

Speech recognition project report
Speech recognition project reportSpeech recognition project report
Speech recognition project report
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Sample project abstract
Sample project abstractSample project abstract
Sample project abstract
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Slideshare Powerpoint presentation
Slideshare Powerpoint presentationSlideshare Powerpoint presentation
Slideshare Powerpoint presentation
 

Similar a Abstract of speech recognition

Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
Thejus Joby
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
ankit_saluja
 

Similar a Abstract of speech recognition (20)

10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
 
Presentation.ai
Presentation.aiPresentation.ai
Presentation.ai
 
Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software Top 10 Best Speech Recognition Software
Top 10 Best Speech Recognition Software
 
Speech recognizers & generators
Speech recognizers & generatorsSpeech recognizers & generators
Speech recognizers & generators
 
ICT, Importance of programming and programming languages
ICT, Importance of programming and programming languagesICT, Importance of programming and programming languages
ICT, Importance of programming and programming languages
 
Wake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phoneWake-up-word speech recognition using GPS on smart phone
Wake-up-word speech recognition using GPS on smart phone
 
Artificial Intelligence for Speech Recognition
Artificial Intelligence for Speech RecognitionArtificial Intelligence for Speech Recognition
Artificial Intelligence for Speech Recognition
 
Paper on Speech Recognition
Paper on Speech RecognitionPaper on Speech Recognition
Paper on Speech Recognition
 
Google Voice-to-text
Google Voice-to-textGoogle Voice-to-text
Google Voice-to-text
 
Speech to text conversion
Speech to text conversionSpeech to text conversion
Speech to text conversion
 
30
3030
30
 
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
IRJET- Communication System for Blind, Deaf and Dumb People using Internet of...
 
Instant speech translation 10BM60080 - VGSOM
Instant speech translation   10BM60080 - VGSOMInstant speech translation   10BM60080 - VGSOM
Instant speech translation 10BM60080 - VGSOM
 
Seminar
SeminarSeminar
Seminar
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Computer system
Computer systemComputer system
Computer system
 
Speech Recognition
Speech Recognition Speech Recognition
Speech Recognition
 
AI for voice recognition.pptx
AI for voice recognition.pptxAI for voice recognition.pptx
AI for voice recognition.pptx
 
D1803041822
D1803041822D1803041822
D1803041822
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Abstract of speech recognition

  • 1.
  • 3.  Biometrics are automated methods of recognizing a person based on a physiological or behavioral characteristic.  Physiological characteristics are related with the shape of the body.  Behavioral charcteristics are related with behavior of a person included but not limited to voice recognition. 
  • 4.
  • 5. IQBAL Reg # 9952 MBA(M) – Section A
  • 6.
  • 7.  Speech Recognition Simply is the process of converting spoken input to text.  It is also known as Speech-to-Text and Voice Recognition.  Technically Speech recognition is the process of converting an acoustic signal, captured by a microphone or a telephone, to a set of words.
  • 8.  Dragon Naturally Speaking developed and acquired by Dragon Systems and Nuance Communications respectively.
  • 9.  Microsoft Speech Recognition by Microsoft.  Via Voice by IBM
  • 10.  NUANCE COMMUNICATIONS:-  This Nuance Communications is a  multinational computer software technology  corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications.
  • 11. Current business products focus on server & embedded speech recognition, telephone call steering systems, automated telephone directory services, medical transcription software & systems, optical character recognition software, and desktop imaging software. ScanSoft and Nuance merged in October 2005; before the merger, the two companies competed in the commercial large scale speech application business.
  • 12.  Nuance was founded in 1994 as a spinoff of SRI International's Speech Technology and Research (STAR) Laboratory to commercialise the speaker-independent speech recognition technology developed for the US government at SRI.  Based in Menlo Park, California, Nuance deployed their first commercial large-scale speech application in 1996.
  • 13. 1994 – Nuance spun off from SRI's STAR Lab. 1996 – Nuance deployed its first commercial speech application. 2000 April 13 – Nuance files initial public offering on the Nasdaq under the symbol NUANE
  • 14.  Dragon speech recognition software is a Naturally Speaking Language.  This software has three primary features of functionality.  Dictation  Text-To-Speech  Command Input
  • 15.  Dictation  As user dictates the words it will converts it into text and it displays.  Text-To-Speech  And as text what is present or selected can be converted to speech.  Command Input  User can control the operations by means of his voice without using keyboard by just giving commands.
  • 16.  TRANSLATION  It cannot translate from one language to another language here comes translation problem.  UNTRAINED  It cannot work without training ,training is required,dynamic acceptance is not present.
  • 17.  PLATFORM DEPENDENT  It cannot work on another platforms other than windows like mac o.s,ubuntu etc.
  • 18. • To develop a translation feature in near future to spread the availabilty of product to all type of users. • To make the system platform independent.
  • 19. • Home Automation There is a lot of interest in the use of SR in domestic appliances such as ovens, refrigerators, dishwashers and washing machines. • Wearable Computers The most futuristic application is in the use and functionality of wearable computers.
  • 20. The most futuristic application is in the use and functionality of wearable computers. These would allow people to go about their everyday lives, but still store information (thoughts, notes, to-do lists) verbally, or communicate via email, phone or videophone, through wearable devices. Crucially, this would be done without having to interact with the device, or even remember that it is there; the user would just speak, the device would know what to do with the speech, and would carry out the appropriate task.
  • 21. • People with Disabilities Speech recognition technology helps people with disabilities interact with computers more easily. People with motor limitations, who cannot use a standard keyboard and mouse, can use their voices to navigate the computer and create documents. • Dyslexic People Speech Recognition Technology is helpful for people with learning disabilities, who experience difficulty with spelling and writing.
  • 22.  Speech to text module
  • 23.  Command Input module  Input predefined execute command commands command define command |
  • 24.  Sound Cards soundcard with the cleanest A/D (Analog to Digital) conversions are recommended.  Microphone The best choice for microphone is the headset style.
  • 25.  Computers / Processors The more the speed the better Speech Recognition would work. For good Speech Recognition you should be having 1 GHz processor and 1 GB of RAM.
  • 26.  Windows Operating System(NT,XP,7,8).  Audio Driver Software
  • 27.  As for a bussiness like online shopping,organisations like amazon etc have separate dept for replying to customers in that place of replying e-mails this can be used to minimisation of time.  Cost required for developing the product is more.  Time required for developing the product is medium.
  • 28. • Speech recognition will revolutionize the way people conduct business over the Web and will, ultimately, differentiate world-class e- businesses. VoiceXML ties speech recognition and telephony together and provides the technology with which businesses can develop and deploy voice-enabled Web solutions TODAY!
  • 29.  These solutions can greatly expand the accessibility of Web-based self-service transactions to customers who would otherwise not have access, and, at the same time, leverage a business’ existing Web investments.
  • 30.  Speech recognition and VoiceXML clearly represent the next wave of the Web. In near future people will be using their home and business computers by speech not by keyboard or mouse. Home automation will be completely based on speech recognition system.