Abstract of speech recognition

Introduction
Physiological Characteristics
Behavioral Characteristic

 Biometrics are automated methods of
recognizing a person based on a physiological
or behavioral characteristic.
 Physiological characteristics are related with
the shape of the body.
 Behavioral charcteristics are related with
behavior of a person included but not limited to
voice recognition.


IQBAL
Reg # 9952
MBA(M) – Section A

 Speech Recognition Simply is the process of
converting spoken input to text.
 It is also known as Speech-to-Text and Voice
Recognition.
 Technically Speech recognition is the process of
converting an acoustic signal, captured by a
microphone or a telephone, to a set of words.

 Dragon Naturally Speaking developed and
acquired by Dragon Systems and Nuance
Communications respectively.

 Microsoft Speech Recognition by Microsoft.
 Via Voice by IBM

 NUANCE COMMUNICATIONS:-
 This Nuance Communications is a
multinational computer software technology
corporation, headquartered in Burlington,
Massachusetts, USA, that provides speech and
imaging applications.

Current business products focus on server & embedded
speech recognition, telephone call steering systems,
automated telephone directory services, medical
transcription software & systems, optical character
recognition software, and desktop imaging software.
ScanSoft and Nuance merged in October 2005;
before the merger, the two companies competed in
the commercial large scale speech application
business.

 Nuance was founded in 1994 as a spinoff
of SRI International's Speech Technology
and Research (STAR) Laboratory to
commercialise the speaker-independent
speech recognition technology developed for
the US government at SRI.
 Based in Menlo Park, California, Nuance
deployed their first commercial large-scale
speech application in 1996.

1994 – Nuance spun off from SRI's
STAR Lab.
1996 – Nuance deployed its first
commercial speech application.
2000 April 13 – Nuance files initial
public offering on the Nasdaq under the
symbol NUANE

 Dragon speech recognition software is a
Naturally Speaking Language.
 This software has three primary features of
functionality.
 Dictation
 Text-To-Speech
 Command Input

 Dictation
 As user dictates the words it will converts it into
text and it displays.
 Text-To-Speech
 And as text what is present or selected can be
converted to speech.
 Command Input
 User can control the operations by means of
his voice without using keyboard by just giving
commands.

 TRANSLATION
 It cannot translate from one language to
another language here comes translation
problem.
 UNTRAINED
 It cannot work without training ,training is
required,dynamic acceptance is not present.

 PLATFORM DEPENDENT
 It cannot work on another platforms other than
windows like mac o.s,ubuntu etc.

• To develop a translation feature in near
future to spread the availabilty of
product to all type of users.
• To make the system platform
independent.

• Home Automation
There is a lot of interest in the use of SR in
domestic appliances such as ovens,
refrigerators, dishwashers and washing
machines.
• Wearable Computers
The most futuristic application is in the use
and functionality of wearable computers.

The most futuristic application is in the
use and functionality of wearable
computers. These would allow people
to go about their everyday lives, but
still store information (thoughts, notes, to-do lists)
verbally, or communicate via email, phone or videophone,
through wearable devices. Crucially, this would be done
without having to interact with the device, or even
remember that it is there; the user would just speak, the
device would know what to do with the speech, and would
carry out the appropriate task.

• People with Disabilities
Speech recognition technology helps people with
disabilities interact with computers more easily.
People with motor limitations, who cannot use a
standard keyboard and mouse, can use their voices
to navigate the computer and create documents.
• Dyslexic People
Speech Recognition Technology is helpful for people
with learning disabilities, who experience difficulty
with spelling and writing.

 Command Input module
 Input predefined execute
command commands command
define
command |

 Sound Cards
soundcard with the cleanest A/D (Analog
to Digital) conversions are recommended.
 Microphone
The best choice for microphone is the
headset style.

 Computers / Processors
The more the speed the better Speech
Recognition would work. For good Speech
Recognition you should be having 1 GHz
processor and 1 GB of RAM.

 Windows Operating System(NT,XP,7,8).
 Audio Driver Software

 As for a bussiness like online
shopping,organisations like amazon etc have
separate dept for replying to customers in that
place of replying e-mails this can be used to
minimisation of time.
 Cost required for developing the product is
more.
 Time required for developing the product is
medium.

• Speech recognition will revolutionize the way
people conduct business over the Web and will,
ultimately, differentiate world-class e-
businesses. VoiceXML ties speech recognition
and telephony together and provides the
technology with which businesses can develop
and deploy voice-enabled Web solutions
TODAY!

 These solutions can greatly expand the
accessibility of Web-based self-service
transactions to customers who would otherwise
not have access, and, at the same time,
leverage a business’ existing Web investments.

 Speech recognition and VoiceXML clearly
represent the next wave of the Web. In near
future people will be using their home and
business computers by speech not by keyboard
or mouse. Home automation will be completely
based on speech recognition system.

Abstract of speech recognition

Abstract of speech recognition

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (7)

Similar a Abstract of speech recognition

Similar a Abstract of speech recognition (20)

Último

Último (20)

Abstract of speech recognition