SlideShare una empresa de Scribd logo
1 de 33
Major Project Mid-Term Presentation :Speaker Verification for Remote Authentication Members:  Ganesh Tiwari (063BCT510) MadhavPandey(063BCT514) ManojShrestha(063BCT518) Supervisor :  Dr. SubarnaShakya Associate Professor
Introduction Voice biometric system user login Text-Prompted system The claimant is asked to speak a prompted text  Speech and Speaker Recognition/Verification More secure to playback attack. Web Application Client (Adobe Flex) : Voice Capture, preprocessing and feature extraction  Server (JAVA) : Training / Classification BlazeDS RPC for JAVA-Flex Connectivity
Block Diagram of Speaker / Speech Recognition System
Signal Capture and Pre-Processing
Capture and Preprocessing Get the audio signal i.e., ADC Make suitable for feature extraction
Capture and Preprocessing :Capture 22050 Hz 16-bits,Signed Little Endian Mono Uncompressed PCM
Capture and Preprocessing :PCM Extract
Capture and Preprocessing : Silence Removal Algorithm described in paper ‘a new method for silence removal and endpoint detection’ † †G. Saha, SandipanChakroborty, SumanSenapati of Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology, Khragpur, India
Capture and Preprocessing :Pre-Emphasis Boosting the high frequency energy In time domain, y[n] = x[n]−αx[n−1], 0.9 ≤ α≤ 1.0
Capture and Preprocessing : Framing Speech Signal is stationary (statistical properties) for 10-30 ms 50% overlapped frames  each of 23ms is used
Capture and Preprocessing :Windowing Windowing is done on the frame blocked signal Hamming window
Feature Extraction
Feature Extraction Transform the input audio signal into a sequence of acoustic feature vectors MFCC : Mel Filter CepstralCoefficients as Feature Perceptual approach  Human Ear processes audio signal in Mel scale Mel scale : linear up to 1KHz and logarithmic after 1KHz MFCC gives distribution of energy in Mel frequency band Calculated for each frame
Feature Extraction : Fourier Transform Gives information about the amount of energy at each frequency band FFT used
Feature Extraction : Mel Filter We used filter bank of triangular filters spaced in Mel scale
Feature Extraction : Mel Filter (contd..) Mel Filter Where,
Feature Extraction :Log, IFT(DCT) Log DCT 	MFCC
Feature Extraction : Cepstral Mean Subtraction CMS: for minimizing channel effect
Feature Extraction : Energy and Deltas For completeness of feature vector and for achieving high recognition rate A Energy Feature A delta or velocity feature, and a double delta or acceleration feature Calculated by linear regression of regression window M
Composition of Feature Vector 12 MFCC Features 12 Delta MFCC 12 Delta-Delta MFCC 1 Energy Feature 1 Delta Energy Feature 1 Delta-Delta Energy Feature  39 Features from each frame
Speaker Recognition/Verification by GMM
Gaussian Mixture Model Parametric probability density function Based on clustering technique M Gaussian components 𝑝(𝑥/)= 𝑚=1𝑀𝑤𝑚 .  𝑔𝑚(𝑥/𝜇𝑚 , 𝐶𝑚) 𝑥: a k-dimensional random vector 𝑤𝑚: mixture weight of mth component 𝑔𝑚 : k-dimensional Gaussian function (pdf)  𝑔𝑚𝑥/𝜇𝑚 , 𝐶𝑚  = 12𝜋𝐾.|𝐶𝑚| exp{−12𝑥−𝜇𝑚 .(𝐶𝑚−1(𝑥−𝜇𝑚 ))}  = (𝑤𝑚, 𝜇𝑚 ,𝐶𝑚)  
GMM Training Goal: estimate the parameters Method: Maximum Likelihood estimation Input: X = {𝑥1,𝑥1,…,𝑥𝑇} P(X/) =𝑡=1𝑇𝑝(𝑥𝑡/) Maximize with Expectation Maximization algorithm  Iterative process:  initial model: 𝑖 new model: 𝑖+1 P(X/ 𝑖+1) ≥ P(X/ 𝑖)  
Verification Decision: Hypothesis Test 	H0: the speaker is the claimed speaker 	H1: the speaker is an imposter Based on likelihood ratio 		 = P(X/)P(X/) Decision by threshold < 𝜃𝑇reject identity claim  > 𝜃𝑇 accept identity claim  
Speech Recognition by HMM/VQ
Hidden Markov Model :Definition Hidden Markov Model (HMM) is the statistical model HMM is the extension of Markov Process HMM has hidden states and observable symbols per states HMM Model : Observed data : feature vector  Hidden states : phonemes
Codebook Generation K-Means Clustering Clustering the whole database & Codebook Generation VQ : Vector Quantization is used for mapping each input feature vector to discrete quantized symbols Codebook for each incoming feature vector is built  Compare it to each of the prototype vectors in codebook  Select the one which is closest (by some distancemetric) Replace the input vector by the index of this prototype vector observation sequence
Speech Recognition System: By : HMM / VQ
Hidden Markov Model :Training Training by:  Forward backward (Baum-Welch) algorithm Forward-backward algorithm iteratively re-estimates the parameters and improves the probability that given observation are generated  by the new parameters Three parameters need to be re-estimated: Initial state distribution: πi Transition probabilities: ai,j Emission probabilities: bi(ot) Input is observation sequence, given by VQ
Hidden Markov Model :Verification/Matching Viterbi algorithm is used Input is  Observation sequence, given by VQ HMM model of the word Best matched word is returned
Problem Faced Learning curve Complex Mathematics Flex & Java  Connectivity (initially) Data conversion
Remaining Tasks Speech Training Data Collection Model Training (HMM, GMM) Module Integration Testing
Thanks

Más contenido relacionado

La actualidad más candente

Automatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approachAutomatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approachAbdullah al Mamun
 
Text-Independent Speaker Verification Report
Text-Independent Speaker Verification ReportText-Independent Speaker Verification Report
Text-Independent Speaker Verification ReportCody Ray
 
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONSPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONniranjan kumar
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionRichie
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognitionphyuhsan
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisRushin Shah
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkeSAT Journals
 
LPC for Speech Recognition
LPC for Speech RecognitionLPC for Speech Recognition
LPC for Speech RecognitionDr. Uday Saikia
 
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...IJERA Editor
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...ijceronline
 
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Ahmed Ayman
 
Konversa.docx - konversa.googlecode.com
Konversa.docx - konversa.googlecode.comKonversa.docx - konversa.googlecode.com
Konversa.docx - konversa.googlecode.combutest
 
Comparative evaluation of bit error rate for different ofdm subcarriers in ra...
Comparative evaluation of bit error rate for different ofdm subcarriers in ra...Comparative evaluation of bit error rate for different ofdm subcarriers in ra...
Comparative evaluation of bit error rate for different ofdm subcarriers in ra...ijmnct
 

La actualidad más candente (19)

Automatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approachAutomatic Speaker Recognition system using MFCC and VQ approach
Automatic Speaker Recognition system using MFCC and VQ approach
 
Text-Independent Speaker Verification Report
Text-Independent Speaker Verification ReportText-Independent Speaker Verification Report
Text-Independent Speaker Verification Report
 
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONSPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognition
 
Speaker recognition.
Speaker recognition.Speaker recognition.
Speaker recognition.
 
COLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech AnalysisCOLEA : A MATLAB Tool for Speech Analysis
COLEA : A MATLAB Tool for Speech Analysis
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
 
LPC for Speech Recognition
LPC for Speech RecognitionLPC for Speech Recognition
LPC for Speech Recognition
 
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
 
Konversa.docx - konversa.googlecode.com
Konversa.docx - konversa.googlecode.comKonversa.docx - konversa.googlecode.com
Konversa.docx - konversa.googlecode.com
 
Matlab: Speech Signal Analysis
Matlab: Speech Signal AnalysisMatlab: Speech Signal Analysis
Matlab: Speech Signal Analysis
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
Speech compression-using-gsm
Speech compression-using-gsmSpeech compression-using-gsm
Speech compression-using-gsm
 
Comparative evaluation of bit error rate for different ofdm subcarriers in ra...
Comparative evaluation of bit error rate for different ofdm subcarriers in ra...Comparative evaluation of bit error rate for different ofdm subcarriers in ra...
Comparative evaluation of bit error rate for different ofdm subcarriers in ra...
 
Speaker Segmentation (2006)
Speaker Segmentation (2006)Speaker Segmentation (2006)
Speaker Segmentation (2006)
 

Destacado

Speech Recognition , Noise Filtering and Content Search Engine , Research Do...
Speech Recognition , Noise Filtering and  Content Search Engine , Research Do...Speech Recognition , Noise Filtering and  Content Search Engine , Research Do...
Speech Recognition , Noise Filtering and Content Search Engine , Research Do...Gayan Kalanamith Mannapperuma
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCCHira Shaukat
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 
Introductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingIntroductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingAngelo Salatino
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminarDiptimaya Sarangi
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Chiranjeevi Adi
 
Voice wiki on mobile project report
Voice wiki on mobile project reportVoice wiki on mobile project report
Voice wiki on mobile project reportRahul E
 
Anna Denisova - Two Realizations of Probability Anomaly Detector with Differ...
Anna Denisova - Two Realizations of Probability Anomaly Detector with  Differ...Anna Denisova - Two Realizations of Probability Anomaly Detector with  Differ...
Anna Denisova - Two Realizations of Probability Anomaly Detector with Differ...AIST
 
Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...Alexander Decker
 
MFCC Malta - MICE Presentation 2017
MFCC Malta - MICE Presentation 2017MFCC Malta - MICE Presentation 2017
MFCC Malta - MICE Presentation 2017MICEboard
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionboddu syamprasad
 
Lossy Compression Using Stationary Wavelet Transform and Vector Quantization
Lossy Compression Using Stationary Wavelet Transform and Vector QuantizationLossy Compression Using Stationary Wavelet Transform and Vector Quantization
Lossy Compression Using Stationary Wavelet Transform and Vector QuantizationOmar Ghazi
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech RecognitionAhmed Moawad
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Voice Identification And Recognition System, Matlab
Voice Identification And Recognition System, MatlabVoice Identification And Recognition System, Matlab
Voice Identification And Recognition System, MatlabSohaib Tallat
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYIJCERT
 

Destacado (20)

Speech Recognition , Noise Filtering and Content Search Engine , Research Do...
Speech Recognition , Noise Filtering and  Content Search Engine , Research Do...Speech Recognition , Noise Filtering and  Content Search Engine , Research Do...
Speech Recognition , Noise Filtering and Content Search Engine , Research Do...
 
Speaker recognition using MFCC
Speaker recognition using MFCCSpeaker recognition using MFCC
Speaker recognition using MFCC
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Introductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal ProcessingIntroductory Lecture to Audio Signal Processing
Introductory Lecture to Audio Signal Processing
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks Hand Written Character Recognition Using Neural Networks
Hand Written Character Recognition Using Neural Networks
 
Voice wiki on mobile project report
Voice wiki on mobile project reportVoice wiki on mobile project report
Voice wiki on mobile project report
 
Anna Denisova - Two Realizations of Probability Anomaly Detector with Differ...
Anna Denisova - Two Realizations of Probability Anomaly Detector with  Differ...Anna Denisova - Two Realizations of Probability Anomaly Detector with  Differ...
Anna Denisova - Two Realizations of Probability Anomaly Detector with Differ...
 
Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...Speech recognition using vector quantization through modified k means lbg alg...
Speech recognition using vector quantization through modified k means lbg alg...
 
MFCC Malta - MICE Presentation 2017
MFCC Malta - MICE Presentation 2017MFCC Malta - MICE Presentation 2017
MFCC Malta - MICE Presentation 2017
 
Hidden markov model
Hidden markov modelHidden markov model
Hidden markov model
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Lossy Compression Using Stationary Wavelet Transform and Vector Quantization
Lossy Compression Using Stationary Wavelet Transform and Vector QuantizationLossy Compression Using Stationary Wavelet Transform and Vector Quantization
Lossy Compression Using Stationary Wavelet Transform and Vector Quantization
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Voice Identification And Recognition System, Matlab
Voice Identification And Recognition System, MatlabVoice Identification And Recognition System, Matlab
Voice Identification And Recognition System, Matlab
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEY
 

Similar a Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recognition/Verification System Mid-term Project Presentation

Making Custom Oscilloscope Measurements
Making Custom Oscilloscope MeasurementsMaking Custom Oscilloscope Measurements
Making Custom Oscilloscope Measurementsteledynelecroy
 
TinyML - 4 speech recognition
TinyML - 4 speech recognition TinyML - 4 speech recognition
TinyML - 4 speech recognition 艾鍗科技
 
Multiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA System
Multiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA SystemMultiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA System
Multiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA SystemTELKOMNIKA JOURNAL
 
Speech recognition final
Speech recognition finalSpeech recognition final
Speech recognition finalArchit Vora
 
Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...
Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...
Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...TSC University of Mondragon
 
damaro.ppt
damaro.pptdamaro.ppt
damaro.pptVideoguy
 
BER performance simulation in a multi user MIMO Presentation
BER performance simulation in a multi user MIMO PresentationBER performance simulation in a multi user MIMO Presentation
BER performance simulation in a multi user MIMO PresentationVikas Pandey
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By MatlabAnkit Gujrati
 
Performance evaluation of multicast video distribution using lte a in vehicul...
Performance evaluation of multicast video distribution using lte a in vehicul...Performance evaluation of multicast video distribution using lte a in vehicul...
Performance evaluation of multicast video distribution using lte a in vehicul...Communication Systems & Networks
 
Melp codec optimization using DSP kit
Melp codec optimization using DSP kitMelp codec optimization using DSP kit
Melp codec optimization using DSP kitsohaibaslam207
 

Similar a Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recognition/Verification System Mid-term Project Presentation (20)

Project sinopsis final M.TECH ( PDF FILE )
Project sinopsis final M.TECH ( PDF FILE )Project sinopsis final M.TECH ( PDF FILE )
Project sinopsis final M.TECH ( PDF FILE )
 
Project sinopsis final M.TECH ( M S WORD FILE )
Project sinopsis final M.TECH ( M S WORD FILE )Project sinopsis final M.TECH ( M S WORD FILE )
Project sinopsis final M.TECH ( M S WORD FILE )
 
Making Custom Oscilloscope Measurements
Making Custom Oscilloscope MeasurementsMaking Custom Oscilloscope Measurements
Making Custom Oscilloscope Measurements
 
4g lte matlab
4g lte matlab4g lte matlab
4g lte matlab
 
M.sc. m kamel
M.sc. m kamelM.sc. m kamel
M.sc. m kamel
 
N017428692
N017428692N017428692
N017428692
 
Resume
ResumeResume
Resume
 
G0361052058
G0361052058G0361052058
G0361052058
 
TinyML - 4 speech recognition
TinyML - 4 speech recognition TinyML - 4 speech recognition
TinyML - 4 speech recognition
 
Multiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA System
Multiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA SystemMultiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA System
Multiuser Detection with Decision-feedback Detectors and PIC in MC-CDMA System
 
Speech recognition final
Speech recognition finalSpeech recognition final
Speech recognition final
 
Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...
Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...
Real-time Implementation of Sphere Decoder-based MIMO Wireless System (EUSIPC...
 
F0331031037
F0331031037F0331031037
F0331031037
 
damaro.ppt
damaro.pptdamaro.ppt
damaro.ppt
 
BER performance simulation in a multi user MIMO Presentation
BER performance simulation in a multi user MIMO PresentationBER performance simulation in a multi user MIMO Presentation
BER performance simulation in a multi user MIMO Presentation
 
MIMO OFDM
MIMO OFDMMIMO OFDM
MIMO OFDM
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Digital communication unit II
Digital communication unit IIDigital communication unit II
Digital communication unit II
 
Performance evaluation of multicast video distribution using lte a in vehicul...
Performance evaluation of multicast video distribution using lte a in vehicul...Performance evaluation of multicast video distribution using lte a in vehicul...
Performance evaluation of multicast video distribution using lte a in vehicul...
 
Melp codec optimization using DSP kit
Melp codec optimization using DSP kitMelp codec optimization using DSP kit
Melp codec optimization using DSP kit
 

Último

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 

Último (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Text Prompted Remote Speaker Authentication : Joint Speech and Speaker Recognition/Verification System Mid-term Project Presentation

  • 1. Major Project Mid-Term Presentation :Speaker Verification for Remote Authentication Members: Ganesh Tiwari (063BCT510) MadhavPandey(063BCT514) ManojShrestha(063BCT518) Supervisor : Dr. SubarnaShakya Associate Professor
  • 2. Introduction Voice biometric system user login Text-Prompted system The claimant is asked to speak a prompted text Speech and Speaker Recognition/Verification More secure to playback attack. Web Application Client (Adobe Flex) : Voice Capture, preprocessing and feature extraction Server (JAVA) : Training / Classification BlazeDS RPC for JAVA-Flex Connectivity
  • 3. Block Diagram of Speaker / Speech Recognition System
  • 4. Signal Capture and Pre-Processing
  • 5. Capture and Preprocessing Get the audio signal i.e., ADC Make suitable for feature extraction
  • 6. Capture and Preprocessing :Capture 22050 Hz 16-bits,Signed Little Endian Mono Uncompressed PCM
  • 8. Capture and Preprocessing : Silence Removal Algorithm described in paper ‘a new method for silence removal and endpoint detection’ † †G. Saha, SandipanChakroborty, SumanSenapati of Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology, Khragpur, India
  • 9. Capture and Preprocessing :Pre-Emphasis Boosting the high frequency energy In time domain, y[n] = x[n]−αx[n−1], 0.9 ≤ α≤ 1.0
  • 10. Capture and Preprocessing : Framing Speech Signal is stationary (statistical properties) for 10-30 ms 50% overlapped frames each of 23ms is used
  • 11. Capture and Preprocessing :Windowing Windowing is done on the frame blocked signal Hamming window
  • 13. Feature Extraction Transform the input audio signal into a sequence of acoustic feature vectors MFCC : Mel Filter CepstralCoefficients as Feature Perceptual approach Human Ear processes audio signal in Mel scale Mel scale : linear up to 1KHz and logarithmic after 1KHz MFCC gives distribution of energy in Mel frequency band Calculated for each frame
  • 14. Feature Extraction : Fourier Transform Gives information about the amount of energy at each frequency band FFT used
  • 15. Feature Extraction : Mel Filter We used filter bank of triangular filters spaced in Mel scale
  • 16. Feature Extraction : Mel Filter (contd..) Mel Filter Where,
  • 17. Feature Extraction :Log, IFT(DCT) Log DCT MFCC
  • 18. Feature Extraction : Cepstral Mean Subtraction CMS: for minimizing channel effect
  • 19. Feature Extraction : Energy and Deltas For completeness of feature vector and for achieving high recognition rate A Energy Feature A delta or velocity feature, and a double delta or acceleration feature Calculated by linear regression of regression window M
  • 20. Composition of Feature Vector 12 MFCC Features 12 Delta MFCC 12 Delta-Delta MFCC 1 Energy Feature 1 Delta Energy Feature 1 Delta-Delta Energy Feature  39 Features from each frame
  • 22. Gaussian Mixture Model Parametric probability density function Based on clustering technique M Gaussian components 𝑝(𝑥/)= 𝑚=1𝑀𝑤𝑚 .  𝑔𝑚(𝑥/𝜇𝑚 , 𝐶𝑚) 𝑥: a k-dimensional random vector 𝑤𝑚: mixture weight of mth component 𝑔𝑚 : k-dimensional Gaussian function (pdf) 𝑔𝑚𝑥/𝜇𝑚 , 𝐶𝑚  = 12𝜋𝐾.|𝐶𝑚| exp{−12𝑥−𝜇𝑚 .(𝐶𝑚−1(𝑥−𝜇𝑚 ))}  = (𝑤𝑚, 𝜇𝑚 ,𝐶𝑚)  
  • 23. GMM Training Goal: estimate the parameters Method: Maximum Likelihood estimation Input: X = {𝑥1,𝑥1,…,𝑥𝑇} P(X/) =𝑡=1𝑇𝑝(𝑥𝑡/) Maximize with Expectation Maximization algorithm Iterative process: initial model: 𝑖 new model: 𝑖+1 P(X/ 𝑖+1) ≥ P(X/ 𝑖)  
  • 24. Verification Decision: Hypothesis Test H0: the speaker is the claimed speaker H1: the speaker is an imposter Based on likelihood ratio  = P(X/)P(X/) Decision by threshold < 𝜃𝑇reject identity claim > 𝜃𝑇 accept identity claim  
  • 26. Hidden Markov Model :Definition Hidden Markov Model (HMM) is the statistical model HMM is the extension of Markov Process HMM has hidden states and observable symbols per states HMM Model : Observed data : feature vector Hidden states : phonemes
  • 27. Codebook Generation K-Means Clustering Clustering the whole database & Codebook Generation VQ : Vector Quantization is used for mapping each input feature vector to discrete quantized symbols Codebook for each incoming feature vector is built Compare it to each of the prototype vectors in codebook Select the one which is closest (by some distancemetric) Replace the input vector by the index of this prototype vector observation sequence
  • 28. Speech Recognition System: By : HMM / VQ
  • 29. Hidden Markov Model :Training Training by: Forward backward (Baum-Welch) algorithm Forward-backward algorithm iteratively re-estimates the parameters and improves the probability that given observation are generated by the new parameters Three parameters need to be re-estimated: Initial state distribution: πi Transition probabilities: ai,j Emission probabilities: bi(ot) Input is observation sequence, given by VQ
  • 30. Hidden Markov Model :Verification/Matching Viterbi algorithm is used Input is Observation sequence, given by VQ HMM model of the word Best matched word is returned
  • 31. Problem Faced Learning curve Complex Mathematics Flex & Java Connectivity (initially) Data conversion
  • 32. Remaining Tasks Speech Training Data Collection Model Training (HMM, GMM) Module Integration Testing