SlideShare una empresa de Scribd logo
1 de 17
Chapter 6. Language and Communications


                                 Speech Perception




   Engineering Psychology and Human Performance   ㅇㅇ



                                                       Seo-jung ko, Industrial Engineering, Hanyang University
Contents
 1. Speech Perception

 2. Representation of Speech

 3. Units of Speech Perception
       Phonemes
       Syllables
       Words

 4. Top-Down Processing of Speech

 5. Applications of Voice Recognition Research

 6. Communications
       Nonverbal Communications
       Video Mediated Communications
       Crew Resource Management



                                       Seo-jung ko, Industrial Engineering, Hanyang University
Speech Perception

 Example
      In1997, a Tragic event occurred at the Tenerife airport in the Canada Island : A
      KLM Royal Dutch Airlines 747 jumbo jet, accelerating for takeoff, crashed into a
      Pan American 747 taxiing on the same runway.
      → Confusion between the KLM pilot and air traffic control.
 Reading & Speech
      In common with reading, the perception of speech involves both bottom-up
      hierarchical processing and top-down contextual processing.
           reading   :: features세부특징 – letters낱자 – words단어
           Speech    :: phonomes음소- sylables음절 -words단어
      But, reading 과 달리 physical units of speech 분리가 쉽지 않다.


 The perceptual system must undertake some analog to digital conversion to
 translate the continuous speech waveform into the discrete units of speech
 perception.

                                                Seo-jung ko, Industrial Engineering, Hanyang University
Contents
 1. Speech Perception

 2. Representation of Speech

 3. Units of Speech Perception
       Phonemes
       Syllables
       Words

 4. Top-Down Processing of Speech

 5. Applications of Voice Recognition Research

 6. Communications
       Nonverbal Communications
       Video Mediated Communications
       Crew Resource Management



                                       Seo-jung ko, Industrial Engineering, Hanyang University
Representation of Speech

                                     (a) The stimulus of speech is
                                         a continuous variation or
                                         oscillation of the air pressure

                                     (b) Fourier 분석
                                         서로 다른 주파수, 진폭을 갖는
                                         sine wave로 분리시킬 수 있다.

                                     (c) Spectral representation.
                                         (b)의 그래프를 각각
                                         Y축 : Power
                                                    sine wave 진동의 평균 폭or 폭의제곱
                                          X축 : Frequency
                                          로 표현함.

                                     (d) Formants :: two separated tones
                                         Y축 : Frequency
                                         X축 : Time
                                         넓이 : amplitude



                           Seo-jung ko, Industrial Engineering, Hanyang University
Contents
 1. Speech Perception

 2. Representation of Speech

 3. Units of Speech Perception
       Phonemes
       Syllables
       Words

 4. Top-Down Processing of Speech

 5. Applications of Voice Recognition Research

 6. Communications
       Nonverbal Communications
       Video Mediated Communications
       Crew Resource Management



                                       Seo-jung ko, Industrial Engineering, Hanyang University
Units of Speech Perception
Phonemes, Syllables, Words

  Phonemes (음소) – the basic unit of speech
  •    changing a phoneme in a word will change its meaning (or change it to a nonword).
  •    The 38 English phonemes. Ex) [p] [b] [t] [d] [k] [g] [f] [v] [θ] …
  •    실제 지각시 phonemes와 printed letters 가 상당히 다름.
  •    Physical form of a phoneme is highly dependent on the context in which it appears.

  Syllables (음절) – the basic unit of speech perception.
  •    Two of more phonemes generally combine to create syllables.
  •    The syllabic unit is itself relatively invariant in its physical form.
  •    A Study suggests that people are particularly dependent on the syllable unit in speech perception.

  Words (단어) – the smallest cognitive or semantic unit of meaning
  •    Morpheme(형태소)로 이루어져있다. Ex) un- . –ing …
  •    Segmentation problem
             “she uses st*and*ard oil”
             세 단어 사이의 boundary-gap 이외에도 두개의 physical pauses가 있음
             → 순수 Bottom-up processing 에서 의미를 모르는 단어들이 연속적으로 주어진 경우
             단어들의 분리경계를 구분하기 어려워진다.


                                                              Seo-jung ko, Industrial Engineering, Hanyang University
Contents
 1. Speech Perception

 2. Representation of Speech

 3. Units of Speech Perception
       Phonemes
       Syllables
       Words

 4. Top-Down Processing of Speech

 5. Applications of Voice Recognition Research

 6. Communications
       Nonverbal Communications
       Video Mediated Communications
       Crew Resource Management



                                       Seo-jung ko, Industrial Engineering, Hanyang University
Top-Down Processing of Speech
 Contrast speech perception with reading
     (1) invariable problem.
     (2) segmentation problem.
     (3) the serial and transient nature of the auditory message.
     →Bottom-up processing을 어렵게 하며, top-down processing 에 의존하게 한다.


  Demonstrations of top-down or context-dependent processing in
  speech perception are quite robust.
     In one experiment, compare recognition of degraded word strings..

     (1) 무작위 단어들 (2) 문법적 구조이지만, 의미가 없는 단어들 (3) 의미적 맥락이 있는 단어들

      → 문법, 의미 제약이 적을수록 신호강도가 커야만 같은 수준의 인식가능.


  Mixture of bottom-up and top-down processing.
     Bottom-up processing        : 음향적인 세부특징, 음절수준의 하위특징

     Top-down processing         : 의미적, 통사론적 맥락에서 특정 speech의 음이 무엇인지
                                  단어경계에 대한 주관적 특성



                                                 Seo-jung ko, Industrial Engineering, Hanyang University
Contents
 1. Speech Perception

 2. Representation of Speech

 3. Units of Speech Perception
       Phonemes
       Syllables
       Words

 4. Top-Down Processing of Speech

 5. Applications of Voice Recognition Research

 6. Communications
       Nonverbal Communications
       Video Mediated Communications
       Crew Resource Management



                                       Seo-jung ko, Industrial Engineering, Hanyang University
Applications of Voice Recognition Research
Speech perception - Two major categories of applications.

   1.    Understanding of how humans perceive speech and employ
         context-driven top-down processing in recognition.

   2.    Measure and predict the effects on speech comprehension of
         various kind of distortion. (extrinsic or intrinsic distortion)

                                                       Natural speech
                                                        the differing amplitudes of the various
                                                       phonemes distributed across a wide rage
                                                       of frequencies.
                                                         → spectrum 형성가능

                                                       Figure 6.12
                                                       Typical power spectra of speech

                                                       Noise & frequency
                                                        동일 주파수대의 noise 가 이해를 더 떨어트림


                                                  Seo-jung ko, Industrial Engineering, Hanyang University
Applications of Voice Recognition Research
Articulation index (AI) : Predict the effects of background noise on speech understanding




                                                                                  Signal
                                                                                   Noise




                                                                                   hearing.
                                                                        It is not comprehension.

                                                                  So, AI provided measure of
                                                                  Only bottom-up stimulus quality.


                                                   Seo-jung ko, Industrial Engineering, Hanyang University
Applications of Voice Recognition Research
Speech intelligibility명백 :: Vocal material of particular level of redundancy over the speech
     channel in question and computing the percentage of words understood correctly.

     정보내용/ redundancy / 청자의 top-down processing 에 따라 다른 이해 정도를 표현할 수 있음.

    제한된 단어 > 제한이 없는 단어 (표준화 등등)

    의미 있는 단어 > 무의미한 음절

    고빈도 단어 > 저빈도 단어

    맥락 있는 문장 > 맥락 없는 문장

    Figure 6.14



The important implications

1.   Either the AI or the speech-intelligibility Measures by themselves are inherently Ambiguous
     unless the redundancy of the transmitted material is carefully specified
2.   data-driven, bottom-up processing may trade off with context-driven, top-down processing.



                                                        Seo-jung ko, Industrial Engineering, Hanyang University
Applications of Voice Recognition Research
The ability to “guess” the massage

Limitations in signal quality can be compensated for by augmenting top-down processing

 –   creating the ability to “guess” the message without actually                   (or completely)   hearing it.

Ex) 표준화된 어휘만 이용, 중복되는 quot;carrier” sentences 사용



The effect of redundant carrier sentences on comprehension.
소음이 있는 상태에서 비행기조조사에게 음성경고를 보내는 실험.

경고형태 :: “ fuel low” ,             “your fuel is low”

        →recognition performance : “fuel low” < “you fuel is low”
        →carrier sentences           : one-syllable words > multi-syllable words




                                                       Seo-jung ko, Industrial Engineering, Hanyang University
Contents
 1.   Speech Perception

 2.   Representation of Speech

 3.   Units of Speech Perception
           Phonemes
           Syllables
           Words

 4.   Top-Down Processing of Speech

 5.   Applications of Voice Recognition Research

 6. Communications
       : there is more to communications than simply understanding
      the words and sentences in speech.
      ex) gestures, pauses, and voice inflection …

           Nonverbal Communications
           Video Mediated Communications
           Crew Resource Management




                                                   Seo-jung ko, Industrial Engineering, Hanyang University
Communications

Communications
:: there is more to communications than simply understanding the words
and sentences in speech.
ex) gestures, pauses, and voice inflection …
Nonverbal Communications
1. Visualizing the mouth.
     화자의 입 움직임과 단어를 발음하는 모양을 보는 것. 유용한 중복적 단서.
     (특히 음성의 질이 좋지 않을 때)

2. Nonverbal cues.
     화자의 끄덕임이나 곤혹스런 표정 등과 같은 표정을 통해 얻는 단서, 제스처 등의 부가정보.
3. Disambiguity.
     화자입장에서 청자의 곤혹스러운 표정이나 끄덕임을 통해 내 말을 이해 했는지 못 했는지
     알 수 있기 때문에 메시지의 모호성을 알 수 있음.
     청각에만 의존하고 시각적 피드백이 없는 경우, 전체적으로 단서 수가 많아지고 정식으로
     이루어지는 대화의 주고받기 빈도가 늘어남.
4. Shared knowledge of action.
    팀 수행에서는, 팀 구성원들이 수행하는 혹은 실패하는 행위를 단지 지켜보는 것 만으로도
    많은 정보가 교환되고 공유됨

                                               Seo-jung ko, Industrial Engineering, Hanyang University
Communications
Video Mediated communications
Video 가 face to face communication의 장점을 가질 것이다.
하지만 비디오와 청각정보의 질이 떨어지고, 이 두 채널을 동기화 시켜하 하는데 문제가 있을 수 있다.
또한 질이 좋고 동기화가 잘 되더라도, face-to-face communication에 비해 원격 비디오 조건에서는
더 많은 단어가 필요했고, 소통의 방식도 청각만 사용하는 소통과 비슷하게 더 많은 공식적인 주고받기와 더 적은수
      의 중단을 보였다.


Crew Resource Management
대화에 참여하는 사람들간의 사회적 분위기의 특성이 의사소통의 패턴을 촉진하거나 떨어뜨릴수 있다.
     조종사의 실수를 보았거나 의심을 갖고 있는 부조종사의 경우.
      1) 조종사가 주의를 기울이도록 만드는데 실패
      2) 너무 모호하게 말해서 실수가 교정되지 못함
     정보를 교환해야 하는 두 오퍼레이터 간에 분명한 지위차이가 있는 경우. (Ex.사장과 비서)


조종사들의 협동을 필요로하는, 효율적인 의사소통을 위한 시뮬레이션 실험 결과
승무원들이 의사소통을 많이 공유하고, 더 빈번히 소통내용을 확인하며 명령 또는 단정적인 문장을 많이 사용.
공식적인 지휘체계의 양방향으로 (조종사->부 조종사. 부 조종사->조종사) 단정적인 진술을 사용.
      - 각 구성원이 분명하게 정해진 책임감을 자각하고 의사소통을 하고 있음을 뜻함
오랜시간 함께 수행한 승무원들의 수행이 더 우수.


                                                  Seo-jung ko, Industrial Engineering, Hanyang University

Más contenido relacionado

Último

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Último (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Destacado

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Destacado (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Speech and Communication Chapter

  • 1. Chapter 6. Language and Communications Speech Perception Engineering Psychology and Human Performance ㅇㅇ Seo-jung ko, Industrial Engineering, Hanyang University
  • 2. Contents 1. Speech Perception 2. Representation of Speech 3. Units of Speech Perception Phonemes Syllables Words 4. Top-Down Processing of Speech 5. Applications of Voice Recognition Research 6. Communications Nonverbal Communications Video Mediated Communications Crew Resource Management Seo-jung ko, Industrial Engineering, Hanyang University
  • 3. Speech Perception Example In1997, a Tragic event occurred at the Tenerife airport in the Canada Island : A KLM Royal Dutch Airlines 747 jumbo jet, accelerating for takeoff, crashed into a Pan American 747 taxiing on the same runway. → Confusion between the KLM pilot and air traffic control. Reading & Speech In common with reading, the perception of speech involves both bottom-up hierarchical processing and top-down contextual processing. reading :: features세부특징 – letters낱자 – words단어 Speech :: phonomes음소- sylables음절 -words단어 But, reading 과 달리 physical units of speech 분리가 쉽지 않다. The perceptual system must undertake some analog to digital conversion to translate the continuous speech waveform into the discrete units of speech perception. Seo-jung ko, Industrial Engineering, Hanyang University
  • 4. Contents 1. Speech Perception 2. Representation of Speech 3. Units of Speech Perception Phonemes Syllables Words 4. Top-Down Processing of Speech 5. Applications of Voice Recognition Research 6. Communications Nonverbal Communications Video Mediated Communications Crew Resource Management Seo-jung ko, Industrial Engineering, Hanyang University
  • 5. Representation of Speech (a) The stimulus of speech is a continuous variation or oscillation of the air pressure (b) Fourier 분석 서로 다른 주파수, 진폭을 갖는 sine wave로 분리시킬 수 있다. (c) Spectral representation. (b)의 그래프를 각각 Y축 : Power sine wave 진동의 평균 폭or 폭의제곱 X축 : Frequency 로 표현함. (d) Formants :: two separated tones Y축 : Frequency X축 : Time 넓이 : amplitude Seo-jung ko, Industrial Engineering, Hanyang University
  • 6. Contents 1. Speech Perception 2. Representation of Speech 3. Units of Speech Perception Phonemes Syllables Words 4. Top-Down Processing of Speech 5. Applications of Voice Recognition Research 6. Communications Nonverbal Communications Video Mediated Communications Crew Resource Management Seo-jung ko, Industrial Engineering, Hanyang University
  • 7. Units of Speech Perception Phonemes, Syllables, Words Phonemes (음소) – the basic unit of speech • changing a phoneme in a word will change its meaning (or change it to a nonword). • The 38 English phonemes. Ex) [p] [b] [t] [d] [k] [g] [f] [v] [θ] … • 실제 지각시 phonemes와 printed letters 가 상당히 다름. • Physical form of a phoneme is highly dependent on the context in which it appears. Syllables (음절) – the basic unit of speech perception. • Two of more phonemes generally combine to create syllables. • The syllabic unit is itself relatively invariant in its physical form. • A Study suggests that people are particularly dependent on the syllable unit in speech perception. Words (단어) – the smallest cognitive or semantic unit of meaning • Morpheme(형태소)로 이루어져있다. Ex) un- . –ing … • Segmentation problem “she uses st*and*ard oil” 세 단어 사이의 boundary-gap 이외에도 두개의 physical pauses가 있음 → 순수 Bottom-up processing 에서 의미를 모르는 단어들이 연속적으로 주어진 경우 단어들의 분리경계를 구분하기 어려워진다. Seo-jung ko, Industrial Engineering, Hanyang University
  • 8. Contents 1. Speech Perception 2. Representation of Speech 3. Units of Speech Perception Phonemes Syllables Words 4. Top-Down Processing of Speech 5. Applications of Voice Recognition Research 6. Communications Nonverbal Communications Video Mediated Communications Crew Resource Management Seo-jung ko, Industrial Engineering, Hanyang University
  • 9. Top-Down Processing of Speech Contrast speech perception with reading (1) invariable problem. (2) segmentation problem. (3) the serial and transient nature of the auditory message. →Bottom-up processing을 어렵게 하며, top-down processing 에 의존하게 한다. Demonstrations of top-down or context-dependent processing in speech perception are quite robust. In one experiment, compare recognition of degraded word strings.. (1) 무작위 단어들 (2) 문법적 구조이지만, 의미가 없는 단어들 (3) 의미적 맥락이 있는 단어들 → 문법, 의미 제약이 적을수록 신호강도가 커야만 같은 수준의 인식가능. Mixture of bottom-up and top-down processing. Bottom-up processing : 음향적인 세부특징, 음절수준의 하위특징 Top-down processing : 의미적, 통사론적 맥락에서 특정 speech의 음이 무엇인지 단어경계에 대한 주관적 특성 Seo-jung ko, Industrial Engineering, Hanyang University
  • 10. Contents 1. Speech Perception 2. Representation of Speech 3. Units of Speech Perception Phonemes Syllables Words 4. Top-Down Processing of Speech 5. Applications of Voice Recognition Research 6. Communications Nonverbal Communications Video Mediated Communications Crew Resource Management Seo-jung ko, Industrial Engineering, Hanyang University
  • 11. Applications of Voice Recognition Research Speech perception - Two major categories of applications. 1. Understanding of how humans perceive speech and employ context-driven top-down processing in recognition. 2. Measure and predict the effects on speech comprehension of various kind of distortion. (extrinsic or intrinsic distortion) Natural speech the differing amplitudes of the various phonemes distributed across a wide rage of frequencies. → spectrum 형성가능 Figure 6.12 Typical power spectra of speech Noise & frequency 동일 주파수대의 noise 가 이해를 더 떨어트림 Seo-jung ko, Industrial Engineering, Hanyang University
  • 12. Applications of Voice Recognition Research Articulation index (AI) : Predict the effects of background noise on speech understanding Signal Noise hearing. It is not comprehension. So, AI provided measure of Only bottom-up stimulus quality. Seo-jung ko, Industrial Engineering, Hanyang University
  • 13. Applications of Voice Recognition Research Speech intelligibility명백 :: Vocal material of particular level of redundancy over the speech channel in question and computing the percentage of words understood correctly. 정보내용/ redundancy / 청자의 top-down processing 에 따라 다른 이해 정도를 표현할 수 있음.  제한된 단어 > 제한이 없는 단어 (표준화 등등)  의미 있는 단어 > 무의미한 음절  고빈도 단어 > 저빈도 단어  맥락 있는 문장 > 맥락 없는 문장  Figure 6.14 The important implications 1. Either the AI or the speech-intelligibility Measures by themselves are inherently Ambiguous unless the redundancy of the transmitted material is carefully specified 2. data-driven, bottom-up processing may trade off with context-driven, top-down processing. Seo-jung ko, Industrial Engineering, Hanyang University
  • 14. Applications of Voice Recognition Research The ability to “guess” the massage Limitations in signal quality can be compensated for by augmenting top-down processing – creating the ability to “guess” the message without actually (or completely) hearing it. Ex) 표준화된 어휘만 이용, 중복되는 quot;carrier” sentences 사용 The effect of redundant carrier sentences on comprehension. 소음이 있는 상태에서 비행기조조사에게 음성경고를 보내는 실험. 경고형태 :: “ fuel low” , “your fuel is low” →recognition performance : “fuel low” < “you fuel is low” →carrier sentences : one-syllable words > multi-syllable words Seo-jung ko, Industrial Engineering, Hanyang University
  • 15. Contents 1. Speech Perception 2. Representation of Speech 3. Units of Speech Perception Phonemes Syllables Words 4. Top-Down Processing of Speech 5. Applications of Voice Recognition Research 6. Communications : there is more to communications than simply understanding the words and sentences in speech. ex) gestures, pauses, and voice inflection … Nonverbal Communications Video Mediated Communications Crew Resource Management Seo-jung ko, Industrial Engineering, Hanyang University
  • 16. Communications Communications :: there is more to communications than simply understanding the words and sentences in speech. ex) gestures, pauses, and voice inflection … Nonverbal Communications 1. Visualizing the mouth. 화자의 입 움직임과 단어를 발음하는 모양을 보는 것. 유용한 중복적 단서. (특히 음성의 질이 좋지 않을 때) 2. Nonverbal cues. 화자의 끄덕임이나 곤혹스런 표정 등과 같은 표정을 통해 얻는 단서, 제스처 등의 부가정보. 3. Disambiguity. 화자입장에서 청자의 곤혹스러운 표정이나 끄덕임을 통해 내 말을 이해 했는지 못 했는지 알 수 있기 때문에 메시지의 모호성을 알 수 있음. 청각에만 의존하고 시각적 피드백이 없는 경우, 전체적으로 단서 수가 많아지고 정식으로 이루어지는 대화의 주고받기 빈도가 늘어남. 4. Shared knowledge of action. 팀 수행에서는, 팀 구성원들이 수행하는 혹은 실패하는 행위를 단지 지켜보는 것 만으로도 많은 정보가 교환되고 공유됨 Seo-jung ko, Industrial Engineering, Hanyang University
  • 17. Communications Video Mediated communications Video 가 face to face communication의 장점을 가질 것이다. 하지만 비디오와 청각정보의 질이 떨어지고, 이 두 채널을 동기화 시켜하 하는데 문제가 있을 수 있다. 또한 질이 좋고 동기화가 잘 되더라도, face-to-face communication에 비해 원격 비디오 조건에서는 더 많은 단어가 필요했고, 소통의 방식도 청각만 사용하는 소통과 비슷하게 더 많은 공식적인 주고받기와 더 적은수 의 중단을 보였다. Crew Resource Management 대화에 참여하는 사람들간의 사회적 분위기의 특성이 의사소통의 패턴을 촉진하거나 떨어뜨릴수 있다.  조종사의 실수를 보았거나 의심을 갖고 있는 부조종사의 경우. 1) 조종사가 주의를 기울이도록 만드는데 실패 2) 너무 모호하게 말해서 실수가 교정되지 못함  정보를 교환해야 하는 두 오퍼레이터 간에 분명한 지위차이가 있는 경우. (Ex.사장과 비서) 조종사들의 협동을 필요로하는, 효율적인 의사소통을 위한 시뮬레이션 실험 결과 승무원들이 의사소통을 많이 공유하고, 더 빈번히 소통내용을 확인하며 명령 또는 단정적인 문장을 많이 사용. 공식적인 지휘체계의 양방향으로 (조종사->부 조종사. 부 조종사->조종사) 단정적인 진술을 사용. - 각 구성원이 분명하게 정해진 책임감을 자각하고 의사소통을 하고 있음을 뜻함 오랜시간 함께 수행한 승무원들의 수행이 더 우수. Seo-jung ko, Industrial Engineering, Hanyang University