Acoustics final ch. 4 and 5

  1. What is acoustic ouput for speech determined by?
    • source characteristics
    • the transfer function of the vocal tract
    • the radiation characteristic as the sound moves from the enclosed vocal tract to the environment
  2. What is a transfer function?
    tells you what will happen to certain frequencies- how energy is transferred from input to output
  3. How do you create a model of a vowel?
    • start with a tube of:
    • uniform cross-sectional diameter
    • open at one end and closed at the other
    • set length of tube to be appropriate for an adult male or female (male- 17 cm, female- 14.5 cm, child- 8.75 cm)
    • the output will resemble the vowel "uh"
    • this model is valid out to 5 kHz
    • *long wavelengths act the same way in a straight line as they do in a curved tube
  4. What are resonances in relation to a tube?
    • a function of the length of the tube
    • best excited at a frequency of 1/4wavelength (f = c/wavelength)
    • odd multiples of this lowest resonant frequency
    • changing with changes in the length (men vs women, adults vs children, Fn = (2n-1)c/4l)
    • resonant frequency will occur at the odd quarter length formula
    • /uh/ has least amount of constriction
  5. What is lip radiation?
    • the mouth opening is a boundary
    • higher frequencies are resonated with greater amplitude than lower frequencies (atmosphere offers greater impedance to lower frequencies which results in larger particle displacement)
  6. What happens since there is less impedance to higher frequencies?
    • displacement is not affected as much
    • newton's third law of motion (for every action, there is an equal and opposite reaction)
  7. What is the effect of lip radiation?
    • **trick! (talking about atmosphere?)
    • -12 dB/octave leaving the larynx
    • +6 dB/octave leaving the lips
    • -12 + 6 = -6 dB/octave in the atmosphere
  8. What is a standing wave?
    • a wave that remains in constant position
    • constructive interference
  9. What is reflection?
    change in direction of the wave
  10. What is destructive interference?
    positive and negative values are less positive or less negative than normal and lose amplitude
  11. Where do reflections occur?
    where the lips meet the atmosphere (a boundary)
  12. Where will there be standing waves?
    at particular frequencies determined by the volume velocity (U) or pressure (P)
  13. Where is U at a minimum and what is P there?
    • at the closed end of the tube
    • P is at a maximum
  14. Where is U at a maximum and what is P there?
    • at the open end of the tube
    • P is at a minimum
  15. Where do air particles vibrate most effectively?
    at the open end of the tube model
  16. Where do air particles vibrate least effectively?
    at the closed end of the tube model
  17. Where do waves vibrate better?
    as they come up and out of the mouth
  18. What happens if wavelength matches the resonances of the tube?
    • vibration will be reinforced
    • frequencies at which the wavelengths have a maximum velocity (or minimum pressure) at the tube opening (lips) will be transmitted most effectively (with the greatest relative amplitude)
  19. Where do the maxima of vibration occur?
    • 1/4, 3/4, 5/4, etc times the length of the tube (quarter length multiples)- the resonances of the tube
    • these are the only wavelengths with the appropriate max (min) pattern
  20. Where is there a volume velocity maxima?
  21. Where is there a minimum amplitude of vibration?
    • nodes
    • max amp of vibration at antinodes
  22. What are resonances a property of?
    • the vocal tract
    • they exist whether they are energized or not
  23. What do resonances do to energy?
    • they modify it
    • they DO NOT add energy to the speech signal
    • resonances do not make you louder
  24. What can resonances tell us?
    the transfer function- tell you what will happen to that frequency whether its there or not
  25. What are the odds that there would be a formant frequency near a harmonic?
  26. What are constrictions created by?
  27. What will a constriction near a node or an antinode do?
    • change the frequency of the formant
    • is a perturbation of the standing wave in the vocal tract
  28. What is the relationship between contriction and perturbation?
    they are used interchangeably
  29. What happens when there is a constriction near an antinode?
    • volume velocity is maximum
    • pressure is minimum
    • formant frequency will be lowered
  30. What happens when there is a constriction near a node?
    • volume velocity is at a minimum
    • pressure is maximum
    • formant frequency is raised
  31. How is F1 affected by nodes and antinodes?
    • antinode- lowered by a constriction in the oral cavity near a volume velocity maximum (lips)
    • node- raised by a constriction in the pharynx
  32. How is F2 affected by nodes and antinodes?
    • antinode- lowered by a constriction (at the lips or in the oropharynx)
    • node- raised by a constriction (in the anterior oral cavity)
    • *pg 31
  33. How is F3 affected by nodes and antinodes?
    • antinode- lowered by a constriction (at the lips or in the middle of the oral cavity)
    • node- raised by a constriction (in the oropharynx or in the anterior oral cavity
  34. What are all three formant frequencies lowered by?
    labial constriction
  35. What are all three formant frequencies raised by?
    a constriction near the larynx
  36. What is point vowel?
    cover range of articulations needed to produce all vowels in English
  37. What is included in the vowel quadrilateral?
    • tongue height
    • tongue advancement
    • lip rounding
    • corner and point vowels
  38. Do articulators affect each other?
    • they have a fair degree of independence, but articulator movements affect each other
    • thus, formant frequencies are a product of the entire length and shape of the vocal tract
  39. What can two different people say the vowel /i/ and it still sounds like the same vowel?
    because it's not the absolute values of the frequency that help us to distinguish, but the relationship of the formant frequencies
  40. What is the duration for tense and lax vowels?
    • tense vowels are relatively long
    • lax vowels are relatively short
  41. What are tense and lax vowels?
    • a phonetic description
    • supposedly correlated with articulatory effort
    • actually correlated with duration
  42. What is a diphthong?
    • two vowels forming a single nucleus
    • on-glide and off-glide
    • should see movement in formants
  43. What is the relationship between high vowels and findamental frequency?
    high vowels have a relatively higher fundamental frequency
  44. /How is the tongue linked to the larynx?
    • when you raise your tongue, your larynx raises too (makes pitch higher)
    • hypoglossus pulls larynx up for high vowels, up to a 20 Hz difference
  45. How are consonants usually described?
    in groups according to their significant acoustic and articulatory properties
  46. What are the groups consonants fall into?
    • stops
    • fricatives
    • affricates
    • nasals
    • glides
    • liquids
  47. How are stop consonants characterized?
    by a complete closure somewhere in the vocal tract
  48. What are the three phases of a stop consonant?
    • closure
    • release
    • transition
    • reverse the steps for postvocalic stops
  49. What is a stop gap?
    • corresponds to the complete closure of the vocal tract (slience)
    • minimum radiated acoustic energy (silence for voiceless stops, voice bar for voiced stops, 50-150 ms*)
  50. What is a stop release (burst)?
    • pressure has been rising behind the obstruction
    • rapid release produces a transient (20-30 ms)
  51. What follows the burst for voiceless stops?
    • frication (air/noise)
    • low freq. for /p/ (500-1500 Hz) (falling spectrum)
    • high freq. for /t/ (above 4 kHz) (rising spectrum)
    • mid-freq. for /k/ (1.5-4 kHz) (peaked spectrum)
  52. What is a transient?
    a short amount of noise
  53. What are acoustic cues?
    • stop gap
    • release of pressure
    • transition
  54. How are /p t k/ distinguished from /b d g/?
    by voicing
  55. What is voice onset time (VOT)?
    • the interval between the release of the stop and the onset of vocal fold vibration
    • for /b d g/ VOT from -20 to +20 ms with a mean of 10 ms
    • for /p t k/ VOT from 25 to 80 ms with a mean of 45 ms
  56. What are some cues for voicing?
    • VOT
    • voice bar for intervocalic stops
    • length of preceding vowel for final stops
  57. Why can voiced stops have a negative onset time?
    because you are measuring from where voicing starts which is actually before the burst (it can be up to 20 ms behind where the burst is)- you make a sound before actually saying the consonant /b/ (called voice bar)
  58. When will vowels be longer in duration?
    if they are followed by voiced consonants instead of voiceless
  59. What are formant transitions?
    • articulatory movement from stop to vowel entails a formant movement
    • as the resonating chamber of the vocal tract changes, the formant frequencies change
    • formant transitions are important for perception
    • formant transitions are approximately 50 ms in duration
  60. Why do formants move?
    you are changing your vocal tract
  61. Why do men have lower formant frequencies than women?
    their vocal tracts are longer
  62. How does F1 move for stop consonants?
    usually rises
  63. How do F2 and F3 work for stop consonants?
    • for /p b/ F2 and F3 rise slightly
    • for /t d/ F2 falls and F3 rises slightly
    • for /k g/ F2 and F3 separate steeply and rapidly
    • however, a given stop is associated with a variety of transitions (there is no fixed pattern for perception)
  64. What are the characteristics of articulation for fricatives?
    • narrow constriction in the vocal tract (not a complete closure like stops)- no moment of silence in spectrum
    • when are flow rate is high, turbulence results
    • turbulence is complex, unpredictable air flow
    • turbulent airflow is perceived as turbulent noise
    • fricatives have a relatively long duration
  65. What are fricatives divided into?
    • sibilants (stridents)- greater noise energy (s, z, esh, yogh)
    • nonsibilants (nonstridents)- f, v, theta, eth, h
  66. What is laminar airflow?
    airflow is neat and organized (predictable)
  67. What is turbulent airflow?
    airflow is messy and unorganized, noise (fricatives)
  68. How are sibilants differentiated among themselves?
    • voicing- pulses (glottal closures) for /z yogh/, no pulses for /s esh/
    • noise spectrum- alveolar sibilants have higher frequency energy range from 4 kHz to 12 kHz, palatal sibilants have energy down to 3 kHz, spectral irregularities aren't important in perception
  69. What are formant transition roles for sibilants?
    formant transition locations depend on the articulation, but the transitions are not important perceptually for sibilants
  70. Does /s/ have high frequency or low frequency energy?
    • high
    • /esh/ has more low frequency energy
  71. What are nonsibilants?
    • /f v theta eth h/
    • less noise energy than sibilants
    • voiced nonsibilants will have quasi-periodic pulses (some periodicity)
    • noise spectra are fairly flat and diffuse
  72. What is the relationship between noise spectrum nonsibilant identification?
    it is not known
  73. What is the role of formant transitions for nonsibilants?
    formant transitions are primary acoustical cue (noise spectrum may play secondary role)
  74. What are affricates?
    • described as a combination of stop and fricative
    • /t-esh d-yogh/
  75. What is the articulation for affricates?
    • complete obstruction in the vocal tract
    • intraoral pressure builds up
    • release to generate fricative noise
  76. What are acoustic features of affricates?
    • rise time- time it takes amplitude measure/envelope to go from 0 to maximum level
    • duration of frication
    • relative amplitude in third formant region
    • stop gap- complete obstruction in vocal tract and it shows up as silence
  77. What is the articulation for nasals?
    • complete closure in vocal tract
    • sound radiated through nasal cavities
    • sometimes called nasal stops
  78. What is a nasal murmur?
    sound of a nasal, acoustic waveform of nasal consonants
  79. What are acoustics of nasals?
    • nasal murmur
    • associated strictly with nasal radiation of sound
    • there are many spectral peaks, but most have low amplitude
    • antiformants- loss of energy
    • nasal formant (low frequency ~300 Hz, highest energy)
  80. Why is consonant energy reduced for nasal formant?
    because higher formants have reduced energy
  81. What is the bandwidth for nasal formants?
    narrower than for vowels
  82. What are some acoustic features of nasals?
    • highly damped formants (broad bandwidths compared to other consonants)
    • formant transitions in connected speech
  83. Why are nasal sounds dampened?
    because of mucous in nose
  84. What are glide consonants?
    • also called approximants and semivowels
    • /w j/
  85. What is the articulation of glides?
    • gradual articulatory motion
    • narrow, but not closed, vocal tract
  86. What are the acoustics of glides?
    • formants for /w/ F1 and F2 are both low
    • formants for /j/ low F1 and high F2
  87. What distinguishes between glides?
    F2 and F3
  88. How are glides like vowels?
    they have formants that we can see
  89. What are liquid consonants?
    • also included as semivowels
    • /r l/
  90. What are liquids characterized by?
    • rapid movements
    • formant structure
    • F3 is the main difference
    • antiformants for /l/
  91. How do the 3rd and 4th formant frequencies act for /r/?
    closer together
  92. How do the 3rd and 4th formant frequencies act for /l/?
  93. What do liquids, glides, and nasals depend on?
    formant frequencies (ear looks for changes)
  94. What do stops, fricatives, and affricates depend on?
    their disruption (either the complete closure or the constriction)
  95. What do nasals depend on?
    their antiformants (more than formants)
Card Set
Acoustics final ch. 4 and 5