previous page table of contents next page


Plosive voice /ga/ /ka/ sound waveform generation by pseudo blast impulse, noise source, and two tubes model




This is an experiment to generate plosive voice /ga/ /ka/ sound by pseudo blast impulse waveform, noise source instead of turbulent sound, and two tubes model with loss.
In this method, the sound will be generated from resonance effected mixed sound of pseudo blast impulse and noise, and following vowel sound portion controlling glottal sound source.

Generation of pseudo blast impulse waveform

It is said that waveform generated by blast phenomenon has a characteristic of 1/f frequency response .
If it uses 1/f frequency response as it is, resonance effect is difficult to be appear, since the amplitude becomes too small even in low frequency component. For that reason, generation uses frequency response from DC to a certain frequency is flat and after that closes to 1/f.
Following figures is the waveform of generated pseudo blast impulse waveform by Hilbert transform and minimum phase condition. It sounds (wav file)


Next figure shows frequency response.


Generation of noise sound

Perlin noise method is used as noise source instead of turbulent sound. This is same as fricative voice /sa/ sound waveform generation. The difference from fricative is that noise frequency is low.
Following figure is the waveform of generated noise sound. It sounds (wav file)




Mix the noise sound with pseudo blast impulse waveform

Assuming that a blast and subsequent breath will generate a turbulent sound, mix the noise sound with pseudo blast impulse waveform.
Following figure is the waveform of the mixed. It sounds (wav file)


Apply resonance effect

By use two tubes model of which input source is the mixed waveform, it applies resonance effect to the mixed sound.
Following figure is two waveforms which is applied different resonance effect. First one sounds (wav file)



Generation of following vowel sound

To generate vowel sound, it uses two tubes model that can vary cross section area and tube length every time step. They are varied from one neutral state to vowel /a/ state as following figures.


Following figure shows cross section area and tube length per time. They start to vary after one pitch duration, of which initial reflection is not yet seen.



Voiced sound and unvoiced sound

Voiced sound or unvoiced sound, also depends on the strength of resonance effect.
Loss passing tube defined as follows is used to control resonance effect.

For example, loss adjustment can vary peak strength of resonance, resonance effect, as shown below two tubes model frequency response comparison for loss.



And also, glottal sound source is controlled to start slowly, fade-in, like red waveform in following figure.


As following vowel portion to combine, two waveforms are prepared. One of which resonance effect is strong (no loss) is for voiced sound. Another of which resonance effect is weak is for unvoiced sound.
Following figure is the waveform of which resonance effect is strong (no loss). It sounds (wav file).


And next figure is the waveform of which resonance effect is weak. It sounds (wav file).


For combination, waveform which is removed initial one pitch duration, green portion in the figure, is used.

Combination waveforms

Finally, the mixed waveform of pseudo blast impulse and noise, which has resonance effect, is combined with following vowel portion, smoothing connection edge. Following figure is the waveform of combination with following vowel portion of which resonance effect is strong (no loss). This waveform (wav file) sounds similar to voice /ga/ sound.


Next figure is the waveform of combination with following vowel portion of which resonance effect is weak. This waveform (wav file) becomes unvoiced sound.



For reference, there is Python program to compute above waveforms by Python. Please see README.txt in the zip file about usage.





No.2b, 18  February 2019