The phase vocoder usually refers to the general technique of passing from (complex-valued) channel amplitudes to pairs consisting of (real-valued) magnitudes and phase precession rates (``frequencies"), and back, as described in Figure 9.11 (Section 9.5). In Example I07.phase.vocoder.pd (Figure 9.18), we use this technique for the specific goal of time-stretching and/or time-contracting a recorded sound under real-time control. That is, we control, at any moment in real time, the location in the recorded sound we hear. One new object is used:
: Shift a block left or right according to its argument.
If the argument is positive, each block of the output is the input
shifted that number of spaces to the right, filling zeros in as
needed on the left. A negative argument shifts to the left, filling
zeros in at the right.
The process starts with a sub-patch, pd read-windows, which outputs two Hann-windowed blocks of the recorded sound, a ``back" one and a ``front" one 1/4 window further forward in the recording. The window shown uses the two outputs of the sub-patch to guide the amplitude and phase change of each channel of its own output.
The top two tabreceive~ objects recall the previous
block of complex amplitudes sent to the rifft~ object at
bottom, corresponding to
in the discussion of
Section 9.5. The patch as a
whole computes
and then its Hann windowed
inverse FT for output.
After normalizing
, its complex
conjugate (equal to its inverse) is multiplied by the windowed
Fourier transform of the ``back" window
, giving
the product
Three other applications of Fourier analysis/resynthesis, not pictured here, are provided in the Pd examples. First, Example I08.pvoc.reverb.pd shows how to make a phase vocoder whose output recirculates as in a reverberator, except that individual channels are replaced by the input when it is more powerful than what is already recirculating. The result is a more coherent-sounding reverberation effect than can be made in the classical way using delay lines.
Example I09.sheep.from.goats.pd demonstrates the (imperfect)
technique of separating pitched signals from noisy ones, channel by
channel, based on the phase coherence we should expect from a
Hann-windowed sinusoid. If three adjacent channels are
approximately
radians out of phase from each other,
they are judged to belong to a sinusoidal peak. Channels belonging
to sinusoidal peaks are replaced with zero to extract the noisy
portion of the signal, or all others are replaced with zero to give
the sinusoidal portion.
Example I10.phase.bash.pd returns to the wavetable looping
sampler of Figure 2.7, and shows
how to align the phases of the sample so that all components of the
signal have zero phase at points 0,
,
, and so on. In this way, two copies of a looping sampler
placed
samples apart can be coherently cross-faded.
A synthetic, pitched version of the original soundfile can be made
using daisy-chained cross-fades.