Virtual Analog Filter Design
Virtual Analog Filter Design
Vadim Zavalishin
ii About this book: the book covers the theoretical and practical aspects of the virtual analog lter design in the music DSP context. Only a basic amount of DSP knowledge is assumed as a prerequisite. For digital musical instrument and eect developers. Front picture: BLT integrator. DISCLAIMER: THIS BOOK IS PROVIDED AS IS, SOLELY AS AN EXPRESSION OF THE AUTHORS BELIEFS AND OPINIONS AT THE TIME OF THE WRITING, AND IS INTENDED FOR THE INFORMATIONAL PURPOSES ONLY.
c Vadim Zavalishin. The right is hereby granted to freely copy this revision of the book in software or hard-copy form, as long as the book is copied in its full entirety (including this copyright note) and its contents are not modied.
To the memory of Elena Golushko, may her soul travel the happiest path. . .
iv
Contents
Preface 1 Fourier theory 1.1 Complex sinusoids . 1.2 Fourier series . . . . 1.3 Fourier integral . . . 1.4 Dirac delta function 1.5 Laplace transform . . vii 1 1 2 3 4 5 7 7 8 9 12 13 14 15 17 18 19 20 22 24 27 27 29 29 30 32 33 35 38 41 42 46 50 53
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
2 Analog 1-pole lters 2.1 RC lter . . . . . . . . . . . . . 2.2 Block diagrams . . . . . . . . . 2.3 Transfer function . . . . . . . . 2.4 Complex impedances . . . . . . 2.5 Amplitude and phase responses 2.6 Lowpass ltering . . . . . . . . 2.7 Cuto parametrization . . . . . 2.8 Highpass lter . . . . . . . . . . 2.9 Poles, zeros and stability . . . . 2.10 Multimode lter . . . . . . . . 2.11 Shelving lters . . . . . . . . . 2.12 Allpass lter . . . . . . . . . . . 2.13 Transposed multimode lter . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
3 Time-discretization 3.1 Discrete-time signals . . . . . . . . 3.2 Naive integration . . . . . . . . . . 3.3 Naive lowpass lter . . . . . . . . . 3.4 Block diagrams . . . . . . . . . . . 3.5 Transfer function . . . . . . . . . . 3.6 Poles . . . . . . . . . . . . . . . . . 3.7 Trapezoidal integration . . . . . . . 3.8 Bilinear transform . . . . . . . . . 3.9 Cuto prewarping . . . . . . . . . 3.10 Zero-delay feedback . . . . . . . . . 3.11 Direct forms . . . . . . . . . . . . . 3.12 Other replacement techniques . . . 3.13 Instantaneously unstable feedback v
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
. . . . . . . . . . . . .
vi 4 Ladder lter 4.1 Linear analog model . . . 4.2 Linear digital model . . . 4.3 Feedback shaping . . . . . 4.4 Multimode ladder lter . . 4.5 Simple nonlinear model . 4.6 Advanced nonlinear model 4.7 Diode ladder . . . . . . . 5 2-pole lters 5.1 Linear analog model . . . 5.2 Linear digital model . . . 5.3 Further lter types . . . . 5.4 Nonlinear model . . . . . 5.5 Cascading decomposition .
CONTENTS 59 59 61 62 62 65 67 68 77 77 80 81 84 86
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
. . . . .
Preface
The classical way of presentation of the DSP theory is not very well suitable for the purposes of virtual analog lter design. The linearity and time-invariance of structures are not assumed merely to simplify certain analysis and design aspects, but are handled more or less as an ultimate truth. The connection to the continuous-time (analog) world is lost most of the time. The key focus points, particularly the discussed lter types, are of little interest to a digital music instrument developer. This makes it dicult to apply the obtained knowledge in the music DSP context, especially in the virtual analog lter design. This book attempts to amend this deciency. The concepts are introduced with the musical VA lter design in mind. The depth of theoretical explanation is restricted to an intuitive and practically applicable amount. The focus of the book is the design of digital models of classical musical analog lter structures using the topology-preserving transform approach, which can be considered as a generalization of bilinear transform, zero-delay feedback and trapezoidal integration methods. This results in digital lters having nice amplitude and phase responses, nice time-varying behavior and plenty of options for nonlinearities. In a way, this book can be seen as a detailed explanation of the materials provided in the authors article Preserving the LTI system topology in s- to z -plane transforms. The prerequisites for the reader include familiarity with the basic DSP concepts, complex algebra and the basic ideas of mathematical analysis. Some basic knowledge of electronics may be helpful at one or two places, but is not critical for the understanding of the presented materials. The author would like to apologize for possible mistakes and messy explanations. This book has been written in a large haste, just in a few days and didnt go through any serious proofreading yet.
vii
viii
PREFACE
Acknowledgements
The author would like to express his gratitude to a number of people who work (or worked at a certain time) at NI and helped him with the matters related to the creation of this book in one or another way: Daniel Haver, Mate Galic, Tom Kurth, Nicolas Gross, Maike Weber, Martijn Zwartjes, and Mike Daliot. Special thanks to Stephan Schmitt, Egbert J urgens and Maximilian Zagler. The author is also grateful to a number of people on the KVR Audio DSP forum and the music DSP mailing list for productive discussions regarding the matters discussed in the book. Particulary to Martin Eisenberg for the detailed and extensive discussion of the delayless feedback, to Dominique Wurtz for the idea of the full equivalence of dierent BLT integrators and to Teemu Voipio for the introduction of the transposed direct form II BLT integrator in the TPT context. Thanks to Robin Schmidt and Richard Homann for reporting a number of mistakes in the book text. One shouldnt underestimate the small but invaluable contribution by Helene Kolpakova, whose questions and interest in the VA lter design matters have triggered the idea to quickly write this book. Last, but most importantly, big thanks to Bob Moog for inventing the voltage-controlled transistor ladder lter.
Chapter 1
Fourier theory
When we are talking about lters we say that lters modify the frequency content of the signal. E.g. a lowpass lter lets the low frequencies through, while suppressing the high frequencies, a highpass lter does vice versa etc. In this chapter we are going to develop a formal denition1 of the concept of frequencies contained in a signal. We will later use this concept to analyse the behavior of the lters.
1.1
Complex sinusoids
In order to talk about the lter theory we need to introduce complex sinusoidal signals. Consider the complex identity: ejt = cos t + j sin t (t R)
(notice that, if t is the time, then the point ejt is simply moving along a unit circle in the complex plane). Then cos t = and sin t = ejt + ejt 2 ejt ejt 2j
Then a real sinusoidal signal a cos(t + ) where a is the real amplitude and is the initial phase can be represented as a sum of two complex conjugate sinusoidal signals: a cos(t + ) = a j (t+) a j jt a j jt e + ej (t+) = e e + e e 2 2 2
Notice that we have a sum of two complex conjugate sinusoids ejt with respective complex conjugate amplitudes (a/2)ej . So, the complex amplitude simultaneously encodes both the amplitude information (in its absolute magnitude) and the phase information (in its argument). For the positive-frequency component (a/2)ej ejt , the complex amplitude a/2 is a half of the real amplitude and the complex phase is equal to the real phase.
1 More
1.2
Fourier series
Let x(t) be a real periodic signal of a period T: x(t) = x(t + T ) Let = 2/T be the fundamental frequency of that signal. Then x(t) can be represented2 as a sum of a nite or innite number of sinusoidal signals of harmonically related frequencies jn plus the DC oset term3 a0 /2: x(t) = a0 + an cos(jnt + n ) 2 n=1
(1.1)
The representation (1.1) is referred to as real-form Fourier series. The respective sinusoidal terms are referred to as the harmonics or the harmonic partials of the signal. Using the complex sinusoid notation the same can be rewritten as
x(t) =
n=
Xn ejnt
(1.2)
where each harmonic term an cos(jnt + n ) will be represented by a sum of Xn ejnt and Xn ejnt , where Xn and Xn are mutually conjugate: Xn = X n . The representation (1.2) is referred to as complex-form Fourier series. Note that we dont have an explicit DC oset partial in this case, it is implicitly contained in the series as the term for n = 0. It can be easily shown that the real- and complex-form coecients are related as Xn = an jn e 2 a0 X0 = 2 (n > 0)
This means that intuitively we can use the absolute magnitude and the argument of Xn (for positive-frequency terms) as the amplitudes and phases of the real Fourier series partials. Complex-form Fourier series can also be used to represent complex (rather than real) periodic signals in exactly the same way, except that the equality Xn = X n doesnt hold anymore. Thus, any real periodic signal can be represented as a sum of harmonically related real sinusoidal partials plus the DC oset. Alternatively, any periodic signal can be represented as a sum of harmonically related complex sinusoidal partials.
speaking, there are some restrictions on x(t). It would be sucient to require that x(t) is bounded and continuous, except for a nite number of discontinuous jumps per period. 3 The reason the DC oset term is notated as a /2 and not as a has to do with simplifying 0 0 the math notation in other related formulas.
2 Formally
1.3
Fourier integral
While periodic signals are representable as a sum of a countable number of sinusoidal partials, a nonperiodic real signal can be represented4 as a sum of an uncountable number of sinusoidal partials:
x(t) =
0
a( ) cos t + ( )
d 2
(1.3)
The representation (1.3) is referred to as Fourier integral.5 The DC oset term doesnt explicitly appear in this case. The complex-form version of Fourier integral6 is
x(t) =
X ( )ejt
d 2
(1.4)
For real x(t) we have a Hermitian X ( ): X ( ) = X ( ), for complex x(t) there is no such restriction. The function X ( ) is referred to as Fourier transform of x(t).7 It can be easily shown that the relationship between the parameters of the real and complex forms of Fourier transform is X ( ) = a( ) j() e 2 ( > 0)
This means that intuitively we can use the absolute magnitude and the argument of X ( ) (for positive frequencies) as the amplitudes and phases of the real Fourier integral partials. Thus, any timelimited signal can be represented as a sum of an uncountable number of sinusoidal partials of innitely small amplitudes.
4 As with Fourier series, there are some restrictions on x(t). It is sucient to require x(t) to be absolutely integrable, bounded and continuous (except for a nite number of discontinuous jumps per any nite range of the argument value). The most critical requirement here is probably the absolute integrability, which is particularly fullled for the timelimited signals. 5 The 1/2 factor is typically used to simplify the notation in the theoretical analysis involving the computation. Intuitively, the integration is done with respect to the ordinary, rather than circular frequency:
x(t) =
0
a(t) cos 2f t + (f ) df
Some texts do not use the 1/2 factor in this position, in which case it appears in other places instead. 6 A more common term for (1.4) is inverse Fourier transform. However the term inverse Fourier transform stresses the fact that x(t) is obtained by computing the inverse of some transform, whereas in this book we are more interested in the fact that x(t) is representable as a combination of sinusoidal signals. The term Fourier integral better reects this aspect. It also suggests a similarity to the Fourier series representation. 7 The notation X ( ) for Fourier transform shouldnt be confused with the notation X (s) for Laplace transform. Typically one can be told from the other by the semantics and the notation of the argument. Fourier transform has a real argument, most commonly denoted as . Laplace transform has a complex argument, most commonly denoted as s.
1.4
The Dirac delta function (t) is intuitively dened as a very high and a very short symmetric impulse with a unit area (Fig. 1.1): (t) = + if t = 0 0 if t = 0
(t) = (t)
(t) dt = 1
(t) +
Figure 1.1: Dirac delta function. Since the impulse is innitely narrow and since it has a unit area,
f ( ) ( ) d = f (0)
from where it follows that a convolution of any function f (t) with (t) doesnt change f (t):
(f )(t) =
f ( ) (t ) d = f (t)
Dirac delta can be used to represent Fourier series by a Fourier integral. If we let
X ( ) =
n=
2 ( nf )Xn
then
Xn ejnf t =
n=
X ( )ejt
d 2
From now on, well not separately mention Fourier series, assuming that Fourier integral can represent any necessary signal. Thus, most signals can be represented as a sum of (a possibly innite number of ) sinusoidal partials.
1.5
Laplace transform
+j
X (s)est
ds 2j
where the integration is done in the complex plane along the straight line from j to +j (apparently X (s) is a dierent function than X ( )8 ). For timelimited signals the function X (s) can be dened on the entire complex plane in such a way that the integration can be done along any line which is parallel to the imaginary axis:
+j
x(t) =
j
X (s)est
ds 2j
( R)
(1.5)
In many other cases such X (s) can be dened within some strip 1 < Re s < 2 . Such function X (s) is referred to as bilateral Laplace transform of x(t), whereas the representation (1.5) can be referred to as Laplace integral.910 Notice that the complex exponential est is representable as est = eRe st eIm st Considering eRe st as the amplitude of the complex sinusoid eIm st we notice that est is: - an exponentially decaying complex sinusoid if Re s < 0, - an exponentially growing complex sinusoid if Re s > 0, - a complex sinusoid of constant amplitude if Re s = 0. Thus, most signals can be represented as a sum of (a possibly innite number of ) complex exponential partials, where the amplitude growth or decay speed of these partials can be relatively arbitrarily chosen.
8 As already mentioned, the notation X ( ) for Fourier transform shouldnt be confused with the notation X (s) for Laplace transform. Typically one can be told from the other by the semantics and the notation of the argument. Fourier transform has a real argument, most commonly denoted as . Laplace transform has a complex argument, most commonly denoted as s. 9 A more common term for (1.5) is inverse Laplace transform. However the term inverse Laplace transform stresses the fact that x(t) is obtained by computing the inverse of some transform, whereas is this book we are more interested in the fact that x(t) is representable as a combination of exponential signals. The term Laplace integral better reects this aspect. 10 The representation of periodic signals by Laplace integral (using Dirac delta function) is problematic for = 0. Nevertheless, we can represent them by a Laplace integral if we restrict to = 0 (that is Re s = 0 for X (s)).
SUMMARY
The most important conclusion of this chapter is: any signal occurring in practice can be represented as a sum of sinusoidal (real or complex) components. The frequencies of these sinusoids can be referred to as the frequencies contained in the signal. For complex representation, the real amplitude and phase information is encoded in the absolute magnitude and the argument of the complex amplitudes of the positive-frequency partials (where the absolute magnitude of the complex amplitude is a half of the real amplitude). It is also possible to use complex exponentials instead of sinusoids.
Chapter 2
2.1
RC lter
Consider the circuit in Fig. 2.1, where the voltage x(t) is the input signal and the capacitor voltage y (t) is the output signal. This circuit represents the simplest 1-pole lowpass lter, which we are now going to analyse. x(t)
R
y (t)
Figure 2.1: A simple RC lowpass lter. Writing the equations for that circuit we have: x = UR + UC y = UC UR = RI I=q C qC = CUC where UR is the resistor voltage, UC is the capacitor voltage, I is the current through the circuit and qC is the capacitor charge. Reducing the number of variables, we can simplify the equation system to: x = RC y +y or y = 1 (x y ) RC 7
y = y (t0 ) +
t0
1 x( ) y ( ) d RC
where t0 is the initial time moment. Introducing the notation c = 1/RC we have
t
y = y (t0 ) +
t0
c x( ) y ( ) d
(2.1)
We will reintroduce c later as the cuto of the lter. Notice that we didnt factor 1/RC (or c ) out of the integral for the case when the value of R is varying with time. The varying R corresponds to the varying cuto of the lter, and this situation is highly typical in the music DSP context.1
2.2
Block diagrams
G'! &% + y"#$ c M M G M qqq G
The integral equation (2.1) can be expressed in the block diagram form (Fig. 2.2). G G y (t)
x(t)
Figure 2.2: A 1-pole RC lowpass lter in the block diagram form. The meaning of the elements of the diagram should be intuitively clear. The gain element (represented by a triangle) multiplies the input signal by c . Notice the inverting input of the summator, denoted by . The integrator simply integrates the input signal:
t
input ( ) d
The representation of the system by the integral (rather than dierential) equation and the respective usage of the integrator element in the block diagram has an important intuitive meaning. Intuitively, the capacitor integrates the current owing through it, accumulating it as its own charge:
t
qC (t) = qC (t0 ) +
t0
I ( ) d
I ( ) d
t0
One can observe from Fig. 2.2 that the output signal is always trying to reach the input signal. Indeed, the dierence x y is always directed from
1 We didnt assume the varying C because then our simplication of the equation system C in this case. doesnt hold anymore, since q C = C U
y to x. Since c > 0, the integrator will respectively increase or decrease its output value in the respective direction. This corresponds to the fact that the capacitor voltage in Fig. 2.1 is always trying to reach the input voltage. Thus, the circuit works as a kind of smoother of the input signal.
2.3
Transfer function
G G y (t)
Suppose x(t) = est (where s = j or, possibly, another complex value). Then
t
y (t) = y (t0 ) +
t0
1 es d = y (t0 ) + es s
=
=t0
1 st 1 e + y (t0 ) est0 s s
Thus, a complex sinusoid (or exponential) est sent through an integrator comes out as the same signal est just with a dierent amplitude 1/s plus some DC term y (t0 ) est0 /s. Similarly, a signal X (s)est (where X (s) is the complex amplitude of the signal) comes out as (X (s)/s)est plus some DC term. That is, if we forget about the extra DC term, the integrator simply multiplies the amplitudes of complex exponential signals est by 1/s. Now, the good news is: for our purposes of lter analysis we can simply forget about the extra DC term. The reason for this is the following. Suppose the initial time moment t0 was quite long ago (t0 0). Suppose further that the integrator is contained in a stable lter (we will discuss the lter stability later, for now well simply mention that were mostly interested in the stable lters for the purposes of the current discussion). It can be shown that in this case the eect of the extra DC term on the output signal is negligible. Since the initial state y (t0 ) is incorporated into the same DC term, it also means that the eect of the initial state is negligible!2 Thus, we simply write (for an integrator): es d = 1 st e s
This means that est is an eigenfunction of the integrator with the respective eigenvalue 1/s. Since the integrator is linear,3 not only are we able to factor X (s) out of the integration: 1 X (s)es d = X (s) es d = X (s)est s
2 In practice, typically, a zero initial state is assumed. Then, particularly, in the case of absence of the input signal, the output signal of the lter is zero from the very beginning (rather than for t t0 ). 3 The linearity here is understood in the sense of the operator linearity. An operator H is linear, if (1 f1 (t) + 2 f2 (t)) = 1 Hf 1 (t) + 2 Hf 2 (t) H
10
but we can also apply the integration independently to all Fourier (or Laplace) partials of an arbitrary signal x(t):
+j
X (s)es
j +j
ds 2j
+j
d =
j
X (s)es d
ds = 2j
=
j
X (s) s ds e s 2j
That is, the integrator changes the complex amplitude of each partial by a 1/s factor. Consider again the structure in Fig. 2.2. Assuming the input signal x(t) has the form est we can replace the integrator by a gain element with a 1/s factor. We symbolically reect this by replacing the integrator symbol in the diagram with the 1/s fraction (Fig. 2.3).4 G'! &% + y"#$ c M M G M qqq G 1 s G G y (t)
x(t)
Figure 2.3: A 1-pole RC lowpass lter in the block diagram form with a 1/s notation for the integrator. So, suppose x(t) = X (s)est and suppose we know y (t). Then the input signal for the integrator is c (x y ). We now will further take for granted the knowledge that y (t) will be the same signal est with some dierent complex amplitude Y (s), that is y (t) = Y (s)est (notably, this holds only if c is constant, that is, if the system is time-invariant !!!)5 Then the input signal of the integrator is c (X (s) Y (s))est and the integrator simply multiplies its amplitude by 1/s. Thus the output signal of the integrator is c (x y )/s. But, on the other hand y (t) is the output signal of the integrator, thus y (t) = c or Y (s)est = c or Y (s) = c from where sY (s) = c X (s) c Y (s)
4 Often in such cases the input and output signal notation for the block diagram is replaced with X (s) and Y (s). Such diagram then works in terms of Laplace transform, the input of the diagram is the Laplace transform X (s) of the input signal x(t), the output is respectively the Laplace transform Y (s) of the output signal y (t). The integrators can then be seen as s-dependent gain elements, where the gain coecient is 1/s. 5 In other words, we take for granted the fact that est is an eigenfunction of the entire circuit.
x(t) y (t) s
11
Thus, the circuit in Fig. 2.3 (or in Fig. 2.2) simply scales the amplitude of the input sinusoidal (or exponential) signal X (s)est by the c /(s + c ) factor. Lets introduce the notation H (s) = Then Y (s) = H (s)X (s) H (s) is referred to as the transfer function of the structure in Fig. 2.3 (or Fig. 2.2). Notice that H (s) is a complex function of a complex argument. For an arbitrary input signal x(t) we can use the Laplace transform representation +j ds x(t) = X (s)est 2 j j From the linearity 6 of the circuit in Fig. 2.3, it follows that the result of the application of the circuit to a linear combination of some signals is equal to the linear combination of the results of the application of the circuit to the individual signals. That is, for each input signal of the form X (s)est we obtain the output signal H (s)X (s)est . Then for an input signal which is an integral sum of X (s)est , we obtain the output signal which is an integral sum of H (s)X (s)est . That is +j ds y (t) = H (s)X (s)est 2j j So, the circuit in Fig. 2.3 independently modies the complex amplitudes of the sinusoidal (or exponential) partials est by the H (s) factor! Notably, the transfer function can be introduced for any system which is linear and time-invariant. For the systems, whose block diagrams consist of integrators, summators and xed gains, the transfer function is always a nonstrictly proper 7 rational function of s. Particularly, this holds for the electronic circuits, where the dierential elements are capacitors and inductors, since these types of elements logically perform integration (capacitors integrate the current to obtain the voltage, while inductors integrate the voltage to obtain the current). It is important to realize that in the derivation of the transfer function concept we used the linearity and time-invariance (the absence of parameter modulation) of the structure. If these properties do not hold, the transfer function cant be introduced! This means that all transfer function-based analysis holds only in the case of xed parameter values. In practice, if the parameters are not changing too quickly, one can assume that they are approximately constant
6 Here
c s + c
(2.2)
we again understand the linearity in the operator sense: (1 f1 (t) + 2 f2 (t)) = 1 Hf 1 (t) + 2 Hf 2 (t) H
(t) where x(t) and y (t) The operator here corresponds to the circuit in question: y (t) = Hx are the input and output signals of the circuit. 7 A rational function is nonstrictly proper, if the order of its numerator doesnt exceed the order of its denominator.
12
during certain time range. That is we can approximately apply the transfer function concept (and the discussed later derived concepts, such as amplitude and phase responses, poles and zeros, stability criterion etc.) if the modulation of the parameter values is not too fast.
2.4
Complex impedances
Actually, we could have obtained the transfer function of the circuit in Fig. 2.1 using the concept of complex impedances. Consider the capacitor equation: I = CU If I (t) = I (s)est U (t) = U (s)est (where I (t) and I (s) are obviously two dierent functions, the same for U (t) and U (s)), then = sU (s)est = sU (t) U and thus = CsU (s)est = sCU (t) I (t) = I (s)est = C U that is I = sCU 1 I sC Now the latter equation looks almost like Ohms law for a resistor: U = RI . The complex value 1/sC is called the complex impedance of the capacitor. The same equation can be written in the Laplace transform form: U (s) = (1/sC )I (s). and respectively, for I (t) = I (s)est and For an inductor we have U = LI st U (t) = U (s)e we obtain U (t) = sLI (t) or U (s) = sLI (s). Thus, the complex impedance of the inductor is sL. Using the complex impedances as if they were resistances (which we can do, assuming the input signal has the form X (s)est ), we simply write the voltage division formula for the circuit in in Fig. 2.1: U= y (t) = UC x(t) UR + UC or
or, cancelling the common current factor I (t) from the numerator and the denominator, we obtain the impedances instead of voltages: y (t) = from where H (s) = y (t) 1/sC 1 1/RC c = = = = x(t) R + 1/sC 1 + sRC s + 1/RC s + c 1/sC x(t) R + 1/sC
13
2.5
Consider again the structure in Fig. 2.3. Let x(t) be a real signal and let x(t) =
j
X (s)est
ds 2j
be its Laplace integral representation. Let y (t) be the output signal (which is obviously also real) and let
+j
y (t) =
j
Y (s)est
ds 2j
be its Laplace integral representation. As we have shown, Y (s) = H (s)X (s) where H (s) is the transfer function of the circuit. The respective Fourier integral representation of x(t) is apparently
+
x(t) =
X (j )ejt
d 2
where X (j ) is the Laplace transform X (s) evaluated at s = j . The real Fourier integral representation is then obtained as ax ( ) = 2 |X (j )| x ( ) = arg X (j ) For y (t) we respectively have89 ay ( ) = 2 |Y (j )| = 2 |H (j )X (j )| = |H (j )| ax ( ) y ( ) = arg Y (j ) = arg (H (j )X (j )) = x ( ) + arg H (j ) ( 0)
Thus, the amplitudes of the real sinusoidal partials are magnied by the |H (j )| factor and their phases are shifted by arg H (j ) ( 0). The function |H (j )| is referred to as the amplitude response of the circuit and the function arg H (j ) is referred to as the phase response of the circuit. Note that both the amplitude and the phase response are real functions of a real argument . The complex-valued function H (j ) of the real argument is referred to as the frequency response of the circuit. Simply put, the frequency response is equal to the transfer function evaluated on the imaginary axis. Since the transfer function concept works only in the linear time-invariant case, so do the concepts of the amplitude, phase and frequency responses!
8 This relationship holds only if H (j ) is Hermitian: H (j ) = H (j ). If it werent the case, the Hermitian property wouldnt hold for Y (j ) and y (t) couldnt have been a real signal (for a real input x(t)). Fortunately, for real systems H (j ) is always Hermitian. Particularly, rational transfer functions H (s) with real coecients obviously result in Hermitian H (j ). 9 Formally, = 0 requires special treatment in case of a Dirac delta component at = 0 (arising particularly if the Fourier series is represented by a Fourier integral and there is a nonzero DC oset). Nevertheless, the resulting relationship between ay (0) and ax (0) is exactly the same as for > 0, that is ay (0) = H (0)ax (0). A more complicated but same argument holds for the phase.
14
2.6
Lowpass ltering
c s + c
Consider again the transfer function of the structure in Fig. 2.2: H (s) = The respective amplitude response is |H (j )| = c c + j
Apparently at = 0 we have H (0) = 1. On the other hand, as grows, the magnitude of the denominator grows as well and the function decays to zero: H (+j ) = 0. This suggests the lowpass ltering behavior of the circuit: it lets the partials with frequencies c through and stops the partials with frequencies c . The circuit is therefore referred to as a lowpass lter, while the value c is dened as the cuto frequency of the circuit. It is convenient to plot the amplitude response of the lter in a fully logarithmic scale. The amplitude gain will then be plotted in decibels, while the frequency axis will have a uniform spacing of octaves. For H (s) = c /(s + c ) the plot looks like the one in Fig. 2.4. |H (j )|, dB 0
-6
-12
-18 c / 8 c 8c
Figure 2.4: Amplitude response of a 1-pole lowpass lter. Notice that the plot falls o in an almost straight line as . Apparently, at c and respectively |s| c we have H (s) c /s and |H (s)| c / . This is a hyperbola in the linear scale and a straight line in a fully logarithmic scale. If doubles (corresponding to a step up by one octave), the amplitude gain is approximately halved (that is, drops by approximately 6 decibel). We say that this lowpass lter has a rollo of 6dB/oct. Another property of this lter is that the amplitude drop at the cuto is 3dB. Indeed |H (jc )| = 1 c 1 = = 3dB c + jc 1+j 2
15
2.7
Cuto parametrization
1 s+1
Suppose c = 1. Then the lowpass transfer function (2.2) turns into H (s) =
which is again our familiar transfer function of the lowpass lter. Consider the amplitude response graph of 1/(s + 1) in a logarithmic scale. The substitution s s/c simply shifts this graph to the left or to the right (depending on whether c < 1 or c > 1) without changing its shape. Thus, the variation of the cuto parameter doesnt change the shape of the amplitude response graph (Fig. 2.5), or of the phase response graph, for that matter (Fig. 2.6). |H (j )|, dB 0
-6
-12
-18 c /8 c 8c
Figure 2.5: 1-pole lowpass lters amplitude response shift by a cuto change. The substitution s s/c is a generic way to handle cuto parametrization for analog lters, because it doesnt change the response shapes. This has a nice counterpart on the block diagram level. For all types of lters we simply visually combine an c gain and an integrator into a single block:10 c M M G M qqq G G G c s G
10 Notice that including the cuto gain into the integrator makes the integrator block invariant to the choice of the time units: t
y (t) = y (t0 ) +
t0
c x( ) d
because the product c d is invariant to the choice of the time units. This will become important once we start building discrete-time models of lters, where we would often assume unit sampling period.
16 arg H (j ) 0
/4
/2 c /8
8c
Figure 2.6: 1-pole lowpass lters phase response shift by a cuto change. Apparently, the reason for the c /s notation is that this is the transfer function of the serial connection of an c gain and an integrator. Alternatively, we simply assume that the cuto gain is contained inside the integrator: c M M G M qqq G G G G
The internal representation of such integrator block is of course still a cuto gain followed by an integrator. Whether the gain should precede the integrator or follow it may depend on the details of the analog prototype circuit. In the absence of the analog prototype its better to put the integrator after the cuto gain, because then the integrator will smooth the jumps and further artifacts arising out of the cuto modulation. With the cuto gain implied inside the integrator block, the structure from Fig. 2.2 is further simplied to the one in Fig. 2.7: x(t) G'! &% + y"#$ G G G y (t)
Figure 2.7: A 1-pole RC lowpass lter with an implied cuto. As a further shortcut arising out of the just discussed facts, it is common to assume c = 1 during the lter analysis. Particularly, the transfer function of a 1-pole lowpass lter is often written as H (s) = 1 s+1
It is assumed that the reader will perform the s s/c substitution as necessary.
17
2.8
Highpass lter
If instead of the capacitor voltage in Fig. 2.1 we pick up the resistor voltage as the output signal, we obtain the block diagram representation as in Fig. 2.8. G y (t) G'! &% "#$ + y G G
x(t)
Figure 2.8: A 1-pole highpass lter. Obtaining the transfer function of this lter we get s H (s) = s + c (or s/(s + 1) in the unit cuto form). Its easy to see that H (0) = 0 and H (+j ) = 1, whereas the biggest change in the amplitude response occurs again around = c . Thus, we have a highpass lter here. The amplitude response of this lter is shown in Fig. 2.9 (in the logarithmic scale). |H (j )|, dB 0
-6
-12
-18 c /8 c 8c
Figure 2.9: Amplitude response of a 1-pole highpass lter. Its not dicult to observe and not dicult to show that this response is a mirrored version of the one in Fig. 2.4.11 Particularly, at c we have H (s) s/c , so when the frequency is halved (dropped by an octave), the amplitude gain is approximately halved as well (drops by approximately 6dB). Again, we have a 6dB/oct rollo.
11 In
the unit cuto notation, its easy to notice that the highpass transfer function
1 1+s
s 1+s
by the substitution s 1/s. This can be obtained from the lowpass transfer function substitution is referred to as LP to HP (lowpass to highpass) substitution. For Hermitian transfer functions (corresponding to real systems), the LP to HP substitution simply mirrors the amplitude response in the logarithmic frequency scale.
18
2.9
Apparently, this function has a pole in the complex plane at s = c . Similarly, the highpass transfer function H (s) = s s + c
also has a pole at s = c , but it also has a zero at s = 0. Recall that the transfer functions of linear time-invariant dierential systems are nonstrictly proper rational functions of s. Thus they always have poles and often have zeros, the numbers of poles and zeros matching the orders of the numerator and the denominator respectively. The poles and zeros of transfer function (especially the poles) play an important role in the lter analysis. For simplicity they are referred to as the poles and zeros of the lters. The transfer functions of real linear time-invariant dierential systems have real coecients in the numerator and denominator polynomials. Apparently, this doesnt prevent them from having complex poles and zeros, however, being roots of real polynomials, those must come in complex conjugate pairs. E.g. a transfer function with a 3rd order denominator can have either three real poles, or one real and two complex conjugate poles. The lowpass and highpass lters discussed so far, each have one pole. For that reason they are referred to as 1-pole lters. Actually, the number of poles is always equal to the order of the lter or (which is the same) to the number of integrators in the lter.12 Therefore it is common, instead of e.g. a 4th-order lter to say a 4-pole lter. The most important property of the poles is that a lter13 is stable if and only if all its poles are located in the left complex semiplane (that is to the left of the imaginary axis). For our lowpass and highpass lters this is apparently true, as long as c > 0.14 If c < 0, the pole is moved to the right semiplane, the lter becomes unstable and will explode. Also the denition of the frequency response doesnt make much sense in this case. If we put a sinusoidal signal through a stable lter we will (as we have shown) obtain an amplitudemodied and phase-shifted sinusoidal signal of the same frequency.15 If we put a sinusiodal signal through an unstable lter, the lter simply explodes (its
12 In certain singular cases, depending on the particular denition details, these numbers might be not equal to each other. 13 More precisely a linear time-invariant system, which particularly implies xed parameters. This remark is actually unnecessary, since, as we mentioned, the transfer function (and respectively the poles) are dened only for the linear time-invariant case. 14 Notably, the same condition ensures the stability of the 1-pole RC lowpass and highpass lters in the time-varying case, which can be directly seen from the fact that the lowpass lters output never exceeds the maximum level of its input. 15 Strictly speaking, this will happen only after the lter has stabilized itself to the new signal. This takes a certain amount of time. The closer the poles are to the imaginary axis (from the left), the larger is this stabilization time. The characteristic time value of the stabilization has the order of magnitude of 1/ max {Re pn }, where pn are the poles. Actually the eects of the transition (occurring at the moment of the appearance of the sinusoidal signal) decay exponentially as et max{Re pn } .
19
output grows innitely), thus it makes no sense to talk of amplitude and phase responses. It is also possible to obtain an intuitive understanding of the eect of the pole position on the lter stability. Consider a transfer function of the form H (s) = F (s)
N
(s pn )
n=1
where F (s) is the numerator of the transfer function and pn are the poles. Suppose all poles are initially in the left complex semiplane and now one of the poles (lets say p1 ) starts moving towards the imaginary axis. As the pole gets closer to the axis, the amplitude response at = Im p1 grows. When p1 gets onto the axis, the amplitude response at = Im p1 is innitely large (since j = p1 , we have H (j ) = H (p1 ) = ). This corresponds to the lter getting unstable.1617 The poles and zeros also dene the rollo speed of the amplitude response. Let Np be the number of poles and Nz be the number of zeros. Since the transfer function must be nonstrictly proper, Np Nz . Its not dicult to see that the amplitude response rollo at + is 6(Np Nz )dB/oct. Respectively, the rollo at 0 is 6Nz0 dB/oct, where Nz0 is the number of zeros at s = 0 (provided there are no poles at s = 0). Considering that 0 Nz0 Nz Np , the rollo speed at + or at 0 cant exceed 6Np dB/oct. Also, if all zeros of a lter are at s = 0 (that is Nz0 = Nz ) then the sum of the rollo speeds at 0 and + is exactly 6Np dB/oct.
2.10
Multimode lter
Actually, we can pick up the lowpass and highpass signals simultaneously from the same structure (Fig. 2.10). This is referred to as a multimode lter. G yHP (t) G'! &% " + y #$ G G G G yLP (t)
x(t)
Figure 2.10: A 1-pole multimode lter. Its easy to observe that yLP (t) + yHP (t) = x(t), that is the input signal is split by the lter into the lowpass and highpass components. In the transfer
16 The reason, why the stable area is the left (and not the right) complex semiplane, falls outside the scope of this book. 17 The discussed 1-pole lowpass lter is actually still kind of stable at = 0 (corresponding to the pole at s = 0. In fact, it has a constant output level (its state is not changing) in this case. However, strictly speaking, this case is not really stable, since all signals in a truely stable lter must decay to zero in the absence of the input signal.
c s + =1 s + c s + c
The multimode lter can be used to implement a 1st-order dierential lter for practically any given transfer function, by simply mixing its outputs. Indeed, let b1 s + b0 (a1 = 0, a0 = 0) H (s) = a1 s + a0 (the case a1 = 0 turns the transfer function into an improper rational function, while a0 = 0 is not dening a stable lter). Dividing the numerator and the denominator by a1 we obtain H (s) = b1 s b0 c (b1 /a1 )s + (b0 /a1 ) = + = s + a0 /a1 a1 s + c a0 s + c b1 b0 = HHP (s) + HLP (s) (where c = a0 /a1 ) a1 a0
Thus we simply need to set the lters cuto to a0 /a1 and take the sum y= as the output signal. b1 a1 yHP (t) + b0 a0 yLP (t)
2.11
Shelving lters
By adding/subtracting the lowpass-ltered signal to/from the unmodied input signal one can build a low-shelving lter: y (t) = x(t) + K yLP (t) The transfer function of the low-shelving lter is respectively: H (s) = 1 + K 1 s+1
The amplitude response is plotted Fig. 2.11. Typically K 1. At K = 0 the signal is unchanged. At K = 1 the lter turns into a highpass. The high-shelving lter is built in a similar way: y (t) = x(t) + K yHP (t) and s s+1 The amplitude response is plotted Fig. 2.12. There are a couple of nontrivial moments here, though. The rst one has to do with the fact that the amplitude boost or drop for the shelf is more convenient to be specied in decibels. Which requires translation of the level change specied in decibels into the K factor. Its not dicult to realize that H (s) = 1 + K dB = 20 log10 (K + 1)
21
-6
-12
-18 c /8 c 8c
Figure 2.11: Amplitude response of a 1-pole low-shelving lter (for various K ). |H (j )|, dB +6
-6
-12
-18 c /8 c 8c
Figure 2.12: Amplitude response of a 1-pole high-shelving lter (for various K ). Indeed, e.g. for the low-shelving lter at = 0 (that is s = 0) we have18 H (0) = 1 + K We also obtain H (+j ) = 1 + K for the high-shelving lter. A further nontrivial moment is that the denition of the cuto at = 1 for such lters is not really convenient. Indeed, looking at the amplitude response
18 H (0) = 1 + K is not a fully trivial result here. We have it only because the lowpass lter doesnt change the signals phase at = 0. If instead it had e.g. inverted the phase, then we would have obtained 1 K here.
22
graphs in Fig. 2.11 and Fig. 2.12 we would rather wish to have the cuto point positioned exactly at the middle of the respective slopes. Lets nd where the middle is. E.g. for the lowpass (and remembering that both scales of the graph are logarithmic) we rst nd the mid-height, which is the geometric average of 1 + K . Then we need to nd at which the shelfs gain and the unit gain: the amplitude response is 1 + K : 1+K from where 1 + 2K + K 2 + 2 = 1 + K + 2 + K 2 and K + K 2 = K 2 from where = 1+K 1 j + 1
2
j + 1 + K j + 1
(1 + K )2 + 2 =1+K 1 + 2
This is the intuitive cuto position for the low-shelving lter built from a unit-cuto lowpass lter. Respectively, given the intuitive cuto position of the low-shelving lter, we need to divide it by 1 + K to obtain the cuto of the underlying lowpass lter. Similarly, for the highpass: 1+K from where 1 + 2 (1 + 2K + K 2 ) = 1 + K + 2 + K 2 and 2 (K + K 2 ) = K from where = 1 1+K j j + 1
2
1 + j (K + 1) j + 1
1 + (1 + K )2 2 =1+K 1 + 2
This is the intuitive cuto position for the high-shelving lter built from a unit-cuto highpass lter. Respectively, given the intuitive cuto position of the high-shelving lter, we need to multiply it by 1 + K to obtain the cuto of the underlying highpass lter.
2.12
Allpass lter
By subtracting the highpass output from the lowpass output of the multimode lter we obtain the allpass lter : H (s) = HLP (s) HHP (s) = 1 s 1s = 1+s 1+s 1+s
23
Indeed, the numerator 1 j and the denominator 1 + j of the frequency response are mutually conjugate, therefore they have equal magnitudes. The allpass lter is used because of its phase response (Fig. 2.13). That is sometimes we wish to change the phases of the signals partials without changing their amplitudes. The most common VA use for the allpass lters is probably in phasers. arg H (j ) 0
/2
c /8
8c
Figure 2.13: Phase response of a 1-pole allpass lter. We could also subtract the lowpass from the highpass: H (s) = 1 s1 s = s+1 s+1 1+s
Apparently the result diers from the previous one only by the inverted phase. In regards to the unit amplitude response of the 1-pole allpass lter, we could have simply noticed that the zero and the pole of the lter are mutually symmetric relatively to the imaginary axis. This is a general property of dierential allpass lters: their poles and zeros always come in pairs, located symmetrically relatively to the imaginary axis (since the poles of a stable lter have to be in the left complex semiplane, the zeros will be in the right complex semiplane). Expressing the transfer functions numerator and denominator in the multiplicative form, we have
N N
(s zn ) |H (s)| =
n=1 N
|s zn | =
n=1 N
(s pn )
n=1 n=1
|s pn |
where pn and zn are poles and zeros. If each pair pn and zn is mutually sym metric relatively to the imaginary axis (pn = zn ), then the factors |j zn | and |j pn | of the amplitude response are always equal, thus the amplitude response is always unity.
24
2.13
We could apply the transposition to the block diagram in Fig. 2.10. The transposition process is dened as reverting the direction of all signal ow, where forks turn into summators and vice versa (Fig. 2.14).19 The transposition keeps the transfer function relationship within each pair of an input and an output (where the input becomes the output and vice versa). Thus in Fig. 2.14 we have a lowpass and a highpass input and a single output. xHP (t) y (t) o o "#$o '! &% + o '! &% + y"#$o
xLP (t)
Figure 2.14: A 1-pole transponsed multimode lter. The transposed multimode lter has less practical use than the nontransposed one in Fig. 2.10. However, one particular usage case is feedback shaping. Imagine we are mixing an input signal xin (t) with a feedback signal xfbk (t), and we wish to lter each one of those by a 1-pole lter, and the cutos of these 1-pole lters are identical. That is, the transfer functions of those lters share a common denominator. Then we could use a single transposed 1-pole multimode lter as in Fig. 2.15. MAM G M qqq MB M G M qqq
C
xin (t)
LP
G TMMF G y (t) G
HP
xfbk (t)
Figure 2.15: A transposed multimode lter (TMMF) used for feedback signal mixing. The mixing coecients A, B , C and D will dene the numerators of the respective two transfer functions (in exactly the same way as we have been mixing
19 The inverting input of the summator in the transposed version was obtained from the respective inverting input of the summator in the non-transposed version as follows. First the inverting input is replaced by an explicit inverting gain element (gain factor 1), then the transposition is performed, then the inverting gain is merged into the new summator.
SUMMARY
25
the outputs of a nontransposed multimode lter), whereas the denominator will be s + c , where c is the cuto of the transposed multimode lter.
SUMMARY
The analog 1-pole lter implementations are built around the idea of the multimode 1-pole lter in Fig. 2.10. The transfer functions of the lowpass and highpass 1-pole lters are c HLP (s) = s + c and HHP (s) = s s + c
respectively. Other 1-pole lter types can be built by combining the lowpass and the highpass signals.
26
Chapter 3
Time-discretization
Now that we have introduced the basic ideas of analog lter analysis, we will develop an approach to convert analog lter models to the discrete time.
3.1
Discrete-time signals
The discussion of the basic concepts of discrete-time signal representation and processing is outside the scope of this book. We are assuming that the reader is familiar with the basic concepts of discrete-time signal processing, such as sampling, sampling rate, sampling period, Nyquist frequency, analog-to-digital and digital-to-analog signal conversion. However we are going to make some remarks in this respect. As many other texts do, we will use the square bracket notation to denote discrete-time signals and round parentheses notation to denote continuous-time signals: e.g. x[n] and x(t). We will often assume a unit sampling rate fs = 1 (and, respectively, a unit sampling period T = 1), which puts the Nyquist frequency at 1/2, or, in the circular frequency terms, at . Apparently, this can be achieved simply by a corresponding choice of time units. Theoretical DSP texts typically state that discrete-time signals have periodic frequency spectra. This might be convenient for certain aspects of theoretical analysis such as analog-to-digital and digital-to-analog signal conversion, but its highly unintuitive otherwise. It would be more intuitive, whenever talking of a discrete-time signal, to imagine an ideal DAC connected to this signal, and think that the discrete-time signal represents the respective continuous-time signal produced by such DAC. Especially, since by sampling this continuous-time signal we obtain the original discrete-time signal again. So the DAC and ADC conversions are exact inverses of each other (in this case). Now, the continuous-time signal produced by such DAC doesnt contain any partials above the Nyquist frequency. Thus, its Fourier integral representation (assuming T = 1) is
x[n] =
X ( )ejn 27
d 2
28
CHAPTER 3. TIME-DISCRETIZATION
x[n] =
j
X (s)esn
ds 2j
Introducing notation z = es and noticing that ds = d(log z ) = we can rewrite the Laplace integral as x[n] = X (z )z n dz 2jz dz z
(where X (z ) is apparently a dierent function than X (s)) where the integration is done counterclockwise along a circle of radius e centered at the complex planes origin:1 z = es = e+j = e ej ( ) (3.1)
We will refer the representation (3.1) as the z -integral.2 The function X (z ) is referred to as the z -transform of x[n]. In case of non-unit sampling period T = 1 the formulas are the same, except that the frequency-related parameters get multiplied by T (or divided by fs ), or equivalently, the n index gets multiplied by T in continuous-time expressions:3
fs
x[n] =
fs
X ( )ejT n
d 2 ds 2j
+jfs
x[n] =
jfs
X (s)esT n z = esT
x[n] =
X (z )z n
dz 2jz
(z = e+jT , fs fs )
The notation z n is commonly used for discrete-time complex exponential signals. A continuous-time signal x(t) = est is written as x[n] = z n in discretetime, where z = esT . The Laplace-integral amplitude coecient X (s) in X (s)est then may be replaced by a z -integral amplitude coecient X (z ) such as in X (z )z n .
1 As with Laplace transform, sometimes there are no restrictions on the radius e of the circle, sometimes there are. 2 A more common term for (3.1) is the inverse z -transform, but we will prefer the z -integral term for the same reason as with Fourier and Laplace integrals. 3 Formally the parameter of the Laplace integral (and z -integral) should have been multiplied by T as well, but it doesnt matter, since this parameter is chosen rather arbitrarily.
29
3.2
Naive integration
The most interesting element of analog lter block diagrams is obviously the integrator. The time-discretization for other elements is trivial, so we should concentrate on building the discrete-time models of the analog integrator. The continuous-time integrator equation is
t
y (t) = y (t0 ) +
t0
x( ) d
In discrete time we could approximate the integration by a summation of the input samples. Assuming for simplicity T = 1, we could have implemented a discrete-time integrator as
n
y [n] = y [n0 1] +
=n0
x[ ]
We will refer to the above as the naive digital integrator. A pseudocode routine for this integrator could simply consist of an accumulating assignment: // perform one sample tick of the integrator integrator_output := integrator_output + integrator_input; It takes the current state of the integrator stored in the integrator output variable and adds the current samples value of the integrator input on top of that. In case of a non-unit sampling period T = 1 we have to multiply the accumulated input values by T :4 // perform one sample tick of the integrator integrator_output := integrator_output + integrator_input*T;
3.3
We could further apply this naive approach to construct a discrete-time model of the lowpass lter in Fig. 2.2. We will use the naive integrator as a basis for this model. Let the x variable contain the current input sample of the lter. Considering that the output of the lter in Fig. 2.2 coincides with the output of the integrator, let the y variable contain the integrator state and simultaneously serve as the output sample. As we begin to process the next input sample, the y variable will contain the previous output value. At the end of the processing of the sample (by the lter model) the y variable will contain the new output sample. In this setup, the input value for the integrator is apparently (x y )c , thus we simply have // perform one sample tick of the lowpass filter y := y + (x-y)*omega_c;
4 Alternatively, we could, of course, scale the integrators output by T , but this is less useful in practice, because the T factor will be usually combined with the cuto gain factor c preceding the integrator.
30
CHAPTER 3. TIME-DISCRETIZATION
(mind that c must have been scaled to the time units corresponding to the unit sample period!) A naive discrete-time model of the multimode lter in Fig. 2.10 could have been implemented as: // perform one sample tick of the multimode filter hp := x-lp; lp := lp + hp*omega_c; where the integrator state is stored in the lp variable. The above naive implementations (and any other similar naive implementations, for that matter) work reasonably well as long as c 1, that is the cuto must be much lower than the sampling rate. At larger c the behavior of the lter becomes rather strange, ultimately the lter gets unstable. We will now develop some theoretical means to analyse the behavior of the discrete-time lter models, gure out what are the problems with the naive implementations, and then introduce another discretization approach.
3.4
Block diagrams
Lets express the naive discrete-time integrator in the form of a discrete-time block diagram. The discrete-time block diagrams are constructed from the same elements as continuous-time block diagrams, except that instead of integrators they have unit delays. A unit delay simply delays the signal by one sample. That is the output of a unit delay comes one sample late compared to the input. Apparently, the implementation of a unit delay requires a variable, which will be used to store the new incoming value and keep it there until the next sample. Thus, a unit delay element has a state, while the other block diagram elements are obviously stateless. This makes the unit delays in a way similar to the integrators in the analog block diagrams, where the integrators are the only elements with a state. A unit delay element in a block diagram is denoted as: G z 1 G
The reason for the notation z 1 will be explained a little bit later. Using a unit delay, we can create a block diagram for our naive integrator (Fig. 3.1). For an arbitrary sampling period we obtain the structure in Fig. 3.2. For an integrator with embedded cuto gain we can combine the c gain element with the T gain element (Fig. 3.3). Notice that the integrator thereby becomes invariant to the choice of the time units, since c T is invariant to this choice. Now lets construct the block diagram of the naive 1-pole lowpass lter. Recalling the implementation routine: // perform one sample tick of the lowpass filter y := y + (x-y)*omega_c; we obtain the diagram in Fig. 3.4. The z 1 element in the feedback from the lters output to the leftmost summator is occurring due to the fact that we are
31
x[n]
G y [n]
x[n]
G y [n]
x[n]
_ _
_ G G y [n]
z 1 o _ _ _ _ _ _ _ _ _ _ _ _ _ _ z 1 o
Figure 3.4: Naive 1-pole lowpass lter (the dashed line denotes the integrator).
picking up the previous value of y in the routine when computing the dierence x y. This unit delay occurring in the discrete-time feedback is a common problem in discrete-time implementations. This problem is solvable, however it doesnt make too much sense to solve it for the naive integrator-based models, as the increased complexity doesnt justify the improvement in sound. We will address the problem of the zero-delay discrete-time feedback later, for now well con-
32
CHAPTER 3. TIME-DISCRETIZATION
centrate on the naive model in Fig. 3.4. This model can be simplied a bit, by combining the two z 1 elements into one (Fig. 3.5), so that the block diagram explicitly contains a single state variable (as does its pseudocode counterpart). _ _ _ _ _ _ _ _ _ _ _ MM M G'! &% G + Gqqq y"#$ T c 1 z _ _ _ _ _ _ _ _ _ _ _
x[n]
G y [n]
Figure 3.5: Naive 1-pole lowpass lter with just one z 1 element (the dashed line denotes the integrator).
3.5
Transfer function
Let x[n] and y [n] be respectively the input and the output signals of a unit delay: x[n] G z 1 G y [n]
For a complex exponential input x[n] = esn = z n we obtain y [n] = es(n1) = esn es = z n z 1 = z 1 x[n] That is y [n] = z 1 x[n] That is, z 1 is the transfer function of the unit delay! It is common to express discrete-time transfer functions as functions of z rather than functions of s. The reason is that in this case the transfer functions are nonstrictly proper5 rational functions, similarly to the continuous-time case, which is pretty convenient. So, for a unit delay we could write H (z ) = z 1 . Now we can obtain the transfer function of the naive integrator in Fig. 3.1. Suppose6 x[n] = X (z )z n and y [n] = Y (z )z n , or shortly, x = X (z )z n and y = Y (z )z n . Then the output of the z 1 element is yz 1 . The output of the summator is then x + yz 1 , thus y = x + yz 1
the assumption of causality, which holds if the system is built of unit delays. in continuous-time case, we take for granted the fact that complex exponentials z n are eigenfunctions of discrete-time linear time-invariant systems.
6 As 5 Under
33
This is the transfer function of the naive integrator (for T = 1). It is relatively common to express discrete-time transfer functions as rational functions of z 1 (like the one above) rather than rational functions of z . However, for the purposes of the analysis it is also often convenient to have them expressed as rational functions of z (particularly, for nding their poles and zeros). We can therefore multiply the numerator and the denominator of the above H (z ) by z , obtaining: H (z ) = z z1
Since z = es , the frequency response is obtained as H (ej ). The amplitude and phase responses are H (ej ) and arg H (ej ) respectively.7 For T = 1 we obtain z H (z ) = T z1 and, since z = esT , the frequency response is H (ejT ). Now lets obtain the transfer function of the naive 1-pole lowpass lter in Fig. 3.5, where, for the simplicity of notation, we assume T = 1. Assuming complex exponentials x = X (z )z n and y = Y (z )z n we have x and yz 1 as the inputs of the rst summator. Respectively the integrators input is c (x yz 1 ). And the integrator output is the sum of yz 1 and the integrators input. Therefore y = yz 1 + c (x yz 1 ) From where 1 (1 c )z 1 y = c x and H (z ) = y c c z = = x 1 (1 c )z 1 z (1 c )
The transfer function for T = 1 can be obtained by simply replacing c by c T . The respective amplitude response is plotted in Fig. 3.6. Comparing it to the amplitude response of the analog prototype we can observe serious deviation closer to the Nyquist frequency. The phase response (Fig. 3.7) has similar deviation problems.
3.6
Poles
Discrete-time block diagrams are diering from continuous-time block diagrams only by having z 1 elements instead of integrators. Recalling that the transfer
7 Another way to look at this is to notice that in order for z n to be a complex sinusoid ejn we need to let z = ej .
34 |H (ej )|, dB 0
CHAPTER 3. TIME-DISCRETIZATION
-6
-12
Figure 3.6: Amplitude response of a naive 1-pole lowpass lter for a number of dierent cutos. Dashed curves represent the respective analog lter responses for the same cutos. arg H (ej ) 0
/4
Figure 3.7: Phase response of a naive 1-pole lowpass lter for a number of dierent cutos. Dashed curves represent the respective analog lter responses for the same cutos.
function of an integrator is s1 , we conclude that from the formal point of view the dierence is purely notational. Now, the transfer functions of continuous-time block diagrams are nonstrictly proper rational functions of s. Respectively, the transfer functions of discrete-time block diagrams are nonstrictly proper rational functions of z . Thus, discrete-time transfer functions will have poles and zeros in a way similar to continuous-time transfer functions. Similarly to continuous-time transfer functions, the poles will dene the stability of a linear time-invariant lter. Consider that z = esT and recall the stability criterion Re s < 0 (where s = pn , where pn are the poles). Apparently, Re s < 0 |z | < 1. We might therefore intuitively expect the discrete-time stability criterion to be |pn | < 1 where pn are the discrete-time poles. This is indeed the case, a linear time-invariant
35
dierence system8 is stable if and only if all its poles are located inside the unit circle.
3.7
Trapezoidal integration
Instead of naive integration, we could attempt using the trapezoidal integration method (T = 1): // perform one sample tick of the integrator integrator_output := integrator_output + (integrator_input + previous_integrator_input)/2; previous_integrator_input := integrator_input; Notice that now we need two state variables per integrator: integrator output and previous integrator input. The block diagram of a trapezoidal integrator is shown in Fig. 3.8. Well refer to this integrator as a direct form I trapezoidal integrator. The reason for this term will be explained later. G'! &% + y"#$ G z 1 G'! &% + y"#$ z 1 o
x[n]
1/2 MMM G qq q
G y [n]
Figure 3.8: Direct form I trapezoidal integrator (T = 1). We could also construct a trapezoidal integrator implementation with only a single state variable. Consider the expression for the trapezoidal integrators output: n x[ 1] + x[ ] y [n] = y [n0 1] + (3.2) 2 =n
0
Suppose y [n0 1] = 0 and x[n0 1]=0, corresponding to a zero initial state (recall that both y [n0 1] and x[n0 1] are technically stored in the z 1 elements). Then
n
y [n] =
=n0
x[ 1] + x[ ] 1 = 2 2
n n
x[ 1] +
=n0 =n0
x[ ]
=
n
= =
1 2
x[ 1] +
=n0 +1 =n0
x[ ]
1 2
n1
x[ ] +
=n0 =n0
x[ ]
u[n 1] + u[n] 2
8 Dierence systems can be dened as those, whose block diagrams consist of gains, summators and unit delays. More precisely those are causal dierence systems. There are also dierence systems with a lookahead into the future, but we dont consider them in this book.
36 where
CHAPTER 3. TIME-DISCRETIZATION
u[n] =
=n0
x[ ]
Now notice that u[n] is the output of a naive integrator, whose input signal is x[n]. At the same time y [n] is the average of the previous and the current output values of the naive integrator. This can be implemented by the structure in Fig. 3.9. Similar considerations apply for nonzero initial state. Well refer to the integrator in Fig. 3.9 as a direct form II or canonical trapezoidal integrator. The reason for this term will be explained later. G'! &% + y"#$ z G'! &% + y"#$ 1/2 MMM G q qq
x[n]
G
1
G y [n]
Figure 3.9: Direct form II (canonical) trapezoidal integrator (T = 1). We can develop yet another form of the bilinear integrator with a single state variable. Lets rewrite (3.2) as y [n] = y [n0 1] + and let u[n 1] = y [n] Notice that y [n] = u[n 1] + and x[n] (3.4) 2 Expressing (3.3) and (3.4) in a graphical form, we obtain the structure in Fig. 3.10, where the output of the z 1 block corresponds to u[n 1]. Well refer to the integrator in Fig. 3.10 as a transposed direct form II or transposed canonical trapezoidal integrator. The reason for this term will be explained later. The positioning of the 1/2 gain prior to the integrator in Fig. 3.10 is quite convenient, because we can combine the 1/2 gain with the cuto gain into a single gain element. In case of an arbitrary sampling period we could also include the T factor into the same gain element, thus obtaining the structure in u[n] = u[n 1] + x[n] = y [n] + x[n] 2 (3.3) x[n] x[n0 1] = y [n0 1] + + x[ ] 2 2 =n
0
x[n0 1] x[n] + x[ ] + 2 2 =n
0
n1
n1
3.7. TRAPEZOIDAL INTEGRATION 1/2 MMM G q qq G'! &% " + y #$ z 1 y G'! &% "#$o +
37
x[n]
G y [n]
Figure 3.10: Transposed direct form II (transposed canonical) trapezoidal integrator (T = 1). Fig. 3.11. A similar trick can be performed for the other two integrators, if we move the 1/2 gain element to the input of the respective integrator. Since the integrator is a linear time-invariant system, this doesnt aect the integrators behavior in a slightest way. c T /2 MMM G q qq G'! &% "#$ + y z 1 y G'! &% "#$o +
x[n]
G y [n]
Figure 3.11: Transposed direct form II (transposed canonical) trapezoidal integrator with embedded cuto gain. Typically one would prefer the direct form II integrators to the direct form I integrator, because the former have only one state variable. In this book we will mostly use the transposed direct form II integrator, because this is resulting in slightly simpler zero-delay feedback equations and also oers a nice possibility for the internal saturation in the integrator. The transfer functions of all three integrators are identical. Lets obtain e.g. the transfer function of the transposed canonical integrator (in Fig. 3.10). Let u be the output signal of the z 1 element. Assuming signals of the exponential form z n , we have x u= + y z 1 2 x y = +u 2 from where u=y and y x 2
x x = + y z 1 2 2
38 or y from where
CHAPTER 3. TIME-DISCRETIZATION x x z = +y 2 2
For an arbitrary T one has to multiply the result by T , to take the respective gain element into account: H (z ) = T z+1 2 z1
One can obtain the same results for the other two integrators. What is so special about this transfer function, that makes the trapezoidal integrator so superior to the naive one, is to be discussed next.
3.8
Bilinear transform
Suppose we take an arbitrary continuous-time block diagram, like the familiar lowpass lter in Fig. 2.2 and replace all continuous-time integrators by discretetime trapezoidal integrators. On the transfer function level, this will correspond z +1 to replacing all s1 with T 2 z 1 . That is, technically we perform a subsitution s1 = T z+1 2 z1
in the transfer function expression. It would be more convenient to write this substitution explicitly as s= 2 z1 T z+1 (3.5)
The substitution (3.5) is referred to as the bilinear transform, or shortly BLT. For that reason we can also refer to trapezoidal integrators as BLT integrators. Lets gure out, how does the bilinear transform aect the frequency response of the lter, that is, what is the relationship between the original continuoustime frequency response prior to the substitution and the resulting discrete-time frequency response after the substitution. Let Ha (s) be the original continuous-time transfer function. Then the respective discrete-time transfer function is Hd (z ) = Ha 2 z1 T z+1 (3.6)
3.8. BILINEAR TRANSFORM Respectively, the discrete-time frequency response is Hd (ejT ) = Ha = Ha 2 ejT 1 T ejT + 1 2 T j tan T 2 = Ha 2 ejT /2 ejT /2 T ejT /2 + ejT /2 =
39
Notice that Ha (s) in the last expression is evaluated on the imaginary axis!!! That is, the bilinear transform maps the imaginary axis in the s-plane to the 2 j tan T unit circle in the z -plane! Now, Ha T is the analog frequency response 2 2 T evaluated at T tan 2 . That is, the digital frequency response at is equal to 2 tan T the analog frequency response at T 2 . This means that the analog frequency response in the range 0 < + is mapped into the digital frequency range 0 T < (0 < fs ), that is from zero to Nyquist!9 Denoting the analog frequency as a and the digital frequency as d we can express the argument mapping of the frequency response function as a = or, in a more symmetrical way a T d T = tan 2 2 (3.8) d T 2 tan T 2 (3.7)
Notice that for frequencies much smaller that Nyquist frequency we have T 1 and respectively a d . This is what is so unique about the bilinear transform. It simply warps the frequency range [0, +) into the zero-to-Nyquist range, but otherwise doesnt change the frequency response at all! Considering in comparison a naive integrator, we would have obtained: s1 = s= z z1 (3.9)
z1 z z1 Hd (z ) = Ha z Hd (ej ) = Ha ej 1 ej = Ha 1 ej
which means that the digital frequency response is equal to the analog transfer function evaluated on a circle of radius 1 centered at s = 1. This hardly denes a clear relationship between the two frequency responses. So, by simply replacing the analog integrators with digital trapezoidal integrators, we obtain a digital lter whose frequency response is essentially the same as the one of the analog prototype, except for the frequency warping. Particularly, the relationship between the amplitude and phase responses of the lter is fully preserved, which is particularly highly important if the lter is to be used as a building block in a larger lter. Very close to perfect!
9A
40
CHAPTER 3. TIME-DISCRETIZATION
Furthermore, the bilinear transform maps the left complex semiplane in the s-domain into the inner region of the unit circle in the z -domain. Indeed, lets obtain the inverse bilinear transform formula. From (3.5) we have (z + 1) from where 1+ and z= 1+ 1 sT =z1 2
sT sT =z 1 2 2
sT 2 sT 2
(3.10)
The equation (3.10) denes the inverse bilinear transform. Now, if Re s < 0, then, obviously sT sT 1+ < 1 2 2 and |z | < 1. Thus, the left complex semiplane in the s-plane is mapped to the inner region of the unit circle in the z -plane. In the same way one can show that the right complex semiplane is mapped to the outer region of the unit circle. And the imaginary axis is mapped to the unit circle itself. Comparing the stability criterion of analog lters (the poles must be in the left complex semiplane) to the one of digital lters (the poles must be inside the unit circle), we conclude that the bilinear transform exactly preserves the stability of the lters! In comparison, for a naive integrator replacement we would have the following. Inverting the (3.9) substitution we obtain sz = z 1 z (1 s) = 1 1 1s Assuming Re s < 0 and considering that in this case z= z 1 1 1 1 1 2 + = = 2 1s 2 1s
s 2
and
1 1+s 1 < 2 1s 2
we conclude that the left semiplane is mapped into a circle of radius 0.5 centered at z = 0.5. So the naive integrator overpreserves the stability, which is not nice, since we would rather have digital lters behaving as closely to their analog prototypes as possible. Considering that this comes in a package with a poor frequency response transformation, we should rather stick with trapezoidal integrators. So, lets replace e.g. the integrator in the familiar lowpass lter structure in Fig. 2.2 with a trapezoidal integrator. Performing the integrator replacement, we obtain the structure in Fig. 3.12. We will refer to the trapezoidal integrator replacement method as the topology-preserving transform (TPT) method. This term will be explained and properly introduced later. For now, before we simply attempt to implement the structure in Fig. 3.12 in code, we should become aware of a few further issues.
41
x[n]
_ _ G G y [n]
Figure 3.12: 1-pole TPT lowpass lter (the dashed line denotes the trapezoidal integrator).
3.9
Cuto prewarping
Suppose we are using the lowpass lter structure in Fig. 3.12 and we wish to have its cuto at c . If we however simply put this c parameter into the respective integrator gain element c T /2, our frequency response at the cuto will be dierent from the expected one. Considering the transfer function of an analog 1-pole lowpass lter (2.2), at the cuto we expect H (jc ) = 1 c = c + jc 1+j
corresponding to a 3dB drop in amplitude and a 45 phase shift. However, letting a = c in (3.8) we will obtain some d = c . That is the cuto point of the analog frequency response will be mapped to some other frequency d in the digital frequency response (Fig. 3.13). This is the result of the frequency axis warping by the bilinear transform.10 However, if we desire to have the 1/(1 + j ) frequency response exactly at d = c , we can simply apply (3.7) to d = c , thereby obtaining some a . This a should be used in the gain element of the integrator, that is the gain should be a T /2 instead of c T /2. This cuto substitution is referred to as the cuto prewarping. The result of the cuto prewarping is illustrated in Fig. 3.14. Apparently, the importance of the cuto prewarping grows as the cuto values get higher. For cuto values much lower than the Nyquist frequency the prewarping has only a minor eect. Notice that its possible to choose any other point for the prewarping, not necessarily the cuto point. That is its possible to make any single chosen point on the analog frequency response to be located at the desired digital frequency. In order to do so we rst choose d of interest, then use (3.7) to nd the respective a . Now we want a particular point on the analog frequency response to be located at a , which can be achieved by a proper choice of the
10 The response dierence at the cuto in Fig. 3.13 might seem negligible. However it will be even higher for cutos closer to Nyquist. Also for lters with strong resonance the detuning of the cuto by frequency warping might be way more noticeable.
42 |H (ej )|, dB 0
CHAPTER 3. TIME-DISCRETIZATION
-6
-12
Figure 3.13: Amplitude response of an unprewarped bilineartransformed 1-pole lowpass lter for a number of dierent cutos. Dashed curves represent the respective analog lter responses for the same cutos. Observe the dierence between the analog and digital responses at each cuto. |H (ej )|, dB 0
-6
-12
Figure 3.14: Amplitude response of an prewarped bilineartransformed 1-pole lowpass lter for a number of dierent cutos. Dashed curves represent the respective analog lter responses for the same cutos. Observe the identical values of the analog and digital responses at each cuto. analog cuto value. Now we put this cuto value into the integrators and thats it!
3.10
Zero-delay feedback
There is a further problem with the trapezoidal integrator replacement in the TPT method. Replacing the integrators with trapezoidal ones introduces delayless feedback loops (that is, feedback loops not containing any delay elements)
43
into the structure. E.g. consider the structure in Fig. 3.12. Carefully examining this structure, we nd that it has a feedback loop which doesnt contain any unit delay elements. This loop goes from the leftmost summator through the gain, through the upper path of the integrator to the lters output and back through the large feedback path to the leftmost summator. Why is this delayless loop a problem? Lets consider for example the naive lowpass lter structure in Fig. 3.5. Suppose we dont have the respective program code representation and wish to obtain it from the block diagram. We could do it in the following way. Consider Fig. 3.15, which is the same as Fig. 3.5, except that it labels all signal points. At the beginning of the computation of a new sample the signals A and B are already known. A = x[n] is the current input sample and B is taken from the internal state memory of the z 1 element. Therefore we can compute C = A B . Then we can compute D = (c T )C and nally E = D + B . The value of E is then stored into the internal memory of the z 1 element (for the next sample computation) and is also sent to the output as the new y [n] value. Easy, right? x[n] A G'! &% "#$ C + y
G y [n]
z 1 B
Figure 3.15: Naive 1-pole lowpass lter and the respective signal computation order. Now the same approach doesnt work for the structure in Fig. 3.12. Because there is a delayless loop, we cant nd a starting point for the computation within that loop. The classical way of solving this problem is exactly the same as what we had in the naive approach: introduce a z 1 into the delayless feedback, turning it into a feedback containing a unit delay (Fig. 3.16). Now there are no delayless feedback paths and we can arrange the computation order in a way similar to Fig. 3.15. This however destroys the resulting frequency response, because the transfer function is now dierent. In fact the obtained result is not signicantly better (if better at all) than the one from the naive approach. There are some serious artifacts in the frequency response closer to the Nyquist frequency, if the lter cuto is suciently high. Therefore we shouldnt introduce any modications into the structure and solve the zero-delay feedback problem instead. The term zero-delay feedback originates from the fact that we avoid introducing a one-sample delay into the feedback (like in Fig. 3.16) and instead keep the feedback delay equal to zero. So, lets solve the zero-delay feedback problem for the structure in Fig. 3.12. Notice that this structure simply consists of a negative feedback loop around
z 1 o
Figure 3.16: Digital 1-pole lowpass lter with a trapezoidal integrator and an extra delay in the feedback.
a trapezoidal integrator, where the trapezoidal integrator structure is exactly the one from Fig. 3.11. We will now introduce the concept of the instantaneous response of this integrator structure. So, consider the integrator structure in Fig. 3.11 and let u[n] denote the input signal of the z 1 element, respectively its output will be u[n 1]. Since there are no delayless loops in the integrator, its not dicult to obtain the following expression for y [n]: y [n] = c T x[n] + u[n 1] 2 (3.11)
Notice that, at the time x[n] arrives at the integrators input, all values in the right-hand side of (3.11) are known (no unknown variables). Introducing notation c T 2 s[n] = u[n 1] g= we have y [n] = gx[n] + s[n] or, dropping the discrete time argument notation for simplicity, y = gx + s That is, at any given time moment n, the output of the integrator y is a linear function of its input x, where the values of the parameters of this linear function are known. The g parameter doesnt depend on the internal state of the integrator, while the s parameter does depend on the internal state of the integrator. We will refer to the linear function f (x) = gx + s as the instantaneous response of the integrator at the respective implied time moment n. The coecient g can be referred to as the instantaneous response gain or simply instantaneous gain. The term s can be referred to as the instantaneous response oset or simply instantaneous oset.
45
Figure 3.17: 1-pole TPT lowpass lter with the integrator in the instantaneous response form. Lets now redraw the lter structure in Fig. 3.12 as in Fig. 3.17. We have changed the notation from x to in the gx + s expression to avoid the confusion with the input signal x[n] of the entire lter. Now we can easily write and solve the zero-delay feedback equation. Indeed, suppose we already know the lter output y [n]. Then the output signal of the feedback summator is x[n] y [n] and the output of the integrator is respectively g (x[n] y [n]) + s. Thus y [n] = g (x[n] y [n]) + s or, dropping the time argument notation for simplicity, y = g (x y ) + s (3.12)
The equation (3.12) is the zero-delay feedback equation for the lter in Fig. 3.17 (or, for that matter, in Fig. 3.12). Solving this equation, we obtain y (1 + g ) = gx + s and respectively y= gx + s 1+g
(3.13)
Having found y (that is, having predicted the output y [n]), we can then proceed with computing the other signals in the structure in Fig. 3.12, beginning with the output of the leftmost summator.11 Its worth mentioning that (3.13) can be used to obtain the instantaneous response of the entire lter from Fig. 3.12. Indeed, rewriting (3.13) as y= and introducing notations G= g 1+g s S= 1+g (3.14) g s x+ 1+g 1+g
we have y = Gx + S So, the instantaneous response of the entire lowpass lter in Fig. 3.12 is again a linear function of the input. We could use the expression (3.14) e.g. to solve the
11 Notice that the choice of the signal point for the prediction is rather arbitrary. We could have chosen any other point within the delayless feedback loop.
46
CHAPTER 3. TIME-DISCRETIZATION
zero-delay feedback problem for some larger feedback loop containing a 1-pole lowpass lter. Lets now convert the structure in Fig. 3.12 into a piece of code. We already know y from (3.14). Lets notice that the output of the c T /2 gain is used twice in the structure. Let v denote the output of this gain. Since g = c T /2, we have v = g (x y ) = g (x Gx S ) = g x =g 1 s x 1+g 1+g =g xs 1+g s g x 1+g 1+g = (3.15)
Recall that s is the output value of the z 1 element and let u denote its input value. Then y =v+s (3.16) and u=y+v (3.17) The equations (3.15), (3.16) and (3.17) can be directly expressed in program code: // perform one sample tick of the lowpass filter v := (x-z1_state)*g/(1+g); y := v + z1_state; z1_state := y + v; or instead expressed in a block diagram form (Fig. 3.18). Notice that the block diagram doesnt contain any delayless loops anymore. G'! &% + y"#$ g/(1 + g ) MMM G q qq G'! &% + y"#$ y
x[n]
G y [n]
Figure 3.18: 1-pole TPT lowpass lter with resolved zero-delay feedback.
3.11
Direct forms
Consider again the equation (3.6), which describes the application of the bilinear transform to convert an analog transfer function to a digital one. There is a
47
classical method of digital lter design which is based directly on this transformation, without using any integrator replacement techniques. In the authors experience, for music DSP needs this method typically has a largely inferior quality, compared to the TPT. Nevertheless we will describe it here for completeness and for a couple of other reasons. Firstly, it would be nice to try to analyse and understand the reasons for the problems of this method. Secondly, this method could be useful once in a while. Particularly, its deciencies mostly disappear in the time-invariant (unmodulated, or suciently slowly modulated) case, while the implementation sometimes could be slightly more ecient. Having obtained a digital transfer function from (3.6), we could observe, that, since the original analog transfer function was a rational function of s, the resulting digital transfer function will necessarily be a rational function of z . E.g. using the familiar 1-pole lowpass transfer function Ha (s) = we obtain Hd (z ) = Ha = 2 z1 T z+1
c T 2
c s + c
2 T
c z 1 z +1
+ c
(z + 1)
c T 2
(z 1) +
(z + 1)
1+
c T 2 (z + 1) c T T z 1 c 2 2
Now, there are standard discrete-time structures allowing an implementation of any given nonstrictly proper rational transfer function. It is easier to use these structures, if the transfer function is expressed as a rational function of z 1 rather than the one of z . In our particular example, we can multiply the numerator and the denominator by z 1 , obtaining Hd (z ) =
c T 2 (1 c T 2
+ z 1 ) 1
c T 2
1+
z 1
The further requirement is to have the constant term in the denominator equal to 1, which can be achieved by dividing everything by 1 + c T /2:
c T 2 T 1+ c 2
(1 + z 1 ) (3.18) z 1
Hd (z ) =
T 1 c 2 c T 1+ 2
Now suppose we have an arbitrary rational nonstrictly proper transfer function of z , expressed via z 1 and having the constant term in the denominator equal to 1:
N
bn z n H (z ) =
n=0 N
(3.19) an z
n
1
n=1
This transfer function can be implemented by the structure in Fig. 3.19 or by the structure in Fig. 3.20. One can verify (by computing the transfer functions
48
CHAPTER 3. TIME-DISCRETIZATION
x(t)
G z 1
G z 1
...
G z 1 11 1 bN
...
y (t) o
G z 1
G z 1
...
G z 1
Figure 3.19: Direct form I (DF1). G'! &% "#$o + '! &% " + y #$o 11 y 1 a1 11 1 b0 y (t) o #$o '! &% " + G z 1 G 11 1 b1 #$o '! &% " + G z 1 '! &% " + y #$o 11 y 1 a2 G 11 1 b2 #$o '! &% " + ... ... G z 1
x(t)
... 11 y 1 aN G 11 1 bN
Figure 3.20: Direct form II (DF2), a.k.a. canonical form. of the respective structures) that they indeed implement the transfer function (3.19). There are also transposed versions of these structures, which the readers should be able to construct on their own. Lets use the direct form II to implement (3.18). Apparently, we have N =1 b0 = b1 = a1 = 1 1+
c T 2 T 1 + c 2 c T 2 c T 2
and the direct form implementation itself is the one in Fig. 3.21 (we have merged the b0 and b1 coecients into a single gain element).
3.11. DIRECT FORMS G'! &% " + y #$ 1 a1 y 11 z G'! &% " + y #$ b0 M M G M qqq
49
x[n]
G
1
G y [n]
Figure 3.21: Direct form II 1-pole lowpass lter. In the time-invariant (unmodulated) case the performance of the direct form lter in Fig. 3.21 should be identical to the TPT lter in Fig. 3.12, since both implement the same bilinear-transformed analog transfer function (2.2). When the cuto is modulated, however, the performance will be dierent. This is very easy to understand intuitively. First, consider the two following analog structures, implementing two dierent ways of combining a cuto gain with an integrator: c M M G M qqq G G and G c M M G M qqq G
Suppose the input signal is a sine and there is a sudden jump in the cuto parameter. In this case there will also be a sudden jump in the input of the rst integrator, however the jump will be converted into a break by the integration. In the second case the jump will not be converted, because it appears after the integrator. Ignoring the problem of a DC oset possibly introduced by such jump in the rst structure (because in real stable lters this DC oset will quickly disappear with time), we should say that the rst structure has a better modulation performance, since the cuto jumps do not produce so audible clicks in the output. Apparently the two structures behave dierently in the time-varying case, even though both have the same transfer function c /s. We say that the two structures have the same transfer function but dierent topology (the latter term referring to the components used in the block diagram and the way they are connected to each other). As mentioned, the transfer function is applicable only to the time-invariant case. No wonder its possible to have structures with identical transfer functions, but dierent time-varying behavior. Now, returning to the comparison of implementations in Fig. 3.21 and Fig. 3.12, we notice that the structure in Fig. 3.21 contains a gain element at the output, the value of this gain being approximately proportional to the cuto (at low cutos). This will particularly produce unsmoothed jumps in the output in response to jumps in the cuto value. In the structure in Fig. 3.12, on the other hand, the cuto jumps will be smoothed by the integrator. Thus, the dierence between the two structures is similar to the just discussed eect of the cuto gain placement with the integrator. We should conclude that, other things being equal, the structure in Fig. 3.21 is inferior to the one in Fig. 3.12 (or Fig. 3.18). In this respect consider that Fig. 3.12 is trying to explicitly emulate the analog integration behavior, preserv-
50
CHAPTER 3. TIME-DISCRETIZATION
ing the topology of the original analog structure, while Fig. 3.21 is concerned solely with implementing a correct transfer function. Since Fig. 3.21 implements a classical approach to the bilinear transform application for digital lter design (which ignores the lter topology) well refer to the trapezoidal integration replacement technique as the topology-preserving bilinear transform (or, shortly, TPBLT). Or, even shorter, we can refer to this technique as simply the topologypreserving transform (TPT), implicitly assuming that the bilinear transform is being used.12 In principle, sometimes there are possibilities to manually x the structures such as in Fig. 3.21. E.g. the time-varying performance of the latter is drastically improved by moving the b0 gain to the input. The problem however is that this kind of xing quickly gets more complicated (if being possible at all) with larger lter structures. On the other hand, the TPT method explicitly aims at emulating the time-varying behavior of the analog prototype structure, which aspect is completely ignored by the classical transform approach. Besides, if the structure contains nonlinearities, preserving the topology becomes absolutely critical, because otherwise these nonlinearites can not be placed in the digital model.13 Also, the direct forms suer from precision loss issues, the problem growing bigger with the order of the system. For that reason in practice the direct forms of orders higher than 2 are rarely used,14 but even 2nd-order direct forms could already noticeably suer from precision losses.
3.12
The trapezoidal integrator replacement technique can be seen as a particular case of a more general set of replacement techniques. Suppose we have two lters, whose frequency response functions are F1 ( ) and F2 ( ) respectively. The lters do not need to have the same nature, particularly one can be an analog lter while the other can be a digital one. Suppose further, there is a frequency axis mapping function = ( ) such that F2 ( ) = F1 (( )) Typically ( ) should map the entire domain of F2 ( ) onto the entire domain of F1 ( ) (however the exceptions are possible). To make the subsequent discussion more intuitive, we will assume that ( ) is monotone, although this is absolutely not a must.15 In this case we could say that F2 ( ) is obtained from F1 ( ) by a frequency axis warping. Particularly,
12 Apparently, naive lter design techniques also preserve the topology, but they do a rather poor job on the transfer functions. Classical bilinear transform approach does a good job on the transfer function, but doesnt preserved the topology. The topology-preserving transform achieves both goals simultaneously. 13 This is related to the fact that transfer functions can be dened only for linear timeinvariant systems. Nonlinear cases are obviously not linear, thus some critical information can be lost, if the conversion is done solely based on the transfer functions. 14 A higher-order transfer function is typically decomposed into a product of transfer functions of 1st- and 2nd-order rational functions (with real coecients!). Then it can be implemented by a serial connection of the respective 1st- and 2nd-order direct form lters. 15 Strictly speaking, we dont even care whether ( ) is single-valued. We could have instead required that F2 (2 ( )) = F1 (1 ( )) for some 1 ( ) and 2 ( ).
51
this is exactly what happens in the bilinear transform case (the mapping ( ) is then dened by the equation (3.7)). One cool thing about the frequency axis warping is that it preserves the relationship between the amplitude and phase. Suppose that we have a structure built around lters of frequency response F1 ( ), and the rest of the structure doesnt contain any memory elements (such as integrators or unit delays). Then the frequency response F ( ) of this structure will be a function of F1 ( ): F ( ) = (F1 ( )) where the specics of the function (w) will be dened by the details of the container structure. E.g. if the building-block lters are analog integrators, then F1 ( ) = 1/j . For the lter in Fig. 2.2 we then have (w) = w w+1
which is the already familiar to us frequency response of the analog lowpass lter. Now, we can view the trapezoidal integrator replacement as a substitution of F2 instead of F1 , where ( ) is obtained from (3.7): a = (d ) = 2 d T tan T 2
The frequency response of the resulting lter is obviously equal to (F2 ( )), where F2 ( ) is the frequency response of the trapezoidal integrators (used in place of analog ones). But since F2 ( ) = F1 (( )). (F2 ( )) = (F1 (( ))) which means that the frequency response (F2 ()) of the structure with trapezoidal integrators is obtained from the frequency response (F1 ()) of the structure with analog integrators simply by warping the frequency axis. If the warping is not too strong, the frequency responses will be very close to each other. This is exactly what is happening in the trapezoidal integrator replacement and generally in the bilinear transform. Dierentiator-based lters We could have used some other two lters, with their respective frequency responses F1 and F2 . E.g. we could consider continuous-time systems built around dierentiators rather than integrators.16 The transfer function of a dierentiator is apparently simply H (s) = s, so we could use (3.5) to build a discrete-time trapezoidal dierentiator. Particularly, if we use the direct form II approach,
16 The real-world analog electronic circuits are built around integrators rather than dierentiators. Therefore the dierentiator-based lters have rather theoretical signicance in the VA lter design.
52
CHAPTER 3. TIME-DISCRETIZATION
it could look similarly to the integrator in Fig. 3.9. When embedding the cuto control into a dierentiator (in the form of a 1/c gain), its probably better to position it after the dierentiator, to avoid the unnecessary de-smoothing of the control modulation by the dierentiator. Replacing the analog dierentiators in a structure by such digital trapezoidal dierentiators we eectively perform a dierentiator-based TPT. E.g. if we replace the integrator in the highpass lter in Fig. 2.8 by a differentiator, we essentially perform a 1/s s substitution, thus we should have obtained a (dierentiator-based) lowpass lter. Remarkably, if we perform a dierentiator-based TPT on such lter, the obtained digital structure is fully equivalent to the previously obtained integrator-based TPT 1-pole lowpass lter. Allpass substitution One particularly interesting case occurs when F1 and F2 dene two dierent allpass frequency responses. That is |F1 ( )| 1 and |F2 ( )| 1. In this case the mapping ( ) is always possible. Especially since the allpass responses (dened by rational transfer functions of analog and digital systems) always cover the entire phase range from to .17 In intuitive terms it means: for a lter built of identical allpass elements, we can always replace those allpass elements with an arbitrary other type of allpass elements (provided all other elements are memoryless, that is there are only gains and summators). We will refer to this process as allpass substitution. Whereas in the trapezoidal integrator replacement we have replaced analog integrators by digital trapezoidal integrators, in the allpass substitution we replace allpass lters of one type by allpass lters of another type. We can even replace digital allpass lters with analog ones and vice versa. E.g., noticing that z 1 elements are allpass lters, we could replace them with analog allpass lters. One particularly interesting case arises out of the inverse bilinear transform (3.10). From (3.10) we obtain z 1 = 1 1+
sT 2 sT 2
(3.20)
The right-hand side of (3.20) obviously denes a stable 1-pole allpass lter, whose cuto is 2/T . We could take a digital lter and replace all z 1 elements with an analog allpass lter structure implementing (3.20). By doing this we would have performed a topology-preserving inverse bilinear transform. We could then apply the cuto parametrization to these underlying analog allpass elements: sT s 2 c so that we obtain z 1 = 1 s/c 1 + s/c
17 Actually, for < < +, they cover this range exactly N times, where N is the order of the lter.
53
The expression s/c can be also rewritten as sT /2, where is the cuto scaling factor: 1 sT /2 z 1 = (3.21) 1 + sT /2 Finally, we can apply the trapezoidal integrator replacement to the cuto-scaled analog lter, converting it back to the digital domain. By doing so, we have applied the cuto scaling in the digital domain! On the transfer function level this is equivalent to applying the bilinear transform to (3.21), resulting in z 1 = = 1 1 sT /2 1 + sT /2 1+
z 1 (z +1) z 1 (z +1)
(z + 1) (z 1) ( 1)z + ( + 1) = (z + 1) + (z 1) ( + 1)z + ( 1)
which applies cuto scaling in the digital domain.18 The allpass lter H (z ) = ( 1)z + ( + 1) ( + 1)z + ( 1)
should have been obtained, as described, by the trapezoidal integrator replacement in an analog implementation of (3.21), alternatively we could use a direct form implementation. Notice that this lter has a pole at z = ( 1)/( + 1). Since | 1| < | + 1| > 0, the pole is always located inside the unit circle, and the lter is always stable.
3.13
Writing the solution (3.13) for the zero-delay feedback equation (3.12) we in fact have slightly jumped the gun. Why? Lets consider once again the structure in Fig. 3.17 and suppose g gets negative and starts growing in magnitude further in the negative direction.19 When g becomes equal to 1, the denominator of (3.13) turns into zero. Something bad must be happening at this moment. In order to understand the meaning of this situation, lets consider the delayless feedback path as if it was an analog feedback. An analog signal value cant change instantly. It can change very quickly, but not instantly, its always a continuous function of time. We could imagine there is a smoother unit somewhere in the feedback path (Fig. 3.22). This smoother unit has a very very fast response time. We introduce the notation y for the output of the smoother.
18 Dierently from the analog domain, the digital cuto scaling doesnt exactly shift the response along the frequency axis in a logarithmic scale, as some frequency axis warping is involved. The resulting frequency response change however is pretty well approximated as shiting in the lower frequency range. 19 Of course, such lowpass lter formally has a negative cuto value. It is also unstable. However unstable circuits are very important as the linear basis for the analysis and implementation of e.g. nonlinear self-oscillating lters. Therefore we wish to be able to handle unstable circuits as well.
Figure 3.22: Digital 1-pole lowpass lter with a trapezoidal integrator in the instantaneous response form and a smoother unit in the delayless feedback path. So, suppose we wish to compute a new output sample y [n] for the new input sample x[n]. At the time x[n] arrives at the lters input, the smoother still holds the old output value y [n 1]. Lets freeze the discrete time at this point (which formally means we simply are not going to update the internal state of the z 1 element). At the same time we will let the continuous time t run, formally starting at t = 0 at the discrete time moment n. In this time-frozen setup we can choose arbitrary units for the continuous time t. The smoother equation can be written as (t) = sgn y (t) y sgn y (t) That is, we dont specify the details of the smoothing behavior, however the smoother output always changes in the direction from y towards y at some (not necessarily constant) speed.20 Particularly, we can simply dene a constant speed smoother: = sgn(y y y ) or we could use a 1-pole lowpass lter as a smoother: =yy y The initial value of the smoother is apparently y (0) = y [n 1]. Now consider that (t) = sgn y (t) y sgn y (t) = sgn g (x[n] y (t)) + s y (t) = = sgn (gx[n] + s) (1 + g ) y (t) = sgn a (1 + g ) y (t) where a = gx[n] + s is constant in respect to t. First, assume 1 + g > 0. Further, (0) > 0 and then the value of the expression suppose a (1 + g ) y (0) > 0. Then y a (1+ g ) y (t) will start decreasing until it turns to zero at some t, at which point the smoothing process converges. On the other hand, if a (1+ g ) y (0) < 0, then (0) < 0 and the value of the expression a (1 + g ) y y (t) will start increasing until it turns to zero at some t, at which point the smoothing process converges. If a (1 + g ) y (0) = 0 then the smoothing is already in a stable equilibrium state. So, in case 1 + g > 0 the instantaneous feedback smoothing process always converges. Now assume 1 + g 0. Further, suppose a (1 + g ) y (0) > 0. Then (0) > 0 and then the value of the expression a (1 + g ) y y (t) will start further increasing (or stay constant if 1 + g = 0). Thus, y (t) will grow indenitely.
20 We also assume that the smoothing speed is suciently large to ensure that the smoothing process will converge at all cases where it potentially can converge (this statement should become clearer as we discuss more details).
55
Respectively, if a (1 + g ) y (0) < 0, then y (t) will decrease indenitely. This indenite growth/decrease will occur within the frozen discrete time. Therefore we can say that y grows innitely in an instant. We can refer to this as to an instantaneously unstable zero-delay feedback loop. The analysis of the instantaneous stability can also be done using the analog lter stability analysis means. Let the smoother be an analog 1-pole lowpass 1 21 ) and notice that in lter with a unit cuto (whose transfer function is s+1 that case the structure in Fig. 3.22 can be redrawn as in Fig. 3.23. This lter has two formal inputs x[n] and s and one output y [n]. s G'! &% " + y #$ g MMM G q qq
1 s+1
x[n]
G y [n]
Figure 3.23: An instantaneous representation of a digital 1-pole lowpass lter with a trapezoidal integrator and an analog lowpass smoother. We can now e.g. obtain a transfer function from the x[n] input to the y [n] output. Ignoring the s input signal (assuming it to be zero), for a continuoustime complex exponential input signal arriving at the x[n] input, which we denote as x[n](t), we have a respective continuous-time complex exponential signal at the y [n] output, which we denote as y [n](t): y [n](t) = g x[n](t) from where y [n](t) = that is H (s) = 1 y [n](t) s+1
g 1 x[n](t) 1 + g s+1
g s+1 1 = g s + (1 + g ) 1 + g s+1
This transfer function has a pole at s = (1 + g ). Therefore, the structure is stable if 1 + g > 0 and not stable otherwise. The same transfer function analysis could have been done between the s input and the y [n] output, in which case we would have obtained H (s) = s+1 s + (1 + g )
21 Apparently, the variable s used in the transfer function 1 is a dierent s than the one s+1 used in the instantaneous response expression for the integrator. The author apologizes for the slight confusion.
56
CHAPTER 3. TIME-DISCRETIZATION
The poles of this transfer function however, are exactly the same, so it doesnt matter.22 Alright, so we have found out that the lter in Fig. 3.12 is instantaneously unstable if g 1, but what can we do about it? Firstly, the problem typically doesnt occur, as normally g > 0 (not only in the 1-pole lowpass lter case, but also in other cases). Even if it can occur in principle, one can consider, whether these extreme parameter settings are so necessary to support, and possibly simply clip the lter parameters in such a way that the instantaneous instability doesnt occur. Secondly, lets notice that g = c T /2. Therefore another solution could be to increase the sampling rate (and respectively reduce the sampling period T ).23 Unstable bilinear transform There is yet another idea, which hasnt been tried out in practice yet.24 We are going to discuss it anyway. The instantaneous instability is occurring at the moment when one of the analog lters poles hits the pole of the inverse bilinear transform (3.10), which is located at s = 2/T . On the other hand, recall that the bilinear transform is mapping the imaginary axis to the unit circle, thus kind-of preserving the frequency response. If the system is not stable, then the frequency response doesnt make sense. Formally, the reason for this is that the inverse Laplace transform of transfer functions only converges for > max {Re pn } where pn are the poles of the transfer function, and respectively, if max {Re pn } 0, it doesnt converge on the imaginary axis ( = 0). However, instead of the imaginary axis Re s = = 0, lets choose some other axis Re s = > max {Re pn } and use it instead of the imaginary axis to compute the frequency response. We also need to nd a discrete-time counterpart for Re s = . Considering that Re s denes the magnitude growth speed of the exponentials est we could choose a z -plane circle, on which the magnitude growth speed of z n is the same as for et . Apparently, this circle is |z | = eT . So, we need to map Re s = to |z | = eT . Considering the bilinear transform equation (3.5), we divide z by eT to make sure zeT has a unit magnitude and shift the s-plane result by : s=+ 2 zeT 1 T zeT + 1 (3.22)
22 This is a common rule: the poles of a system with multiple inputs and/or multiple outputs are always the same regardless of the particular input-output pair for which the transfer function is being considered (exceptions in signular cases, arising out of pole/zero cancellation are possible, though). 23 Actually, the instantaneous instability has to do with the fact that the trapezoidal integration is not capable of producing reasonable approximation of the continuous-time integration, due to too extreme parameter values. Increasing the sampling rate obviously increases the precision of the trapezoidal integration as well. The same idea can also be used to easily and reliably nd out, whether the positive value of the denominator of the feedback equations solution corresponds to the instantaneously stable case or vice versa. The sign which the denominator has for T 0 corresponds to the instantaneously stable case. 24 The author just got the idea while writing the book and didnt nd the time yet to properly experiment with it. Sucient theoretical analysis is not possible here due to the fact that practical applications of instantaneously unstable (or any unstable, for that matter) lters occur typically for nonlinear lters, and theres not much theoretical analysis means for the latter. Hopefully there are no mistakes in the theoretical transformations, but even if there are mistakes, at least the idea itself could maybe work.
57
We can refer to (3.22) as the unstable bilinear transform, where the word unstable refers not to the instability of the transform itself, but rather to the fact that it is designed to be applied to unstable lters.25 Notice that at = 0 the unstable bilinear transform turns into an ordinary bilinear transform. The inverse transform is obtained by (s )T (zeT + 1) = zeT 1 2 from where zeT and z = eT 1+ 1 1 (s )T 2 =1+
(s )T 2 (s )T 2
(s )T 2
(3.23)
2 Apparently the inverse unstable bilinear transform (3.23) has a pole at s = + T . In order to avoid hitting that pole by the poles of the lters transfer function (or maybe even generally avoid the real parts of the poles to go past that value) we could e.g. simply let = max {0, Re pn }
In order to construct an integrator dened by (3.22) we rst need to obtain the expression for 1/s from (3.22): 1 = s + =T 1
2 T
zeT 1 zeT +1
=T
That is
(3.24)
A discrete-time structure implementing (3.24) could be e.g. the one in Fig. 3.24. Yet another approach could be to convert the right-hand side of (3.24) to the analog domain by the inverse bilinear transform, construct an analog implementation of the resulting transfer function and apply the trapezoidal integrator replacement to convert back to the digital domain. It is questionable, whether this produces better (or even dierent) results than Fig. 3.24.
25 Apparently, the unstable bilinear transform denes the same relationship between Im s and arg z as the ordinary bilinear transform. Therefore the standard prewarping formula applies.
58
T 2+T
CHAPTER 3. TIME-DISCRETIZATION G'! &% + y"#$ z 1 y
11 eT
y 1 G'! &% "#$o + Figure 3.24: Transposed direct form II-style unstable trapezoidal integrator. 11
1
2T 2+T
x[n]
MMM G qq q
G y [n]
SUMMARY
We have considered three essentially dierent approaches to applying timediscretization to analog lter models: naive, TPT (by trapezoidal integrator replacement), and the classical bilinear transform (using direct forms). The TPT approach combines the best features of the naive implementation and the classical bilinear transform.
Chapter 4
Ladder lter
In this chapter we are going to discuss the most classical analog lter model: the transistor ladder lter. We will also discuss to an extent the diode ladder version.
4.1
The analog transistor ladder lter is an essentially nonlinear structure. However, as the rst approximation (and actually a quite good one) we will use its linearized model (Fig. 4.1). The LP1 blocks denote four identical (same cuto) 1-pole lowpass lters (Fig. 2.2). The k coecient controls the amount of negative feedback, which aects the lter resonance. Typically k 0, although k < 0 is also sometimes used.1 G'! &% " + y #$ G LP1 G LP1 k q qq M MMo Figure 4.1: Transistor ladder lter. Let G LP1 G LP1 G G y (t)
x(t)
1 1+s be the 1-pole lowpass transfer function. Assuming complex exponential x and y we write 4 y = H1 (s) (x ky ) H1 (s) =
1 The reason for the negative (rather than positive) feedback is actually quite intuitively simple. Considering the phase response of four serially connected 1-pole lowpass lters at the cuto: 4 1 1 1 = = 4 1+s (1 + j ) 4 s=j
we notice that the signal phase at the cuto is inverted. Therefore we have to invert it once again in the feedback to achieve the resonance eect.
59
60 from where
(4.1)
At k = 0 the lter behaves as 4 serially connected 1-pole lowpass lters. The poles of the lter are respectively s = 1 + (k )1/4 where the raising to the 1/4th power is understood in the complex sense, therefore giving 4 dierent values: s = 1 + 1 j 1/4 k 2 (k 0)
(this time k 1/4 is understood in the real sense). Therefore, at k = 0 all poles are located at s = 1, as k grows they move apart in 4 straight lines (all going at j 1/4 45 angles). As k grows from 0 to 4 the two of the poles (at s = 1+ 1 k ) 2 are moving towards the imaginary axis, producing a resonance peak in the amplitude response (Fig. 4.2). At k = 4 they hit the imaginary axis: 1j Re 1 + 41/4 2 and the lter becomes unstable. |H (j )|, dB 0 k=0 =0
-6
Figure 4.2: Amplitude response of the ladder lter for various k . In Fig. 4.2 one could notice that, as the resonance increases, the lter gain at low frequencies begins to drop. Indeed, substituting s = 0 into (4.1) we obtain H (0) = 1 1+k
61
4.2
A naive digital implementation of the ladder lter shouldnt pose any problems. We will therefore immediately skip to the TPT approach. Recalling the instantaneous response of a single 1-pole lowpass lter (3.14), we can construct the instantaneous response of a serial connection of four of such lters. Indeed, lets denote the instantaneous responses of the respective 1-poles as fn ( ) = g + sn (obviously, the coecient g is identical for all four, whereas sn depends on the lter state and therefore cannot be assumed identical). Combining two such lters in series we have f2 (f1 ( )) = g (g + s1 ) + s2 = g 2 + gs1 + s2 Adding the third one: f3 (f2 (f1 ( ))) = g (g 2 + gs1 + s2 ) + s3 = g 3 + g 2 s1 + gs2 + s3 and the fourth one: f4 (f3 (f2 (f1 ( )))) = g (g 3 + g 2 s1 + gs2 + s3 ) = = g 4 + g 3 s1 + g 2 s2 + gs3 + s4 = G + S where G = g4 S = g 3 s1 + g 2 s2 + gs3 + s4 Using the obtained instantaneous response G + S of the series of 4 1-poles, we can redraw the ladder lter structure as in Fig. 4.3. G'! &% "#$ + y u[n] G G + S k q qq M MMo Figure 4.3: TPT ladder lter in the instantaneous response form. Rather than solving for y , lets solve for the signal u at the feedback point. From Fig. 4.3 we obtain u = x ky = x k (Gu + S ) from where u= x kS 1 + kG (4.2) G G y [n]
x[n]
We can then use the obtained value of u to process the 1-pole lowpasses one after the other, updating their state, and computing y [n] as the output of the fourth lowpass.
62
Notice that for positive cuto values of the underlying 1-pole lowpasses we have g > 0. Respectively G = g 4 > 0. This means that for k 0 the denominator of (4.2) is always positive and never turns to zero, so we should be safe regarding the instantaneously unstable feedback.2 For k < 0 the situation is however dierent. Since 0 < g < 1 (for c > 0), it follows that 0 < G < 1. Thus 1 + kG > 0 k 1, however at k < 1 we can get into an instantaneously unstable feedback case.
4.3
Feedback shaping
We have observed that at high resonance settings the amplitude gain of the lter at low frequencies drops (Fig. 4.2). An obvious way to x this problem would be e.g. to boost the input signal by the (1 + k ) factor.3 However theres another way to address the same issue. We could kill the resonance at the low frequencies by introducing a highpass lter in the feedback (Fig. 4.4). In the simplest case this could be a 1-pole highpass. G'! &% "#$ + y G LP1 G LP1 k q qq M MMo G LP1 HP o G LP1 G y (t)
x(t)
Figure 4.4: Transistor ladder lter with a highpass in the feedback. The cuto of the highpass lter can be static or vary along with the cuto of the lowpasses. The static version has a nice feature that it kills the resonance eect at low frequencies regardless of the master cuto setting, which may be desirable if the resonance at low frequencies is considered rather unpleasant (Fig. 4.5). In principle one can also use other lter types in the feedback shaping. One has to be careful though, since this changes the positions of the lter poles. Particularly, inserting a lowpass into the feedback can easily destabilize an otherwise stable lter.
4.4
Warning! The multimode functionality of the ladder lter is a rather exotic feature. If youre looking for the bread-and-butter bandpass, highpass, notch etc.
2 Strictly speaking, we should have checked the instantaneous stability using the feedback smoother approach. However typically a positive denominator of the form 1 + g or 1 + kG implies that everything is ne. A quicker way to check for the instantaneous feedback would be to let the sampling rate innitely grow (T 0) and then check, whether the denominator changes its sign along the way. In our case G = g 4 = (c T /2)4 , which means the denominator is always larger than 1 (under the assumption k 0), regardless of T . 3 We boost the input rather than the output signal for the same reason as when preferring to place the cuto gains in front of the integrators.
63
-6
-12
-18 HP /8 HP 8HP
Figure 4.5: Amplitude response of the ladder lter with a staticcuto highpass in the feedback for various lowpass cutos.
lters, you should rst take a look at the multimode 2-pole state-variable lter discussed later in the book. By picking up intermediate signals of the ladder lter as in Fig. 4.6 we obtain the multimode version of this lter. We then can use linear combinations of signals yn to produce various kinds of ltered signal.4 yy0 G'! &% + y"#$ G G LP1 yy1 G G LP1 yy2 G k q qq M MMo Figure 4.6: Multimode ladder lter. Suppose k = 0. Apparently, in this case, the respective transfer functions associated with each of the yn outputs are Hn (s) = If k = 0 then from H4 (s) = 1 k + (1 + s)4 1 (1 + s)n (n = 0, . . . , 4) G LP1 yy3 G G LP1 G y4
4 Actually, instead of y we could have used the input signal x for these linear combinations. 0 However, it doesnt matter. Since y0 = x ky4 , we can express x via y0 or vice versa. Its just that some useful linear combinations have simpler (independent of k) coecients if y0 rather than x is being used.
64
using the obvious relationship Hn+1 (s) = Hn (s)/(s + 1) we obtain Hn (s) = (1 + s)4n k + (1 + s)4
Lets say we want to build a 4th-order highpass lter. Considering that the 4th order lowpass transfer function (under the assumption k = 0) is built as a product of four 1st order lowpass transfer functions 1/(1 + s) HLP (s) = 1 (1 + s)4
we might decide to build the 4th order highpass transfer function as a product of four 1st order highpass transfer functions s/(1 + s): HHP (s) = s4 (1 + s)4
Lets attempt to build HHP (s) as a linear combination of Hn (s). Apparently, a linear combination of Hn (s) must have the denominator k + (1 + s)4 , so lets instead construct s4 HHP (s) = (4.3) k + (1 + s)4 which at k = 0 will turn into s4 /(1 + s)4 : a0 (1 + s)4 + a1 (1 + s)3 + a2 (1 + s)2 + a3 (1 + s) + a4 s4 = = 4 k + (1 + s) k + (1 + s)4 1 a0 s4 + (a1 + 4a0 )s3 + (a2 + 3a1 + 6a0 )s2 + = (1 + s)4 + (a3 + 2a2 + 3a1 + 4a0 )s + (a0 + a1 + a2 + a3 + a4 ) from where a0 = 1 a1 + 4a0 = 0 a2 + 3a1 + 6a0 = 0 a3 + 2a2 + 3a1 + 4a0 = 0 a0 + a1 + a2 + a3 + a4 = 0 from where a0 = 1, a1 = 4, a2 = 6, a3 = 4, a4 = 1. The amplitude response corresponding to (4.3) is plotted in Fig. 4.7.5 A bandpass lter can be built as HBP (s) = s2 k + (1 + s)4 (4.4)
The two zeros at s = 0 will provide for a 12dB/oct rollo at low frequencies and will reduce the 24dB/oct rollo at high frequencies to the same 12dB/oct. Notice that the phase response at the cuto is zero: HBP (j ) = 1 1 = k + (1 + j )4 4k
5 We could have also constructed a true ladder highpass lter by using four 1-pole highpass lters instead of four 1-pole lowpass lters as the fundamental blocks of the ladder lter.
65
-6
-12
-18 c /8 c 8c
Figure 4.7: Amplitude response of the highpass mode of the ladder lter for various k . Finding the coecients is left as an exercise for the reader. The amplitude response corresponding to (4.3) is plotted in Fig. 4.8.6 |H (j )|, dB 0
-6
-12
-18 c /8 c 8c
Figure 4.8: Amplitude response of the bandpass mode of the ladder lter for various k . Further lter types can be built in a similar way.
4.5
At k 4 the ladder lter becomes unstable, as the resonance becomes too strong. We could however prevent the signal level from growing innitely by
could have also constructed a true ladder bandpass lter by using two 2-pole bandpass lters (e.g. SVF bandpasses, or serial combinations of a 1-pole lowpass and a 1-pole highpass at the same cuto) instead of four 1-pole lowpass lters as the fundamental blocks of the ladder lter. Notice that in the bandpass case, we need a positive rather than negative (inverted) feedback, since the phase shift at the bandpass cuto is zero.
6 We
66
putting a saturator into the feedback path. This will allow the lter to go into selfoscillation at k > 4. The best place for such saturator is probably at the feedback point, since then it will process both the input and the feedback signals simultaneously, applying a nice overdrive-like saturation to the input signal. A hyperbolic tangent function should provide a nice saturator (Fig. 4.9). It is transparent at low signal levels, therefore at low signal levels the lter behaves as a linear one. x(t) G'! &% "#$ + y G tanh G 4 LP k q q Mq MMo Figure 4.9: Transistor ladder lter with a saturator at the feedback point. The introduction of the nonlinearity in the feedback path poses no problems for a naive digital model. In the TPT case however this complicates the things quite a bit. Consider Fig. 4.3 redrawn to contain the feedback nonlinearity (Fig. 4.10). x[n] G'! &% + y"#$ u[n] G tanh k q q Mq MMo Figure 4.10: Nonlinear TPT ladder lter in the instantaneous response form. Writing the zero-delay feedback equation we obtain u = x k (G tanh u + s) (4.5) G G + s G G y [n] G G y (t)
Apparently, the equation (4.5) is a transcendental one. It can be solved only using numerical methods. Also, a linear zero-delay feedback equation had only one solution, but how many solutions does (4.5) have? In order to answer the latter question, lets rewrite (4.5) as (x ks) u = kG tanh u (4.6)
If k 0 and G > 0, then the right-hand side of (4.6) is a nonstrictly increasing function of u, while the left-hand side of (4.6) is a strictly decreasing function of u. Thus, (4.6) and respectively (4.5) have a single solution in this case. If k < 0, (4.5) can have multiple solutions (up to three). One could use the smoother paradigm introduced in the instantaneously unstable feedback discussion to nd out the applicable one. It is possible to avoid the need of solving the transcendental equation by using a saturator function which still allows analytic solution. This is particularly the case with second-order curves, such as hyperbolas. E.g. one could consider
67
a saturator function f (x) = x/(1 + |x|) consisting of two hyperbolic segments, which turns (4.6) into u (x ks) u = kG (4.7) 1 + |u| In order to solve (4.7) one rst has to check whether the solution occurs at u > 0 or u < 0, which can be done by simply evaluating the left-hand side of (4.7) at u = 0. Then one can replace |u| by u or u respectively and solve the resulting quadratic equation.7 Yet another approach (which also works for multiple nonlinearities!) is to rst solve the feedback equation for the linear case, and then apply the nonlinearites on top. E.g. we use the structure in Fig. 4.3 to obtain the value of u. However then we pretend that we have found the value of u for Fig. 4.10 (or Fig. 4.9, for that matter) and proceed accordingly, putting u through the hyperbolic tangent shaper and then further through the 1-pole lowpasses. We refer to this approach as the cheap method of applying the nonlinearities to the TPT structures. It is intuitively clear, that the cheap method is more likely to produce wrong results at high cuto values. No matter, which approach we chose to compute nonlinearities, one shouldnt forget that nonlinear shaping introduces overtones (usually an innite amount of those) into the signal, which in turn introduces aliasing. Meaning: the stronger are the nonlinearities in your structure, the more you might need to oversample. If the oversampling is extreme anyway, there might be little dierence in quality between the naive and the TPT approach.8
4.6
The nonlinearity introduced in Fig. 4.9 does a good job and sounds reasonably close to a hardware analog transistor ladder lter, however this is not how the nonlinearities in the hardware ladder lter really work. In order to describe a closer to reality nonlinear model of the ladder lter, we need to start by introducing nonlinearities into the underlying 1-pole lowpass lters (Fig. 4.11). So the equation (2.1) is transformed into
t
y = y (t0 ) +
t0
c tanh x( ) tanh y ( ) dt
7 The same kind of quadratic equation appears for any other second-order curve (hyperbola, ellipse, parabola, including their rotated versions). In solving the quadratic equation one has not only to choose the right one of the two roots of the equation. One also has to choose the right one of the two solution formulas for the quadratic equation Ax2 2Bx + C = 0: B B 2 AC C x= or x= A B B 2 AC
(where the choice of the formula is based on the sign of B and on the choice of the sign, corresponding to choosing between the two solutions). The reason to use these two dierent formulas is that they can become ill-conditioned, resulting in computation precision losses. More details on this approach are discussed in the authors article Computational optimization of nonlinear zero-delay feedback by second-order piecewise approximation. 8 Before making a nal decision, it might be worth asking a few musicians with an ear for analog sound to perform a listening test, whether the dierences between the naive and TPT models of a particular lter are inaudible and uncritical.
tanh o Figure 4.11: A nonlinear 1-pole lowpass element of the ladder lter.
Which eect does this have? Apparently, the presence of the tanh function reduces the absolute value of the dierence tanh x tanh y , if the level of one or both of the signals is suciently high. If both x and y have large values of the same sign, its possible that the dierence tanh x tanh y is close to zero, even though the dierence x y is very large. This means that the lter will update its state slower than in (2.1). Intuitively this feels a little bit like cuto reduction at large signal levels. We can then connect the 1-pole models from Fig. 4.11 into a series of four 1-poles and put a feedback around them, exactly the same way as in Fig. 4.1. Notice that when connecting Fig. 4.11 lters in series, one could use a common tanh module between each of them, thereby optimizing the computation (Fig. 4.12).9 One could further enhance the nonlinear behavior of the ladder lter model by putting another saturator (possibly of a dierent type, or simply dierently scaled) into the feedback path.
4.7
Diode ladder
In the diode ladder lter the serial connection of four 1-pole lowpass lters (implemented by the transistor ladder) is replaced by a more complicated structure of 1-pole lters (implemented by the diode ladder). The equations of the diode ladder itself (without the feedback path) are y 1 = c c y 2 = 2 c y 3 = 2 c y 4 = 2 tanh x tanh(y1 y2 ) tanh(y1 y2 ) tanh(y2 y3 ) tanh(y2 y3 ) tanh(y3 y4 ) tanh(y3 y4 ) tanh y4
which is implemented by the structure in Fig. 4.13 (compare to Fig. 4.12). The diode ladder itself is then built by providing the feedback path around the diode ladder, where the fourth output of the diode ladder is fed back into the diode ladders input (Fig. 4.14).
is an issue which may appear when using simple tanh approximations having a fully horizontal saturation curve. If both the input and the output signals of a 1-pole are having large values of the same sign, the tanh approximations will return two identical values and the dierence tanh x tanh y will be approximated by zero. This might result in the lter getting stuck (the cuto eectively reduced to zero).
9 There
4.7. DIODE LADDER x(t) G tanh G'! &% + y"#$ o "#$ '! &% + y o "#$ '! &% + y o "#$ '! &% + y G tanh o G tanh o G tanh o G tanh o Figure 4.12: Advanced nonlinear transistor ladder (the main feedback path of the ladder lter not shown). G G y4 (t) G G y3 (t) G G y2 (t) G G y1 (t)
69
The linearized form of the diode ladder equations is obtained by assuming tanh , resulting in y 1 = c (x + y2 ) y1 y 2 = c (y1 + y3 )/2 y2 y 3 = c (y2 + y4 )/2 y2 y 4 = c y3 / 2 y4 which apparently is representable as a serial connection of four identical 1-pole lowpass lters (all having the same cuto c ) with some extra gains and feedback paths (Fig. 4.15). These more complicated connections between the lowpasses destroy the frequency response of the ladder in a remarkable form, responsible for the characteristic diode ladder lter sound. Generally, the behavior of the diode ladder lter is less straightforward than the one of the transistor ladder lter. Transfer function In order to obtain the transfer function of the structure in Fig. 4.15 let G(s) = 1 1+s (4.8)
70 x(t) G tanh G'! &% + y"#$ o "#$ '! &% + y o "#$ '! &% + y o "#$ '! &% + y MMM G qq q
1/2
CHAPTER 4. LADDER FILTER G tanh o MMM G qq q G tanh o MMM G qq q G tanh o G tanh o Figure 4.13: Diode ladder. x(t) G'! &% "#$ + y G Diode ladder k q q Mq MMo Figure 4.14: Diode ladder lter. G G y (t)
1/2 1/2
G y1 (t)
G y2 (t)
G y3 (t)
G y4 (t)
G LP1
G y1 (t)
1/2
MM Gq q
G LP1
G y2 (t)
1/2
MM Gq q
G LP1
G y3 (t)
1/2
M GqM q
G LP1
G y4 (t)
x(t)
be the transfer function of the underlying lowpass lters. Let F (s) = 2G(s), or, simplifying the notation, F = 2G. Assuming complex exponential signals est ,
4.7. DIODE LADDER we obtain from Fig. 4.15 y4 = F y3 Respectively y3 = F (y2 + y4 ) Multiplying both sides by F we have F y3 = F 2 (y2 + y4 ) from where, recalling that y4 = F y3 : y4 = F 2 (y2 + y4 ) that is y4 = Further, y2 = F (y1 + y3 ) = F y1 + y4 Multiplying both sides by F 2 /(1 F 2 ): y4 = from where y4 1 and respectively y4 = And nally, y1 = 2F (x + y2 ) Multiplying both sides by F /(1 2F 2 ): y4 = F2 2F 4 1 F2 y2 x + 1 2F 2 F2 1 F2 2 2F 4 2 1F x + 2 F y4 = 1 2F 2 1 2F 2 y4 1 2F 2 from where y4 1 4F 2 + 2F 4 = 2F 4 x and y4 = 2F 4 G4 /8 x = x 1 4F 2 + 2F 4 1 G2 + G4 /8 1 F2 1 2F 2 = 2F 4 1 2F 2 x+ 1 F2 y4 F2 =
3
71
F2 y2 1 F2
F3 F2 y1 + y4 2 1F 1 F2 F2 1 F2 = F3 y1 1 F2
F3 y1 1 2F 2
from where
2F 4 x 1 2F 2
That is, the transfer function (s) of the diode ladder is (s) = G4 /8 G4 /8 G2 + 1 where G(s) = 1 1+s
8c
Figure 4.16: Amplitude response of the diode ladder lter for various k . from where we obtain the transfer function of the entire diode ladder lter as H (s) = 1 + k
The corresponding amplitude response is plotted in Fig. 4.16. Lets nd the positions of the poles of the diode ladder lter. Equating the denominator to zero: 1 + k = 0 we have 1 = k = that is k that is kG4 /8 G2 + 1
G4 /8
G4 G4 = G2 + 1 8 8
1+k 4 G G2 + 1 = 0 8 1 1
1+k 4 1+k 2
Solving for G2 : G =
2
1 =
1k 2 1+k 4
(4.9)
that is
1+k 4 1+k 4
1 1
(s + 1)2 =
1k 2
1k 2
1k 2
73 1
1+k 2 1k 2
1 = 2
= that is
1k 2
(s + 1)2 =
1 1 2 2
1k 2
(4.10)
The equation (4.10) denes the positions of the poles of the diode ladder lter. Apparently, its easily solvable. We would be interested to nd out, at which k does the selfoscillation start. If the poles are to be on the imaginary axis, then s = j . Substituting s = j into (4.10) we get (1 2 ) + 2j = 1 j 2 2 k1 2 (4.11)
1 Equating the real parts of (4.11) we obtain 1 2 = 1 2 and = 2 . Equating 1 the imaginary parts of (4.11) and substituting = we obtain 2
2 1 = 2 2 from where
k1 2
4 = k 1
and, since k R, we have k = 17. That is, given the unit cuto of the underlying one-pole lowpass lters, the selfoscillation starts at k = 17, where the resonance peak is located at = 1/ 2. TPT model Converting Fig. 4.15 to the instantaneous response form we obtain the structure in Fig. 4.17. From Fig. 4.17 we wish to obtain the instantaneous response of the entire diode ladder. Then we could use this response to solve the zero-delay feedback equation for the main feedback loop of Fig. 4.14.
G 2g+s1
G y1
G g+s2
G y2
G g+s3
G y3
G g+s4
G y4
Figure 4.17: Linearized diode ladder in the instantaneous response form. From Fig. 4.17 we have y4 = gy3 + s4 . We can rewrite it as y4 = G4 y3 + S4 where G4 = g, S4 = s4
y3 = g (y2 + y4 ) + s3 = g (y2 + G4 y3 + S4 ) + s3 from where y3 = gy2 + gS4 + s3 = G3 y2 + S3 1 gG4 where G3 = g gS4 + s3 , S3 = 1 gG4 1 gG4
Further, y2 = g (y1 + y3 ) + s2 = g (y1 + G3 y2 + S3 ) + s2 from where y2 = gy1 + gS3 + s2 = G2 y1 + S2 1 gG3 where G2 = g gS3 + s2 , S2 = 1 gG3 1 gG3
And ultimately y1 = 2g (x + y2 ) + s1 = 2g (x + G2 y1 + S2 ) + s1 from where y1 = 2gx + 2gS2 + s1 = G1 x + S1 1 2gG2 yn = Gn yn1 + Sn where G1 = 2g 2gS2 + s1 , S1 = 1 2gG2 1 2gG2
Thus, we have (where y0 = x) from where its easy to obtain the instantaneous response of the entire diode ladder as y4 = G4 y3 + S4 = G4 (G3 y2 + S3 ) + S4 = G4 G3 y2 + (G4 S3 + S4 ) = = G4 G3 (G2 y1 + S2 ) + (G4 S3 + S4 ) = = G4 G3 G2 y1 + (G4 G3 S2 + G4 S3 + S4 ) = = G4 G3 G2 (G1 x + S1 ) + (G4 G3 S2 + G4 S3 + S4 ) = = G4 G3 G2 G1 x + (G4 G3 G2 S1 + G4 G3 S2 + G4 S3 + S4 ) = Gx + S Notice, that we should have checked that we dont have instantaneously unstable feedback problems within the diode ladder. That is, we need to check that all denominators in the expressions for Gn and Sn dont turn to zero or to negative values. Considering that 0 < g < 1 2 and that G4 = g , we have 0 < G4 < and 1 gG4 = 1 g 2 > Respectively 0 < G3 = g/(1 gG4 ) < and 1 gG3 > 1 1/2 2 = 3/4 3 1 2 3 4
1 2 1 2 =1 = 2 3 3 3
75
1/2 3 = 2/3 4
13 3 1 =1 = 24 4 4 and thus all denominators are always positive. Also 2g 1 0 < G1 = < =4 1 2gG2 1/4 1 2gG2 > 1 2 Thus 0 < G = G4 G3 G2 G1 <
1 2 3 4=1 2 3 4 Using the obtained instantaneous response of the entire diode ladder we now can solve the main feedback equation for Fig. 4.14. For a nonlinear diode ladder model we could use the structure in Fig. 4.13. However, it might be too complicated to process. Even the application of the cheap TPT nonlinear processing approach is not fully trivial. One can therefore use simpler nonlinear structures instead, e.g. the one from Fig. 4.9. Also, the other ideas discussed for the transistor ladder can be applied. In regards to the multimode diode ladder lter, notice that the transfer functions corresponding to the yn (t) outputs are dierent from the ones of the transistor ladder, therefore the mixing coecients which worked for the modes of the transistor ladder lter, are not going to work the same for the diode ladder.
SUMMARY
The transistor ladder lter model is constructed by placing a negative feedback around a chain of four identical 1-pole lowpass lters. The feedback amount controls the resonance. A nonlinearity in the feedback path (e.g. at the feedback point) could be used to contain the signal level, so that selfoscillation becomes possible.
76
Chapter 5
2-pole lters
The other classical analog lter model is the 2-pole lter design commonly referred to in the music DSP eld as the state-variable lter (SVF). It can also serve as a generic analog model for building 2-pole lters, similarly to previously discussed 1-pole RC lter model.
5.1
The block diagram of the state-variable lter is shown in Fig. 5.1. The three outputs are the highpass, bandpass and lowpass signals.12 G yHP (t) G'! &% "#$ + y G G G 11
1
2R #$o '! &% " + Figure 5.1: 2-pole multimode state-variable lter. From Fig. 5.1 one can easily obtain the transfer functions for the respective signals. Assume complex exponential signals. Then, assuming unit cuto, yHP = x 2RyBP yLP
1 One can notice that the lter in Fig. 5.1 essentially implements an analog-domain canonical form, similar to the one in Fig. 3.20. Indeed lets substitute in Fig. 3.20 the z 1 elements by s1 elements (integrators) and let a1 = 2R, a2 = 1. Then the gains b0 , b1 and b2 are simply picking up the highpass, bandpass and lowpass signals respectively. 2 As usual, one can apply transposition to obtain a lter with highpass, bandpass and lowpass inputs.
x(t)
77
from where
and HHP (s) = Thus HHP (s) = s2 + 2Rs + 1 s HBP (s) = 2 s + 2Rs + 1 1 HLP (s) = 2 s + 2Rs + 1 s2 yHP 1 = R x 1 + 2s +
1 s2
s2
The respective amplitude responses are plotted in Fig. 5.2, Fig. 5.3 and Fig. 5.4. One could observe that the highpass response is a mirrored version of the lowpass response,3 while the bandpass response is symmetric by itself.4 It can also be easily shown mathematically. The slope rollo speed is apparently 12dB/oct for the low- and highpass, and 6dB/oct for the bandpass. Notice that yLP (t)+2RyBP (t)+ yHP (t) = x(t), that is, the input signal is split into lowpass, bandpass and highpass components. The same can be expressed in the transfer function form: HLP (s) + 2RHBP (s) + HHP (s) = 1 The resonance of the lter is controlled by the R parameter. Contrarily to the ladder lter, where the resonance increases with the feedback amount, in the state-variable lter the bandpass signal feedback serves as a damping means for the resonance. In the absence of the bandpass signal feedback the lter will get unstable. The R parameter therefore may be referred to as the damping parameter. Solving s2 + 2Rs + 1 = 0 we obtain the poles of the lter at s = R R2 1= R R 2 1 R j 1 R 2 if R 1 if 1 R 1
3 As with the 1-pole lters, the highpass transfer function can be obtained from the lowpass transfer function by the s 1/s (LP to HP) substitution. 4 The bandpass response can be obtained from the lowpass response by a so-called LP to BP (lowpass to bandpass) substitution:
1 2R
s+
1 s
79
+6
-6 R=1 -12
-18 c /8 c 8c
Figure 5.2: Amplitude response of a 2-pole lowpass lter. |H (j )|, dB +12 R = 0.1
+6
-6 R=1 -12
-18 c /8 c 8c
Without trying to give a precise denition of the resonance concept, we could say that at R = 1 there is no resonance (there are two real poles at s = 1). As R starts decreasing from 1 towards 0 there appear two mutually conjugate complex poles moving along the unit circle towards the imaginary axis, so the resonance slowly appears. At R = 0 the lter becomes unstable.
80 |H (j )|, dB
R = 0.1 +12
+6
0 R=1
-6
-12
-18 c / 8 c 8c
The amplitude response at the cuto ( = 1) is 1/2R for all three lter types. Except for the bandpass, the point = 1 is not exactly the peak location but its pretty close (the smaller the value of R, the closer is the true peak to = 1). The phase response at the cuto is 90 for lowpass, 0 for bandpass and +90 for highpass.
5.2
Skipping the naive implementation, which the readers should be perfectly capable of creating and analysing themselves by now, we proceed with the discussion of the TPT model. Assuming g + sn instantaneous responses for the two trapezoidal integrators one can redraw Fig. 5.1 to obtain the discrete-time model in Fig. 5.5. Picking yHP as the zero-delay feedback equations unknown5 we obtain from Fig. 5.5: yHP = x 2R(gyHP + s1 ) g (gyHP + s1 ) s2 from where 1 + 2Rg + g 2 yHP = x 2Rs1 gs1 s2 from where yHP = x 2Rs1 gs1 s2 1 + 2Rg + g 2 (5.1)
5 The state-variable lter has two feedback paths sharing a common path segment. In order to obtain a single feedback equation rather than an equation system we should pick a signal on this common path as the unknown variable.
81
x[n]
11 1 2R #$o '! &% " + Figure 5.5: TPT 2-pole multimode state-variable lter in the instantaneous response form. Using yHP we can proceed dening the remaining signals in the structure.6
5.3
By mixing the lowpass, bandpass and highpass outputs one can obtain further lter types. Unit gain bandpass 2Rs + 2Rs + 1 This version of the bandpass lter has a unit gain at the cuto (Fig. 5.6). Notice that the unit gain bandpass signal can be directly picked up at the output of the 2R gain element in Fig. 5.1. HBP1 (s) = 2RHBP (s) = s2 Band-shelving lter By adding/subtracting the unit gain bandpass signal to/from the input signal one obtains the band-shelving lter (Fig. 5.7):7 HBS (s) = 1 + 2RKHBP (s) = 1 + 2RKs s2 + 2Rs + 1
Similarly to the other shelving lter types we can specify the mid-slope requirement |HBS (j )| = 1 + K (for some ), from where we obtain R= 1 2/2 2/2 = 2 1+K 2 1+K
82 |H (j )|, dB R=5 0
R = 0.1 R=1
-6
-12
-18 c / 8 c 8c
Figure 5.6: Amplitude response of a 2-pole unit gain bandpass lter. |H (j )|, dB +12
+6
-6
-12
-18 c / 8 c 8c
Figure 5.7: Amplitude response of a 2-pole band-shelving lter for R = 1 and varying K .
Notch lter At K = 1 the band-shelving lter turns into a notch (or bandstop) lter (Fig. 5.8): HN (s) = 1 2RHBP (s) = s2 + 1 s2 + 2Rs + 1
83
0.5 R=5 0 c /8
R=1
8c
Figure 5.8: Amplitude response of a 2-pole notch lter. The amplitude scale is linear. Allpass lter At K = 2 the band-shelving lter turns into an allpass lter (Fig. 5.9): HAP (s) = 1 4RHBP (s) = s2 2Rs + 1 s2 + 2Rs + 1
Notice how the damping parameter aects the phase response slope. arg H (j ) 0 R = 0.2
R=5 2 c /8
R=1
8c
Peaking lter By subtracting the highpass signal from the lowpass signal (or also vice versa) we obtain the peaking lter (Fig. 5.10): HPK (s) = HLP (s) HHP (s) = 1 s2 s2 + 2Rs + 1
+6
R=1 0
-6
-12
R=5
-18 c / 8 c 8c
5.4
Nonlinear model
In the ladder lter the resonance was created as the result of the feedback. Therefore by limiting the feedback level (by a saturator) we could control the resonance amount and respectively prevent the lter from becoming unstable. The feedback in the SVF has a more complicated structure. Particularly, the bandpass path is responsible for damping the resonance. We could therefore apply an inverse idea: try boosting the bandpass signal in case the signal level becomes too strong. This can be achieved e.g. by placing an inverse hyperbolic tangent nonlinearity into the bandpass feedback path (Fig. 5.11).8 The idea is that at low signal levels tanh1 yBP + (R 1)yBP RyBP at higher signal levels the nonlinearity boosts the bandpass signal. Notice that the bandpass signal should be rather picked up at the output of the nonlinearity than at the yBP point to make sure it has a similar level as the lowpass and highpass. The nonlinear feedback equation can be obtained using yHP as the unknown: yHP = x 2 tanh1 (gyHP + s1 ) 2(R 1)(gyHP + s1 ) g (gyHP + s1 ) s2 Instead of using the hyperbolic tangent function, one could also use a hyperbolic shaper of a similar shape f (x) = x/(1 |x|) to allow analytical solution of the nonlinear zero-delay feedback equation.
8 Since the domain of the inverse hyperbolic tangent is restricted to (1, 1), the cheap TPT nonlinearity application method doesnt work in this case, at least not in a straightforward manner.
5.4. NONLINEAR MODEL G'! &% + y"#$ y yHP G G g + s1 yBP G G g + s2 tanh1 yBP o G'! &% "#$o + "#$o '! &% + 11 2 11 R1
85
yLP
A more straightforward possibility is to introduce saturation into the integrators, or at least into the rst of them. In the TPT approach one could use the direct form I-style integrator (Fig. 3.8) resulting in the integrator in Fig. 5.12. Or one could equivalently use the transposed direct form II-style integrator (Fig. 3.10) resulting in the integrator in Fig. 5.13.
x[n]
c T /2 MMM G q qq
G G z 1
G tanh z 1 o
G y [n]
x[n]
c T /2 MMM G qq q
G tanh
G y [n]
86
5.5
Cascading decomposition
First, recall that a 1-pole multimode lter can be used to implement any 1storder rational transfer function. Similarly, a multimode SVF can be used to implement practically any 2nd-order rational transfer function. Indeed, consider H (s) = b2 s2 + b1 s + b0 s2 + a1 s + a0
where we assume a0 > 0.9 Lets perform the cuto scaling substitution s s a0 , which turns H (s) into something of the form H (s) = b2 s2 + b1 s + b0 s2 + a1 s + 1
(the coecient values are now dierent in the above). Now simply let R = a1 /2 and use b2 , b1 and b0 as the mixing coecients for the highpass, bandpass and lowpass signals. This further allows to implement practically any given stable transfer function by a serial connection of a number of 2-pole (and possibly 1-pole) lters. Indeed, simply factor the numerator and the denominator into 2nd- and possibly 1st-order factors (where the 2nd-order real factors will necessarily appear for complex conjugate pairs of roots and optionally for pairs of real roots). Any pair of 2nd-order factors (one in the numerator, one in the denominator) can be implemented by a 2-pole multimode SVF. Any pair of 1st-order factors can be implemented by a 1-pole multimode. If there are not enough 2nd-order factors in the numerator or denominator, a pair of 1st order factors in the numerator or denominator can be combined into a 2nd-order factor.
SUMMARY
The state-variable lter has the structure shown in Fig. 5.1. Contrarily to the ladder lter, the resonance strength in the SVF is controlled by controlling the damping signal. The multimode outputs have the transfer functions HHP (s) = s2 + 2Rs + 1 s HBP (s) = 2 s + 2Rs + 1 1 HLP (s) = 2 s + 2Rs + 1 s2
9 If a < 0, this means that H (s) has two real poles of dierent signs. If a = 0 then at 0 0 least one of the poles is at s = 0. In either case, this lter is already unstable, which means, if we are practically interested in its implementation, most likely there is a nonlinear analog prototype, and we simply can apply TPT to this prototype to obtain a digital structure. If we insist on using the SVF structure (why would we?), we can also extend it to the canonical form by introducing a gain element into the lowpass feedback path.
Chapter 6
Allpass-based eects
Phasers are essentially ladder lters built around allpass lters instead of lowpass lters. Flangers can be obtained from phasers by an allpass substitution. For these reasons both types belong to the VA lter discussion.
6.1
Phasers
The simplest phaser is built by mixing the unmodied (dry) input signal with an allpass-ltered (wet) signal as in Fig. 6.1, where the allpass lters cuto is typically modulated by an LFO.1 The allpass lter can be rather arbitrary, except than it has to be a dierential lter.2 ydry (t) "#$ G'! &% + 1/2 MMM G qq q
x(t)
G AP
ywet (t)
G y (t)
Figure 6.1: The simplest phaser. At the points where the allpass lters phase response is 180 , the wet and the dry signals will cancel each other, producing a notch. At the points where the allpass lters phase response is 0 the wet and the dry signals will boost each other, producing a peak (Fig. 6.2). The phaser structure in Fig. 6.1 contains no feedback, therefore there is no dierence between naive and TPT digital implementations (except that the underlying allpass lters should be better constructed in a TPT way).
the absence of LFO modulation the structure should be rather referred to as a (multi-) notch lter. 2 Phasers typically use dierential allpass lters or their digital counterparts. If e.g. a delay (which is not a dierential lter, but is an allpass) is used as the allpass, the structure should be rather referred to as a anger.
1 In
87
88 |H (j )| 1
0.5
0 c /8
8c
Figure 6.2: Amplitude response of the simplest phaser from Fig. 6.1 (using four identical 1-pole allpass lters with the same cuto as the allpass core of the phaser). Mixing at arbitrary ratios Instead of mixing at the 50/50 ratio we can mix at any other ratio, where the sum of the dry and wet mixing gains should amount to unity. This will aect the depth of the notches and the height of the peaks. For the phaser in Fig. 6.1 the mixing ratio higher than 50/50 (where the wet signal amount is more than 50%) hardly makes sense. Wet signal inversion By inverting the wet signal, one swaps the peaks and the notches. Notice that the phase response of dierential allpasses at = 0 can be either 0 or 180 , the same holds for the phase response at = +. For that reason the possibility to swap the peaks and the notches might be handy. Notch spacing In the simplest case one uses a series of identical 1-pole allpasses inside a phaser. In order to control the notch spacing in an easy and nice way, one should rather use a series of identical 2-pole allpasses. As mentioned earlier, by changing the resonance amount of the 2-pole allpasses one controls the phase slope of the lters. This aects the spacing of the notches (Fig. 6.3). Feedback We can also introduce feedback into the phaser. Similarly to the case of the ladder lter modes, the dry signal is better picked up after the feedback point (Fig. 6.4) The feedback changes the shape of the peaks and notches (Fig. 6.5). With the introduction of feedback we have a zero-delay feedback loop in the phaser structure. It can be solved using typical TPT means.3
3 Inserting a unit delay in the feedback produces subtle but rather unpleasant artifacts in the phasing response, one should better use the TPT approach here.
89
0 c /8
8c
Figure 6.3: Eect of the allpass resonance on the notch spacing (using two 2-pole allpass lters as the allpass core of the phaser).
x(t)
G AP q qq M MMo k
1/2 MMM G qq q
G y (t)
k = 0.3 |H (j )| 1 k = 0.5
0.5 k=0 0 c /8 c 8c
Figure 6.5: Eect of the feedback amount in Fig. 6.4 on the notch and peak shapes.
90
6.2
Flangers
A delay is a linear time-invariant allpass. It even has a transfer function H (s) = esT where T is the delay time. Obviously |H (s)| = |esT | = 1. However it is not a dierential lter, for that reason the transfer function is not a rational function of s. Digital delay models are typically built using interpolation techniques, the details of which fall outside the scope of this book. Using the allpass substitution principle we can replace the allpass lter chain in a phaser by a delay. This produces a anger. The discussion of the phasers mostly didnt assume any details about the underlying allpass, therefore most of it is applicable to angers. The main dierence with using a delay is that the 0 and 180 phase response points are evenly spaced in the linear frequency scale (Fig. 6.6), whereas the spacing of the same points in responses of dierential allpasses is not that regular. Also, a delays phase response can easily have lots of 0 and 180 points (the larger the delay time is, the more of those points it has within the audible frequency range), while the number of those points in a dierential allpass lters phase response is limited by the lters order. |H (j )| 1
0.5
0 1 2T 3 2T 5 2T 7 2T 9 2T
Figure 6.6: Amplitude response of the simplest anger using the structure from Fig. 6.1. Rather than modulating the delay time linearly by an LFO, one should consider that a lters cuto should be typically modulated in the logarithmic frequency scale (a.k.a. the pitch scale), therefore one in principle should do the same for the delay in a anger. The delays cuto for that purpose can be simply dened as c = 2/T , where T is the delay time.
SUMMARY
A phaser is made of an allpass dierential lter connected in parallel with the dry signal path. This creates notches at the points of 180 phase dierence and peaks at 0 points. The allpass cuto should be modulated by an LFO. Using a delay instead of a dierential allpass creates a anger. Feedback can be used to change the shape of the peaks and notches in the amplitude response.
Index
1-pole lter, 7 2-pole lter, 77 4-pole lter, 59 allpass lter, 22, 83 allpass substitution, 52 amplitude response, 13, 33 bandpass lter, 77, 81 bilinear transform, 38 inverse, 40 topology-preserving, 50 unstable, 57 BLT, 38 BLT integrator, see trapezoidal integrator canonical form, 47 cheap TPT method, 67 complex exponential, 5 complex impedances, 12 complex sinusoid, 1 cuto, 8, 14 parametrization of, 15 damping in SVF, 78 DC oset, 2 delayless feedback, 42 DF1, 47 DF2, 47 dierentiator, 51 diode ladder lter, 68 Dirac delta, 4 direct form, 47 eigenfunction, 9 lter 1-pole, 7 2-pole, 77 4-pole, 59 91 allpass, 22, 83 bandpass, 77, 81 highpass, 17, 62, 77 ladder, 59 lowpass, 7, 59, 77 multimode, 19, 63, 77 notch, 82 peaking, 83 shelving, 20, 81 stable, 18 anger, 90 Fourier integral, 3 Fourier series, 2 Fourier transform, 3 frequency response, 13, 33 gain element, 8 harmonics, 2 Hermitian, 3 highpass lter, 17, 62, 77 instantaneous gain, 44 instantaneous oset, 44 instantaneous response, 44 instantaneously unstable feedback, 55 integrator, 8 BLT, see integrator, trapezoidal naive, 29 trapezoidal, 35 ladder lter, 59 diode, 68 Laplace integral, 5 Laplace transform, 5 linearity, 11 lowpass lter, 7, 14, 59, 77 multimode lter, 19, 63, 77 naive integrator, 29
92 nonstrictly proper, 11 notch lter, 82 partials, 2 peaking lter, 83 phase response, 13, 33 phaser, 87 pole, 18 prewarping, 41 rollo, 14 shelving lter, 20, 81 stable lter, 18 state-variable lter, 77 summator, 8 SVF, 77 time-invariant, 10 topology, 49 topology-preserving transform, 40, 50 TPBLT, 50 TPT, 40, 50 cheap, 67 transfer function, 11, 32 transposition, 24 trapezoidal integrator, 35 unit delay, 30 z -integral, 28 z -transform, 28 zero, 18 zero-delay feedback, 43
INDEX