| 
 .. .
  Introduction to dither ===   FGR. 23 Dither can be applied to a quantizer in one of two ways. In (a) the
  dither is linearly added to the analog input signal, whereas in (b) it’s added
  to the reference voltages of the quantizer.
 ===   At high signal levels, quantizing error is effectively noise. As the audio
  level falls, the quantizing error of an ideal quantizer becomes more strongly
  correlated with the signal and the result is distortion. If the quantizing
  error can be decorrelated from the input in some way, the system can remain
  linear but noisy. Dither performs the job of decorrelation by making the action
  of the quantizer unpredictable and gives the system a noise floor like an analog
  system.   The first documented use of dither was by Roberts in picture coding.  In this system, pseudo-random noise (see Section 3) with rectangular probability
  and a peak-to-peak amplitude of Q was added to the input signal prior to quantizing,
  but was subtracted after reconversion to analog. This is known as subtractive
  dither and was investigated by Schuchman and much later by Sherwood. Subtractive
  dither has the advantages that the dither amplitude is non-critical, the noise
  has full statistical independence from the signal and has the same level as
  the quantizing error in the large signal undithered case.   Unfortunately, it suffers from practical drawbacks, since the original noise
  waveform must accompany the samples or must be synchronously recreated at the
  DAC.  This is virtually impossible in a system where the audio may have been edited
  or where its level has been changed by processing, as the noise needs to remain
  synchronous and be processed in the same way. All practical digital audio systems
  use non-subtractive dither where the dither signal is added prior to quantization
  and no attempt is made to remove it at the DAC.   The introduction of dither prior to a conventional quantizer inevitably causes
  a slight reduction in the signal-to-noise ratio attainable, but this reduction
  is a small price to pay for the elimination of non-linearities. The technique
  of noise shaping in conjunction with dither will be seen to overcome this restriction
  and produce performance in excess of the subtractive dither example above.  The ideal (noiseless) quantizer of FGR. 19 has fixed quantizing intervals
  and must always produce the same quantizing error from the same signal. In
  FGR. 23 it can be seen that an ideal quantizer can be dithered by linearly
  adding a controlled level of noise either to the input signal or to the reference
  voltage which is used to derive the quantizing intervals. There are several
  ways of considering how dither works, all of which are equally valid.  The addition of dither means that successive samples effectively find the
  quantizing intervals in different places on the voltage scale. The quantizing
  error becomes a function of the dither, rather than a predictable function
  of the input signal. The quantizing error is not eliminated, but the subjectively
  unacceptable distortion is converted into a broadband noise which is more benign
  to the ear.  Some alternative ways of looking at dither are shown in FGR. 24.  Consider the situation where a low-level input signal is changing slowly within
  a quantizing interval. Without dither, the same numerical code is output for
  a number of sample periods, and the variations within the interval are lost.
  Dither has the effect of forcing the quantizer to switch between two or more
  states. The higher the voltage of the input signal within a given interval,
  the more probable it becomes that the output code will take on the next higher
  value. The lower the input voltage within the interval, the more probable it’s
  that the output code will take the next lower value. The dither has resulted
  in a form of duty cycle modulation, and the resolution of the system has been
  extended indefinitely instead of being limited by the size of the steps.  Dither can also be understood by considering what it does to the transfer
  function of the quantizer. This is normally a perfect staircase, but in the
  presence of dither it’s smeared horizontally until with a certain amplitude
  the average transfer function becomes straight.  In an extension of the application of dither, Blesser has suggested digitally
  generated dither which is converted to the analog domain and added to the input
  signal prior to quantizing. That same digital dither is then subtracted from
  the digital quantizer output. The effect is that the transfer function of the
  quantizer is smeared diagonally (FGR. 25). The significance of this diagonal
  smearing is that the amplitude of the dither is not critical. However much
  dither is employed, the noise amplitude will remain the same. If dither of
  several quantizing intervals is used, it has the effect of making all the quantizing
  intervals in an imperfect convertor appear to have the same size.  ===   FGR. 24 Wideband dither of the appropriate level linearizes the transfer function
  to produce noise instead of distortion. This can be confirmed by spectral analysis.
  In the voltage domain, dither causes frequent switching between codes and preserves
  resolution in the duty cycle of the switching.
 ===   FGR. 25 In this dither system, the dither added in the analog domain shifts
  the transfer function horizontally, but the same dither is subtracted in the
  digital domain, which shifts the transfer function vertically. The result is
  that the quantizer staircase is smeared diagonally as shown top left. There
  is thus no limit to dither amplitude, and excess dither can be used to improve
  differential linearity of the convertor.
 ===  Requantizing and digital dither The advanced ADC technology which is detailed later in this section allows
  as much as 24-bit resolution to be obtained, with perhaps more in the future.
  The situation then arises that an existing sixteen-bit device such as a digital
  recorder needs to be connected to the output of an ADC with greater wordlength.
  The words need to be shortened in some way.   Section 3 showed that when a sample value is attenuated, the extra low-order
  bits which come into existence below the radix point preserve the resolution
  of the signal and the dither in the least significant bit(s) which linearizes
  the system. The same word extension will occur in any process involving multiplication,
  such as digital filtering. It will subsequently be necessary to shorten the
  wordlength. Clearly the high-order bits cannot be discarded in two's complement
  as this would cause clipping of positive half-cycles and a level shift on negative
  half-cycles due to the loss of the sign bit. Low-order bits must be removed
  instead.  Even if the original conversion was correctly dithered, the random element
  in the low-order bits will now be some way below the end of the intended word.
  If the word is simply truncated by discarding the unwanted low-order bits or
  rounded to the nearest integer the linearizing effect of the original dither
  will be lost.  ===   FGR. 26 Shortening the wordlength of a sample reduces the number of codes
  which can describe the voltage of the waveform. This makes the quantizing steps
  bigger, hence the term requantizing. It can be seen that simple truncation
  or omission of the bits does not give analogous behavior. Rounding is necessary
  to give the same result as if the larger steps had been used in the original
  conversion.
 ===   FGR. 27 In a simple digital dithering system, two's complement values from
  a random number generator are added to low-order bits of the input. The dithered
  values are then rounded up or down according to the value of the bits to be
  removed. The dither linearizes the requantizing.
 ===  Shortening the wordlength of a sample reduces the number of quantizing intervals
  available without changing the signal amplitude. As FGR. 26 shows, the quantizing
  intervals become larger and the original signal is requantized with the new
  interval structure. This will introduce requantizing distortion having the
  same characteristics as quantizing distortion in an ADC. It then is obvious
  that when shortening the wordlength of a twenty-bit convertor to sixteen bits,
  the four low order bits must be removed in a way that displays the same overall
  quantizing structure as if the original convertor had been only of sixteen
  bit wordlength. It will be seen from FGR. 26 that truncation cannot be used
  because it does not meet the above requirement but results in signal dependent
  offsets because it always rounds in the same direction. Proper numerical rounding
  is essential in audio applications. Rounding in two's complement is a little
  more complex than in pure binary as can be seen in Section 3.   Requantizing by numerical rounding accurately simulates analog quantizing
  to the new interval size. Unfortunately the twenty-bit convertor will have
  a dither amplitude appropriate to quantizing intervals one sixteenth the size
  of a sixteen-bit unit and the result will be highly non-linear.  In practice, the wordlength of samples must be shortened in such a way that
  the requantizing error is converted to noise rather than distortion.  One technique which meets this requirement is to use digital dithering23 prior
  to rounding. This is directly equivalent to the analog dithering in an ADC.
  It will be shown later in this section that in more complex systems noise shaping
  can be used in requantizing just as well as it can in quantizing.  Digital dither is a pseudo-random sequence of numbers. If it’s required to
  simulate the analog dither signal of FGR. 23 and 24, then it’s obvious that
  the noise must be bipolar so that it can have an average voltage of zero. Two's
  complement coding must be used for the dither values as it’s for the audio
  samples.  FGR. 27 shows a simple digital dithering system (i.e. one without noise shaping)
  for shortening sample wordlength. The output of a two's complement pseudo-random
  sequence generator (see Section 3) of appropriate wordlength is added to input
  samples prior to rounding. The most significant of the bits to be discarded
  is examined in order to determine whether the bits to be removed sum to more
  or less than half a quantizing interval. The dithered sample is either rounded
  down, i.e. the unwanted bits are simply discarded, or rounded up, i.e. the
  unwanted bits are discarded but one is added to the value of the new short
  word.  The rounding process is no longer deterministic because of the added dither
  which provides a linearizing random component.  If this process is compared with that of FGR. 23 it will be seen that the
  principles of analog and digital dither are identical; the processes simply
  take place in different domains using two's complement numbers which are rounded
  or voltages which are quantized as appropriate. In fact quantization of an
  analog dithered waveform is identical to the hypothetical case of rounding
  after bipolar digital dither where the number of bits to be removed is infinite,
  and remains identical for practical purposes when as few as eight bits are
  to be removed. Analog dither may actually be generated from bipolar digital
  dither (which is no more than random numbers with certain properties) using
  a DAC.   Dither techniques The intention here is to treat the processes of analog and digital dither
  as identical except where differences need to be noted. The characteristics
  of the noise used are rather important for optimal performance, although many
  sub-optimal but nevertheless effective systems are in use. The main parameters
  of interest are the peak-to-peak amplitude, the amplitude probability distribution
  function (pdf) and the spectral content.  The most comprehensive ongoing study of non-subtractive dither has been that
  of Vanderkooy and Lipshitz. and the treatment here is based largely upon their
  work.    FGR. 28 (a) Use of rectangular probability dither can linearize, but noise
  modulation (b) results. Triangular pdf dither (c) linearizes, but noise modulation
  is eliminated as at (d). Gaussian dither (e) can also be used, almost eliminating
  noise modulation at (f).
 Rectangular pdf dither   Section 3 showed that the simplest form of dither (and therefore the easiest
  to generate) is a single sequence of random numbers which have uniform or rectangular
  probability. The amplitude of the dither is critical.  FGR. 28(a) shows the time-averaged transfer function of one quantizing interval
  in the presence of various amplitudes of rectangular dither.  The linearity is perfect at an amplitude of 1Q peak-to-peak and then deteriorates
  for larger or smaller amplitudes. The same will be true of all levels which
  are an integer multiple of Q. Thus there is no freedom in the choice of amplitude.  With the use of such dither, the quantizing noise is not constant. FGR. 28(b)
  shows that when the analog input is exactly centered in a quantizing interval
  (such that there is no quantizing error) the dither has no effect and the output
  code is steady. There is no switching between codes and thus no noise. On the
  other hand, when the analog input is exactly at a riser or boundary between
  intervals, there is the greatest switching between codes and the greatest noise
  is produced. Mathematically speaking, the first moment, or mean error is zero
  but the second moment, which in this case is equal to the variance, is not
  constant. From an engineering standpoint, the system is linear but suffers
  noise modulation: the noise floor rises and falls with the signal content and
  this is audible in the presence of low-frequency signals.  The dither adds an average noise amplitude of Q/___ 12 rms to the quantizing
  noise of the same level. In order to find the resultant noise level it’s necessary
  to add the powers as the signals are uncorrelated. The total power is given
  by:  [...] and the rms voltage is Q/__ 6. Another way of looking at the situation
  is to consider that the noise power doubles and so the rms noise voltage has
  increased by 3 dB in comparison with the undithered case. Thus for an n-bit
  wordlength, using the same derivation as expression (1) above, the signal to
  noise ratio for Q pk-pk rectangular dither will be given by: [...]  Unlike the undithered case, this is a true signal-to-noise ratio and linearity
  is maintained at all signal levels. By way of example, for a sixteen-bit system
  95.1 dB SNR is achieved. The 3 dB loss compared to the undithered case is a
  small price to pay for linearity.  Triangular pdf dither  The noise modulation due to the use of rectangular-probability dither is
  undesirable. It comes about because the process is too simple. The undithered
  quantizing error is signal dependent and the dither represents a single uniform-probability
  random process. This is only capable of decorrelating the quantizing error
  to the extent that its mean value is zero, rendering the system linear. The
  signal dependence is not eliminated, but is displaced to the next statistical
  moment. This is the variance and the result is noise modulation. If a further
  uniform probability random process is introduced into the system, the signal
  dependence is displaced to the next moment and the second moment or variance
  becomes constant.  Adding together two statistically independent rectangular probability functions
  produces a triangular probability function. A signal having this characteristic
  can be used as the dither source.  FGR. 28(c) shows the averaged transfer function for a number of dither amplitudes.
  Linearity is reached with a peak-to-peak amplitude of 2Q and at this level
  there is no noise modulation. The lack of noise modulation is another way of
  stating that the noise is constant. The triangular pdf of the dither matches
  the triangular shape of the quantizing error function.  The dither adds two noise signals with an amplitude of Q/___ 12 rms to the
  quantizing noise of the same level. In order to find the resultant noise level
  it’s necessary to add the powers as the signals are uncorrelated. The total
  power is given by:  ... and the rms voltage is Q/__ 4. Another way of looking at the situation
  is to consider that the noise power is increased by 50 percent in comparison
  to the rectangular dithered case and so the rms noise voltage has increased
  by 1.76 dB. Thus for an n-bit wordlength, using the same derivation as expressions
  (1) and (2) above, the signal to noise ratio for Q peak-to-peak rectangular
  dither will be given by:   Continuing the use of a sixteen-bit example, a SNR of 93.3 dB is available
  which is 4.8 dB worse than the SNR of an undithered quantizer in the large-signal
  case. It’s a small price to pay for perfect linearity and an unchanging noise
  floor.  Gaussian pdf dither  Adding more uniform probability sources to the dither makes the overall probability
  function progressively more like the Gaussian distribution of analog noise.
  FGR. 28(d) shows the averaged transfer function of a quantizer with various
  levels of Gaussian dither applied. Linearity is reached with 1/2Q rms and at
  this level noise modulation is negligible. The total noise power is given by:  ...and so the noise level will be Q__ 3 rms. The noise level of an undithered
  quantizer in the large signal case is Q___ 12 and so the noise is higher by
  a factor of:   Thus the SNR is given by 6.02(n - 1) + 1.76 dB. A sixteen-bit system with
  correct Gaussian dither has a SNR of 92.1 dB.  This is inferior to the figure in expression (3) by 1.1 dB. In digital dither
  applications, triangular probability dither of 2Q peak-to-peak is optimum because
  it gives the best possible combination of nil distortion, freedom from noise
  modulation and SNR. Using dither with more than two rectangular processes added
  is detrimental. Whilst this result is also true for analog dither, it’s not
  practicable to apply it to a real ADC as all real analog signals contain thermal
  noise which is Gaussian. If triangular dither is used on a signal containing
  Gaussian noise, the results derived above are not obtained. ADCs should therefore
  use Gaussian dither of Q/2 rms and the performance will be given by expression
  (4).  It should be stressed that all the results in this section are for conventional
  quantizing and requantizing. The use of techniques such as oversampling and/or
  noise shaping require an elaboration of the theory in order to give meaningful
  SNR figures.   FGR. 29 The components of a conventional convertor. A jitter-free clock drives
the voltage conversion, whose output may be re-sampled prior to reconstruction
 ===  |