Académique Documents
Professionnel Documents
Culture Documents
This article compares the two ways in which sound is 1 Overview of dierences
recorded and stored. Actual sound waves consist of continuous variations in air pressure. Representations of It is a subject of debate whether analog audio is supethese signals can be recorded using either digital or analog rior to digital audio or vice versa. The question is highly
techniques.
dependent on the quality of the systems (analog or digAn analog recording is one where a property or charac- ital) under review, and other factors which are not necteristic of a physical recording medium is made to vary essarily related to sound quality. Arguments for analog
in a manner analogous to the variations in air pressure systems include the absence of fundamental error mechof the original sound. Generally, the air pressure varia- anisms which are present in digital audio systems, includ[2]
tions are rst converted (by a transducer such as a mi- ing aliasing, quantization noise, and the absolute limcrophone) into an electrical analog signal in which ei- itation of dynamic range. Advocates of digital point to
ther the instantaneous voltage or current is directly pro- the high levels of performance possible with digital auportional to the instantaneous air pressure (or is a func- dio, including excellent linearity in the audible band and
[3]
tion of the pressure). The variations of the electrical sig- low levels of noise and distortion.
nal in turn are converted to variations in the recording Accurate, high quality sound reproduction is possible
medium by a recording machine such as a tape recorder with both analog and digital systems. Excellent, expenor record cutterthe variable property of the medium sive analog systems may outperform digital systems, and
is modulated by the signal. Examples of properties that vice versa; in theory any system of either type may be
are modied are the magnetization of magnetic tape surpassed by a better, more elaborate and costly system
or the deviation (or displacement) of the groove of a of the other type, but in general it tends to be less exgramophone disc from a smooth, at spiral track.
pensive to achieve any given standard of technical signal
quality with a digital system, except when the standard is
very low. One of the most limiting aspects of analog technology is the sensitivity of analog media to minor physical degradation; however, when the degradation is more
pronounced, analog systems usually perform better, often
still producing recognizable sound, while digital systems
will usually fail completely, unable to play back anything
from the medium (see digital cli). The principal advantages that digital systems have are a very uniform source
delity, inexpensive media duplication, and direct use of
the digital 'signal' in todays popular portable storage and
playback devices. Analog recordings by comparison require comparatively bulky, high-quality playback equipment to capture the signal from the media as accurately
as digital.
A digital recording is produced by converting the physical properties of the original sound into a sequence of
numbers, which can then be stored and read back for reproduction. Normally, the sound is transduced (as by a
microphone) to an analog signal in the same way as for
analog recording, and then the analog signal is digitized,
or converted to a digital signal, through an analog-todigital converter and then recorded onto a digital storage
medium such as a compact disc or hard disk.
Two prominent dierences in functionality are the
bandwidth and the signal-to-noise ratio (S/N); however,
both digital and analog systems have inherent strengths
and weaknesses. The bandwidth of the digital system is
determined, according to the Nyquist frequency, by the
sample rate used. The bandwidth of an analog system is
dependent on the physical capabilities of the analog circuits. The S/N of a digital system is rst limited by the
bit depth of the digitization process, but the electronic
implementation of the digital audio circuit introduces additional noise. In an analog system, other natural analog
noise sources exist, such as icker noise and imperfections in the recording medium. Some functions of the
two systems are also naturally exclusive to either one or
the other, such as the ability for more transparent ltering
algorithms[1] in digital systems and the harmonic saturation of analog systems.
1.2
Duplication
3.3
Aliasing
2.4
3
indicated by providing information on the level of the response relative to a reference frequency. For example, a
system component may have a response given as 20 Hz
to 20 kHz +/- 3 dB relative to 1 kHz. Some analog tape
manufacturers specify frequency responses up to 20 kHz,
but these measurements may have been made at lower signal levels.[4] Compact cassettes may have a response extending up to 15 kHz at full (0 dB) recording level (Stark
1989). At lower levels usually 10 dB, cassettes typically
rolls-o at around 20 kHz for most machines, due to the
nature of the tape media caused by self-erasure (which
worsens the linearity of the response).
The frequency response for a conventional LP player
might be 20 Hz - 20 kHz +/- 3 dB. Unlike the audio
CD, vinyl records and cassettes do not require a cut-o
in response above 20 kHz. The low frequency response
of vinyl records is restricted by rumble noise (described
above). The high frequency response of vinyl depends on
the cartridge. CD4 records contained frequencies up to
50 kHz, while some high-end turntable cartridges have
frequency responses of 120 kHz while having at frequency response over the audible band (e.g. 20 Hz to
15 kHz +/0.3 dB).[5] In addition, frequencies of up to
122 kHz have been experimentally cut on LP records.[6]
3
3.1
Frequency response
Digital mechanisms
3.3 Aliasing
3.2
Analog mechanisms
4 DIGITAL ERRORS
Hawksford (1991:18) highlighted the advantages of digital converters that oversample. Using an oversampling 4 Digital errors
design and a modulation scheme called sigma-delta modulation (SDM), analog anti-aliasing lters can eectively
4.1 Quantization
be replaced by a digital lter.[8] This approach has several advantages. The digital lter can be made to have a
near-ideal transfer function, with low in-band ripple, and
15
no aging or thermal drift.
14
3.4
CD quality audio is sampled at 44.1 kHz (Nyquist frequency = 22.05 kHz) and at 16 bits. Sampling the waveform at higher frequencies and allowing for a greater
number of bits per sample allows noise and distortion to
be reduced further. DAT can sample audio at up to 48
kHz, while DVD-Audio can be 96 or 192 kHz and up to
24 bits resolution. With any of these sampling rates, signal information is captured above what is generally considered to be the human hearing range.
Work done in 1981 by Muraoka et al.[9] showed that music signals with frequency components above 20 kHz were
only distinguished from those without by a few of the
176 test subjects (Kaoru & Shogo 2001). Later papers,
however, by a number of dierent authors, have led to
a greater discussion of the value of recording frequencies above 20 kHz. Such research led some to the belief
that capturing these ultrasonic sounds could have some
audible benet. Audible dierences were reported between recordings with and without ultrasonic responses.
Dunn (1998) examined the performance of digital converters to see if these dierences in performance could be
explained.[10] He did this by examining the band-limiting
lters used in converters and looking for the artifacts they
introduce.
13
12
11
10
9
8
7
6
5
4
3
2
1
0
A signal is recorded digitally by an analog-to-digital converter, which measures the amplitude of an analog signal at regular intervals, which are specied by the sample
rate, and then stores these sampled numbers in computer
hardware. The fundamental problem with numbers on
computers is that the range of values that can be represented is nite, which means that during sampling, the
amplitude of the audio signal must be rounded. This process is called quantization, and these small errors in the
measurements are manifested aurally as a form of low
level distortion.
4.3
Jitter
The range of possible values that can be represented numerically by a sample is dened by the number of binary
digits used. This is called the resolution, and is usually
referred to as the bit depth in the context of PCM audio. The quantization noise level is directly determined
by this number, decreasing exponentially as the resolution increases (or linearly in dB units), and with an adequate number of true bits of quantization, random noise
from other sources will dominate and completely mask
the quantization noise.
4.2
Dither as a solution
4.3 Jitter
The accuracy of a digital system is dependent on the sampled amplitude values, but it is also dependent on the
temporal regularity of these values. This temporal dependency is inherent to digital recording and playback
and has no analog equivalent, though analog systems have
their own temporal distortion eects (pitch error and
wow-and-utter).
It is possible to make quantization noise more audibly benign by applying dither. To do this, a noise-like signal is
added to the original signal before quantization. Dither
makes the digital system behave as if it has an analog
noise-oor. Optimal use of dither (triangular probability
density function dither in PCM systems) has the eect of
making the rms quantization error independent of signal
level (Dunn 2003:143), and allows signal information to
be retained below the least signicant bit of the digital
system (Stuart n.d.:3).
Dither algorithms also commonly have an option to employ some kind of noise shaping, which pushes the fre-
5
present study would really represent the limit
of auditory resolution or it would be limited
by resolution of equipment. Distortions due to
very small jitter may be smaller than distortions
due to non-linear characteristics of loudspeakers. Ashihara and Kiryu [8] evaluated linearity
of loudspeaker and headphones. According to
their observation, headphones seem to be more
preferable to produce sucient sound pressure
at the ear drums with smaller distortions than
loudspeakers.' [13]
DYNAMIC RANGE
5.2 Resolution
Dynamic range
6.2
Digital lters
other than a strict change in frequency response, this colThe benets of using digital recorders with greater than oration can sometimes have a positive eect on the per16 bit accuracy can be applied to the 16 bits of audio ception of the sound of the audio signal.
CD. Stuart (n.d.:3) stresses that with the correct dither,
the resolution of a digital system is theoretically innite,
6.2 Digital lters
and that it is possible, for example, to resolve sounds at
110 dB (below digital full-scale) in a well-designed 16
Digital lters can be made to objectively perform better
bit channel.
than analog components,[1][15] because the variables involved can be precisely specied in the calculations.
Signal processing
After initial recording, it is common for the audio signal to be altered in some way, such as with the use of
compression, equalization, delays and reverb. With analog, this comes in the form of outboard hardware components, and with digital, the same is accomplished with
A practical advantage of digital processing is the more
plug-ins that are utilized in the users DAW.
convenient recall of settings. Plug-in parameters can be
A comparison of analog and digital ltering shows tech- stored on the computer hard disk, whereas parameter denical advantages to both methods, and there are several tails on an analog unit must be written down or otherwise
points that are relevant to the recording process.
recorded if the unit needs to be reused. This can be cumbersome when entire mixes must be recalled manually using an analog console and outboard gear. When working
6.1 Analog hardware
digitally, all parameters can simply be stored in a DAW
project le and recalled instantly. Most modern professional DAWs also process plug-ins in real time, which
means that processing can be largely non-destructive until
nal mix-down.
Phase shift: the sinusoidal wave in red has been delayed in time
equal to the angle , shown as the sinusoidal wave in blue.
Sound quality
Subjective evaluation
SOUND QUALITY
clude whether the component has a 'bright' or 'dull' sound, precise amplitude for every sample cycle, a DSD recorder
or how well the component manages to present a 'spatial uses a technique called sigma-delta modulation. Using
image'.
this technique, the audio data is stored as a sequence of
Another type of subjective test is done under more con- xed amplitude (i.e. 1- bit) values at a sample rate of
trolled conditions and attempts to remove possible bias 2.884 MHz, which is 64 times the 44.1 kHz sample rate
from listening tests. These sorts of tests are done with the used by CD. At any point in time, the amplitude of the
component hidden from the listener, and are called blind original analog signal is represented by the relative pretests. To prevent possible bias from the person running ponderance of 1s over 0s in the data stream. This digital
data stream can therefore be converted to analog by the
the test, the blind test may be done so that this person is
also unaware of the component under test. This type of simple expedient of passing it through a relatively benign
analog low-pass lter. The competing DVD-Audio fortest is called a double-blind test. This sort of test is often
used to evaluate the performance of digital audio codecs. mat uses standard, linear PCM at variable sampling rates
and bit depths, which at the very least match and usually
There are critics of double-blind tests who see them as greatly surpass those of a standard CD Audio (16 bits,
not allowing the listener to feel fully relaxed when evalu- 44.1 kHz).
ating the system component, and can therefore not judge
dierences between dierent components as well as in In the popular Hi-Fi press, it had been suggested that
sighted (non-blind) tests. Those who employ the double- linear PCM creates [a] stress reaction in people, and
blind testing method may try to reduce listener stress by that DSD is the only digital recording system that does
allowing a certain amount of time for listener training not [...] have these eects (Hawksford 2001). This
claim appears to originate from a 1980 article by Dr John
(Borwick et al. 1994:481-488).
Diamond entitled Human Stress Provoked by Digitalized
Recordings.[17] The core of the claim that PCM (the only
digital recording technique available at the time) record7.2 Early digital recordings
ings created a stress reaction rested on tests carried out
Early digital audio machines had disappointing results, using the pseudoscientic technique of applied kinesiolConwith digital converters introducing errors that the ear ogy, for example by Dr Diamond at an AES 66th
[18]
vention
(1980)
presentation
with
the
same
title.
Diacould detect (Watkinson 1994). Record companies remond
had
previously
used
a
similar
technique
to
demonleased their rst LPs based on digital audio masters in the
late 1970s. CDs became available in the early 1980s. At strate that rock music (as opposed to classical) was bad
due to the presence of the stopped
this time analog sound reproduction was a mature tech- for your health[19]
anapestic
beat.
Dr Diamonds claims regarding dignology.
ital audio were taken up by Mark Levinson, who asserted
There was a mixed critical response to early digital that while PCM recordings resulted in a stress reaction,
recordings released on CD. Compared to vinyl record, DSD recordings did not.[20][21][22] A double-blind subit was noticed that CD was far more revealing of the jective test between high resolution linear PCM (DVDacoustics and ambient background noise of the recording Audio) and DSD did not reveal a statistically signicant
environment (Greeneld et al. 1986). For this reason, dierence.[23] Listeners involved in this test noted their
recording techniques developed for analog disc, e.g., mi- great diculty in hearing any dierence between the two
crophone placement, needed to be adapted to suit the new formats.
digital format (Greeneld et al. 1986).
Some analog recordings were remastered for digital formats. Analog recordings made in natural concert hall
acoustics tended to benet from remastering (Greeneld
et al. 1990). The remastering process was occasionally criticised for being poorly handled. When the original analog recording was fairly bright, remastering sometimes resulted in an unnatural treble emphasis (Greeneld
et al. 1990).
7.3
9
know. He also pioneered the unsuccessful Digital Compact Cassette and conducted the rst recording ever to
be commercially released on CD: Richard Strausss Eine
Alpensinfonie.
Complicating the discussion is that recording professionals often mix and match analog and digital techniques in
the process of producing a recording. Analog signals can
be subjected to digital signal processing or eects, and
inversely digital signals are converted back to analog in
equipment that can include analog steps such as vacuum
tube amplication.
For modern recordings, the controversy between analog
recording and digital recording is becoming moot. No
matter what format the user uses, the recording probably was digital at several stages in its life. In case of
video recordings it is moot for one other reason; whether
the format is analog or digital, digital signal processing is
likely to have been used in some stages of its life, such as
digital timebase correction on playback.
An additional complication arises when discussing human perception when comparing analog and digital audio
in that the human ear itself, is an analog-digital hybrid.
The human hearing mechanism begins with the tympanic
membrane transferring vibrational motion through the
middle-ears mechanical systemthree bones (malleus,
incus and stapes)into the cochlea where hair-like nerve
cells convert the vibrational motion stimulus into nerve
impulses. Auditory nerve impulses are discrete signalling
events which cause synapses to release neurotransmitters
to communicate to other neurons (see here.) The all-ornone quality of the impulse can lead to a misconception
that neural signalling is somehow 'digital' in nature, but
in fact the timing and rate of these signalling events is not
clocked or quantised in any way. Thus the transformation
of the acoustic wave is not a process of sampling, in the
sense of the word as it applies to digital audio. Instead it
is a transformation from one analog domain to another,
and this transformation is further processed by the neurons to which the signalling is connected. The brain then
processes the incoming information and perceptually reconstructs the original analog input to the ear canal.
10
13 BIBLIOGRAPHY
11
Notes
12
References
13 Bibliography
Ashihara, K. et al. (2005). Detection threshold for
distortions due to jitter on digital audio, Acoustical
Science and Technology, Vol. 26 (2005), No. 1 pp.
5054.
Blech, D. & Yang, M. (2004). Perceptual Discrimination of Digital Coding Formats, Audio Engineering Society Convention Paper 6086, May 2004.
Croll, M. (1970). Pulse Code Modulation for High
Quality Sound Distribution: Quantizing Distortion
at Very Low Signal Levels, Research Department
Report No. 1970/18, BBC.
Dunn, J. (1998). The benets of 96 kHz sampling
rate formats for those who cannot hear above 20
kHz, Preprint 4734, presented at the 104th AES
Convention, May 1998.
11
Dunn, J. (2003). Measurement Techniques for
Digital Audio, Audio Precision Application Note #5,
Audio Precision, Inc. USA. Retrieved March 9,
2008.
Ely, S. (1978). Idle-channel noise in p.c.m. soundsignal systems. BBC Research Department, Engineering Division.
Nishiguchi, T. et al. (2004). Perceptual Discrimination between Musical Sounds with and without
Very High Frequency Components, NHK Laboratories Note No. 486, NHK (Japan Broadcasting Corporation).
Pozzoli, G. DIGITabilis: crash course on digital audio interfaces. Part 1.4 - Enemy Interception. Effects of Jitter in Audio, TNT-Audio - online HiFi
review, 2005.
Pohlmann, K. (2005). Principles of Digital Audio
5th edn, McGraw-Hill Comp.
Rathmell, J. et al. (1997). TDFD-based Measurement of Analog-to-Digital Converter Nonlinearity,
Journal of the Audio Engineering Society, Volume
45, Number 10, pp. 832840; October 1997.
Rumsey, F. & Watkinson, J. (1995). The Digital
Interface Handbook, 2nd edition. Sections 2.5 and
6. Pages 37 and 154-160. Focal Press.
Stark, C. (1989). Encyclopdia Britannica, 15th
edition, Volume 27, Macropaedia article 'Sound',
section: 'High-delity concepts and systems, page
625.
Kaoru, A. & Shogo, K. (2001). Detection threshold for tones above 22 kHz, Audio Engineering
Society Convention Paper 5401. Presented at the
110th Convention, 2001.
Watkinson J. (1994). An Introduction to Digital Audio. Section 1.2 'What is digital audio?', page 3;
Section 2.1 'What can we hear?', page 26. Focal
Press. ISBN 0-240-51378-9.
14 External links
Skeptoid #303: Are Vinyl Recordings Better than
Digital? at Skeptoid
12
15
15
15.1
15.2
Images
File:Ambox_important.svg Source: https://upload.wikimedia.org/wikipedia/commons/b/b4/Ambox_important.svg License: Public domain Contributors: Own work, based o of Image:Ambox scales.svg Original artist: Dsmurat (talk contribs)
File:Dithering_example_dithered_16color.png Source: https://upload.wikimedia.org/wikipedia/commons/c/c3/Dithering_example_
dithered_16color.png License: CC-BY-SA-3.0 Contributors: ? Original artist: ?
File:Edit-clear.svg Source: https://upload.wikimedia.org/wikipedia/en/f/f2/Edit-clear.svg License: Public domain Contributors: The
Tango! Desktop Project. Original artist:
The people from the Tango! project. And according to the meta-data in the le, specically: Andreas Nilsson, and Jakub Steiner (although
minimally).
File:Pcm.svg Source: https://upload.wikimedia.org/wikipedia/commons/b/bf/Pcm.svg License: CC-BY-SA-3.0 Contributors: ? Original
artist: ?
File:Phase_shift.svg Source: https://upload.wikimedia.org/wikipedia/commons/5/55/Phase_shift.svg License: CC BY-SA 3.0 Contributors: Own work Original artist: Peppergrower
File:Question_book-new.svg Source: https://upload.wikimedia.org/wikipedia/en/9/99/Question_book-new.svg License: Cc-by-sa-3.0
Contributors:
Created from scratch in Adobe Illustrator. Based on Image:Question book.png created by User:Equazcion Original artist:
Tkgd2007
15.3
Content license