Digital Audio - Technical Writing Class Paper - OCR Reformat
-
Upload
paul-teich -
Category
Technology
-
view
317 -
download
0
Transcript of Digital Audio - Technical Writing Class Paper - OCR Reformat
AN ANALYSIS OF THE
ADVANTAGES, DISADVANTAGES, AND STANDARDIZATION
OF
DIGITAL AUDIO
for
Professor Eugene O. Young
Technical writing Instructor
Texas A&M University
College Station, Texas
by
Paul Robert Teich
Student
Computing Science
April 26, 1982
i
LETTER OF TRANSMITTAL
406 C Manuel Dr.
College Station, Texas 77840
April 26, 1982
Dr. Eugene O. Young
217-D Academic and Agency Building
Texas A&M University
College Station, Texas 77843
Dear Dr. Young:
I have completed the major report for English 301 and am submitting it for
grading. The title of my report is: “An Analysis of the Advantages,
Disadvantages, and Standardization of Digital Audio.” I am a computing science
major, and my hobby is high fidelity sound reproduction. The research I did for
this paper laid the groundwork for my research project this summer, which will be
on replicating human speech inflections and accent with a digital speech
synthesizer.
The field of digital audio is very young, even when compared with the age
of digital computer technology. I believe, though, that in the next decade,
digital audio will take over a major portion of the consumer audio market. This
report is written to audiophiles who have heard about digital audio products, but
have had no exposure to literature on them. There are no books devoted to digital
audio and most articles are very technical, requiring quite a bit of knowledge
about digital computers. To get a fairly extensive knowledge of digital audio
requires days of looking for sources and reading. Some of the information was not
directly available at all, but was the result of written inquiries to various
manufacturers. I have not yet found a more comprehensive report on digital audio
than mine.
Thank you for your advice on using note cards and on revision. This is the
largest paper I have ever completed, and I am going to include it, along with my
summer research on my resume. If you have any questions about my report, please
call me at 693-5560.
Sincerely,
Paul R. Teich
ii
Table of Contents
LETTER OF TRANSMITTAL ...................................................... i
LIST OF ILLUSTRATIONS ..................................................... iv
LIST OF TABLES ............................................................ iv
ABSTRACT .................................................................. v
INTRODUCTION............................................................... 1
THE DIGITAL AUDIO PROCESS .................................................. 2
Digital Audio Recording .................................................. 2
Filtering of analog input signal ....................................... 2
Analog-to-digital conversion of signal ................................. 3
Digital Audio Playback ................................................... 6
Pickup of digital information from storage media........................ 6
Digital-to-analog conversion of binary code ............................ 6
Filtering of output signal ............................................. 7
Summary .................................................................. 7
DISADVANTAGES OF THE DIGITAL AUDIO PROCESS ................................. 7
Aliasing Distortion ...................................................... 8
Quantization Noise ...................................................... 10
ADVANTAGES OF THE DIGITAL AUDIO PROCESS ................................... 13
Specifications .......................................................... 13
Frequency response .................................................... 13
Channel separation .................................................... 13
Total harmonic distortion ............................................. 14
Wow and flutter ....................................................... 14
Signal-to-noise ratio ................................................. 14
Dynamic range ......................................................... 14
Error Detection ......................................................... 16
Parity ................................................................ 16
Cyclic redundancy check code (CREE) ................................... 17
Error Concealment ....................................................... 17
Error Correction ........................................................ 18
Other Advantages ........................................................ 20
iii
PROBLEMS WITH STANDARDIZING THE CONSUMER DIGITAL AUDIO FIELD .............. 21
Recording Formats ....................................................... 21
Sampling rate ......................................................... 22
Quantizing bits ....................................................... 24
Non-audio information ................................................. 26
Total format .......................................................... 28
Storage/Playback Systems ................................................ 28
Tape .................................................................. 29
Disc .................................................................. 29
SELECTED DIGITAL AUDIO SYSTEMS ............................................ 32
Technics by Matsushita .................................................. 32
JVC ..................................................................... 33
Sony .................................................................... 34
PCM-F1 ................................................................ 34
DAD-1X ................................................................ 34
FUTURE OF DIGITAL AUDIO ................................................... 35
Digital Audio Discs ..................................................... 35
Advantages ............................................................ 35
Disadvantages ......................................................... 37
Support ............................................................... 37
Digital Audio Today ..................................................... 38
CONCLUSION................................................................ 39
APPENDIX ................................................................. 41
GLOSSARY OF TERMINOLOGY ................................................. 41
BIBLIOGRAPHY ............................................................ 43
iv
LIST OF ILLUSTRATIONS
Figure 1 Typical analog music input ........................................ 3
Figure 2 Music signal being sampled ........................................ 4
Figure 3 Square waveform of sampled voltage levels ......................... 5
Figure 4 Aliasing Distortion ............................................... 8
Figure 5 High Frequency Sampling ........................................... 9
Figure 6 Dynamic range of magnetic recording tape ......................... 15
Figure 7 Sony Crossword code error correction method ...................... 19
Figure 8 Interfacing between 16-bit and 14-bit machines ................... 25
Figure 9 The 3M format .................................................... 28
Figure 10 Laser scanning .................................................. 31
Figure 11 Block diagram of a digital tape machine ......................... 36
LIST OF TABLES
Table 1 A Sampling of the Current Digital Audio Market .................. 11
Table 2 Groups of Manufacturers with Compatible Formats ................. 21
Table 3 Preferred Sampling Rates ....................................... 23
Table 4 Number of Quantizing Bits Used in Existing and Proposed Systems . 26
Table 5 Error Correction Methods ....................................... 27
Table 6 Storage/Playback Systems: Comparison of Characteristics ......... 32
v
ABSTRACT
This report presents the audiophile with a comprehensive view of the
digital audio field, from a description of how digital audio works to current
controversies in the field. The process of digital audio is described at a level
that requires no prior understanding of digital computers. The advantages and
disadvantages of the process are discussed, along with possible solutions and
alternatives. The problem of standardization within the digital audio field is
examined, and possible solutions are suggested. Finally, the future of digital
audio and its possible repercussions in the audio field are presented.
1
INTRODUCTION
The “Computer Revolution,” started in the mid-1960s, has been gradually
edging into the audio field for years. It started with digital FM tuning
circuits and electronically controlled turntable motors, which used quartz phase-
locked loop (PLL) circuits. In the last three or four years, microprocessor
control has been an integral part of tape decks, pre-amplifiers, and such; and it
recently has been added to equalizers and remote control devices. However, the
nature of recorded sound had not changed. Microprocessors could help get the best
possible analog recordings, but could not improve on the actual method of
recording.
Analog recording, since its invention by Thomas Edison, translates music to
an electronic equivalent. The amplitude of a voltage or current is made
proportional to the loudness of the music, and the frequency matches that of the
music on a one-to-one ratio (4:225). Digital recording takes an analog music
signal and converts it into a computer readable code. The advantage of the
digital process is that the music can be measured for computer coding with much
greater precision than the magnetic field strength of a tape can be measured
(1:132). This means that the computer code can be recorded on tape or disk; and,
when it is played back, the music is not dependent on the physical
characteristics of the tape for its reproduction.
This report will present a detailed description of the digital audio
process, as well as its advantages and disadvantages. The problem of
standardization in the field of consumer digital audio equipment will be
discussed, and some sample systems will be examined. This report assumes that
the reader is well acquainted with most aspects of high fidelity sound systems,
including specifications, measurements and technical terminology. Some working
definitions will be included in the Appendix at the end of the report.
2
Most of the Sources for this paper are from the Journal of the Audio
Engineering Society and Stereo Review Magazine, due to the fact that digital
audio is a very young field with no books published as yet. The rest of the
references are from other trade journals, informational brochures, and a passing
reference to digital audio as a future development in a recent book. Few sources
are over five pages long.
THE DIGITAL AUDIO PROCESS
Digital audio can be divided into two separate processes: digital recording
and playback. The process of digital recording consists of: (1) Filtering of the
analog input signal, (2) analog-to-digital conversion of the signal, and (3)
transfer of the digital information to the recording media. The steps involved
in digital playback are: (1) Pickup of the digital information from the recording
media, (2) digital-to-analog conversion of the binary code, and (3) filtering of
the output signal.
Digital Audio Recording
Filtering of analog input signal. First, an analog input signal, usually
from a pre-amplifier (8:560), is routed through a very sharp-cutoff, high-
frequency-cut filter. This filter, called an anti-aliasing filter, removes all
frequencies above the highest frequency to be reproduced (20:64) (see Figure 1).
In most systems, this frequency is 20 kHz.
3
Figure 1 (a) Typical analog music input from a pre-amplifier, containing very
high frequency noise (20:63).
(b) Music signal after anti-alias filtering (20:64).
Analog-to-digital conversion of signal. The voltage of the filtered analog
signal is next measured at very small, constant intervals of time. The frequency
of this sampling is determined from a mathematical model, which states that the
signal must be sampled at least at twice the highest frequency to be reproduced.
Common sampling rates are between 44 kHz and 5l kHz (23:17). To measure the
voltage of the signal at each time interval, a "sample-and-hold” circuit is used.
This circuit samples the voltage of the signal at the beginning of a time
interval, and then holds its own voltage equal to the sampled value for the
4
duration of the interval (see Figure 2). It repeats this process for each
interval to be measured (20:64).
Figure 2 Music signal being sampled (20:64).
The output of the sample-and-hold circuit is a series of voltage levels in
the form of a square wave (see Figure 3a). The voltage levels are now
individually compared to a set of reference voltage levels and they are then
translated, using an analog-to-digital (A/D) converter, to binary numbers
(34:65). Binary is a number system that uses combinations of ones and zeroes,
called "bits,” to represent numbers. The output of the converter is a string of
bits, called “binary code”, which is an accurate representation of the sample-
and-hold circuit output (see Figure 3c). The process of changing the samples
from analog to binary form is known as "quantization" (2324).
5
Figure 3 (a) Square waveform of sampled voltage levels (20:64).
(b) Waveform after being rounded off to the reference levels (20:65).
(c) Binary code generated by analog-to-digital converter.
The last step in the analog-to-digital conversion sequence is the encoding
of the binary code. This procedure inserts timing and error correction
information, in binary form, into the code. It also can merge two separate
channels for stereo. The output of the encoding step is all of the digital
information necessary for the reconstruction of the analog input signal (1:132).
The final process is the actual recording of the digital information onto
tape. This is done by translating the bits of the binary code into
6
electromagnetic pulses, which are recorded on tape in the same way an analog
signal would be recorded. The tape may now be copied, transferred onto master
audio discs, or transferred to other storage media.
Digital Audio Playback
Pickup of digital information from storage media. First, the digital
information is translated from storage media back into the string of digital
information needed to reconstruct the original analog signal (20:65).
Digital-to-analog conversion of binary code. The first process that the
digital information encounters is the removal of the timing and error correction
information. The error correction information is used to restore the parts of
the binary code that were not picked up properly, due to flaws in the storage
media or pickup mechanism. The timing information is used to control how fast
the digital information is being picked up from the storage media. As the error
correcting is finished, the binary code is put into a waiting area, called a
buffer, for temporary storage. The amount of material the buffer can hold is
limited, however, so the timing information is used to control the pickup
mechanism. For example, if digital information is being picked up and filling
the buffer too fast or too slowly, the timing information corrects the speed
deviation (20:65).
As the binary code is put into the back of the buffer, the codes for the
individual voltage levels are taken from the front. Then, they are removed at a
rate equal to the original sampling frequency which is used in the recording
process. As they are removed, they are translated, using a digital-to-analog
(D/A) converter, back into their corresponding voltage levels. The output of the
digital-to-analog conversion is a series of voltages in the form of a square
waveform. This replicates the output of the sample-and-hold circuit, with the
7
small differences due to the rounding off process during quantization of the
original waveform (20:65).
Filtering of output signal. The final stage of this process is the
changing of the square waveform into a constantly varying analog signal. This is
accomplished by an "output-smoothing" filter, which, as the anti-aliasing filter
did, removes all frequencies above the highest frequency to be reproduced. The
effect is the smoothing of the square waveform into an almost exact replica of
the input to the digital audio recording process (20:65).
Summary
A music signal to be recorded first encounters the anti-aliasing filter.
Next, the signal is sampled, and the samples are rounded off. Then, the samples
are quantized, encoded and, finally, transferred to the recording media. To
restore the music to analog form, the digital information is picked up from the
recording. It is then decoded and sent to the digital-to-analog converter. The
signal from the D/A converter is sent through an output-smoothing filter, which
results in a replica of the original music signal.
DISADVANTAGES OF THE DIGITAL AUDIO PROCESS
There are three disadvantages associated with the process of digital audio.
Aliasing distortion and quantizing noise are the two main problems. Both result
in distortion in a digital audio system, and neither of these forms of distortion
are found in analog systems. While aliasing distortion is a correctable problem,
quantizing noise is rooted in the theoretical capabilities of digital audio
(20:65-6). The third disadvantage is the cost of a digital audio system. This is
a temporary condition, however, as technological advances are already lowering
the cost of digital audio equipment (14:57).
8
Aliasing Distortion
As described earlier, there is a mathematical relationship which states
that the sampling frequency of the music must be at least twice the highest
frequency to be sampled (see Figure 4a). This assures that both the positive and
the negative halves of even the highest frequency are sampled (see Figure 4b)
(16:57).
Figure 4 (a) Comparison of properly filtered system with improperly filtered
system (3:511).
(b) 20 kHz since wave (solid line) being sampled (dotted line) at 44
kHz (16:59).
9
If a frequency above half the sampling rate is sampled, it results in an
incorrect voltage for that sample. During playback, the incorrect voltage will
show up as a lower frequency distortion (see Figure 5). This distortion is
called "aliasing" distortion, because the low frequency signal generated is a
false representation, or "alias,” of the high frequency input (20:64). Aliasing
distortion products are not harmonically related to the music, and so they are
very noticeable (3:510). The effects of aliasing distortion also include the
downgrading of the signal-to-noise ratio of the recording (23:5).
Figure 5 (a) High frequency is sampled at low point.
(b) High frequency is sampled at high point.
(c) Filtered curve is sampled.
To minimize aliasing distortion, the sampling rate must be at least twice
the highest frequency sought. Equally as important is the filtering of the input
to remove any unsought higher frequencies. This is the purpose of the anti-
aliasing filter used in the digital recording process.
10
Quantization Noise
During the quantizing step of digital recording, the sampled voltage levels
are rounded off to the nearest reference level. The reference levels are a part
of the analog-to-digital converter, and are used in the same way that the
markings on a ruler are used to approximate a value that could have very many
decimal places (3 inches compared to 3.002 inches) (20:65). "There is always a
slight error between the original signal level and the quantized value.” (23:5).
These errors are heard as noise, and are called “quantizing noise.”
To minimize quantizing noise, more closely spaced reference levels must be
used. However, this introduces two other problems. First, the finer the spacing
between reference levels, the larger the binary number needed to represent the
levels; therefore, more space is used to record the number. The more space each
number takes up, the less music fits onto a tape or disc. The second problem is
that, as the spacing gets closer, it becomes harder for the analog-to-digital
converter to tell the difference between adjacent levels (20:65). This problem
is rooted in the present limits of integrated electronic circuitry technology.
To combat these problems, it is necessary to compromise accuracy for space and
electronic considerations. How much compromise depends on the whims of the
digital audio system designers. However, when better analog-to-digital
converters and faster, cheaper, computer memory become available, there will be
no need for a tradeoff.
There are two very audible subdivisions of quantizing noise. The first is
called “granulation noise.” As the signal gets weaker (near the bottom of the
system's dynamic range), the rounding off process alters a larger percent of the
input signal. This can add a harsh, gritty distortion to the music signal. "The
subjective effect of this noise is much more unpleasant than ordinary tape hiss.”
11
To correct granulation noise, many digital recorders add a low level hiss to the
music. This changes the granulation noise to a more benign form of hiss (16:58).
The other problem is that when a signal is recorded that is larger than the
highest reference level, it is rounded to the highest reference level. The
analog recording equivalent of this is clipping, but the effect is much more
noticeable in digital audio. The solution to digital clipping is, while
recording the music, to set the recording level a few decibels lower, although
this also decreases the dynamic range of the recording (16:58).
Cost
From the viewpoint of the audio consumer, the most serious disadvantage of
digital audio is the cost. Digital system components are very expensive, though,
as pointed out in The Complete Guide to Stereo Equipment, some digital components
are “less expensive than some of the esoteric models some folks favor” (4:184).
Table l gives a random sampling of the current digital audio market.
Table 1 (30, 33, 43) A Sampling of the Current Digital Audio Market
Model
Type
Cost
Sony PCM-F1 Portable VCR audio adaptor $2,500
Hitachi PCMV100 Audio/video recorder $1,900
Sanyo VCR audio adaptor $2,900
Sharp VCR audio adaptor $2,950
Technics SVP100 Audio recorder $3,000
Sony PCM10 VCR audio adaptor $3,500
Toshiba VCR audio adaptor $3,750
JVC
VCR audio adaptor $7,500
12
There are several reasons why digital audio reproduction is so expensive,
all are by-products of digital computer technology. First, though not an
equipment cost, is the price of video cassette recorder (VCR) tape. VCR tape is
needed, because the frequency of the electronic pulse representing the binary
code is typically measured in megahertz (MHz). In the case of the Sony PCM-10,
the sampling frequency (44.056 kHz) is multiplied by 14 bits per sample and by
two channels (for stereo reproduction) to obtain 1.234 MHz with error correction
and synchronization, the resulting frequency may be as high as 2.643 MHz.
Standard audio cassette tapes cannot record such high frequencies, which is why
VCR tape is used (23:l0). Even when ordered from a warehouse supplier, VCR tape
prices range from $12.00 to above $17.00 per tape (12:2).
The reasons why digital audio equipment, itself, is so expensive deal with
current computer and integrated circuitry technology. To reduce distortion
caused by the elements in this circuitry (as happens in all audio systems), high-
speed, high-precision circuit elements are needed (23:5). In digital audio
systems, the sample-and-hold, analog-to-digital, synchronizing, and error
correction circuitry are all implemented using integrated circuitry (see
Appendix).
Integrated circuits of the complexity needed for audio applications are
very expensive to produce. Only the larger audio electronics corporations, such
as Sony, have the resources needed to design and produce their own integrated
circuitry (14:57). Other electronics companies, outside of the audio field, have
produced chips to sell to audio manufacturers. An example of an integrated
circuit produced for audio applications is the Burr-Brown Corporation's PCM75
analog-to-digital converter. There are several models, ranging from $198 to $249,
depending on the speed and distortion of each model (19:1).
13
ADVANTAGES OF THE DIGITAL AUDIO PROCESS
There are many more advantages to the digital audio process than
disadvantages. The specifications of a digital system's ability to reproduce
sound are much better than analog system specifications. Digital systems make
use of signal error detection, concealment and correction, while analog systems
do not. In addition, copying digital recordings does not degrade the sound.
There is no print-through on digital tapes; and on audio discs (analogous to
records), any wear that may occur will not affect the music.
Specifications
The specifications used to describe how well a digital audio system
reproduces music are the same as used in analog audio. The major categories are:
(1) frequency response, (2) channel separation, (3) total harmonic distortion,
(4) wow and flutter, (5) signal-to-noise ratio, and (6) dynamic range.
Frequency response. The frequency response of digital systems is limited
only in the high-frequency response. This is because of the anti-aliasing filter.
However, if the proper filter and sampling rate are used, the high-frequency
response can be extended beyond the range of human hearing. The frequency
response of a digital recording is also exceptionally flat, displaying none of
the dips, peaks and roll offs associated with analog readings. It is not
uncommon for digital recordings to display a frequency response of 0 - 20 kHz ± 1
dB (17:623).
Channel separation. When recording music digitally, the two (stereo)
channels are sampled separately. After they are converted to binary, they are
merged, giving two values (for left and right channels) for each time interval,
and then are encoded. Upon playback, the binary code is decoded, and the two
14
channels are separated and sent to separate buffers (10:65). This process
insures that the two channels do not intermix, and it gives digital audio a
channel separation specification equal to its dynamic range. For most digital
audio systems, this is equal to or greater than 90 dB (17:623). Crosstalk is
also eliminated.
Total harmonic distortion. In a digital system, harmonic distortions are
produced "only by the electronic circuits for coding and decoding” (28:520).
Given the state of modern digital technology, in most digital systems, the total
harmonic distortion can be kept equal to or below 0.05% (17:623).
Wow and flutter. During the digital-to-analog translation of digital
playback, the individual voltage levels are released from the buffer at
essentially unvarying time intervals. A crystal oscillator timing mechanism,
similar to those used in electronic watches and phase-locked turntables,
implements this (8:66). Wow and flutter is virtually eliminated from digital
systems by this process. If it does occur in greater than “unmeasurably small”
quantities, something is drastically wrong (20:65).
Signal-to-noise ratio. The signal-to-noise ratio (S/N ratio) of a system is
basically attributable to the mechanical limitations of the recording media. The
noise level in a digital recording, likened to tape hiss and surface noise in
analog recordings, is at much lower level and consists mainly of granulation
noise. There are no mechanical sources of noise in a digital system; so, for a
typical digital system, the S/N ratio should be equal to or greater than 90 dB
(again, approximating the value of the dynamic range).
Dynamic range. The most noticeable difference in the sound quality of
digital systems is in the dynamic range captured by digital recordings. In
analog recordings, very soft sounds are often buried below the noise level of the
storage media (tape hiss or record surface noise), while very loud sounds cannot
15
be recorded faithfully. Loud sounds may saturate the tape or cause the wiggles
in record grooves to overrun adjacent grooves (10:63). As a result, music to be
stored on records or tape must be "compressed”. Compression makes soft sounds
louder, and loud sounds softer. While this enables the entire musical selection
to fit onto the tape or record, "It detracts from the lifelike quality of musical
reproduction during playback” (4:225). The music on records and tapes usually
has a dynamic range of under 65 dB, compared to orchestra music, which has a
dynamic range of 85 to 95 dB (10:63) (see Figure 6).
Figure 6 Dynamic range of magnetic recording tape is approximately 60 dB, while
orchestra music has a range of 85 to 95 dB (10:63).
Because the electromagnetic pulses representing the binary code on the
magnetic tape (or the dots on the audio discs) are just a representation of the
sound, the level that they are recorded at is not important. On tape, a level
well above the residual tape hiss and well below the saturation level can be
chosen. The level of the pulses remains constant and, thus, eliminates the
disadvantages of recording analog music on tape (24:10).
16
The accuracy of digital audio techniques has been summed up by E. Brad
Meyer, of Point One Audio: "Digital sound will contain none of the frequency-
response errors, distortion, noise, flutter, and so on that are inherent in even
the best analog recording and reproducing systems" (16:56).
Error Detection
Even in the best digital systems, there will be mechanical causes for
dropouts or errors in reading the recorded music during playback (2315). These
situations, such as a scratch in a digital disc or a tape dropout, can be
disastrous in a digital system. With some types of encoding, a dropout causes a
burst of noise at the maximum possible output level on the machine. Clearly,
some form of error detection, common in computer software packages, is needed.
In fact, computer data error detection is entirely applicable to digital audio
(16:58). There are numerous error detection systems; however, only two shall be
mentioned here.
Parity. As stated earlier, binary code is a string of ones and zeroes
(bits). The binary code can be divided into groups of bits, which are called
"words,” of arbitrary length. Each word represents the level of one sampling of
the sound. All ones in a word then are counted. If the total is odd, a one is
inserted into the binary code immediately after the last bit in the word; if the
total is even, a zero is inserted in the same position. This last number is
called the parity bit. When the playback device reads the binary code from tape
or disc, it re-counts the ones in each word, comparing the results (odd or even)
to the parity bit shown. If these do not match, then an error has been detected.
Unfortunately, the parity procedure does not detect an even number of
errors within the same word, because such an error leaves the count equal to the
parity bit (even + even = even; odd + even = odd). This gives parity a 50%
17
chance of detecting an error. The single-parity-bit method is, at best, a
rudimentary means of error detection (2316).
Cyclic redundancy check code (CREE). CRCC is a method used by Sony
Corporation to provide error detection in their digital systems. It can use any
number of parity bits for each word. If the number of parity bits is "n”, then
the probability of an error being detected is 1 minus 2-n
. If "n" is 16 (as in
the Sony PCM-1600), the probability of error detection is 1 minus 2-16
= .999985
(99.9985%). This is an almost perfect error detection capability. (This
equation also describes one bit parity 1 minus 2-1
=1 minus .5 = .5). (23:6).
Error detection is only the first step in managing errors in digital
systems. Once an error has been detected, something must be done about it.
Error Concealment
To prevent the errors from affecting the quality of the sound, various
methods of concealing the error exist. There are three typical methods of error
concealment:
1. Muting: The word with the error is set to zero (muted) (23:7).
18
2. Previous word holding: The value of the word before the error is
substituted for the error (23:7).
3. Linear interpolation: The average of the two words immediately before
and after the error is substituted for the error. This is the most
accurate method of error concealment, but it assumes that the words on
either side of the error are not themselves erroneous (23:7).
Though error concealment is a great improvement over playing music with
errors in it, it is possible to go even further in retaining the quality of the
music.
Error Correction
“An error correction code not only detects the code errors but also
perfectly corrects them.” To do this, it is necessary to find out where and how
the errors have occurred. The code required to do this is very complex. To
explain how it works, a simplified description of Sony's Crossword Code is
examined in Figure 7 (23:7).
19
Figure 7 This is a simplified description of the Sony Crossword code error
correction method (23:7). To correct code errors, it is essential to
find out where and how they have occurred. This is done as follows:
(a) Data values are given for four samples of music.
(b) Numbers are inserted which, when added, total 20 (an arbitrarily
chosen number) both horizontally and vertically in each
row.
(c) The tape, itself, on which is recorded the original data values,
plus the additional numbers from each horizontal and vertical
column, respectively.
(d) Playback data values. (Note that10 is shown, rather than the 12
originally assigned in the first position).
(e) Both the top horizontal row and the left vertical row add up to
only 18, rather than the 20 originally specified. Therefore, l0
must be an incorrect value.
(f) The correct word is reproduced for playback.
NOTE: All words and correction codes are expressed in ordinary decimal
figures instead of binary codes to facilitate their
understanding.
20
Another Sony method, the cross-interleave rectification code (CIRC) can
fully correct code errors of up to 4000 bits (for a length of 2.5 mm on a digital
audio disc (27:66). This is remarkable, for a scratch that large on a
conventional record would not only destroy the musical passage, it would make the
record unplayable.
Other Advantages
There are several more advantages to digital audio; however, all of them
are by-products of the digital audio technology behind the specifications and
error-resolving techniques. Some of these are:
1. The elimination of print-through on the recordings. This is related to
the code being tape-recorded at any chosen level above the tape hiss
and below the saturation level of the tape (28:520).
2. The possibility of adding any number of alternate channels. Each
channel would still be totally separate from the others. This
suggests the possibility of bringing back quadrophonic sound or of
recording different instruments or vocals on different channels. A
user could turn the vocals off and sing along with no disturbance.
Similarly, a musician could blank out his particular instrument on the
recording and play along (29:512). This also raises the possibility
of recording comments, engineering notes (as to equalization,
microphone and speaker placement, etc.), and other non-music
information on a separate channel (29:512).
3. The possibility of mass producing tapes and audio discs with
definitely superior musical reproduction (as compared to records and
tapes currently on the market) (4:184) for a much lower cost than
today's recordings (16:56).
21
4. ". . . copies can be made from one digital machine to another without
the loss in quality that occurs with conventional recorders . . . an
exact duplicate of the signal that emerged from the recording console
at the original session" (16:58). This is because the binary code is
being recorded -- not the music.
PROBLEMS WITH STANDARDIZING THE CONSUMER DIGITAL AUDIO FIELD
Recording Formats
The standardization of recording formats is the most important issue
deciding the future of digital audio. Without standardization, the equipment
produced by one manufacturer will not directly be able to play the recordings
produced by another. The consumer would have to buy a format converter to do so;
and these, currently comprised mostly of integrated circuitry, would cost
hundreds of dollars. Digital audio is already a costly addition to a sound
system. Having to buy a converter for each format to be played would convince
most consumers that digital audio is not worth the cost. Table 2 shows which
manufacturers have equipment compatible with other manufacturers.
Table 2 (27) Groups of Manufacturers with Compatible Formats
1 2 3 4
U.S. Pioneer Philips Compact JVC Mitsubishi
Magnavox Built MCA Sony Compact
Philips
Sony Long Play
22
The recording format is a combination of sampling rate, quantization, and
non-audio information (error correction and synchronization). The only standards
set by industry for digital audio are from the electronics industry association
of Japan (EIAJ). The EIAJ standard STC-007, labeled "Consumer Use PCM Encoder-
Decoder”, sets three standards. First, it sets the number of bits used to
represent one sampling (the number of “quantizing bits”) to 14 bits. Second, it
sets the sampling rate to 44,056 Hz. Third, though it does not define what type
of error detection is required or what the decoding stage of the playback device
should do if it encounters an uncorrectable error, the standard requires that
some form of error correction be used (21:20). Though these are the first
industry standards in digital audio, they apply only to consumer equipment (not
professional equipment), and they have little influence over U.S. or other
overseas manufacturers. Without standardization of recording formats, it is
difficult (at best) to transfer a recording from one format to another (28:520).
Each part of the digital format will be examined here to provide the reader with
an idea of the scope of the standardization problem.
Sampling rate. The sampling rate has been the most widely differing factor
of the digital industry. Because most high-fidelity applications require at
least a frequency of 20 kHz, most of the sampling frequencies are above 40 kHz.
However, because the European Broadcast Union (EBU) and satellite communications
networks operate at 32 kHz, some manufacturers are considering this or some other
compatible frequency (17:621). Table 3 shows a list of audio industry leaders
and their preferred sampling rates.
23
Table 3 (3, 9, 17) Preferred Sampling Rates
Organization
Sample
Rate
Remarks
Ampex 50 kHz Absolute synchronization with
standard video film frame rates
Audio Engineering
Society (AES)
50.4 kHz High quality studio work
44.1 kHz Other studio applications
32 kHz Broadcasting applications
European Broadcasting
Union (EBU) 48 kHz Easy conversion to 32 kHz
SMPTE Video Study
Group 60 kHz
Integer samples of audio data for
frame
JVC 47.25 kHz Audio High Density (AHD) Systems
Sony/Philips 44.1 kHz Compact Audio Disc (CAD) Systems
Telefunken 48.0 kHz Mini Disk (MD) Systems
The hardest part of converting music from one digital format to another is
taking "x" samples per second and converting them to "y" samples per second. “If
system A and system B are to be directly interfaced digitally, they must both use
the same sampling rates" (3:509). If they don't have the same sampling rates,
then there must be some simple ratio between the two rates, so a digital rate
changer can be used. However, if there is no simple ratio between the rates,
then the recording must be converted back to analog, and then sampled at the new
rate (3:510). In addition, "if the conversion is from a high sampling frequency
to a lower one, a digital anti-aliasing filter must be used to take care of this
new, lower, sampling rate.” That is, the higher frequencies that can be recorded
with a higher sampling rate may be above the anti-aliasing filter's cutoff for
the lower sampling rate, so the filtering must be done digitally (17:620). Every
24
time a switch is made from one sampling frequency to another, it is accompanied
by a 4 to 6 dB loss in the S/N ratio (17:620).
While differences in sampling rates are having little impact on the
consumer digital recording market, they will have a profound effect on the pre-
recorded music market, creating a need for expensive translating devices (since
all of their circuitry is integrated). While the STC-007 sampling standard is
needed, it is not compatible with other rates. A sampling rate of 50 kHz would
satisfy all high fidelity needs, as well as mesh with standard video film frame
rates.
Quantizing bits. Digressing for a moment, a bit can be either a zero or a
one, as explained previously. Inside a computer, bits form numbers of base two
(binary). In the human number system of base l0 (decimal), each digit has the
possibility of being one of ten numbers (0 through 9). A decimal number of "n"
digits can represent any one of l0n numbers. (If "n" equals 3, then 1000 (103
)
can be represented by three digits -- 0 through 999). In binary, each digit has
two possibilities, so a binary number of "n" bits could represent any one of Zn
numbers. For a binary number of 14 bits, there would be 16,384 possible values;
and for 16 bits, there would be 65,536 possible values. If a group of 14 to 16
bits is taken as one sample of a music signal, called a word, then the possible
values of the binary number represented by the word could be equated to a set of
reference voltage levels. This is the analog-to-digital converter's task in a
digital recording system and is known as "linear quantization”.
The number of bits in a word determines the dynamic range of a digital
recording. The dynamic range equals roughly six times the number of bits, giving
a theoretical upper limit of about 84 dB dynamic range for 14-bit words and 96 dB
dynamic range for 16-bit words (10:64). Other kinds of quantization are
available, but none are as easy to implement. A type of quantization called
25
“floating point quantization" can be used to make words smaller, but it decreases
a system's S/N ratio (32508). All types of quantization use binary.
To convert from a 14-bit word to a 16-bit word, two zeroes are added to the
end of the 14-bit number. They are in the least significant places, because that
is where the difference in accuracy is. To convert from 16 bits to 14, the last
two bits are ignored, because they affect the music the least (see Figure 8)
(3:509). No dynamic range is gained from converting 14 bits to 16; however, some
dynamic range is lost converting from 16 to 14 bits.
Figure 8 Interfacing between 16-bit and 14-bit machines (7:62).
Top: Missing bits seen as zeroes.
Bottom: Two extra bits ignored.
26
The STC-007 standard calls for 14 bit linear encoding to be used in all
digital audio consumer use applications (21:20). However, most of the systems on
the market now and planned for the immediate future, use the more accurate 16-bit
words. The word size recommended by Sony and Studer is 16 bits (9:624); and the
Audio Engineering Society recently recommended 20 to 24 bits (17:621), although
the technology available for 20 or more bits is not currently feasible. For the
immediate future, 16 bits, with its 90 dB of dynamic range, is the best choice
and the one more companies are going to in their products (see Table 4).
Table 4 (8, 11, 17, 19, 25, 26) Number of Quantizing Bits Used in Existing
and Proposed Systems
Manufacturer Model Quantizing Bits Dynamic Range
*Technics SV-P100 14 84.3
*Sony PCM-F1 14/16
(switchable)
86/90
Sony DAD-1X 16 >95
Sony/Philips CAD 16 90
JVC AHD 16 90
Hitachi PCMVl00 14 85
Burr-Brown PCM75 (A/D chip) 16 90
*On the market now.
Non-audio information. The information encoded into the string of binary
code after the analog-to-digital conversion is the non-audio information. It
includes the error correction and the synchronization coder. The EIAJ STC-007
standards "provide for various levels of error correction and detection, with
27
each manufacturer able to decide and select the degree and sophistication of the
error collection used" (10:87). The standard, however, really doesn't
standardize anything; and it does not define what is to happen when an
uncorrectable error is encountered (81:20). Table 5 gives a survey of the
different error correction methods used by manufacturers in their various models.
Table 5 (23) Error Correction Methods
Manufacturer Error Correction Methods
Ampex Interleave (IL), parity word (PN), erasure
(E)
Columbia Parity (P), duplicated recording (DP)
Hitachi P, Interpolation (IP)
Matsushita P, DP
Mitsubishi Adjacent code (AC), cyclic redundancy check
code (CRCC)
Sony DP, IL, crossword code (CC), AC, modified CC,
E, P, IP
Soundstream CRCC
3M, BBC IL, CRCC, PW, E
Teac DP, IL, P
Toshiba P, previous word holding
Each manufacturer has its own synchronization, but synchronization
information, as with error correction information, is relatively easy to; remove
from the binary code and is really not a large concern in standardization. The
binary code can be run through a short computer program that strips both of these
from the words containing the audio samples.
28
Total format. The last item in format standardization is that each
manufacturer takes all of the audio and non-audio information and puts it
together in the binary code string as he sees fit. As an example, Figure 9 shows
the 3M digital format. Conversion from one format to another requires stripping
the non-audio information -- an easy job -- and then matching the number of
quantizing bits and the sampling rates.
Figure 9 The 3M format. 3M uses T6 bit linear quantizing, 50 kHz samp1ing, and
20 kHz anti-aliasing and smoother filters. Data is organized into
frames of 400 bits. There are, arranged as shown above:
16 16-bit data words (D)
8 16-bit parity words (P)
1 12-bit frame good/bad check word (CRC)
1 4-bit synchronization pulse (SYNC)
Storage/Playback Systems
In digital audio, as with analog audio, there are two general methods of
storing the music. The first is magnetic tape. As discussed earlier, the binary
bits are translated into electromagnetic pulses and recorded in the material of
the tape. The second is storing the music on a rotating disc. There are many
methods of implementing disc storage in a digital audio system, unlike analog
audio, in which records all operate on essentially the same format.
29
Tape. Standardization of digital tape recording formats is not very
important, due to the fact that tape recording systems are more commonly used for
home recording than for playing back pre-recorded material (4:226). This is
supported by the recent addition of digital tape recorders to several companies‘
lines of consumer products. (The recordings from each format are not compatible
with the other formats).
Disc. Unlike tape, a disc-- whether analog or digital -- cannot be
recorded on at home. Once a disc is mastered, nothing can be done to it to add
more music. Different disc formats mean that several discs and playback units
will not be compatible. Therefore, it is extremely important that storage and
playback equipment be standardized.
There are three classes of disc playback devices, each with its own type of
disc. They are classed as to how the information is transferred from disc. The
first type, and closest in principle of operation to the analog record uses
piezoelectric detection. The digital information is recorded on a polyvinyl
chloride (PVC) disc -- the same material used currently in analog records. The
pickup device looks like an analog cartridge. However, the stylus is blunter,
for better wear, and the cartridge contains a piezoelectric transducer instead of
a magnetic transducer. The digital information is recorded as very small ridges
in a groove, much like a conventional record. The ridges cause the stylus to
vibrate; and the piezoelectric transducer in the cartridge translates the
vibration to an electric signal, which can then be processed digitally. This
method, like in analog records, is susceptible to wear, both to the disc and to
the stylus (29:5l2).
The second class of digital playback devices uses electrostatic detection.
This type of device is a spin-off of computer data storage technology. The
digitized music is stored by inducing small amounts of static electricity into
30
the disc. The discs are made of a mixture of PVC and carbon. The carbon holds
the electrostatic charge. Two types of detection schemes are currently
implemented. In the first, the electric charges are aligned within a groove. A
blunt device rides in the groove, detecting the charges. 0f course, with the
mechanical contact, there is wear to the disc and to the stylus. In the second
type of detection, a detector rides over the surface of an ungrooved disc. The
information is still recorded in a line that spirals toward the center of the
disc but, when the disc is rotating, the detector is kept over the line by an
electromechanical tracking device. The detector never touches the surface of the
disc, so no wear is induced. More data can be stored on the second type, because
the static charges can be confined to a width smaller than any groove needed to
guide a pickup device. Computers use this method to access information from
their disc drives (29:512).
The third class of playback devices uses photoelectric detection.
Photoelectric detection essentially consists of shining the light of a small,
low-intensity laser onto the disc. The light from the laser is reflected from
pits in the disc. These reflections can be decoded into the digital information
necessary to rebuild the music. There are two methods of storing the information
on a "laser disc.” The first method, now is use for video laser disc
applications, uses small pits of varying depth etched into a PVC disc with a
high-intensity laser. The playback laser is reflected, with varying degrees of
intensity, from the pits. The digital information is decoded from the intensity
of the reflections. The second method uses a PVC disc coated with a thin layer
of metal. A high-intensity laser etches holes in the layer of metal. The holes
vary in length, not depth. When the disc is being read, a low-intensity laser
scans the disc, looking for the non-reflecting holes in it (24:12) (see Figure
10). The lengths of the non-reflecting portions can be translated directly into
31
bits. Both of the laser discs are coated in a clear plastic in order to prevent
damage to the stored information (29:512).
Figure 10 Laser scanning. The surface of the audio disc:
(a) Is scanned by a beam of laser light;
(b) Focused by a lens;
(c) And the focused beam of light
(d) Is reflected from the PVC surface and measured by sensors
(7:62).
There are no current industry standards, or even preferences, for anyone
storage/playback system. For a comparison of characteristics favored by industry
leaders, see Table 6.
32
Table 6 (5, 7, 11, 29) Storage/Playback Systems: Comparison of
Characteristics Favored by Industry Leaders
Item
AES Digital
Audio
Technical
Committee
Telefunken/
Teledec Sony/Philips JVC
Channels 2/4 4 2 4
Revolutions /
Minute (R/M) Not given 278-695
Not given in
R/M 900
Diameter 300 mm max. 135 mm 120 mm 260 mm
Playing time 80 minutes
(two sides)
120 minutes
(two sides)
60 minutes
(one side) 120 min
Type Not given Piezoelectric Photoelectric Electrostatic
SELECTED DIGITAL AUDIO SYSTEMS
The following systems represent equipment manufactured by, or to be
manufactured by, leading corporations in the digital audio field. Availability
of information was the deciding factor in choosing the systems to be reviewed.
No information is available on the organization of their digital formats.
Technics by Matsushita
The Technics SV-P100 Digital Cassette Recorder is the first piece of
digital equipment to appear on the consumer market. Introduced in early 1982, it
was the first to be reviewed by a major trade magazine (Stereo Review, March
1982). The cost of the SV-P100 is $3,000 (26:46). Craig Start, testing the
cassette deck for Stereo Review, commented, ". . . It was the cleanest sound I
have ever recorded, and I've recorded a lot. To me, that says a lot about the
SV-P100‘s capabilities” (26:46).
33
Technics records on standard video tapes, and its specifications are
(26:42):
Sample rate: 44,045 Hz
Quantization: 14 bit linear
Dynamic range: 84.3 dB
Distortion: <.01%
Wow and flutter: Unmeasurable
Signal—to-noise: >80.3 dB
Frequency response: 2 Hz — 20 kHz + 0/-2.5 dB
JVC
JVC's audio high density (AHD) digital audio disc system is not yet on the
market, but it is expected within two years. It uses a grooveless electro-
tracking capacitance pickup system (11:70). The specifications of the disc and
playback system are shown below. The JVC AHD system is unique among the
forthcoming audio disc systems in that it has the capacity to playback visual
information in the form of still pictures or slow motion. The disc can also be
played on JVC s video high density (VHD) video playback system, which is now on
the market. Some of the non-audio information will be used as a guide for a
music search capability, locating the beginning of each music track. The AHD
system provides four channels of information -- three for audio purposes, and one
for video. Two of the audio channels will be used for stereo, leaving one free
for special effects, album notes, engineering notes, or for any other as-yet-
unthought-of purposes (11:68)
JVC's ADH disc specifications are (11:70):
Disc size: 260 mm (10 inches)
Revolutions: 900 (R/M)
Playing time: 2 Hours (one per side)
Channels: 3 audio (2 stereo; 1 extra); 1 video
Quantization: 16 bit linear
Sample rate: 47.25 kHz
34
Sony
Sony Corporation, being one of the front-runners in the digital audio
"revolution" (to turn a phrase), has an entrant both in the tape recorder and the
audio disc categories. The PCM-F1 Portable Digital-Audio Adaptor was introduced
to the market in early 1982, while the Digital Audio Disc (DAD)-1X is awaiting
further standardizations in the digital audio field.
PCM-F1. The Sony PCM-F1 Portable Digital-Audio Adaptor turns any NTSC-
standard video recorder into a two channel digital audio recorder costs $1,900,
assuming it has a video recorder to plug into. If not, a video recorder will
cost $800 to $1,200 more (12:2). The PCM-F1 has a switchable quantizer for either
14 or 16 bit linear quantizing, so that the consumer can decide the accuracy he
needs, compared with how much music he wants to fit on a tape (25:13).
The Sony PCM-F1 Portable Digital-Audio Adaptor specifications are (25:13):
Quantization: 14 bit and 16 bit (both linear)
Dynamic range: 86 dB/14 bit and 90 dB/16 bit
Harmonic distortion: 0.01%/14 bit and 0.007%/16 bit
Sampling rate: 44.056 kHz
Frequency response: 10-20 kHz ± 0.5 dB
Wow and flutter: Unmeasurable
Weight: 8.75 lbs.
DAD-1X. The DAD-1X Long-Play Digital Audio Disc System uses a
photoelectric detection system on PVC discs coated with a reflective layer of
metal. The specifications for this system are as follows (8:977):
Disc size: 303 mm
Playing time: 2.5 hours (single side)
Revolutions: 450 R/M
Channels: 2
Sampling rate: 44.056 kHz
Quantization: 16 bit linear
Dynamic range: >95 dB
Harmonic distortion: 0.03%
Frequency response: 2-20 kHz ± .25 dB
Wow and flutter: Undetectable
Error detection: Cross interleave
35
Note that both of the tape decks reviewed (the only two currently on the
market for consumer use) abide by the EIAJ standards, having been manufactured in
Japan. Sony's digital disc system chose the more accurate 16-bit quantization,
being strictly a decoding system (only able to play the discs). For an
illustration of the working of digital tape decks, see Figure 11.
FUTURE OF DIGITAL AUDIO
Digital audio is still a young science. The first digital cassette
recorders have been on the market for only a few months (26:40), but their effect
has been enormous. The digital recorder is now the most accurate component of
consumer stereo systems -- so accurate that it has exposed handicaps in the
response of most amplifiers and speakers that were not previously a matter of
concern. The coming of the digital audio disc, delayed because of technological
and standardization problems, is heralded by many as the end of the analog
record. But, while the digital disc is superior to conventional records, some
argue that the manufacturers have too much money invested in the production of
the latter to make them obsolete overnight. Along with this argument, there is
the possibility that a selection of non-compatible digital disc systems might
turn the consumer away from digital audio.
Digital Audio Discs
Advantages. The digital audio disc is a vastly superior method of re-
producing sound than on conventional records. All of the shortcomings of
conventional turntables will be circumvented. There will be no inner-groove
distortion, tracking error, tone-arm resonances and record and stylus wear.
Digital discs will not suffer nearly as much as analog records from the effects
37
of poor vinyl, bad pressings, dust or any of the other problems associated with
the manufacturing and use of the discs (16:56). The digital discs should last
indefinitely, as the only wear that they encounter is being taken out of their
cover to be put in the player. Another advantage is that digital discs may
become cheaper to produce than conventional records (4:184).
Disadvantages. When home digital disc playback systems appear on the
market, possibly in early 1983 (16:56), they will be handicapped in two areas.
One is that the record companies "have invested hundreds of millions of dollars
in conventional record-making equipment.” They will not render their equipment
obsolete without hard evidence that the digital discs will sell. Therefore, they
will not issue a large selection of the digital discs until they are sure that "a
large listening audience has the equipment to play them.” The second problem is
that "the consumer is not likely to buy a digital disc player until there is a
large selection of digital discs to play on it (4:184).” Therein lies the
possibility of a stalemate, as when quadraphonic records were introduced. The
quadraphonic records could be played on existing equipment, but few records were
marketed, so the public became disenchanted with "quad" sound (4:226). There is
little chance of this happening, however. Audio companies, such as Sony, have
spent millions of dollars on digital 14:54). The situation is more like that of
the audio manufacturers abandoning the 78 RPM record for the longer playing 45
and 33 RPM records. The better method -- in this case, the digital disc -- will
win out, but there will be a long transition period (4:184). Analog recordings
could survive up to twenty years in the presence of digital discs.
Support. At the l98l audio fair in Tokyo, about twenty companies dis-
played prototype compact digital-audio-disc (CDAD) players, although only a half-
dozen or so actually had working models (14:57). Toshiba, Denon and Yamaha have
38
adopted the joint Sony/Philips "compact disc" (CD) format (12:53), perhaps
signaling the beginning of informal standardization of the audio disc field.
Digital Audio Today
With digital recorders already on the market, audiophiles are re-examining
previously acceptable performance standards for other components in the chain of
audio reproduction. Because of digital audio‘s wide dynamic range, sharp peaks
(transients from percussion and other sources) to which analog systems cannot
react fast enough for them to record fully, can be recorded. These peaks,
requiring up to 100 times the average power of the music to reproduce, will
result in clipping by most amplifiers-and, sometimes, may even activate speaker
protection circuitry (22:63). Some of the newest amplifiers on the market can
handle these very sharp transients, but they are very expensive. However, safe
headroom is vital to digital recording. Instantaneous true peak reading meters
are necessary, therefore, on digital equipment in order to avoid amplifier
overload (2:244). But, though existing equipment may not be ready for digital
audio, Robert Berkovitz, director of research for Teledyne Acoustic Research
(manufacturers of the AR speaker series) said, "Most of the analog recordings
made during the past decade will be perfectly suited to reproduction on home
digital playback equipment (once they are converted to a digital format)"
(22.61).
Though digital accuracy is a fact, there has been concern by some hard core
audiophiles that digitally recorded music is unwholesome - that somewhere in the
process, detrimental factors creep into the music. At the 1980 Spring Audio
Engineering Society Convention in Los Angeles, Dr. John Diamond, a psychiatrist,
attempted to prove that digitally recorded music produces stress and fatigue in
listeners. Though his experiments were loosely controlled, if controlled at all,
39
they started some serious debate as to what kinds of affects digitizing adds to
the music, even if any changes imparted to the music have no audible effects on
the human ear (16:59). However, “A comparison test between all the major digital
recorders and a professional half-inch-tape, two-track analog machine running at
30 inches per second that was conducted at the Sound Emporium in Nashville last
summer confirmed that there do not seem to be any major degradations ascribable
to the digital process" (16:59).
CONCLUSION
Digital audio reproduction is the most significant development in the audio
field to occur in at least the last two decades. Toshit Doi, manager of Sony's
Digital Audio Project, says, "Before digital audio was introduced, the quality
bottleneck of the whole audio chain was in analog tape recorders and analog
records.” The weakest points are now the transducers (microphones and
loudspeakers) (22:63).
Remember, however, that “the accuracy of the digital process depends on the
sampling and quantization processes" (1:132). State-of-the-art integrated
circuitry can be improved upon, and advances in technology can make digital audio
even more accurate than it is now. If the field of digital audio is rigidly
standardized, the benefits of these technological advantages could be severely
delayed, and there would be no way to get more information out of earlier digital
recordings (16:59).
Of the possible technological advances, there is one that could make
digital tape recorders and audio discs obsolete before they have a chance to gain
a large acceptance. Static Systems Corporation, New York, has developed a system
called the Sonadisc. It is a microprocessor based audio player that uses a coin-
40
sized disc, containing magnetic bubble memory (a state-of-the-art computer memory
technology), to store music. The device has no moving parts, because all of the
music is accessed electronically from the bubble memory disc. The memory disc is
inserted into a receptacle in the player and currently stores only about a
second's worth of music (one megabit's worth). A radical advance in magnetic
bubble memory devices would definitely increase the amount of storage available
(6:6). If, and when, this happens, it will certainly be a more efficient way to
store music.
I believe that the EIAJ STC-007 standard is already obsolete. Sixteen- bit
systems, sampling at a higher rate than 41 kHz, have dramatically better
specifications. Manufacturers realize this, so when the digital audio disc
enters the market, it will not be in accordance with the EIAJ standards.
I favor using an industrial coalition -- the type Sony and Philips have
formed -- to informally standardize the field. This approach assures that
technological advances would reach the marketplace faster than under stringent
governmental industrial standards. I do believe that a standardization of
digital tape deck formats is necessary to accommodate music exchanges between
systems without the need for a converter. This will eventually happen; but, for
now, just having digital tape decks available for consumer use is a big enough
plus.
Digital audio recording and playback are the future of audio reproduction,
and the sooner they reach maturity, the sooner we can stop losing our heritage of
music to the degradations of analog storage media.
41
APPENDIX
GLOSSARY OF TERMINOLOGY
(Taken from The Complete Guide to Stereo Equipment)
Channel separation: Electrical or acoustical difference between left and right
channels in stereo.
Crosstalk: Signal leakage between two channels.
Decibel (dB): Numerical expression of acoustic or electrical ratios, such as
relative intensity of sound or relative strength of signal.
Distortion: Unwanted noise, or sounds that did not exist in studio when original
recording was made. All distortion is undesirable.
Dropout: Momentary loss of signal during tape recording or playback caused by
tape defects.
Dynamic range: Span in volume, expressed in decibels, between loudest and
softest sounds.
Filter: A circuit which attenuates signals above, below or at a particular
frequency.
Frequency: Rule of repetition, in cycles per second (cps), or musical pitch and
of electrical signals, expressed in Hertz (Hz). Low frequencies refer to
bass tones; high frequencies to treble tones.
Frequency range: Range from highest to lowest pitch sounds at usable volume
level.
Frequency response: Ability to reproduce given range of frequencies.
Harmonic distortion: Disturbs original relationship between naturally related
tones and is expressed in percentages.
Head room: Margin between maximum levels indicated on recorder's volume
indicator and actual level of severe tape overload.
Hertz: Name given to number of vibrations or cycles per second in electrical
signal of alternating current. Abbreviated Hz, with capital "H"; but when
spelled out, only lower-case letters are employed. The name derives from
Heinrich Hertz, early electrical scientist.
Integrated circuit (IC): Combination of transistors, diodes and resistors
assembled in pre-packaged circuit capable of providing high gain, low
distortion, and easily controllable performance in extremely miniaturized
form.
Kilohertz (kHz): 1,000 hertz.
42
Megahertz (MHZ): 1,000,000 hertz
Noise: Extraneous sound or signal that intrudes into original as result of
environmental noise, distortion, hum, or defective parts and tubes in
equipment.
Saturation: Condition whereby tape has reached its maximum degree of
magnetization.
Separation: Degree to which stereo signals are kept apart in passing through
reproducing systems.
Signal: Electrical replica of actual sound; in broadcasting, carrier wave
itself.
Signal-to-noise ratio: Often abbreviated “S/N ratio"; relative amount of signal
to undesired and extraneous noises in any device or its output.
Transducer: Device for changing one form of energy into another. Microphone and
phone pickup are sound-to-electrical energy transducers.
Transient: Heavy burst of music, causing the amplitude of the music signal to
increase sharply and then subside.
Wow and flutter: Fluctuations in speed of turntable or tape transport that cause
pitch distortions; wow refers to slow, repeated fluctuations; flutter to
short, rapid fluctuations.
43
BIBLIOGRAPHY
1. Bayliff, R. W. "Digital techniques and the gramophone record.” Gramophone.
vo1. 57, no. 673, June 1979, pp. 131-2, 135.
2. Burkowitz, P. K. "User‘s note on digital audio standard code and procedures,
operational requirements.” Journal of the Audio Engineering Society. Apri1
1978, PP. 242, 244, 246.
3. Busby, E. S. "Digital audio recording on videotape: Some choices." SMPTE
Journal. vol. 89, no. 7, July 1980, pp. 508-12.
4. The Complete Guide to Stereo Equipment. New York: Simon and Schuster
Publishers, 1979.
5. Digital Audio Technical Committee. “Digital audio disk standardization
Conference.” Journal of the Audio Engineering Society. July/August 1980,
pp. 525-27.
6. "Digital music at home.” Computer Music Journal. vol. 5, no. 4, 1981, p.
6.
7. Doi, T. T. “General information on a compact digital audio disk.” Journal
of the Audio Engineering Society. January/February 1981, pp. 60, 62.
8. Doi, T. T., et al. "A long play digital audio disk system.” Journal of the
Audio Engineering Society. December 1979, pp. 975-81.
9. Doi, T. T. and Roger Lagadex. “An outline of the format jointly supported by
Sony and Studer for stationary-head digital audio recording.” Journal of the
Audio Engineering Society. September 1980, p. 624.
10. Feldman, L. "Digital audio for the 1980's." Radio Electron. June 1980, pp.
63-5, 87.
11. Hidaka, T. "JVC AHD digital audio disk system." Journal of the Audio
Engineering Society. January/February 1981, pp. 68, 70.
12. Illinois Audio Catalog. Chicago, Illinois Audio, 1982.
13. “JVC digital audio products.” Computer Music Journal. vol. 5, no. 4, 1981,
pp. 89-90.
14. Klein, Larry. “Audio fair in Tokyo (Oct. ‘81).” Stereo Review. February
1982, pp. 56-60.
15. McCracken, J. A. “A high-performance digital audio recorder.” Journal of
the Audio Engineering Society. July/August 1981, p. 560.
16. Meyer, E. Brad. "Digital.” Stereo Review. Apri1 1982, pp. 56-9.
44
17. "Minutes of the meeting of the Digital Audio Technical Committee." Journal
of the Audio Engineering Society. September 1981, pp. 620-24.
18. "New audio products at Las Vegas CES.” Stereo Review. May 1982, pp. 53-6.
19. PCM75: Designed for Audio. Tucson, Burr-Brown Research Corporation, July
1981.
20. Ranada, David. “Digital audio: A primer.” Stereo Review. February 1981, pp.
63-5.
21. Ranada, David. “Good news: Digital recording standards." Stereo Review.
March 1980, p. 20.
22. Ranada, David. "Superdiscs.” Stereo Review. December 1980, pp. 61-4, 67.
23. Sony‘s Digital Audio Technology - QA. Japan, Sony Corporation, nd.
24. Sony's Digital Audio - The Next Step. Japan, Sony Corporation, May 1981.
25. “Sony‘s small, portable digital audio adaptor." Stereo Review. May 1982, p.
13.
26. Stark, Craig. "Equipment test reports: Technics SV-P100 digital cassette
recorder." Stereo Review. March 1982, pp .40, 42, 46.
27. Traiman, Steve. “Digital decade: The audio/visual future.” Stereo Review.
January 1980, pp. 69-72.
28. Weisser, A. “The digital recording of sound in broadcasting.” SMPTE
Journal. July 1980, pp. 520-4.
29. Welland, K. and H. Redlich. “The MD (Mini Disk) system: A contribution to
the digital audio disk standard.” Journal of the Audio Engineering Society.
July/August 1980, pp. 510-14.