Digital Audio

21
chool of Computer Science & Information Technology Digital Audio Digital Audio G6DPMM G6DPMM

description

Digital Audio. G6DPMM. G6DPMM - Lecture 14. Digital Audio. Properties of Sound. Acoustics Pressure waves in air Pitch - frequency - time (kHz) Amplitude - level - energy (dB) Subjective Volume - frequency and energy. Analogue vs Digital. Analogue technologies Physical Magnetic - PowerPoint PPT Presentation

Transcript of Digital Audio

Page 1: Digital Audio

School of Computer Science & Information Technology

School of Computer Science & Information Technology

Digital Audio Digital Audio

G6DPMMG6DPMM

Page 2: Digital Audio

School of Computer Science & Information Technology

School of Computer Science & Information Technology

G6DPMM - Lecture 14G6DPMM - Lecture 14

Digital AudioDigital Audio

Page 3: Digital Audio

Properties of SoundProperties of Sound

AcousticsAcoustics Pressure waves in airPressure waves in air Pitch - frequency - time (kHz)Pitch - frequency - time (kHz) Amplitude - level - energy (dB)Amplitude - level - energy (dB) Subjective Volume - frequency and energySubjective Volume - frequency and energy

Page 4: Digital Audio

Analogue vs DigitalAnalogue vs Digital

Analogue technologiesAnalogue technologies PhysicalPhysical MagneticMagnetic

Digital technologiesDigital technologies Sample soundSample sound Approximation of analogueApproximation of analogue Must be captured (a2d conversion)Must be captured (a2d conversion)

Page 5: Digital Audio

Sound in MultimediaSound in Multimedia

MIDIMIDI Defines the synthesis of soundDefines the synthesis of sound Analogous to vector graphicsAnalogous to vector graphics

Digital AudioDigital Audio Captured waveformCaptured waveform Analogous to bitmap graphicsAnalogous to bitmap graphics

Page 6: Digital Audio

MIDIMIDI

Musical Instrument Digital Interface.Musical Instrument Digital Interface. Standard for synthesisers and electronic Standard for synthesisers and electronic

instruments and devices.instruments and devices. MIDI “score” describes time stamped sequence MIDI “score” describes time stamped sequence

of notes.of notes. Reproduction dependent upon hardware (MIDI Reproduction dependent upon hardware (MIDI

device).device).

Page 7: Digital Audio

Advantages of MIDIAdvantages of MIDI

Very compact (up to 1,000 times smaller than Very compact (up to 1,000 times smaller than CD audio)CD audio)

Sounds produced by hardware - therefore low Sounds produced by hardware - therefore low system overhead.system overhead.

Low bandwidth requirements.Low bandwidth requirements. High quality - dependent upon hardware.High quality - dependent upon hardware. Scaleable, Editable & cross platform.Scaleable, Editable & cross platform.

Page 8: Digital Audio

Disadvantages of MIDIDisadvantages of MIDI

Hardware dependent - only a faithful reproduction if the Hardware dependent - only a faithful reproduction if the equipment is constant.equipment is constant.

Cannot record “real world” sounds.Cannot record “real world” sounds. Difficult to represent speech.Difficult to represent speech.

Creation of MIDICreation of MIDI Via editing software (sequencer)Via editing software (sequencer) Via instruments / peripheralsVia instruments / peripherals Usually requires musicians or sound effects specialists.Usually requires musicians or sound effects specialists.

Page 9: Digital Audio

Digital AudioDigital Audio

Sampled SoundSampled Sound At regular time intervals a sample is taken, and the At regular time intervals a sample is taken, and the

information is stored digitally.information is stored digitally.

ApplicationsApplications CDCD Digital tape (DAT)Digital tape (DAT) Digital broadcast (radio / TV)Digital broadcast (radio / TV) Most multimedia soundMost multimedia sound

Page 10: Digital Audio

Sampling (capturing)Sampling (capturing)

Sampling hardware - from any analogue source (usually line in).Sampling hardware - from any analogue source (usually line in).

Sampling rateSampling rate frequency of samplesfrequency of samples often called “frequency”often called “frequency”

Sample sizeSample size amount of information stored.amount of information stored. often called “resolution”often called “resolution”

Page 11: Digital Audio

Sampling RateSampling Rate

Page 12: Digital Audio

Sample SizeSample Size

Page 13: Digital Audio

Sampling ParametersSampling Parameters

Common Sample Rate (frequency)Common Sample Rate (frequency) 44.1 kHz (CD audio)44.1 kHz (CD audio) 22.05 kHz22.05 kHz 11.025 kHz11.025 kHz

Common Sample Sizes (resolution)Common Sample Sizes (resolution) 8 bit (256 amplitude states)8 bit (256 amplitude states) 16 bit (65,536 amplitude states)16 bit (65,536 amplitude states)

Page 14: Digital Audio

ExamplesExamples

Frequency(kHz)

Resolution(bits)

Tracks Storage(1 min)

44.1 16 2 10.5 Mb “CD quality”

44.1 16 1 5.25 Mb High quality mono

22 16 1 2.5 Mb Reasonable speech

22 8 2 2.6 Mb Medium qualitystereo

22 8 1 1.3 Mb “TV quality”

11 8 1 650 Kb “Telephone quality”

Page 15: Digital Audio

DistortionDistortion

Distortion is caused when the reconstruction of the waveform is Distortion is caused when the reconstruction of the waveform is unacceptable.unacceptable.

Usually arises from incorrect settings of disparate equipment Usually arises from incorrect settings of disparate equipment (amplitude / levels).(amplitude / levels).

Also caused by insufficient sampling frequency (Nyquist theory).Also caused by insufficient sampling frequency (Nyquist theory).

Page 16: Digital Audio

The Nyquist RuleThe Nyquist Rule

Page 17: Digital Audio

Implications of NyquistImplications of Nyquist

Each half of the waveform must be recordedEach half of the waveform must be recorded there must be 2 samples per periodthere must be 2 samples per period the sampling frequency must be at least twice the highest the sampling frequency must be at least twice the highest

signal frequency.signal frequency.

For example:For example:if the highest frequency is 14,080 Hz, then at least 2x14.08 = 28.16 kHz must be if the highest frequency is 14,080 Hz, then at least 2x14.08 = 28.16 kHz must be used. Thus 22.05 would not suffice - 44.1 would be needed.used. Thus 22.05 would not suffice - 44.1 would be needed.

Page 18: Digital Audio

Editing Digital SoundEditing Digital Sound

Wide range of softwareWide range of software Commercial (eg Sonic Foundry Sound Forge)Commercial (eg Sonic Foundry Sound Forge) Shareware/FreewareShareware/Freeware Bundled with hardwareBundled with hardware

Common OperationsCommon Operations TrimmingTrimming Splicing and assemblySplicing and assembly Volume adjustmentsVolume adjustments DownsamplingDownsampling Fades & other effectsFades & other effects

Page 19: Digital Audio

Sound CompressionSound Compression

In principle very similar to image or moviesIn principle very similar to image or movies

Usually (not always) lossyUsually (not always) lossy

Requires codec for compression and playbackRequires codec for compression and playback

Streaming is commonplace and well developed (eg Streaming is commonplace and well developed (eg RealAudio, QuickTime, MS Media Player).RealAudio, QuickTime, MS Media Player).

Page 20: Digital Audio

Common CodecsCommon Codecs

GSM - mostly for voice (3.5-28 Mb per hour)GSM - mostly for voice (3.5-28 Mb per hour) ADPCM (Microsoft or IMA) - high quality, but low compression (14-ADPCM (Microsoft or IMA) - high quality, but low compression (14-

152 Mb per hour)152 Mb per hour) Lernout & Hauspie - voice only (3.5-7 Mb per hour)Lernout & Hauspie - voice only (3.5-7 Mb per hour) CCIT A-Law - European TAPI devices, very high quality (24-302 Mb CCIT A-Law - European TAPI devices, very high quality (24-302 Mb

per hour)per hour) CCIT CCIT -Law - American TAPI devices, very high quality (24-302 Mb -Law - American TAPI devices, very high quality (24-302 Mb

per hour)per hour)

Page 21: Digital Audio

Common File FormatsCommon File Formats

Very large numberVery large number Audio IFF (AIFF) - developed by Apple for Macintosh, also used by SGI and various software.Audio IFF (AIFF) - developed by Apple for Macintosh, also used by SGI and various software. AU Audio - uncompressed format developed by Sun, widely used on the Internet.AU Audio - uncompressed format developed by Sun, widely used on the Internet. WAV - developed by Microsoft for Windows.WAV - developed by Microsoft for Windows. MP3 - highly compressed, high quality spin-off of the MPEG projectMP3 - highly compressed, high quality spin-off of the MPEG project RealAudio RAM - developed for streaming by RealMedia.RealAudio RAM - developed for streaming by RealMedia.

Differ in terms of:Differ in terms of: Compression algorithmsCompression algorithms MetadataMetadata Security & EncryptionSecurity & Encryption