Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word...

37
Technical Report: Technology Review Document ID WP1-T1-TR Document Version 1 Local Path Corpus2/socis/work_packages/wp1/ technical_review.doc Authors Lee Gillam Function To explore available data capture technologies Distribution Surrey, Sheffield, Police Forces Comments

Transcript of Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word...

Page 1: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

Technical Report:

Technology Review

Document ID WP1-T1-TR

Document Version

1

Local Path Corpus2/socis/work_packages/wp1/technical_review.doc

Authors Lee Gillam

Function To explore available data capture technologies

Distribution Surrey, Sheffield, Police Forces

Comments

Page 2: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

Table of Contents

Page 3: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

Digital Camera Guide to Technologies 3

Nikon Coolpix-950 4

Canon Powershot Pro70 - £625 5

Sony DSC F505 - £505 6

Comments from an imaging consultant 6

Comments from Hayden B. Baldwin, Executive Director, International Crime Scene Investigator's

Association 7

Associated Technologies 8

Conclusion 9

Digital Tape - Sony NT-2 - $1695.00 12

Olympus D1000 Digital Voice Recorder PACKAGE With ViaVoice Transcription Software - approx

$289 14

Digital Voice Recorder DS-150 - $179 22

Page 4: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D E S I G N C U S T O M I S A T I O N

Generic TechnologyIdeal technology versus the available reality

he idea of a Scene of Crime Officer performing mostly hands-free data collection is not yet a realistic task due to the various limitations with current technologies.

Digital cameras, for example, are rather cumbersome and have limited data transfer capabilities - infra-red communications between cameras and computers are in the early stages. Speech capture systems, specifically radio microphones for limited power remote usage, interfacing directly with speech recognition software are still a while off, while actual speech recognition technologies still need to be evaluated within this particular domain. If we combine these needs, with the possibility for electronic capture of other 'media' such as scene of crime sketches perhaps on a PDA, the number of possible individual technologies increases along with the number of possible, though immature technologies that can be used in integration. A further consideration to be made is that all these devices need a variety of power sources, with each having an individual operational period. While we are some distance from the ideal, there are a number of these individual technologies that will be investigated, with a view to their use in immediate data capture. This document will also serve as a reference point for evaluating technologies to satisfy these needs at other points in the future.

T

3

Chapter

1

Page 5: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

4

Chapter

2

Page 6: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

P E R S O N A L D I G I T A L A S S I S T A N T S

Digital Camera TechnologiesThis chapter deals with the state-of-the-art in digital camera technologies.

igital cameras are a fairly recent technology at the time of this document. As each manufacturer attempts to become the market leader in this non-homogenised

digital area, a number of non-standard features are being introduced into these devices. Key issues that are of concern for SOC images are, for example, the possibility to 'watermark' images and the clarity of the images at higher magnifications. From a data oriented point of view, these concerns are less critical and issues such as the ability to store, capture and transfer these images are key.

D

Digital Camera Guide to TechnologiesThe non-standard components of various cameras have a number of features, strengths and drawbacks that are worth considering. These can be summarised as:

Data transfer - mechanisms, rates, active storage

Image magnification, clarity

Image 'security'

Peripherals

Cost

Nikon Coolpix-950 - £650

Page 7: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

Pixels 2.11 Megapixel CCDPicture size 1600x1200, 1024x768 or 648x480, 4 quality SettingsLens 7-21mm (38-115mm equivalent), 3x power zoom, f/2.6Focusing TTL Auto or manual focus, 2cm macroFlash Auto, forced, off, redeye reduction. Range: 8cm - 3mSensitivity ISO 80Shutter 8s - 1/750 (mechanical)Modes Shutter priority, aperture priority, programmed AEViewfinder Real image optical or via screen.Screen 2" Polysilicon

Storage Removable: CompactFlash (8MB supplied, holding 1 pictures at highest resolution/quality or 32 pictures at lowest resolution/quality)

Interfaces Serial, Video out (Local)Batteries 4xAACompatibility Windows 95 or later & Mac OS7.5.1 or laterOther features

Uncompressed mode; Quick start; Best Shot Selector; Manual Gain; Tone compensation; External TTL speedlight control; Lens conversion options.

Dimensions 143(w) X 36.5(d) X 76.5(h)Weight 350g excluding batteries

In-the-box Camera; Soft case; Wrist strap; 4xAA batteries; 8MB CF card; Serial cables; Video cable; User guide; Software kit (CD-ROM)

Options Various CompactFlash cards; AC adapter; Wideangle & Fisheye conversion Lenses; External flash bracket; PCMCIA adapter

Source: Digital Camera Company

PC Pro's verdict was that "High image quality and a truckload of features are undermined by fiddly controls and unreliable PC link software."

Furthermore, the PC Pro review in issue 58 says:

Nikon's CoolPix 950 boasts a similarly high resolution of 2.11 million pixels, giving pictures of 1,600 ´ 1,200 resolution. More importantly, it manages to maintain a high level of quality while keeping file sizes small. In fact, at its highest quality setting, the CoolPix 950 manages to compress its full resolution images by a ratio of around seven to one without sacrificing quality. This allows you to fit an average of ten shots onto the supplied 8Mb CompactFlash card. Unlike the MX-2700, the CoolPix 950 can save and store images in uncompressed TIFF file format. However, at around 5.5Mb per image, you'll only be able to fit a single shot at a time on the card

Even with the compression turned on, the CoolPix 950 is capable of extremely impressive results. In terms of sheer detail captured and image quality, it easily outstrips not only the MX-2700 but also the Minolta Dimage EX Zoom 1500 (reviewed issue 53, p177) and the Olympus C-900 Zoom (reviewed issue 54, p174).

In fact, the only cameras that come close are the Sony Cyber-Shot DSC-55E (reviewed p167), which is slightly below it in terms of quality, and the Canon PowerShot Pro 70 (reviewed issue 54, p165), which is slightly higher up the rankings thanks to its fantastic colour, crisp focus and overall image quality.

Nikon has also managed to pack a more bewildering array of features into the CoolPix 950 than we've ever seen in a digital camera. Like many 35mm cameras, the 950 features two main shooting modes: A-Rec (automatic) and M-Rec (manual). The first of these is simply a point-and-shoot mode allowing you to select your flash mode -

6

Page 8: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

automatic, fill, red-eye reduction and off are available - and choose between normal, infinity and macro focus modes. Aperture size, shutter speed, white balance and sensitivity settings are all dealt with automatically.

In M Rec mode, things become a little more complicated. Here you can choose between aperture or shutter priority exposure modes, matrix, spot or centre-weighted light metering, and set the sensitivity level. You get a total of three aperture settings to choose from, in the range f-3.9 to f-11, and the shutter speed can be adjusted from a 500th of a second right up to eight seconds. There's also the option to pick from a list of preset white balance settings or you can manually set your own.

Even more clever than this, however, is the CoolPix 950's best-shot selection feature. With this mode turned on, the camera takes a series of shots, compares them and then picks the one with the most detail. Unfortunately, the 950's extensive range of adjustments and features can make things confusing. Some functions can only be accessed by the fiddly process of holding one button down while turning a small dial at the front underneath the shutter button. This means important adjustments to shutter speed or aperture size are more awkward to make than they should be.

Fortunately, the rest of the CoolPix 950's ergonomics are up to scratch. It feels solidly built and, like its predecessor the CoolPix 900 (reviewed issue 48, p164), the 950 is split into two halves with a joint in the centre. This allows the 3x zoom lens (38-115mm equivalent), optical viewfinder and flash to be rotated independently of the main camera body, which houses a decent 2in TFT screen. It may sound gimmicky but the design means you don't need to crouch, contort your body or crane your neck in order to take pictures from different angles, and it makes overhead shots particularly straightforward.

When it comes to transferring the images to your PC using the 115Kbits/sec serial link, the CoolPix 950 does its best to thwart you. Quite apart from the fact that it doesn't support TWAIN, I found that the supplied NikonView software, which works by mounting the camera as a virtual drive in the Windows Explorer, worked intermittently at best. I only managed to download one image at a time before having to restart the software and re-establish the connection, which was really annoying. At least the inclusion of a copy of Adobe Photoshop 5 LE goes some way to redeeming the situation.

This problem, and the camera's over-fiddly controls mar what is otherwise a high-quality digital camera. Image quality is only bettered by Canon's Powershot Pro 70, which costs considerably more than the £681 Nikon. It's also well built and boasts more features and adjustments than you can shake a stick at.

Canon Powershot Pro70 - £625

Pixels 1.68 Megapixel CCDPicture size 1536x1024 or 768x512, 3 quality Settings

Lens 6-15mm (28-70mm equivalent), 2.5x power zoom, f/2Focusing TTL Auto focusFlash HotshoeSensitivity ISO 100-400Shutter 1/2 - 1/8000Modes Aperture priority, programmed AEViewfinder Real image optical or via screen.Screen 2" LCD

Storage Removable: CompactFlash (15MB supplied, holding 5 pictures at highest resolution/quality or 180 pictures at lowest resolution/quality)

Interfaces Serial, Video out (Local)

7

Page 9: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

Batteries Lithium Ion rechargeableCompatibility Windows 3.1/95/98/NT and Mac

Other features Twin CF slots, remote release connector, dedicated flash hotshoe.

Dimensions 145(w) X 132(d) X 85(h)

Weight 650g excluding batteries

In-the-box Camera; 15Mb CF card; Nickel-Metal-Hydride battery; Charger; Cables; User guide; Software kit.

Options Various CF cards; EOS Speedlite EX series flashSource: Digital Camera Company

PC Pro's verdict was that " A classy piece of kit with image quality and a price tag to match. Worthy of an Honourable mention this month."

Furthermore, the PC Pro review in issue 63 says:

Both the price and form factor make it clear that the Pro70 is aimed at the advanced amateur photographer wanting to go digital. Its hefty, no-compromise build quality is more reminiscent of mid-range SLR (single lens reflex) cameras than the more common point-and-shoot variety. Features such as the hot shoe external flash connection, the lens housing-mounted zoom control, remote shutter control socket and twin CompactFlash slots - one of which will allow you to insert a Type II Card such as the 340Mb IBM microdrive (reviewed p179) - further the professional feel.

It has fewer controls than we'd expect from a camera with such high-end pretensions, however. Although you get a Program mode to accompany the point-and-shoot automatic mode, it only gives you aperture priority and exposure compensation adjustments, with no control over shutter speed or focus. There's clearly some high-quality optics inside, though, as you can see from the close-up above and the results pages (see p120). It may not boast as ludicrously high a resolution (just 1,536 x 1,024) as many of the other cameras, but this doesn't prevent it from producing the very best in image quality. It produces balanced, detailed and colour-rich images that only the Sony DSC-F505 and Olympus C-2500L can match.

The only problem with the Canon PowerShot Pro70 is the cost. At a street price of £769, it's the second most expensive camera on test, and adding a flash will set you back another £130. It does have an edge over the DSC-F505, thanks to superb image quality and a proper optical viewfinder, but doesn't come close to it for value.

Sony DSC F505 - £505

Pixels 2.11 Megapixel CCDPicture size 1600x1200, 1024x768 & 640x480, 2 quality Settings

Lens 7.1-35.5mm (38-190mm equivalent), 5x zoom, f/2.8 - f/8Focusing TTL Auto or manual focus, 8cm macroFlash Auto, forced, off. Range: 3mSensitivity ISO 100Shutter 1/6 - 1/600Modes Shutter priority, aperture priority, programmed AE, sound recordingViewfinder Via screenScreen 2" LCD

Storage Removable: Memory Stick (4MB supplied, holding 8 pictures at highest resolution/quality or 63 pictures at lowest resolution/quality)

Interfaces Serial & USB, Video out (Local)Batteries Lithium Ion rechargeableCompatibility Windows 95/98

Other features

Rotating lens; Solarize, B&W, Negative and Sepia effects; In-camera movie playback;

8

Page 10: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

Dimensions 107(w) X 136(d) X 62(h)

Weight 435g excluding batteries

In-the-box Camera; NP-F10 battery; Charger/AC adapter; 4Mb memory stick; Cables (Serial, Video & USB); hand strap; Lens cap; User guide; Software kit.

Options Various Memory sticks; Fast charger; PCMCIA adapter.

Source: Digital Camera Company

PC Pro's verdict was that " Fantastic quality, a stunning macro mode and great value for money. The best digital camera buy we've seen yet."

Furthermore, the PC Pro review in issue 63 says:

Despite its strong branding in virtually every sector of consumer technological goods, Sony has never been big in cameras. The company's only offerings at the consumer end of the digital market - the Mavicas - couldn't challenge the likes of Nikon, Kodak and Olympus when it came to quality.

Its latest models have changed all that. At the top end of the range sits the DSC-F505, a 1,600 x 1,200 resolution camera with a 5x optical zoom lens - the biggest of any camera on test - and a digital zoom mode. It boasts all of the features you'd expect of a camera in its price bracket: manual focus, shutter and aperture priority settings, variable light metering modes and a high-quality macro mode. The latter allows you to shoot subject matter from a range of 8cm while keeping everything in the crispest focus.

The camera even allows you to shoot short MPEG movie clips, up to 15 seconds in length, just like its smaller cousin the DSC-F55E. Strangely enough, though, there's no optical viewfinder and you have to rely on the 2in LCD screen to frame your shots. While this is fine in most circumstances, in dimly-lit environments it makes it difficult to see what you're shooting.

As you can see from the results pages (see p120) and the close-up above, all of this goes together to produce some remarkable image quality. The level of detail and focus is impressive in all conditions, and the correct colour balance, using the presets and manual mode, is easy to achieve in all lighting conditions. The DSC-F505 was almost inseparable from the Canon PowerShot Pro70 and Olympus C-2500L in terms of quality, but just lost out in the end to the Pro70's slightly superior colour accuracy.

The DSC-F505 does, however, beat these two cameras in terms of value for money. In fact, its extremely reasonable price tag of £596 was enough to scoop PC Pro's new overall Winner award.

Comments from an imaging consultant"This sounds like a fascinating project! I have visited your web site, but haven't thoroughly read the complete proposal. However, in what I have read so far, this looks like a project that can benefit many fields.

As to which digital cameras to use, my initial thought is to utilize those with built-in microphones - thus eliminating the need for seperate image and audio recording devices.

"The second criteria would relate to the type of crime/quality of image. For instance, low-end crimes (car burglarlies, vandalism, minor assault) can be documented with low-end, point-and-shoot type cameras. However, major crimes may

9

Page 11: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

require a higher quality digital camera such as thowe utilizing 35mm SLR camera bodies and relatively large chips. I am uncertain which category you are considering.

"Most US agencies are utilizing Kodak, Nikon, Epson and Olympus brand digital cameras. Some are using the Sony Mavica cameras - though I can't imagine this lasting too long due to the undependability of the floppy media and the low resolution images.

"On a seperate issue, I have an interesting idea on the extraction of the data itself. Since many in forensics (as well as most fields) use Adobe Photoshop for some of their imaging. And, since Photoshop supports the photojournalism NAA/IPTC records, perhaps the audio could be extracted into the IPTC record in Photoshop, then extracted from there to create the database. By placing the data in the IPTC record, it essentially becomes a part of the image file itself (the IPTC record is included in the header).

"Also, I am curious as to what kind of response you are getting to th concept in general. Although I have been using computers for imaging applications for a number of years, I still feel a bit distrustful of voice recognition, and even a bit odd in speaking to my computer or camera. I wonder if speech will be a more difficult transition than imaging in computers.

"If I can help with your project in any way, please don't hesitate to contact me. And, please keep me appraised of the progress of this project."

George Reis,

Ph: 714-775-3120

Free Radical Enterprises Fx: 714-775-2212

[email protected] http://www.free-radical.com

Digital Imaging, Consulting and Training

Specializing in Forensic Applications

Comments from Hayden B. Baldwin, Executive Director, International Crime Scene Investigator's Association"Your question on what kind of digital camera to use in a crime scene is difficult to answer. The reason I state this is because of the way cameras are used in crime scenes in general. Crime scene photography requires a camera that can be used for general scene photos, medium shots and close-

10

Page 12: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

ups. The camera also must be able to take timed exposures and 1:1 photos of evidence for comparisons, i.e., tool marks, fingerprints, blood stain pattern droplets, etc.

"So far no digital camera that I know of is capable of doing all of that. However it can be accomplished by using the digital camera for the general and medium scene photos and use a regular 35 mm for the macro and timed exposures.

"I have a Nikon 950 digital camera and it is capable of taking extremely sharp macro photographs. The problem is that the adjustment for aperture settings is limited, thereby limiting the depth of field focusing which is of primary importance in photographing a fingerprint on a curved surface at the crime scene.

"I also have a Sony TRV 900 Digital Video camcorder that has the ability >to take outstanding professional video at the crime scene AND great still photography with the same camera. Digital imaging is a great way for law enforcement to use but it does have its limitations, but it also has great potential!

"To match what a 35 mm negative can do the digital camera must be able to photograph with a resolution in the 500 line range and over 4000 pixels. None are available at this time to my knowledge. The newest digital cameras are just over 3 meg in pixels, so we are getting closer. However the file sizes as TIF files are enormous and pose a storage problem in the field. (my Nikon 950 will only store 6 high resolution TIF images with a 40 meg memory card)

"The other problem in crime scene photography is the ability to enlarge the image for printing. For example, take the same photo with a 35 mm and a digital camera, enlarge both images by 10 times. There is no comparison, the 35 mm is a sharp photo and the digital image is pixelized.

"The solution from my viewpoint is to use a digital camera to replace where a "Polaroid" camera would be used or severely limit it's use to general and medium photos or to simply continue to use the 35 mm or medium format cameras with "film" and digitize the negatives. This last choice will give more flexibility to the printing an image.

"I have rambled on here a bit but hopefully I have provided you with some useful information. I agree with your project

11

Page 13: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

D I G I T A L C A M E R A S

and the potential of the digitized voice and photo system. I would be happy to help in any manner that I can."

Associated TechnologiesAside from the cameras themselves, there are a number of optional extras that have to be considered within the purchase. While there are only a few cameras that can make use of it, IBM provide a 340MB Compact Flash card which, at about 6MB per detailed image would store about 56 images in a single go. For Compact Flash, the current limit is around 128MB and 64MB for other cards.

IrTranP is a technology in the early stages. This allows for data transfer to be made over infra-red. The principle of the technology is of interest, but it is currently in the inception phases and should be considered only once matured.

ConclusionHaving considered the best options available within the price range, our conclusion is that for functionality and ease of use, the Nikon 950 was going to be the best current option. Following discussions with the Digital Camera Company, we also came to an agreement regarding a discounted price for this unit and accessories such as a flash-mounting bracket and the best value 96MB memory card, capable of storing about 191 low-resolution images and around 15-20 very high resolution images.

12

Page 14: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

P E R S O N A L D I G I T A L A S S I S T A N T S

Personal Digital AssistantThis chapter deals with the state-of-the-art in PDA technologies.

t present, PDAs exist in a variety of forms with a variety of input devices, storage capabilities and methods of synchronisation. We have already

discussed the quantity of storage required for digital images and currently PDAs are too restricted in this area to allow instant and automatic download and access for these images. Where these devices could be of assistance is in the SOC sketches. Being able to synchronise a sketch of a crime scene drawn by the SOCO at the scene, along with the camera images and the associated spoken description adds a further dimension to the association between language and vision. PDA technology will be considered if there is shown to be a need for it.

A

Chapter

3

Page 15: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

S P E E C H R E C O G N I T I O N T E C H N O L O G I E S

Speech Recognition

TechnologiesThis chapter deals with the state-of-the-art in Speech Recognition technologies.

Highly visible SR technologies have been on the market for a few years, with Dragon Dictate and IBM's ViaVoice amongst the best known. The intention will be to have both of these products evaluated, as well as technology available at the University of Sheffield. Only domain-specific training with these products will determine the most appropriate technology in this case.

Voice Recording TechnologiesThis chapter deals with the state-of-the-art in Speech Recording technologies.

n the field of voice recording, there have been a number of innovations made since expert commentary was captured on audio tape. These innovations have been

mainly in the digital world and it is these digital voice capture technologies that we seek to deal with here.

IMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes

Chapter

4Chapter

5

Page 16: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

and sizes, along with the more recent MP3 recorders that claim to be able to record upto 60 minutes of spoken material and can then be plugged into the parallel port of a laptop or desktop computer.

With recorded sound, the main issue is sound quality. Digital sound tends to be less sensitive to background noise than would normal analog sound. The issue with these systems is going to be whether hands-free recording is going to be possible. MP3 recorders have the added bonus over CD, MD and tapes in that there is no separate recording media that needs to be considered. If separate recorded media is required, the MP3's can easily be archived off onto CD or tape for posterity. As with all such digital media, the link between the desktop machine and the recorder will be the essential element.

Digital Tape - Sony NT-2 - $1695.00From - http://www.voice-recognition.com/digitalrecorders.html

A Sony digital recorder can assist users who need mobility and convenience. Sony digital voice recorders allow busy executives, physicians, attorney's, journalist's, college students, or law enforcement to take notes, dictate reports, or memos on the go.

Connect the recorder to the sound card of the PC, play the recordings into your favorite voice recognition software program, the words are transcribed as if you had been right there dictating directly into the PC .

The Quality of your recordings count, that is why VRS uses the Sony NT-2 digital recorder. By using a recorder that is able to record at CD quality and a high quality noise canceling microphone, we are able to achieve crystal clear recordings for unmatched recognition accuracy.

The Sony NT-2 is small, lightweight, and uses a single AA battery. The NT-2 is our top pick for use in speech applications. Easy to use controls for frequent recording sessions. Ideally suited for busy professionals who are not

15

Page 17: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

going to sit in front of the computer. This tiny digital recording wonder squeezes up to two hours of high-quality audio onto a postage stamp sized cassette. The NT-2 is the perfect recording solution when high quality, ultra-compact size and long recording times are all important.

Finally something that works the way you do. Busy executives, will appreciate the convenience. Have your memos and reports typed automatically into your PC. Just connect the recorder to the sound card of the PC, hit play on the digital voice recorder and watch the words appear. Instead of being tied to the PC the Sony digital voice recorder gives users freedom and mobility. The buttons on the Sony NT-2 are easy to use, the digital voice recorder is also very small, and lightweight.

Unmatched recording quality assures you of getting the highest recognition from your software. We recommend that our customers use NaturallySpeaking software from Dragon Systems. NaturallySpeaking voice recognition software supports multiple users, making the digital voice recorder an ideal solution for offices that rely on transcription services. Why waste valuable time and money generating endless lines of text? The digital voice recorder simplifies the entire process entering data into the PC.

Once the recording has been played into the NaturallySpeaking document proofreading the document is simple.

NaturallySpeaking has recorded speech capabilities that enable you to hear the words you dictated. Highlight any portion of the document to listen to the words being played back.

Just click on the incorrect word to open the correction dialogue box. Type in the correct word, your document, and NaturallySpeaking will be updated automatically. It's just that simple.

Specification of the Sony NT-2 digital voice recorder.

Digital Stereo with wide dynamic range provides low-noise low-distortion and playback with superior speech clarity.

Digital stereo recording/playback

Seamless auto-reverse

16

Page 18: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

Feather-touch operation

Large size multi-function LCD display

Compact size and lightweight

5 3/4 oz. (160g)

4 3/8 " x 15/16 " x 2 1/2 "

Number of Channels: 2-Channel Stereo

Sampling frequency: 32kHz

Playback frequency range: 10 -14,500Hz

Playback Dynamic Range: Better Than 8db

Total Harmonic Distortion:< 0.08%

Sony NT-2 digital voice recorder $1695.00

Call 410-268-3200 to order. Click here to send email.

3105 Riverwalk Drive Annapolis, MD 21403 Ph: (410) 268-3200 Fax: (410) 268-9229

© Copyright 1999 VRS LLC

Olympus D1000 Digital Voice Recorder PACKAGE With ViaVoice Transcription Software - approx $289

From - http://www.zap1.com/oly_d1000home.html

With Miniature Card Removable Media

Olympus' new Digital Voice Recorder, the D1000, packaged with IBM's ViaVoice software, creates a whole new category of voice recorder. The D1000combined with IBM's ViaVoice, and Intel's Flash Memory

17

Page 19: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

Miniature Card delivers a powerful solution for the mobile worker.

Users can dictate memos, reports, and other types of correspondences into theOlympus D1000 digital voice recorder where it is stored on the Flash MemoryMiniature Card. Once the audio is transferred to the PC, IBM's ViaVoicedictation product converts the recorded voice into text that can be edited,formatted and printed, just like any word processor file.

The D1000 will be shipped with a cable, carrying case, a customized versionViaVoice and a 2MB Intel Flash Memory Miniature Card and a PC AdapterCard. The removable miniature card and the Olympus PC Adapter Card (forPCMCIA slots) allows users to easily move audio files to a PC and convenientlyshare information with co-workers. The cable is for installation and enrollmentof a customized version of ViaVoice. An optional Miniature CardReader/Writer is available for desktop computer users.

User friendly, the D1000 comes loaded with many features including aremovable flash memory card, over-write recording, insert recording and LCDinformation display.

Ultra Compact Design - measures a mere 4.7" (H) x 1.81" (W) x .91" (D) andweighs only 6.0 oz. with batteries.

Recording Time - Record in Standard or Long, by setting the REC TIMEswitch. STANDARD recording is 16 minutes with a 2MB card and 33 minuteswith a 4MB card. LONG recording is 34 minutes with a 2MB card and 72minutes with a 4MB card. If using ViaVoice, you must be recording in theStandard mode. The D1000 comes standard with one 2MB card. Additional

18

Page 20: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

2MB and 4MB cards can be purchased for increased storage. An 8MB card willbe available in the Spring of '98.

Silent Compression Voice Activation (SCVA) - A unique silence compressionalgorithm is incorporated dynamically selecting one of two bit-rates, dependingupon the incoming speech activity. This new SCVA will allow you to saveadditional memory space by coding silence at a lower bit rate.

Large LCD Panel for message and operating mode display.

Message editing made easy with three edit features; Over-Write, Insert, andErase. Digital recording accommodates an Insert Recording feature not possiblein analog recording. The Insert Recording feature is comparable to the abilityto add text in word-processing software, creating a natural and efficient way toedit voice recordings.

The Erase feature allows you to erase each file, part of a document or thewhole card. The Write Protection feature can be used for the entire card or foreach file to prevent accidental erasure.

Interface with a PC with a PCMCIA card adapter. Files can be transferred toany laptop or desktop PC which has a PCMCIA slot. Makes e-mailing voicefiles a reality. An optional Miniature Card Reader/Writer is available fordesktop computer users.

Fast-forward control will gradually speed up playback to 96 times normalspeed, but will stop automatically at the end of the file.

ViaVoice Transcription

A customized version of the IBM ViaVoice using the speech engine of ViaVoiceGold.

19

Page 21: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

With a click of the button on the Olympus DSS Player, dictation begins.

Talk Naturally. With ViaVoice, there is no longer a need to leave a briefpause between words.

Improve Personal Productivity. High accuracy, speech recognitionsoftware. You talk naturally, it types!

Type Eyes-free and Hands-free. With ViaVoice, there is no need to stareat the screen or pound away at the keyboard. Just talk, it types! Whenyou're finished with your text, then go back and make corrections.

Dictate into SpeakPad. SpeakPad is ViaVoice's optimized speechrecognition word processing environment. From SpeakPad, text may betransferred to any application, on any operating system, which supportsthe Cut & Paste facility.

Dictate into MS Word. With IBM ViaVoice a user may dictate directlyinto the Microsoft Word, word processing environment.

Playback Your Dictation. The system offers complete audio playback ofa word, sentence or the entire text. This unique feature allows you tomake sure that sounds and words match.

Train Incrementally. ViaVoice allows users to train in small, easy steps.Try the first 50 sentences in the enrollment (training) process; it willtake less than 30 minutes and will significantly increase your dictationaccuracy.

Add Your Own Words. ViaVoice ships with a 22,000 word basevocabulary and the ability for you to add 42,000 of your own words.

20

Page 22: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

Learn Continuously. ViaVoice adapts to you as you adapt to using it. Thesystem continuously learns how you say and use words. If you dictate,correct and update, your dictation accuracy will just keep on gettingbetter and better!

System Requirements:

Windows 95

Pentium 133 MHz or higher

32 MB RAM

CD ROM drive

125 MB hard disk space

Sounder Blaster 100% compatible

Line-in input support

From - http://www.olympus-europa.com/voice_processing/dvr/d1000/detail.stm

What functions can the D1000 perform?

- SCVA (Silent Compression Voice Activation) - voice-controlled recording with adjustable microphone sensitivity (like VCVA with analog devices)

- lndexing - automatic date time recording, automatic dictation numbering (= E lndex) and additional indexing possible

- 2 recording times, Standard and Long mode, which influence the length of the storage capacity

- Dictation/conference switch-over adjustable microphone sensitivity

- Transport lock (Hold function)

Digital Speech Standard

21

Page 23: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

What storage media are available for DSS voice files?

At present, there is the PCMClA card (also called PC card) which is used in Grundig and Philips devices and the miniature card used in the Olympus D1000.

What is the advantage of DSS?

DSS voice files require much less memory than, for example, the .WAV files previously used for speech recording, while retaining the same sound quality. This is particularly advantageous when sending data by e-mail. DSS files also contain additional information, e.g. date and time of recording, as well as an lD number, i.e. who made the recording.

How do you rate the future of DSS?

DSS will establish itself as the universal standard as there has not previously been a uniform standard for dictations, and other major manufacturers in this sector have also expressed an interest in working with this standard.

Downloading voice files to a PC

How are the voice files transferred to the PC?

- This requires various accessories and special software for downloading the .dss files from the miniature card to

- a laptop there is the CA1 PC card adapter (supplied with the Laptop PC Kit)

- a PC there is the reader/writer device for connection to the parallel port (supplied with the Desktop PC Kit).

- The software is called "DSS Player" and is included in each accessories kit.

lt is also possible to read in voice data analogously using a cable (3.5 mm jack plug) connected to the "line-in" socket of the PC, for voice recognition only.

Please note - only for PCs with a line-in socket. This will not work with the microphone socket.

What are the benefits of downloading digital DSS voice files as compared to downloading by cable connection?

22

Page 24: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

1. lt is much faster, because the miniature card downloads the voice files via the PC card adapter within seconds. With the cable using the line-in option a voice file is converted into text in the PC direct from the miniature card. This requires the real time equivalent to the duration of the text.

2. The voice files downloaded in DSS format can be archived as voice files in the DSS Player. The speech, which has been converted directly into text via cable using the ViaVoice Transcription software, does not exist as a voice file in the PC, but remains stored on the miniature card only.

3. The recognition accuracy of digital voice files is higher.

DSS-Player Software

What is DSS Player software and what is it used for?

lt is software developed by Olympus especially for DSS voice files recorded using a Digital Voice Recorder.

- lt is used mainly to load these DSS voice files on to the PC.

- It serves to archive voice files.

- lt can be used purely as playback software with the usual functions such as Play, FF, REW, Erase, Store, etc.

- lt can read additional information of voice files recorded with the D1000, e.g. user lD or indexing.

- Using the OLE function, voice files can be moved to other applications, e.g. to send a text document with a voice appendix by e-mail.

Can DSS Player also convert and process other voice files?

DSS Player can convert DSS files into .WAV files, but .WAV files cannot be converted into DSS.

Can DSS Player software also be used for voice files from other manufacturers?

Yes, but only if the voice files have been recorded in DSS Standard and the PC can read the storage medium, such as the PCMClA card S-RAM from Grundig and Philips.

What else can DSS Player software do?

23

Page 25: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

- The new Version 2.0 is designed so that a voice recognition function can be called up in combination with lBM ViaVoice technology. This means that every voice file recorded in Standard mode can automatically be converted into text.

ViaVoice Transcription Software

What is ViaVoice Transcription software?

ViaVoice Transcription software is powerful voice recognition software based on lBM ViaVoice technology. The software has been adapted so that it can recognise DSS voice files. lt works with context checking and is speaker-adaptive, i.e. it learns from and adapts to the user. Texts can be corrected later, allowing the user to concentrate fully on dictating.

How does the D1000 ViaVoice Transcription System work?

The user speaks into the D1000 Digital Voice Recorder. The voice files stored on miniature card are transferred to the PC. Using the ViaVoice Transcription voice recognition software, voice files can be converted directly into written texts.

For which users is ViaVoice Transcription most suitable?

a) For anyone who has to write a lot of text documents but is not very often at his or her workplace. This applies in particular to people who travel a lot and would like to dictate while on the move.

b) For anyone who does not like typing very much.

Can the ViaVoice Transcription software be used by different users?

Yes, the voice files of any number of users can be converted into text using ViaVoice Transcription. However, a profile should be created for each individual user, i.e. his or her enrolment must be stored on the PC.

ls it possible to transfer the user data to a different device?

Yes, the data can be stored as "back up" in the Enrolment part of the program. With the command "Restore" the data can be transferred onto another PC.

How much vocabulary does ViaVoice Transcription use?

24

Page 26: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

ViaVoice Transcription uses approx. 30,000 words in its basic vocabulary. This can be built up to 64,000 words by the user. ln addition, ViaVoice Transcription has a dictionary with approx. 320,000 words - back-up vocabulary that includes spell-checking and pronunciation.

Other Functions

What does Vocabulary Expander do?

With the "Vocabulary Expander" auxiliary program the user can quickly incorporate words and complex sentences in the activated basic vocabulary and thereby improve recognition accuracy. A loaded text is analysed and words added to the user's personal vocabulary. At the same time, the sentence structures are incorporated in the speech model. For words which have no speech pattern in the back-up vocabulary the pronunciation must be added in order for them to be learnt correctly.

What is the Vocabulary Manager?

The "Vocabulary Manager" auxiliary program calls up the list words added by the user. lndividual words, such as proper nouns, can be added here. lt is also possible to delete words which are misspelt or no longer to be used.

What are dictation macros and what are they for?

A dictation macro is a stored voice command with which a text element can be added into the dictation. A lot of time can be saved by using a macro as an abbreviation for a frequently occurring text. For example, it is possible to call up addresses and salutations, or even an entire letter of thanks by dictating "Thank you".

What does text -to-speech mean?

lt is possible to have a text, regardless of whether it has been dictated or loaded on to the PC by other means, read out by a computer voice. This can be of help if, for example, the user is too tired to read.

What is the SpeakPad?

SpeakPad is a word-processing program which is opened automatically whenever the voice recognition function is selected in DSS Player. The written text is displayed in this

25

Page 27: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

window and can be formatted, stored, printed or transferred to other applications. Another possibility is to open first the SpeakPad and then select the voice file which is to be converted into text.

What help functions are there?

Help functions installed with ViaVoice Transcription are:

- Please read first - important introductory notes

- Dictation trainer

- Frequently used commands

- Online help

For Dictating

ls a particular way of speaking required when dictating?

Basically, no. The user can dictate continuously, i.e. it is not necessary to pause between words. However, clear enunciation is important and the running on of word endings should be avoided.

At what speed can dictation take place?

More than 140 words per minute can be dictated.

What must the user remember when dictating with the D1000?

Voice recognition

- works only if recording is done in "Standard Mode".

- is possible only with recordings made in quiet surroundings. Conference rooms, lectures, conversations or interviews are not suitable for direct conversion into text.

- The microphone of the D1000 should be held about 5 cm away from the mouth, but make sure that breathing noises are not recorded.

Languagesln what languages is ViaVoice Transcription available?

26

Page 28: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

ViaVoice Transcription for the D1000 is available in British and American English, as well as French and German.

PC-RequirementsWhat are the minimum PC requirements?

We recommend Pentium 166 MHz or 150 MHz, 32 MB RAM, 115 to 130 MB free hard disk memo (depending on the language version) and Creative Labs Sound Blaster 16 or 100% compatible. PC with "line-in" socket and CD-ROM drive for installation. A 133 MHz Pentium is sufficient if DSS voice files only are used for voice recognition.

Available addons - http://www.zap1.com/oly_d1000home.html

Mic Model 77712(3.5mm jack)Koss clip-on external mic with 5' cable. Good Quality.

For models: E.W. Bridge SPE-25J, SPE-50J, Dragon Recorders (VoiceIt VTR Recorders), Olympus D1000 & other devices that support 3.5mm jacks.

Mic Model 77716(3.5mm jack)Tiny hideable clip-on external mic, 4' cable. Best Quality

For models: E.W. Bridge SPE-25J, SPE-50J, Dragon Recorders (VoiceIt VTR Recorders), Olympus D1000 & other devices that support 3.5mm jacks.

Digital Voice Recorder DS-150 - $179

From - http://www.aaacamera.com/olympus_digital_voice_recorder_ds150.html

One of the slimmest, most compact recorders ever produced, the DS-150 Digital Voice Recorder has 8MB of built-in memory that offers up to 160 minutes of recording time. Variable Control Voice Actuator (VCVA) technology optimizes built-in memory by ceasing record functions when the microphone senses silence. Two folders store up to 99 messages each. Messages can be moved from one folder to another for easy organization. Downloading speech files to the PC is a breeze. Simply connect the recorder to your computer,

27

Page 29: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

with the optional USB data transfer kit, and select files to download. You can download the entire contents of the recorder, one specific folder or even one specific recording.

Features:

160-minute recording time

Two folders for organizing notes under separate subjects with 99 messages per folder

LCD display shows message number, real-time counter, time remaining, current time and date, and operating modes

Two recording modes (standard mode-75 minutes; long mode-160 minutes)

Voice activation - will only record when the microphone senses sound

External earphone jack lets you use an earphone or headphone for private review of recordings

External microphone jack for use with external noise-cancellation microphones

Fast playback: 30% faster than real time

Partial erase function erases one message at a time or an entire folder at once

Specifications:

Recording format:Digital recording in accordance with IVA digital speech standard

Recording media: Built-in flash memory

Specified input level:

-70 dBv

Overall frequency response

SP mode: 300 to 5,000 Hz. LP mode: 300 to 3,000 Hz.

Sampling frequency

SP mode: 12 kHz. LP mode: Approx. 160 min.

28

Page 30: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

V O I C E R E C O R D I N G T E C H N O L O G I E S

(1 message continuous recording)

Recording time: SP mode: Approx. 75 min. LP mode: Approx. 160 min.(1 message continuous recording)

Folders: 2 (99 messages per folder)

Speakers: Built-in 28 mm round dynamic speaker

Microphone jack: 3.5 mm mini-jack, impedence 2kW

Earphone jack: 3.5 mm mini-jack, impedence 8W

Maximum output (3V DC):

100 mW or more (8W speaker

Power supply: Rated voltage: 3v DCBatteries: Two "AAA"-size batteries (LR03 or R03) (not included)External power supply: AC adapter

Battery life (recording):

Approx. 10 hours (according to Olympus tests using alkiline batteries)

Size: 4.6 (H) x 1.7 (W) x 0.6 (D) inches (without protrusions)

Weight: 2.6 oz. (/4 grams) (including batteries)

Based on the information available - and the limited information currently regarding MP3 recorders - the decision to purchase an Olympus D1000 with an extra 8MB memory card along with the laptop package (PCMCIA interface) and IBM ViaVoice was made.

29

Page 31: Elegant Manual - University of Surrey  · Web viewMechanisms for capturing the spoken word include: Compact Disc writers, Mini-Disc writers, digital tapes of various shapes and sizes,

30