helping

tranfree issue 25 - 5th February 2001

 

What's All The Talk About?

Where To Spend Your Money In Speech Recognition

by Michael Benis

Why Bother?

The why of speech recognition is very simple...

most of us can talk faster than we can type

...so it increases our productivity. In addition, the fact that you can move around because you're not tied to your keyboard helps keep you feeling fresher even when you're under pressure.

As a result, you can achieve higher productivity for more hours in the day without falling to pieces. Perhaps most important of all...

it's great protection against (RSI) repetitive strain injury

...and one of the few sure-fire ways of getting over it.


Does It Work?

Yes, but you have to get everything right...

  • your computer
  • sound card
  • microphone
  • speech recognition product
  • how you use the system

Go through every step methodically and you'll end up...

...grinning from ear to ear as you lean back in your chair, feet up on your desk merrily chatting away to your computer.

But don't rush things. It will take you around a month before you're using your system with complete success and confidence.

Although you'll find more than enough useful pointers in my old ITI Bulletin articles on the translatortips.com site, there really is no substitute for the certainty, speed and peace of mind you'll enjoy if you can actually find someone to give you first-hand advice and one-on-one training. But be careful...


All Experts Aren't Alike

Translators aren't typical of most speech recognition users. Not only do we generate large amounts of text, but the vocabulary and style of the texts we dictate vary greatly.

As a result, choices that may be acceptable to less demanding users are very much less acceptable where we are concerned. So ignore what you read in the high street magazines. If you opt for training, go for someone who has been recommended by a fellow translator. Above all, follow this advice...


The Basics

To be really happy, you want at least...

  • 500 MHz processor,
  • 128 MB RAM
  • Turtle Beach Montego or Sound Blaster Live! soundcard

If you're buying new, get as close as you can to doubling those specifications (processor speed and RAM) before the price jump per MHz starts getting silly.


Dragon NaturallySpeaking (Version 5)

NaturallySpeaking was the first continuous speech recognition system on the market and still leads the field. It's the easiest to use, gives you the highest recognition accuracy and is the only system that allows you to correct entire sentences at a time.

What's more, it offers a whole range of commands that make dictation, correction and navigation faster than the competition. Unfortunately, the latest version is not an improvement in every respect, although Dragon is working to sort things out.


IBM ViaVoice (Release 8)

ViaVoice is close on the heels of NaturallySpeaking for recognition accuracy. But for ease of use, stability and productivity when dictating long texts, it definitely trails in second place.

That said, it remains NaturallySpeaking's only serious competitor and unquestionably offers better value for money in its Professional version.


Lernout & Hauspie Voice Xpress Pro (Version 5)

Although a considerable improvement on its predecessor, Voice Xpress quite simply isn't a serious contender for translators. The main reason is that it won't play back what you said. (All the other products do this like a dictaphone).

Since speech recognition errors can be very different from what you actually said, you'll spend a lot of time scratching your head trying to remember with this product. That's wasted time in which you could have simply got on with the job.


Philips FreeSpeech 2000

An attractively-priced product, offering a choice of languages, the ease of use and recognition accuracy of FreeSpeech 2000 simply don't compare with NaturallySpeaking and ViaVoice.

It also suffers the disadvantage that you can only play back and correct what you said in a series of separate operations, although you can do this very quickly if you invest in the splendid SpeechMike Pro.


Where To Put Your Money

If you're already using IBM ViaVoice and are happy with it, then you'll be delighted with the upgrade to Release 8.

If you're using Dragon NaturallySpeaking Version 4, hang onto it and wait for Dragon to iron out the bugs in Version 5.

Likewise, if you're looking to buy your first system, search out one of the remaining copies of Version 4. You'll save yourself a tidy sum and will be able to upgrade to Version 5 when it actually represents an improvement in every respect. But don't worry if your dealer can only lay their hands on Version 5, it's still the best choice for translators.

Don't forget that you'll need the much more expensive "Professional" version of NaturallySpeaking if you want to integrate speech recognition with Translation Memory.

That's particularly true of Trados, for which you have to write macros that drive the menus if you want to avoid damaging its tags. With any of the other Translation Memory programs on the market, you can obtain the results you want by just dictating their keystroke shortcuts in NaturallySpeaking Preferred.


Good Luck And Let Us Know How You Get On

So you've decided to treat your hands to a lazy 2001. What next?

  1. Go through the enrolment procedure slowly and methodically, but don't do much more than the bare minimum
  2. Get used to dictating for several days, noting which style works best
  3. Then enrol again

This time, read several texts and feed as many old translations as possible into the "Vocabulary Builder" or its equivalent.

After a month or so, let us know how you got on.

mb@tranfree.com

The more feedback we receive the better the information we can provide.


Michael Benis - Freelance Copywriter, Interpreter, Journalist
Translator of French / Italian / US -> UK English
EC Freelance Translator - Approved Police Interpreter


 

Click here to return to tranfree 25 main page