Masthead.jpeg (68003 bytes)

    Number 69, December 2007


TO OUR READERS, CUSTOMERS, EMPLOYEES, AND RESELLERS . . . .


*****
HAPPY HOLIDAYS AND BEST WISHES FOR THE NEW YEAR
*****


USE SPEECHPROFESSIONAL WITH OFF-THE-SHELF USB FOOT CONTROL

No need for a high-cost "certified" foot control" with this system.  Why pay more . . . .

SpeechProfessional™ software suite includes workflow manager, server-based speech processing, and dictation and transcription desktop utilities.  The included SpeechMax™ session file editor for dictation, transcription, and speech recognition supports dictation audio playback while highlighting text to verify accuracy.    It can be used with
an off-the-shelf USB Infinity transcriptionist footpedal or other HID compliant USB device.  The company also offers a "2 for 1" package that includes two session file editor licenses plus USB foot control for only $289.  Call Deb at 219-662-3800 by January 11, 2008 and get this two-for-one package for just $189.  Offer valid for single purchase only.  Not available to employees, resellers, or with other discount packages.

PURCHASE DRAGON MEDICAL AND OTHER EDITIONS AND SAVE

Dragon NaturallySpeaking 9

Best buys for health care, law, business, schools, and government

Medical $1,199 (reg. $1,599)
Legal $899 (reg. $1,199)
Professional $699 (reg. $899)
Preferred $179 (reg. $199)
Medical Small Practice Edition $899 (reg. $1,199)
Medical Small Practice Edition 5 user Value Pack $3,899 (reg. $5,499)
Runtimes Medical $699 Legal $699 Professional $499*


* Dragon speech recognition dictation-only runtimes may be purchased from Custom Speech USA only for use with the company's products.  Electronic Help manual is included with the runtime, but not voice commands, printed manual, or headset microphone.  See other products for pricing and more information.

We also offer upgrades, value packs, bundles, and open license programs.  Call 219-662-3800 TODAY for a FREE quote, or use the Online Information Form to get more information about Dragon and other third-party solutions.

Say I Can teaching CDs for Dragon NaturallySpeaking also available.

ECONOMICS OF SPEECH RECOGNITION EDITING FOR RADIOLOGY


Radiologists are among the most common users of speech recognition.  However, William R. Reinus, MD, MBA,
Temple University Medical Center (Philadelphia, PA) warned that radiologist productivity may be reduced by up to 25% if hospitals force radiologists to correct their own speech recognition reports.  Dr. Reinus also referred to potential lengthy review time by transcriptionist to listen to the dictation and locate words misrecognized by the speech engine.  In dealing with alternatives, he further cautioned that there may be limitations with use of structured dictation and templates, such as "modification of templates for studies with near-normal results."  Many clinicians, he said, "expect customized reports for their patients' studies, particularly more sophisticated studies." Economics of Radiology Report Editing Using Voice Recognition Technology (Journal of American College of Radiology December 2007)

Howard P. Forman, MD, MBA, a radiologist from
Yale University Medical Center responded in a companion article that his institution's "transcription services, whether homegrown or outsourced, would take anywhere from 2 hours to 2 weeks to deliver a transcribed report. Even in efficient settings, reports might not be edited and signed on the same day as they were dictated." Re: "Economics of Radiology Report Editing Using Voice Recognition Technology" (Journal of American College of Radiology December 2007) He further indicated that he would prefer to edit his own reports, and send them off immediately to a referring clinician without waiting for correction by an
assistant.

SpeechProfessional™ software suite add-on for Dragon, IBM, and Microsoft speech recognition can make a radiology practice more efficient by dealing with the issues raised by the Dr Reinus and Dr. Forman.

1. Server-based speech recognition with editor correction.  With server-based systems and offline correction, there is no significant change in radiologist workflow.  The dictation audio is routed to a server-based system.  A preliminary report is generated by "computerized transcriptionist."   Speech recognition systems are up to 99% accurate.   Reports are edited by a transcriptionist editor and returned to the radiologist for approval and digital signature, as with manual transcription.  Use of foot control and WordCheck™, advanced text comparison techniques, described below, improve editor efficiency and reduce turnaround time.  The company's unique "dual-engine" and "multi-engine" approach has been reviewed in health care, legal, and general technology journals.

2. Real-time speech recognition available also.  In some settings, such as requests for a STAT reading, that rapid turnaround is desirable.  With our system a radiologist can use real- time or server-based technology, depending upon the situation and personal preference. 

3. Easy implementation of customizable, study-specific and doctor-specific structured dictation and templates. Structured dictation may be used with "fill-in-the-blank" electronic forms. Template reports may include preset reports that apply to common clinical situations or imaging findings, such as a "normal CT Brain".  With both structured dictation and templates speaker generation time, speaker or transcriptionist editing, and basic errors such as dropped words (such as "no") can decrease.  SpeechProfessional™supports both structured dictation and templates using both server-based and real-time speech recognition, as well as for manual transcription. These normals and templates may be customized for particular imaging studies or referring clinicians.

4. RIS/HIS and EMR integration. From the outset, the Custom Speech  workflow system with SpeechProfessional™ was designed to support easy integration with third-party solutions. Most recently, the company has integrated Dragon server-based speech recognition with a JAVA-based medical query system at
Columbia University Medical Center in New York City. Similarly, a large multispecialty medical clinic in Billings, Montana has added Dragon server-based speech recognition as a back-end add-on to a DVI dictation and transcription workflow. A
gastroenterology test site in
Queensland, Australia integrated back-end Dragon server-based
workflow to a third-party dictation system. Other projects have included speech recognition for a
Larchmont, NY radiology imaging center, creation of custom templates for use with real-time speech recognition and preexisting database structure at the Los Angeles County Coroner's Office, and HL-7 integration with speech recognition system for a large Hagerstown, MD community hospital.  In each of these cases, Custom Speech expertise and experience, scripting tools, software development kit, and open database engine architecture facilitated rapid and successful integration with a third-party system. For more information see Case Studies.

5. Unique features to improve efficiency and privacy. Custom Speech provides state of the art technology AND novel features not available with other systems.

****  
WordCheck™ using dual or multiple speech engines for faster review of dictation audio. U.S. and foreign patented SpeechMax™ WordCheck feature can reduce review time by identifying probable errors and highlighting these for review by the transcriptionist. It does this by comparing synchronized text output by two or more speech recognition engines. Differences indicate possible errors. Agreement suggests accuracy. If there is 100% speech engine accuracy, there are no differences between the speech engines. With no differences, audio review time of a 1 minute or 1 hour file could be reduced to essentially zero before sending the text for physician approval. Where accuracy is less than 100%, differences will require review. But by tabbing through the document listening primarily to differences where the speech recognition engines are relatively accurate, the correctionist can shorten audio review time and more quickly produce an accurate document for review.  

**** 
Speaker-specific speech recognition for high accuracy.   Conventional speaker-dependent speech recognition systems use speech from many different speakers with different accents to create a speaker-independent speech user profile. Enrollment training (reading a 15 minute or half hour script) adapts the speaker-independent speech user profile to make it more speaker dependent.  The SweetSpeech™ speaker-specific model is based upon the speech, accent, background and channel noise, and vocabulary of only a single speaker. It reflects actual accent, speaking habits, and actual noise of the system, not a mathematical approximation of it based upon adaptive training. As a result, the system may be highly accurate. "Initial results show that by changing the focus to MSSR, word error rates can drop very significantly. In comparison with speaker-adaptive speech recognition system, massively speaker-specific speech recognition also performs better since model parameters can be tuned to be suitable to one particular individual." Y. Shi & E. Chang, "Studies in Massively Speaker-Specific Speech Recognition" (IEEE 2004)

****
ScrambledSpeech™ to protect patient confidentiality. Department of Veterans Affairs Office of Inspector General, Audit of Veteran's Health Administration's Acquisition of Medical Transcription Services (VA Office of Inspector General June 14, 2006) audit showed that confidential VHA information was being transcribed overseas and that terrorists and others potentially had access to medical data identifying U.S. service members and their families.  Using with SpeechMax™ session file editor, to protect privacy, a user selects a speech recognition or manually-transcribed session file or dictation audio. The user divides the parent session file into shorter two or more child session files. Order of audio or text of each child session file is scrambled. The operator sends each child session file to a different transcriptionist or editor for processing. The operator then merges and unscrambles completed child session files to generate final document or report. The major benefit is that it limits transcriptionist or editor access to confidential information.
This can benefit any HMO, hospital, or health system that outsources transcription.  View the video demo. And the session file editor software
supports lock/unlock of session files to prevent unauthorized editing of the final document (document protection).

HOW DOES TEXT TO SPEECH WORK?

Best-selling acWAVE™ audio conversion software includes FREE text to speech.  Use with AT&T Natural Voices Text to Speech and other Microsoft SAPI 5.x compatible voice fonts.  Using this tool, you can generate voice messages, telephone prompts, and other life-like sounding records.  A June 2007 SmartComputing article explains how the technology works.

"Speech synthesis, or the act of generating artificial speech using machines or software, is the fundamental technology behind TTS.  Research on the subject predates modern computing.  Homer Dudley’s speech synthesis machine, named Voder, could generate human-like sounds, and with the help of a trained operator, serenaded attendees at the 1939 World’s Fair with a warbled rendition of “Auld Lang Syne.” I n the 1950s, scientists used electronics designed to emit sounds at frequencies similar to those produced by the human vocal tract, but the results still sounded thoroughly inhuman.

Several decades later, Bell Labs researchers took another approach, called concatenative synthesis.  This method employs a limited amount of prerecorded speech split up into its fundamental vowel and consonant sound units, or phonemes, which are then mixed and matched to produce unique words and sentences.  The result was a much-improved ability to produce intelligible speech on the fly, but because the number of phonemes being used was rather limited, these systems sounded less than convincing.

As computing hardware improves, TTS algorithms are becoming better able to search for and splice together a greater number of phonemes, which has led researchers to focus more attention on sounds as they occur in context with adjacent sounds, varying pitch in a given sentence, and the stressing of sounds in certain words.  As a result, current TTS software can mimic human speech in a much more believable way than its predecessors."

Click here to review the complete article.


Custom Speech USA, Inc.
3 North Court Street, Suite B365Crown Point, IN 46307
219-662-3800 (v)
219-662-3877 (f)
www.customspeechusa.com
info@customspeechusa.com
We teach computers how to listen®

Online Information Request Form
_____________________________________________________

You are receiving this copy of the Custom Speech USA, Inc. newsletter because you have expressed interest in our products and services.  If you have been sent a newsletter in error, or would like to be removed from the list, please click here

Price, terms, specifications, and availability are subject to change without notice.  Custom Speech USA, Inc. trademarks are indicated. Other marks are the property of their respective owners.  Click here for
Online Privacy Policy

Dragon and NaturallySpeaking® are licensed trademarks of Nuance® (Nuance Communications, Inc.)