Unlock the Power of Your Voice™

Speaker-specific speech recognition can be more accurate . . .

--Create accurate speech user profiles from day-to-day dictation
--Profile created "behind the scenes," no speaker enrollment required
--Unlimited speaker license available 


The primary emphasis in speech recognition, text to speech, and other speech and language processing in research has been has been working with data from large numbers of speakers or authors.  In speech recognition, the focus has been speaker independent models, or speaker adaptive models that use speaker independent data and are adapted to a speaker's voice through enrollment and corrective training.  Some have proposed a focus on the individual speaker and using "massive amounts of speaker-specific training data recorded in one’s daily life. We call this Massively Speaker-Specific Recognition (MSSR) . . . . Initial results show that by changing the focus to MSSR, word error rates can drop very significantly.  In comparison with speaker-adaptive speech recognition system, MSSR also performs better since model parameters can be tuned to be suitable to one particular individual."  
Y. Shi & E. Chang, "Studies in Massively Speaker-Specific Speech Recognition" (IEEE 2004)

Here is your toolkit for speaker-specific processing!

Editions:

Full -- includes full speech and language toolkit
Audio Segmentation (SpeechSplitter™) -- audio/voice/music segmentation
Lexicon/Language Model -- (WordPronounce™/WordContext™)

Patents pending Windows-based SweetSpeech™ toolkit includes Model Builder to create acoustic model, language model, and lexicon for speaker-specific speech recognition.  The SAPI 5.x-compatible toolkit further includes phonetic pronunciation generator and regular expressions application for forward and reverse formatting text.  Using day-to-day dictation audio and transcription, create one or more speech user profiles from audio and text data that was formerly discarded as a useless byproduct of the dictation process.   The system supports Unicode and end-user adjustment of processing parameters.  SpeechServers™ SAPI 5.x is included in same install kit as SweetSpeech™, but is separately licensed.

The speech engine may be used with the SpeechServers™ user and file management for back-end, server-based speech recognition.   Plugins for the SpeechMax™ HTML session file editor support real-time interactive speech recognition with the system and local, client-based transcription of an audio file.   The session file editor may also be used in the preautomation, training phase to generate training data for the speech user profile, and in the automation phase, to edit server-based or real-time speech recognition.  Tools for audio segmentation, language model, and lexicon may be ordered separately.  Use the the toolkit with SpeechProfessional™ software suite and purchase at a discount as an EnterpriseSpeech™ add-on. 

SweetSpeech™ toolkit is a "best-buy" value.  Compare with conventional off-the shelf large vocabulary, continuous speech recognition systems.

 

Price, terms, specifications, and availability are subject to change without notice. Custom Speech USA, Inc. trademarks are indicated.   Other marks are the property of their respective owners.