|
Last updated: August 2, 2011
Suggestions for additional resources
The EMMA Standard
EMMA (Extensible MultiModal Annotation) is a standard published by the W3C for representing multimodal user inputs, including speech, text, and handwriting.
NL Workbench simple tools for exploring Statistical Language Models and tagged grammars which includes an open source implementation of EMMA developed by Conversational Technologies.
Speech Technology Resources
A directory of speech technology related websites.
Open Directory for Speech Technology
Standards and Information about Standards
World Wide Web Consortium: http://www.w3.org
VoiceXML tutorials and training
Quick Guide to the XML SRGS Grammar format (download, 48k )
IETF SpeechSC: The SpeechSC Work Group is developing protocols (Media Resources Control Protocol) to support distributed media processing of audio streams, http://www.ietf.org/html.charters/speechsc-charter.html
Books
Abbott, K. R. (2001). Voice Enabling Web Applications: VoiceXML and Beyond, APress.
Andersson, E. A., S. Breitenbach, et al. (2001). Early Adopter VoiceXML. Birmingham, UK, Wrox Press.
Balentine, B. and D. Morgan (1999).How to build a speech recognition application. San Ramon, California, Enterprise Integration Group.
Balentine, B. (2007) It's better to be a good machine than a bad person. Annapolis, MD, ICMI Press.
Beasley, R., K. M. Farley, et al. (2002). Voice Application Development with VoiceXML, Sams.
Deborah Dahl, Editor. Practical Spoken Dialog Systems. Springer-Verlag, 2005.
Gardner-Bonneau, D. (1999). Human Factors and Voice Interactive Systems. Boston, Kluwer Academic Publishers.
Harris, R. A. (2005)." Voice Interaction Design, Morgan Kaufmann.
Hocek, A. and D. Cuddihy (2002)." Definitive VoiceXML, Prentice-Hall.
Kotelly, B. (2003). The Art and Business of Speech Recognition. Addison-Wesley.
Larson, J. A. (2002). VoiceXML: Introduction to developing speech applications. Upper Saddle River New Jersey, Prentice Hall.
Meisel, W.(2006) VUI Visions: Expert views on effective voice user interface design, Trafford Publishing.
Miller, M. VoiceXML: 10 projects to voice-enable your web site, John Wiley and Sons.
Reeves, Byron, and C. Nass (1996) The Media Equation, Cambridge University Press.
Sharma, C. and J. Kunin (2002)." VoiceXML. New York, John Wiley and Sons, Inc.
Shukla, C., A. Dass, et al. (2002). VoiceXML 2.0 Developer's Guide: Building Professional Voice-enabled Applications with JSP, ASP & Coldfusion. New York, McGraw-Hill Osborne Media.
Speech Recognition Technology
Text to Speech Technology
Speech Analytics
Speech Analytics refers to software that analyzes speech and gets various types of useful information from it. Examples would be keyword spotting in broadcast speech and analysis of calls between customers and agents in a call center. Some companies that work in this area include:
Speaker Verification
A biometric technology for verifying that someone is who they claim to be based on characteristics of their voice.
VoiceXML Information
University Research Centers
|