答案:要做语音,查了一些资料。
Java Speech API :http://java.sun.com/products/java-media/speech/
Java Speech API 允许开发人员将语音技术整合到 Java applet 和应用程序的用户界面中。这种 API 规定一个跨平台的接口来支持命令—控制识别器、听写系统和语音合成器。
这个综合 API 被分成几个规范:
Java Speech API 规范(Java Speech API Specification,JSAPI)
Java Speech API 程序员指南
Java Speech API 语法格式规范(Java Speech API Grammar Format Specification,JSGF)
Java Speech API 标记语言规范(Java Speech API Markup Language Specification,JSML)
这个 API 没有 Sun 公司的参考实现,但是有许多第三方实现,包括 Speech for Java(可以从 IBM alphaWorks 获得),它使用 ViaVoice 来支持语音命令识别、听写和文本到语音的合成。
java.sun.com官方资料:
Question Index
Where can I get the Java Speech API (JSAPI)?
The Java Speech API (JSAPI) is not part of the JDK and Sun does not ship an implementation of JSAPI. Instead, we work with third party speech companies to encourage the availability of multiple implementations.
Question Index
What is the Java Speech API (JSAPI)?
The Java Speech API allows Java applications to incorporate speech technology into their user inte易做图ces. It defines a cross-platform API to support command and control recognizers, dictation systems and speech synthesizers.
Question Index
When will the JSAPI specification be available?
The Java Speech 1.0 API specification was released on October 26, 1998, and is available at http://java.sun.com/products/java-media/speech/
Question Index
What does the Java Speech API specification include?
The Java Speech API specification includes the Javadoc-style API documentation for the approximately 70 classes and inte易做图ces in the API. The specification also includes a detailed Programmer's Guide which explains both introductory and advanced speech application programming with JSAPI. Two companion specifications are available: JSML and JSGF.
The specification is not yet provided with the .class files needed to compile applications with JSAPI.
Question Index
What are JSML and JSGF?
The Java Speech API Markup Language (JSML) and the Java Speech API Grammar Format (JSGF) are companion specifications to the Java Speech API. JSML (currently in beta) defines a standard text format for marking up text for input to a speech synthesizer. JSGF version 1.0 defines a standard text format for providing a grammar to a speech recognizer. Both specifications are available at http://java.sun.com/products/java-media/speech/
Question Index
How was the JSAPI specification developed?
Sun Microsystems, Inc. worked in partnership with leading speech technology companies to define the initial specification of the Java Speech API, JSML and JSGF. Sun is grateful for the contributions of:
Apple Computer, Inc.
AT&T
Dragon Systems, Inc.
IBM Corporation
Novell, Inc.
Philips Speech Processing
Texas Instruments Incorporated
The JSAPI specification and JSGF and JSML specifications were developed as open specifications with input from a public review process.
Question Index
How does JSAPI relate to other Java APIs?
The Java Speech API is part of a family of APIs that work together as a suite to provide customers with enhanced graphics and extended communications capabilities. These APIs include the
Java 2D API
Java 3D API
Java Advanced Imaging API
Java Media Framework API
Java Sound API
Java Telephony API
Question Index
What JSAPI implementations are now available?
The Java Speech API is a freely available specification and therefore anyone is welcome to develop an implementation. The following implementations are known to exist.
Note: Sun Microsystems, Inc. makes no representations or warranties about the suitability of the software listed here, either express or implied, including but not limited to the implied warranties of mechantability, fitness for a particular purpose, or non-infringement. The implementations listed here have not been tested with regard to compliance to the JSAPI specification, nor does their appearance on this page imply any form of endorsement of compliance on the part of Sun.
FreeTTS on
Description: Open source speech synthesizer written entirely in the Java programming language.
Requirements: JDK 1.4. Read about more requirements on the FreeTTS web site.
IBM's "Speech for Java"
Description: Implementation based on IBM's ViaVoice product, which supports continuous dictation, command and control and speech synthesis. It supports all the European language versions of ViaVoice -- US & UK English, French, German, Italian and Spanish -- plus Japanese.
Requirements: JDK 1.1.7 or later or JDK 1.2 on Windows 95 with 32MB, or Windows NT with 48MB. Both platforms also require an installation ViaVoice 98.
IBM's "Speech for Java" on Linux
Description: Beta version of "Speech for Java" on Linux. Currently only supports speech recognition.
Requirements: RedHat Linux 6.0 with 32MB, and Blackdown JDK 1.1.7 with native thread support.
The Cloud Garden
Description: Implementation for use with any recognition/TTS speech engine compliant with Microsoft's SAPI5 (with SAPI4 support for TTS engines only). An additional package allows redirection of audio data to/from Files, Lines and remote clients (using the javax.sound.sampled package). Some examples demonstrate its use in applets in Netscape and IE browsers.
Requirements: JDK 1.1 or better, Windows 98, Me, 2000 or NT, and any SAPI 5.1, 5.0 or 4.0 compliant speech engine (some of which can be downloaded from Microsoft's web site).
Lernout & Hauspie's TTS for Java Speech API
Description: Implementations based upon ASR1600 and TTS3000 engines, which support command and control and speech synthesis. Supports 10 different voices and associated whispering voices for the English language. Provides control for pitch, pitch range, speaking rate, and volume.
Requirements: Sun Solaris OS version 2.4 or later, JDK 1.1.5. Sun Swing package (free download) for graphical Type-n-Talk demo.
More information: Contact Edmund Kwan, Director of Sales, Western Region Speech and Language Technologies and Solutions (ekwan@lhs.com)
Conversa Web 3.0
Description: Conversa Web is a voice-enabled Web browser that provides a range of facilities for voice-navigation of the web by speech recognition and text-to-speech. The developers of Conversa Web chose to write a JSAPI implementation for the speech support.
Requirements: Windows 95/98 or NT 4.0 running on Intel Pentium 166 MHz processor or faster (or equivalent). Minimum of 32 MB RAM (64 MB recommended). Multimedia system: sound card and speakers. Microsoft Internet Explorer 4.0 or higher.
Festival
Description: Festival is a general multi-lingual speech synthesis system developed by the Centre for Speech Technology Research at the University of Edinburgh. It offers a full text to speech system with various APIs, as well an environment for development and research of speech synthesis techniques. It is written in C++ with a Scheme-based command interpreter for general control and provides a binding to the Java Speech API. Supports the English (British and American), Spanish and Welsh languages.
Requirements: Festival runs on Suns (SunOS and Solaris), FreeBSD, Linux, SGIs, HPs and DEC Alphas and is portable to other Unix machines. Preliminary support is available for Windows 95 and NT. For details and requirements see the Festival download page.
Elan Speech Cube
Description: Elan Speech Cube is a Multilingual, multichannel, cross-operating system text-to-speech software component for client-server architect
上一个:java Singleton模式
下一个:javascript在页面间数据传输的使用