Audio Mode Output for ER Design Tool


Sheng Huang

Oral Defence Date: 

Friday, August 4, 2006 - 13:00


TH 331


1:00 - 3:00 pm


Prof. Marguerite Murphy, Prof. Dragutin Petkovic


Audio is a very natural mean of communication to human beings and people have always worked hard to find ways to simulate human voices. Nowadays, people have made great progress in the use of computers to convert written English texts to speech sound, so called “audio mode output”. Audio mode output is becoming more popular in computer applications because it improves the function and usability of computer software. This project deals with using speech synthesis or text-to-speech technology to add audio mode output to the ER Design Tool. Our extensions allow the software to speak detailed and complete textual information describing Entity-Relationship schemas. While this information is already stored internally as part of application schema entry, it is invisible, incomplete, or hidden in ER Schema graphs because this information cannot be displayed on size-limited computer screens. We believe that audio mode output increases the usability of the ER Design Tool. An open source JAVA speech synthesis API, FreeTTS API, is used to implement this feature. This API is portable, efficient and simple. The project design integrates the FreeTTS API with the ER Design Tool to dynamically synthesize written texts to Microsoft wave sound format in real-time. After testing using professional commercial speech applications on different computers, the audio mode output of the ER Design Tool is verified to be accurate and efficient. The technology used to extend the ER Design Tool could be used in many other software applications.

Sheng Huang

Java Speech API, FreeTTS, API, Voice, sound wave, speech synthesis, speech engine , phoneme, rhythm, prosody, Entity-Relationship schema, syllable

Date Report Of Completion Filed : 08/08/2006