CE-06.02

Title: 

Audio Mode Output for ER Design Tool

Author(s): 

Sheng Huang

Oral Defence Date: 

08/04/2006

Location: 

TH 331

Committee: 

Prof. Marguerite Murphy, Prof. Dragutin Petkovic

Abstract: 

Audio is a very natural mean of communication to human beings and people have always worked hard to find ways to simulate human voices. Nowadays, people have made great progress in the use of computers to convert written English texts to speech sound, so called “audio mode output”. Audio mode output is becoming more popular in computer applications because it improves the function and usability of computer software. This project deals with using speech synthesis or text-to-speech technology to add audio mode output to the ER Design Tool. Our extensions allow the software to speak detailed and complete textual information describing Entity-Relationship schemas. While this information is already stored internally as part of application schema entry, it is invisible, incomplete, or hidden in ER Schema graphs because this information cannot be displayed on size-limited computer screens. We believe that audio mode output increases the usability of the ER Design Tool. An open source JAVA speech synthesis API, FreeTTS API, is used to implement this feature. This API is portable, efficient and simple. The project design integrates the FreeTTS API with the ER Design Tool to dynamically synthesize written texts to Microsoft wave sound format in real-time. After testing using professional commercial speech applications on different computers, the audio mode output of the ER Design Tool is verified to be accurate and efficient. The technology used to extend the ER Design Tool could be used in many other software applications.

Keywords: 

Java Speech API, FreeTTS, API, Voice, sound wave, speech synthesis, speech engine , phoneme, rhythm, prosody, Entity-Relationship schema, syllable

Copyright: 

Sheng Huang