The EMU Speech Database SystemEMU is a collection of software tools for the creation, manipulation and analysis of speech databases. At the core of EMU is a database search engine which allows the researcher to find various speech segments based on the sequential and hierarchical structure of the utterances in which they occur. EMU includes an interactive labeller which can display spectrograms and other speech waveforms, and which allows the creation of hierarchical, as well as sequential, labels for a speech utterance. Testrelease 2.2.3, March 2009The Emu developers would appreciate, if you test this upcoming release. See section.
Release 2.0, May 2008Emu 2.0 is released. See section. | |||||||||||||||||||||
DownloadsTest new EMU 2.2.3 and EMU-R 4.1Major changes:
Please report any Bug to our Bug Tracking System.
The test versions here will be updated each time newly reported bugs are fixed until an EMU Version is released officially. Thus, if you find any bug, please report it and/or check if there is a new version 2.2.X in this place and try it again. Installing a new version does not take too much time any more due to the ability to store database files anywhere, because the emu-conf file that is in charge of the template storage is unaffected by intalling or unnstalling. Binary packages are available for:
EMU 2.0 Release, May 2008We are very proud to present the newest Release of Emu which includes binaries not only for Windows but also for Linux and now even for Mac OS X. A lot of code in this new release has been rewritten or reorganized. This enables us to build Emu in basically the same way on most platforms where Tcl/Tk is supported. Short summary of changes in Emu 2.0
All Emu downloads are available via the . TroubleshootingThis software is likely to contain bugs. If you think you have encountered a bug, please don't hesitate to consult our Bug Tracking System and eventually report your problem. PatchesDownload patches from: Emu's Sourceforge patch collection A patch fixes problems of the current version of the software. All patches will be included in the next software release. Please make sure that you install patches in the right chronological order. It might be, that some files are part of more than one patch. In this case, the lastest version includes earlier fixes already. Source DistributionThe source distribution is a snapshot of the Emu sources at the release date. The source should build easily on Linux with and should build on Solaris as long as a recent gcc and Gnu make are available. It may also build on other flavours of Unix but has only been tested on these two platforms. Instructions for compilation are in the README file within the distribution. In addition to the above released sources, the latest version of the source via anonymous CVS is available from the Please note that the CVS version may not compile in all configurations. A number of packages needed to build a full EMU system are version Version 8.5 of and the latest versions (CVS snapshots) of libassp and tclassp, both found at libassp Emu also requires the Tcl standard library (tcllib), tklib the Bwidgets Tcl library, both available via Sourceforge as well as the memchan, tclvfs, trf and tkimg packages. All these packages are also available via Active Tcl. | |||||||||||||||||||||
Library Emu in the Software package RThe Emu-R library is a collection of programs running within the R programming language for the analysis of any speech database that can be read by the Emu Speech Database System. Installing Emu/RInstallation of the Emu functions for R is simple as the package is now hosted by the The Comprehensive R Archive Network. After you have installed R (see The R Project for Statistical Computing for details of how to do this) start up R and then install the the Emu-R library with: install.packages("emu")
Alternatively, you can dowload the Emu-R package from SourceForge. The actual R-CRAN package requires >= EMU2.0 (no prev). If you use the EMU2.0prev, download the EmuR Package emu_3.0.zip from SourceForge. | |||||||||||||||||||||
Emu and ToBIToBI is a system for transcribing the intonation patterns of spoken language. ToBI defines a number of annotation levels and the criteria for placing labels on each. These labels include a segmentation into words along with tone labels which mark prosodic events such as prosodic tones and phrase boundary events. ToBI annotations have largely been made using the Unix based Waves+ toolkit from Entropic and the example materials are made available in the ESPS format which is readable only by Waves+ (and by Emu when an ESPS licence is available -- ie. on a Unix platform). This page describes some tools for using ToBI annotation in the Emu system. The ToBI training materials available from the Ohio State web site are in the ESPS format. We have converted these files to the SSFF format read by Emu on both Unix and Windows. These files are now available on our server:
The smaller files consist of only 12 utterances from the database (those beginning with `a'). These packages contain the original label files (augmented with a dummy label at the start of the word level so that Emu can treat the words as segments rather than events). Two Emu template files are provided, one mimics the traditional ToBI annotation scheme which presents four independant tiers, the other adds domination relations and two additional levels for intonational and intermediate phrases. A script is provided to convert traditional flat ToBI annotations into hierarchical ones. An example hierarchical annotation is shown below.
In this scheme, the Tone level is preserved, non-phrasal tones are linked to the word in which they occur and words are grouped into Intonational and Intermediate phrases based on the position of phrase boundary events. Pitch Tracking on WindowsResearchers investigating prosody have relied on ESPS/Waves+ for both labelling and pitch tracking. Emu can manage the labelling role but does not provide a pitch tracker. There are a number of possible pitch trackers that might be of use. Fortunately, the ESPS codebase has now been donated to the KTH speech group and is now being integrated into the Snack toolkit. The most recent release of Snack includes the ESPS pitch and formant tracker code and the most recent Emu release contains a simple tool to run these over speech data and produce SSFF formatted data files for your corpus. | |||||||||||||||||||||
Publications
| |||||||||||||||||||||
Emu and MySQL | |||||||||||||||||||||
Frequently Asked QuestionsHow do I install the EMU speech database system?
How do I start the EMU speech database system?EMU version 2.0 comes up with the application dbemu. The complete new interface provides a lucid view of all available databases, quick access to all tools that come along with Emu and variety of new ways to work on the utterances of a database.
How can I get access to my existing databases?
Are there some patches?Download patches from: Emu's Sourceforge patch collection A patch fixes problems of the current version of the software. All patches will be included in the next software release. Please make sure that you install patches in the right chronological order. It might be, that some files are part of more than one patch. In this case, the lastest version includes earlier fixes already. Convert Labels freezes converting Praat TextGrid filesThis problem occurs when the TextGrid file is UTF-16 encoded. Until EMU support UTF-16, change the encoding to UTF-8. Therefore change Praat preferences (open and save TextGrid again) or use a text editor to convert.
| |||||||||||||||||||||
Miscellaneous Information
We are in the process of transferring additional information from the old
website to the new design. Meanwhile please visit the old site: | |||||||||||||||||||||
DocumentationsDocumentations of the Speech Tools of EMU Version 2 are included in the tools by a file menu entry Help or by the context menu below the emu picture. For other help see Documentation of EMU 1.7 Video documentation |
Copyright ©
2009, Institute of Phonetics and Speech Processing, LMU Munich
Copyright ©
2006, Institute of Phonetics and Digital Speech Processing, CAU Kiel
Copyright ©
2001, Department of Linguistics, Macquarie
University.