The EMU Speech Database System

EMU is a collection of software tools for the creation, manipulation and analysis of speech databases. At the core of EMU is a database search engine which allows the researcher to find various speech segments based on the sequential and hierarchical structure of the utterances in which they occur. EMU includes an interactive labeller which can display spectrograms and other speech waveforms, and which allows the creation of hierarchical, as well as sequential, labels for a speech utterance.

Release 2.2, September 2009

Emu 2.2 is released. See

Testrelease 2.2.3, March 2009

The Emu developers would appreciate, if you test this upcoming release. See section.

Release 2.1, Octobre 2008

Emu 2.1 is released on demand for windows user only.

Release 2.0, May 2008

Emu 2.0 is released. See section.

Downloads

Binary packages of Emu 2.2 are available for:

Linux: x86 (2.2.3) v. 2.2.3 (2.2.4 coming very soon)
Mac OS X: 10.4 and above v. 2.2.4
Windows: XP, Vista, 7 v. 2.2.4

Note: this release works best with .


EMU 2.2 Release, September 2009

This release is the result of the comments given to us by the users that tested the Emu 2.2.X versions. We like to thank all the users that helped us by reporting bugs and sending feature requests. Thus Emu 2.2 contains bug fixes and asked features. For a complete list, see Emus Bug Tracking System for closed bugs and Emus Feature Request Tracking System for closed feature requests.
Some selected modifications are listed below:
  • Database Installer (autoDBinstaller)
    • Automatic selection of steps from chosen database from server/zip/directory
    • Template modification for paths improved
    • Adpats User Defined Paths
    • Adpats User Defined Module Paths
  • Emu (dbemu)
    • Play utterance from Emu database interface
    • (select from right mouse button click menu)
    • Use new quick search/selection feature in utterance and database box
    • (click inside the box and write the initial letters of database or utterance)
    • Press <Return> to open labeller
    • Left mouse button on template reports the template location in the help window below the Emu
    • Set Extension to the extension of the first track in database template
  • Emu-R tcl library (EmuR)
    • Faster emu.query and emu.track
    • Emu.requery for same-level target queries prepared
  • EMU Query Tool (querytool)
    • Open utterance from querytool inside the labeller
    • on right clicks in the result segment lists, a popup menu appears
    • offering to open the selected segment's utterance in the labeller
  • ExtractHierarchy
    • Now works for event levels also
  • Convert Labels (labConvert)
    • Ariculate Assistant label file conversion added
    • Converting from KielCorpus 2 Emu for the whole database or parts of it added
    • QuickSelect added
  • Graphical Template Editor (GTed)
    • Legal labels for labellinks added
    • Shortcut Ctrl-s for saving added
  • EMU Labeller (labeller2)
    • Hierarchy and Signal view display and functions improved
    • higher time precision for label marks
    • set mark to zero crossing added (middle mouse button on mark (segement) or label (event))
    • display and modification of more than 4 formants enabled
    • better alignment of signals and approved info bar
    • bindings added and documented -> help -> controls
    • sample number, duration of segment added to info bar
    • Extra menu -> write emu/tcl script added - for writing small scripts like the line LinkFromTimes etc.
    • hierarchy beta view improved very much - Emu crashes while preparing the display are not regocnized anymore
    • legal labels in hierarchy beta view
    • legal labels for event levels
    • automatic redraw after Build Hierarchy
    • extended label lines scrolled, resized and zoomed
    • Ad hoc scripting via menu-> extra -> new emu script
  • Signal Processing Tool (tkassp)
    • reset_all function added
    • help is activated


Test new EMU 2.2.2 and EMU-R 4.1

Major changes: labeller: right track alignment; labeller modules: Hier2SigView available again, emadisplay added; tkassp: fixed output handling; dbemu: use of relative paths enabled;

Please report any Bug to our Bug Tracking System.

Windows: XP, Vista EMU2.2.2 Emu_R4.1
Mac OS X: 10.4 and above EMU2.2.2 Emu_R4.1
Linux: x86 EMU2.2.2 Emu_R4.1

The test versions here will be updated each time newly reported bugs are fixed until an EMU Version is released officially. Thus, if you find any bug, please report it and/or check if there is a new version 2.2.X in this place and try it again. Installing a new version does not take too much time anymore due to the ability to store database files anywhere, because the emu-conf file that is in charge of the template storage is anaffected by intalling or unnstalling.

EMU 2.0 Release, May 2008

We are very proud to present the newest Release of Emu which includes binaries not only for Windows but also for Linux and now even for Mac OS X. A lot of code in this new release has been rewritten or reorganized. This enables us to build Emu in basically the same way on most platforms where Tcl/Tk is supported.

Short summary of changes in Emu 2.0

  • The new interface dbemu provides a lucid view of all available databases, quick access to all tools that come along with Emu and variety of new ways to work on the utterances of a database
  • The Emu Labeller has been completely rewritten - finally making Emu entirely available for Mac OS X. This includes the abandoning of the padgraph and trackdata packages used in 1.x releases and replacing them using Snack and and the extension snackssff. Also included is the beta version of a new hierarchy view which will improve the display of hierarchy trees
  • Most graphical interfaces have been overhauled using themeable widgets from the tile / ttk package
  • A number of new applications can now be found in Emu
    • Emu-Conf enables the quick access to new databases templates in non-standard directories
    • DB Installer can download and unpack databases from the internet and then adapt and configure them for the user's environment
    • EPG2SSFF can convert Raw EPG data (Reading) to the SSFF format used in Emu
  • tkassp, Emu's signal processing tool has adapted to the launch and the ongoing changes in the new libassp project
  • All non-tcl code has been reorganized to conform with the Tcl Extension Architecture ( TEA)
  • Emu can "speak" with Praat and WaveSurfer

All Emu downloads are available via the .

Troubleshooting

This software is likely to contain bugs. If you think you have encountered a bug, please don't hesitate to consult our Bug Tracking System and eventually report your problem.

Patches

Download patches from: Emu's Sourceforge patch collection

A patch fixes problems of the current version of the software. All patches will be included in the next software release. Please make sure that you install patches in the right chronological order. It might be, that some files are part of more than one patch. In this case, the lastest version includes earlier fixes already.

Source Distribution

The source distribution is a snapshot of the Emu sources at the release date. The source should build easily on Linux with and should build on Solaris as long as a recent gcc and Gnu make are available. It may also build on other flavours of Unix but has only been tested on these two platforms. Instructions for compilation are in the README file within the distribution.

In addition to the above released sources, the latest version of the source via anonymous CVS is available from the . Please note that the CVS version may not compile in all configurations.

A number of packages needed to build a full EMU system are version Version 8.5 of and the latest versions (CVS snapshots) of libassp and tclassp, both found at libassp Emu also requires the Tcl standard library (tcllib), tklib the Bwidgets Tcl library, both available via Sourceforge as well as the memchan, tclvfs, trf and tkimg packages. All these packages are also available via Active Tcl.

Library Emu in the Software package R

The Emu-R library is a collection of programs running within the R programming language for the analysis of any speech database that can be read by the Emu Speech Database System.

Emu/R 4.1 released (might not be updated at R-CRAN yet)

This release works best with .
It includes some bug fixes and minor changes to functions as well as improved error messages. Some major changes are listed below:

  • Installation and configuration
    • The Emu package can be installed successfully even there is no emu installation on the system.
    • An emu installation is linked to R using the new function emulink(). This is tried during installation but you can link it on aour own again using emulink().
    • Better information about the tcl and Emu libary paths.
  • emu.track()
    • optimized for faster processing. Some kind of progress bar is added also.
    • takes the segment list argument as character also. For larger segment lists the processing is faster taht way.
    • "cut" argument works now
    • broken for event lists : FIXED
  • trackinfo()
    • updated to use emuR::gettracks from Emu library
  • write.trackdata()
    • writes entries Original_Freq and Trackname to file for spectral data

Installing Emu/R

Installation of the Emu functions for R is simple as the package is now hosted by the The Comprehensive R Archive Network. After you have installed R (see The R Project for Statistical Computing for details of how to do this) start up R and then install the the Emu-R library with:

install.packages("emu")

Alternatively, you can dowload the Emu-R package 4.1 from SourceForge:

source (Linux, Mac): x86, 10.4 and above  v. 4.1
Windows: XP, Vista, 7 v. 4.1

The actual R-CRAN package (4.1) requires >= EMU2.2.4

Emu and ToBI

ToBI is a system for transcribing the intonation patterns of spoken language. ToBI defines a number of annotation levels and the criteria for placing labels on each. These labels include a segmentation into words along with tone labels which mark prosodic events such as prosodic tones and phrase boundary events. ToBI annotations have largely been made using the Unix based Waves+ toolkit from Entropic and the example materials are made available in the ESPS format which is readable only by Waves+ (and by Emu when an ESPS licence is available -- ie. on a Unix platform). This page describes some tools for using ToBI annotation in the Emu system.

The ToBI training materials available from the Ohio State web site are in the ESPS format. We have converted these files to the SSFF format read by Emu on both Unix and Windows. These files are now available on our server:

The smaller files consist of only 12 utterances from the database (those beginning with `a').

These packages contain the original label files (augmented with a dummy label at the start of the word level so that Emu can treat the words as segments rather than events). Two Emu template files are provided, one mimics the traditional ToBI annotation scheme which presents four independant tiers, the other adds domination relations and two additional levels for intonational and intermediate phrases. A script is provided to convert traditional flat ToBI annotations into hierarchical ones. An example hierarchical annotation is shown below.

emu tobi

In this scheme, the Tone level is preserved, non-phrasal tones are linked to the word in which they occur and words are grouped into Intonational and Intermediate phrases based on the position of phrase boundary events.

Pitch Tracking on Windows

Researchers investigating prosody have relied on ESPS/Waves+ for both labelling and pitch tracking. Emu can manage the labelling role but does not provide a pitch tracker. There are a number of possible pitch trackers that might be of use.

Fortunately, the ESPS codebase has now been donated to the KTH speech group and is now being integrated into the Snack toolkit. The most recent release of Snack includes the ESPS pitch and formant tracker code and the most recent Emu release contains a simple tool to run these over speech data and produce SSFF formatted data files for your corpus.

Publications

In progress
Harrington, J. (in preparation). The Phonetic Analysis of Speech Corpora. Blackwell.
Harrington, J. (in press). Acoustic Phonetics. In the revised Edition of Hardcastle W. & Laver J. (Eds.), The Handbook of Phonetic Sciences. Blackwell.
2006
Bombien, L., Cassidy, S., Harrington, J., John, T., Palethorpe, S. (2006). Recent Developments in the Emu Speech Database System. Proceedings of the Australian Speech Science and Technology Conference, Auckland, December 2006.
2003
Harrington, J., Cassidy, S., John, T. and Scheffers, M. (2003). Building an interface between EMU and Praat: a modular approach to speech database analysis. International Congress of Phonetic Sciences, Barcelona, August 2003. (4 pages).
2002
Steve Cassidy, XQuery as an Annotation Query Language: a Use Case Analysis, Proceedings of LREC 2002, Las Palmas, Spain, May 2002.
2001
Bird, S. and Harrington, J. (2001). Speech annotation and corpus tools. Speech Communication, 33, 1-4.
Bird, S. and Harrington, J. (Editors) (2001). Guest Editors for Special Issue on Speech Annotation and Corpora. Speech Communication. Vol. 33, 1-174.
Cassidy, S. and Harrington, J. (2001). Multi-level annotation in the Emu speech database management system. Speech Communication, 33, 61-77
2000
Cassidy, S. and Bird, S. (2000) Querying Databases of Annotated Speech, Proceedings of the Australian Database Conference, Canberra, January 2000.
Cassidy, S., Welby, P., McGory, J., and Beckman, M. (2000) Testing the Adequacy of Query Languages Against Annotated Spoken Dialog. Proceedings of the Speech Science and Technology Conference, Canberra, December 2000.
1999
Cassidy, S. (1999) Compiling Multi-Tiered Speech Databases into the Relational Model: Experiments with the Emu System. In Proceedings of Eurospeech '99, Budapest, September 1999.
Harrington, J. & Cassidy, S. (1999). Techniques in Speech Acoustics. Kluwer Academic Publishers: Foris, Dordrecht. ISBN: 0-7923-5731-0.
1997
Millar, J., Dermody, P., Harrington, J., and Vonwiller, J. (1997). Spoken Language Resources for Australian Speech Technology. Journal Of Electrical and Electronic Engineers Australia, 1, 13-23.
1996
Cassidy, S. and Harrington, J. (1996). EMU: an enhanced hierarchical speech database management system. Proceedings of the 6th Australian International Conference on Speech Science and Technology (p. 361-366).
1994
Harrington, J. & So, L. (1994). Some design criteria in the annotation of a spoken database of Cantonese. Fifth International Conference on Speech Science and Technology, Perth, Australia. (4 pages).
Millar, J.B., Vonwiller, J.P., Harrington, J. and Dermody, P.J. (1994). The Australian National Database Of Spoken Language, Proceedings International Conference on Acoustics, Speech and Signal Processing, ICASSP-94, Adelaide, (pp.I97-I100).
1993
Harrington, J., Cassidy, S., Fletcher, J. and McVeigh, A. (1993). The mu+ system for corpus-based speech research. Computer Speech & Language, 7, 305-331.
1992
Croot, K., Fletcher, J. and Harrington, J. (1992). Phonetic segmentation on the Australian National Database of Spoken Language. Proceedings of the Fourth International Conference on Speech Science and Technology. (p.86-90). Brisbane, Australia.
McVeigh, A. and Harrington, J. (1992). The mu+ system for speech database analysis. Proceedings of the Fourth International Conference on Speech Science and Technology. (p.548-553). Brisbane, Australia.
Harrington, J. and McVeigh (1992). Analysing acoustic and articulatory speech data using the mu+ speech database interrogation system. Journal of the Acoustical Society of America, vol. 91, supplement 1. (A).
Millar, J., Dermody, P., Harrington, J. & Vonwiller, J. (1992). A national cluster of spoken language databases for Australia. Proc. of the Third Australian Int. Conference on Speech Science and Technology. Melbourne, Australia. (4 pages).
1990
Millar, J., Dermody, P., Harrington, J. & Vonwiller, J. (1990). A national database of spoken language; concept, design, and implementation. Proc. of the International Conference on Spoken Language Processing (ICSLP-90). Kobe, Japan. (4 pages).
1988
Harrington, J. & Watson, G. (1988). Acoustic Phonetics in S. Unpublished m/s Centre for Speech Technology Research, University of Edinburgh.

Emu and MySQL

Frequently Asked Questions

This is a very short list of FAQ. If you have any problem, see the Emu forums especially the Help forum. Your problem might be a software bug. In this case see Emus Bug tracking system.

How do I install the EMU speech database system?

  • Windows
    • Execute (double-left-mouse button) the setup file and follow the instructions on the screen.
  • Unix
    • Change to the directory where you saved the installer.
    • Make the file executable
      • $ chmod +x Emu-2.2.1-Linux-x86-Install
    • To install
      • $ ./Emu-2.2.1-Linux-x86-Install
    • To install systemwide
      • $ sudo ./Emu-2.2.1-Linux-x86-Install
    • Follow the instructions on the screen
  • MAC
    • Execute (double-left-mouse button) the setup file and follow the instructions on the screen.

How do I start the EMU speech database system?

EMU version 2.0 comes up with the application dbemu. The complete new interface provides a lucid view of all available databases, quick access to all tools that come along with Emu and variety of new ways to work on the utterances of a database.

  • Windows
    • from Windows start menu, start Dbemu from the EMU Speech Tools menu
  • Unix
    • enter the command dbemu
  • MAC
    • start Application Emu.app

How can I get access to my existing databases?

  • Define the path(s) to the template file(s) with the Emu-Conf Editor.
    • execute Dbemu
    • open the emu-conf editor from the dbemu file menu
    • add the path to your existing template files
    • save the modifications

Are there some patches?

Download patches from: Emu's Sourceforge patch collection

A patch fixes problems of the current version of the software. All patches will be included in the next software release. Please make sure that you install patches in the right chronological order. It might be, that some files are part of more than one patch. In this case, the lastest version includes earlier fixes already.

Convert Labels freezes converting Praat TextGrid files

This problem occurs when the TextGrid file is UTF-16 encoded. Until EMU support UTF-16, change the encoding to UTF-8. Therefore change Praat preferences (open and save TextGrid again) or use a text editor to convert.

Miscellaneous Information

We are in the process of transferring additional information from the old website to the new design. Meanwhile please visit the old site:
Old Emu Site

Documentations

Documentations of the Speech Tools of EMU Version 2 are included in the tools by a file menu entry Help or by the context menu below the emu picture.

For other help see Documentation of EMU 1.7

Video documentation