This section gives a brief description of the executable programs
available with the speech tools. Most of these programs are simple
wrap-around main()
functions to library routines.
Many of these programs have man pages. Please consult the man pages for
more detailed information. Most programs print a summary of their
command line options when given the -help
flag. Some programs are
"finished", while others are still "in progress". The finished programs
should be well documented and stable. The "in progress" programs are
near completion but typically still require some work regarding user
interfaces and documentation.
Changes waveform file formats, performs re-sampling and scaling, prints information on waveform headers etc.
Changes track file formats, converts track files into label files, smoothes tracks, re-samples tracks. Tracks are for F0, LPC coefficients, ceptra and such like.
Changes label file formats, converts label files into track files, performs one-to-one mapping of labels from one set to another, performs context sensitive label re-writing.
Plays arbitrary waveform files on a variety of hardware audio devices. Can perform re-sampling to match audio device capability. `na_play' has support for a number of audio devices. Compile time options specify which devices are supported. Note you must actually have these devices on your machine before `na_play' can play any waveform.
`na_play', depending on compile-time options, supports the following audio devices, specified by the `-p' command.
netaudio
is not supported.
na_play
as
you could just use the command directly but is necessary with some
systems using the speech tools.
The default audio is netaudio if it is supported. If not the platform
specific auido mode is the default (e.g. sun16audio, linux16audio,
freebsd16audio or mplayeraudio). If none of these is supported,
sunaudio is the default. The Audio_Command
method is always an
option.
Pitch tracker based on super resolution pitch determination (srpd). Takes waveforms (of any type) as input and produces F0 contours.
Pitch tracker with smoothing based on super resolution pitch determination (srpd). Takes waveforms (of any type) as input and produces F0 contours. Smoothing involes median smoothing of the pda output and interpolation through unvoiced regions.
Basic signal processing functions allowing generation of LPC coefficents, cepstra, mel cepstra etc at pitch synchronous and fixed intervals. Also allows generation of delta and delta coefficients.
Signal filter, used for generating LPC residuals amongst others.
A straightforward Viterbi decoder, using an ngram language model (which
can be estimated using build_ngram
, and a sequence of observation
probability vectors.
Build ngram language models.
Test an ngram on text data.
A general dynamic program aligner.
Modify ngrams, e.g. interpolation of two ngram models.
A classification and regression tree building program following the techniques described in breiman84. See section Wagon
Program from testin CART trees, and predicting from CART trees.
Ordinary least squares analysis (linear regression).
Testing for OLS output.
Suite of programs to build, train, test and parse stochastic context free grammars.
Suite of programs for building and running weighted finite state machines.
Generate tilt descriptions of F0 contours.
Generate F0 contours from tilt descriptions.
Go to the first, previous, next, last section, table of contents.