File: README

package info (click to toggle)
speech-tools 1.1.0-1
  • links: PTS
  • area: non-free
  • in suites: slink
  • size: 5,280 kB
  • ctags: 6,800
  • sloc: cpp: 53,460; ansic: 2,936; java: 1,405; makefile: 830; perl: 728; sh: 538; awk: 49; pascal: 14
file content (184 lines) | stat: -rw-r--r-- 6,555 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
		EDINBURGH SPEECH TOOLS LIBRARY
               VERSION 1.0.95 31st July 1998

Principal authors: Paul Taylor, Alan W Black, Richard Caley and Simon King

Centre for Speech Technology
University of Edinburgh 
80 South Bridge
Edinburgh EH1 1HN, UK
http://www.cstr.ed.ac.uk/projects/speechtools.html

This is a BETA release, and hence has a number of known parts
missing or incomplete.

------------------------------------------------------------------------

The Edinburgh speech tools system is a library of C++ classes, functions
and utility programs that are frequently used in speech software. The
system compiles to a single unix library .a file which can be linked
with software. At present, C++ classes for several useful speech and
language classes have been written, along with audio software and some
basic signal processing software. It is hoped to add to the system
(especially the signal processing) in the near future.

========================================================================


            Edinburgh Speech Tools Library version 1.0.95
              Centre for Speech Technology Research                
                   University of Edinburgh, UK                     
                    Copyright (c) 1994-1998
                      All Rights Reserved.                         
                                                                   
Permission to use, copy, modify, distribute this software and its    
documentation for research, educational and individual use only, is  
hereby granted without fee, subject to the following conditions:     
 1. The code must retain the above copyright notice, this list of    
    conditions and the following disclaimer.                         
 2. Any modifications must be clearly marked as such.                
 3. Original authors' names are not deleted.                         
This software may not be used for commercial purposes without        
specific prior written permission from the authors.                  
                                                                     
THE UNIVERSITY OF EDINBURGH AND THE CONTRIBUTORS TO THIS WORK        
DISCLAIM ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING      
ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT   
SHALL THE UNIVERSITY OF EDINBURGH NOR THE CONTRIBUTORS BE LIABLE     
FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES    
WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN   
AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,          
ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF       
THIS SOFTWARE.                                                       

========================================================================

The following c++ programs are available:

	na_play: generic playback program for use with net_audio and CSTR ao.

	ch_wave: Waveform file conversion program.

	ch_lab: label file conversion program.

	ch_track: Track file conversion program.

        wagon: a CART tree build and test program

The following C++ sub-libraries are available

	audio:	C++ audio functions for Network Audio system, Suns
                Linux and FreeBSD

	speech_class:	C++ speech classes, including waveform and track.

	ling_class:	C++ linguistic classes.
	
	sigpr:	Signal processing

	utils: Various utilities.	


ACKNOWLEDGEMENTS
------------------------------------------------------------------------

Most of the software was written by Paul Taylor, Alan Black, Simon King
and Richard Caley additional contributions have been made by the
following people.  Slightly different licence agreements accompany
these pieces of code, so it is important that the various agreements
be properly understood.

Tony Robinson: some signal processing code.
Markus Mummert: sample rate converter (rateconv.cc).
Craig Reese and Joe Campbell: ulaw conversion code.
Paul Bagshaw: pitch tracker.
Theo Veenker: IRIX audio support.
Paradigm Assoc. and George Carrett: For Scheme In One Defun

INSTALLATION
************

see INSTALL file for details

FILE/DIRECTORY STRUCTURE
------------------------------------------------------------------------

The main directory contains minimal wrap-around main() functions
which compile to form executables which use the algorithms in the
speech tools. All these main() functions really do is to read
command line options and read and write files, the algorithms
themselves are completely contained within the libraries.

The directory structure is as follows:

config: 
        shared make rules and configuration make rules

speech_class:
	general speech class files, including waveforms, contours etc.

ling_class:
	general linguistic class files, including a stream-based architecture.

sigpr:
	signal processing code.

doc:
	basic documentation, generates html, info and postscript

include:
	class and general header files.

lib:	
	where the .a files are kept

main:
	Directory containing main files which compile to executables.


DOCUMENTATION
------------------------------------------------------------------------

There are two sources of documentation.

Man pages exist for the more important stand-alone programs. These are
found in the main directory.

The library documentation is in texinfo format which can be converted
into GNU info HTML and postscript, see Makefile in doc/

Note the documentation is not up to date.

BUGS/COMMENTS ETC
------------------------------------------------------------------------

This software hasn't been subjected to any rigorous testing as yet,
and there are likely to be lots of bugs. If you find one, please send
a bug report to speechtools@cstr.ed.ac.uk. Any useful comments will also be
appreciated. See also the bugs file in the doc directory.

If you have some software that you think could be integrated into the
speechtools system, please contact us at
speechtools@cstr.ed.ac.uk. The whole point of writing and releasing
code such as this is to prevent duplication of effort, and therefore
we would appreciate greatly any code which will add to the usefulness
of the system. Signal processing routines are particularly welcome.


======================================================================
ADDITIONAL SOFTWARE
======================================================================

CSTR supplies a number of different speech technology systems
including speech synthesis systems and intonation analysis software.
To find out more information checkout http://www.cstr.ed.ac.uk/