File: README.txt

package info (click to toggle)
mbrola-de7 0.0.20030404%2Brepack2-5
  • links: PTS
  • area: non-free
  • in suites: bookworm, bullseye, buster, sid, trixie
  • size: 55,400 kB
  • sloc: makefile: 4
file content (208 lines) | stat: -rw-r--r-- 6,320 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
DE7 -- a female German voice with three levels of vocal effort
==============================================================

1. General description
2. Diphone inventory
3. Examples
4. Contact

1. General description
======================

DE7 is a female German diphone voice created in the context of the EU project
NECA (IST-2000-28580), with the aim of being particularly suitable for the
flexible synthesis of expressive speech.

The main feature of DE7 is therefore that it contains a complete diphone
set for three different voice qualities defined by their vocal effort, low
effort ("soft voice"), medium effort ("normal or modal voice"), and high
effort ("loud voice").

2. Diphone inventory
====================

DE7 uses the SAMPA phonetic alphabet as defined under
http://www.phon.ucl.ac.uk/home/sampa/home.htm. In addition to the German
phonemes, some English and French phonemes are included in order to allow
for the synthesis of foreign words. These additions do not claim
completeness and are not intended for the synthesis of full-sentence
English or French speech, but solely for the generation of a typical German
speaker's rendition of English or French words.

German phonotactics do not allow all possible diphones. Therefore, only
those diphones considered relevant were recorded. However, as some users
may intend to use the database for different tasks than expected, a
translation table maps *all* unexisting diphones to similar existing
diphones. As a consequence, for all combinations of the phone symbols
listed below into diphones, at least an approximate realisation is
provided.

As an example, final devoicing in German usually makes it impossible to
have voiced consonants at the end of a word or a morpheme. Therefore,
diphones such as "d-f" were not recorded. If this diphone is requested,
"t-f" will be realised instead.

Phone inventory:
----------------

Silence
        _               Silence before and after phrases

German phones:
--------------
(examples taken from http://www.phon.ucl.ac.uk/home/sampa/german.htm,
slightly adapted)

Vowels:

	I		Sitz		zIts
	E		Gesetz		g@zEts
	a		Satz		zats
	O		Trotz		trOts
	U		Schutz		SUts
	Y		hübsch		hYpS
	9		plötzlich	pl9tslIC

	i:		Lied		li:t
	e:		Beet		be:t
	E:		spät		SpE:t
	a:		Tat		ta:t
	o:		rot		ro:t
	u:		Blut		blu:t
	y:		süß		zy:s
	2:		blöd		bl2:t

	aI		Eis		aIs
	aU		Haus		haUs
	OY		Kreuz		krOYts

	@		bitte		bIt@
	6		besser		bEs6

Glottal stop:
	?		Verein			fE6?aIn

The glottal stop is treated as a normal phone with which diphones can be
built (in the example: "6-?" and "?-aI"). It was recorded as a
word-internal juncture which is smaller than real silence "_".

Consonants:
	p		Pein			paIn
	b		Bein			baIn
	t		Teich			taIC
	d		Deich			daIC
	k		Kunst			kUnst
	g		Gunst			gUnst

	pf		Pfahl			pfa:l
	ts		Zahl			tsa:l
	tS		deutsch			dOYtS

The "dZ" is not included as a separate phone due to its rarity in
German. It can be approximated as "d"+"Z".

	f		fast			fast
	v		was			vas
	s		Tasse			tas@
	z		Hase			ha:z@
	S		waschen			vaS@n
	Z		Genie			Zeni:
	C		sicher			zIC6
	j		Jahr			ja:6
	x		Buch			bu:x
	h		Hand			hant

	m		mein			maIn
	n		nein			naIn
	N		Ding			dIN
	l		Leim			laIm
	R		Reim			RaIm

English phones added:
---------------------
(examples taken from http://www.phon.ucl.ac.uk/home/sampa/english.htm,
slightly adapted)

	EI		raise			rEIz
Note that "EI" is used instead of English sampa "eI".

	T		thin			TIn
	D		this			DIs

	r		wrong			rON
	w		wasp			wOsp


French phones added:
--------------------
(examples taken from http://www.phon.ucl.ac.uk/home/sampa/french.htm,
slightly adapted)

	e~		vin			ve~
	a~		vent			va~


Vocal effort
------------

Each of the above-mentioned phone symbols also exists in a variant recorded
with higher vocal effort and in a variant recorded with lower vocal
effort. These variants are identified by appending a suffix to the phone
symbol, "_loud" for the high-vocal-effort variant and "_soft" for the
low-vocal-effort variant. The default variant, with no suffix, corresponds
to a medium, "normal" level of vocal effort.

For each vocal effort setting, a full diphone set was recorded. In order to
increase naturalness and ease of use, the low-vocal-effort diphone set is
normalised at -1.5dB relative to the default, medium-vocal-effort diphone
set, and the high-vocal-effort diphone set is normalised at +1.5dB relative
to the default. For ease of reference, the simplifying integrated
descriptions "loud" and "soft" were preferred over the more accurate, but
less intuitive descriptions "higher vocal effort" and "lower vocal effort".

While not yet explored scientifically, it is technically possible to
combine phones from different vocal efforts in an utterance. The diphones
at the joints between different vocal effort stretches are taken from the
"loud" diphone set for medium-loud joints, from the "soft" diphone set for
medium-soft joints, and from the default diphone set for soft-loud joints.
That means, in 

_	10
z	91	(0,106)
a	72	(0,116)
6       40
b_loud	59
R_loud	51
Y_loud	77	(49,80)
k	108
@	97
n	66	(100,75)
_	400

the diphones used would be "_-z", "z-a", "a-6", "6_loud-b_loud",
"b_loud-R_loud", "R_loud-Y_loud", "Y_loud-"k_loud", "k-@", "@-n", and
"n-_".

Note there is only one symbol for silence, "_", and only one symbol for a
glottal stop, "?". No vocal-effort-specific versions of these phone symbols
exist.

3. Examples
===========

The TEST/ directory contains a number of .pho files, some of which are
taken from an early version of the "Socialite" demonstrator developed in
the NECA project. See http://www.ai.univie.ac.at/NECA for more information
about the project in general and the role of emotional speech synthesis in
the project. Some foreign language files are meant to demonstrate the
degree to which the voice can be used for speaking other languages with a
German accent.

4. Contact
==========

This diphone database was recorded jointly by the Institute of Phonetics at
Saarland University and by the Language Technology Department at DFKI,
Saarbrücken.

For any enquiries, contact Marc Schröder, schroed@dfki.de.