5th International Conference on Spoken Language Processing
30th November - 4th December, 1998
Sydney, Australia

ICSLP'98 Proceedings


This Web page contains information that might be useful to people reading and citing papers from the CD-ROM version of the proceedings of ICSLP'98 and who do not have access to Volume one of the printed proceedings.


Proceedings Editors

Robert H. Mannell and Jordi Robert-Ribes

Proceedings Publisher

Australian Speech Science and Technology Association, Incorporated (ASSTA)

Proceedings Production

Volume and Page Numbering

Included below with the author, title and first-author affiliation is also information on the Volume and page number of each paper as it appeared in the seven Volume printed proceedings.

Volume 1 of the proceedings is the "Conference Program and Abstract Book".

Volumes 2 to 7 are the Conference Proceedings Volumes.

Page numbering for Volumes 2 to 7 starts with page 1 aligned with the first page of the first paper in Volume 2 and continues with contiguous numbering to the final page of the final paper in Volume 7. As a consequence, the first paper in Volume 3, for example, commences on page 551 rather than on Volume 3, page 1.

Only the first page of each paper is indicated on this Web page. All papers, with the exception of the two plenary papers, are 4 pages in length and so the reference to the first paper in Volume 3 (paper Tu5P1) which is indicated as "Volume 3, p.551" can be interpreted as pp551-554.

The two plenary papers, which are the very first and the very last papers in the printed proceedings (but are found together on the CD-ROM), are greater than four pages in length and their start and end pages are indicated explicitely in the listing below.

Paper Numbering Schemes

The papers are listed here using the paper numbers that appear in the printed ICSLP'98 Proceedings and in the Conference Program. These paper numbers refer to the day, session and paper number within that session.

For example, paper "Tu5P11" refers to:-

In the CD-ROM Proceedings the paper numbers are the original numbers allocated upon initial submission of the paper. At the time that the CD-ROM went into production the program had not been completely finalised and so only these original paper numbers were available for inclusion in the CD-ROM Proceedings. The names of the sessions that the papers appear in are consistent between the printed and CD-ROM Proceedings.


Post-Conference Sale of CD-ROM Proceedings

A limited number of the ICSLP'98 CDROM are available from the ASSTA treasurer David Grayden.
ICSLP'98 CDROMS are also available from Causal Productions.

Post-Conference Sale of Printed Proceedings

A small number of printed proceedings may become available for purchase from ASSTA soon.
Contact Robert Mannell for further information.


Contents

Volume 2
Tu2A - Plenary Session 1
Tu3A - Text-To-Speech Synthesis 1
Tu3B - Spoken Language Models and Dialog 1
Tu3C - Prosody and Emotion 1
Tu3D - Hidden Markov Model Techniques 1
Tu3P - Speaker and Language Recognition 1
Tu3Q - Multimodal Spoken Language Processing 1
Tu3R - Isolated Word Recognition
Tu4A - Robust Speech Processing in Adverse Environments 1
Tu4B - Spoken Language Models and Dialog 2
Tu4C - Articulatory Modelling 1
Tu4D - Talking To Infants, Pets and Lovers
Tu5A - Robust Speech Processing in Adverse Environments 2
Tu5B - Spoken Language Models and Dialog 3
Tu5C - Speech Coding 1
Tu5D - Articulatory Modelling 2
Volume 3
Tu5P - Prosody and Emotion 2
Tu5Q - Neural Networks, Fuzzy and Evolutionary Methods 1
Tu5R - Utterance Verification and Word Spotting 1 / Speaker Adaptation 1
Tu6A - Text-To-Speech Synthesis 2
Tu6B - Spoken Language Models and Dialog 4
Tu6D - Human Speech Perception 1
We1A - Robust Speech Processing in Adverse Environments 3
We1B - Speech and Hearing Disorders 1
We1C - Prosody and Emotion 3
We1D - Spoken Language Understanding Systems 1
We1P - Signal Processing and Speech Analysis 1
Volume 4
We1Q - Spoken Language Generation and Translation 1
We1R - Spoken Language Models and Dialog 5
We2A - Segmentation, Labelling and Speech Corpora 1
We2B - Multimodal Spoken Language Processing 2
We2C - Prosody and Emotion 4
We2D - Neural Networks, Fuzzy and Evolutionary Methods 2
We4A - Large Vocabulary Continuous Speech Recognition 1
We4B - Speaker and Language Recognition 2
We4C - Signal Processing and Speech Analysis 2
We4D - Prosody and Emotion 5
We4P - Robust Speech Processing in Adverse Environments 4
We4Q - Segmentation, Labelling and Speech Corpora 2
We4R - Speech Technology Applications and Human-Machine Interface 1
Volume 5
We5A - Large Vocabulary Continuous Speech Recognition 2
We5B - Text-To-Speech Synthesis 3
We5C - Language Acquisition 1
We5D - Acoustic Phonetics 1
We6A - Speaker Adaptation 2
We6B - Speech Coding 2
We6C - Hidden Markov Model Techniques 2
We6D - Multilingual Perception and Recognition 1
Th1A - Large Vocabulary Continuous Speech Recognition 3
Th1B - Articulatory Modelling 3
Th1C - Language Acquisition 2
Th1D - Speaker and Language Recognition 3
Th1P - Text-To-Speech Synthesis 4
Th1Q - Spoken Language Understanding Systems 4
Th1R - Human Speech Perception 2
Th2A - Large Vocabulary Continuous Speech Recognition 4
Th2B - Spoken Language Understanding Systems 2
Volume 6
Th2C - Signal Processing and Speech Analysis 3
Th2D - Human Speech Perception 3
Th4A - Speaker Adaptation 3
Th4B - Spoken Language Understanding Systems 3
Th4C - Multimodal Spoken Language Processing 3
Th4D - Acoustic Phonetics 2
Th4P - Large Vocabulary Continuous Speech Recognition 5
Th4Q - Speech Coding 3
Th4R - Language Acquisition 3 / Multilingual Perception and Recognition 2
Th5A - Segmentation, Labelling and Speech Corpora 3
Th5B - Text-To-Speech Synthesis 5
Th5C - Spoken Language Generation and Translation 2
Th5D - Human Speech Perception 4
Volume 7
Fr1A - Robust Speech Processing in Adverse Environments 5
Fr1B - Text-To-Speech Synthesis 6
Fr1C - Speech Technology Applications and Human-Machine Interface 2
Fr1D - Prosody and Emotion 6
Fr1P - Hidden Markov Model Techniques 3
Fr1Q - Speech & Hearing Disorders 2 / Speech Processing for the Speech Impaired and Hearing Impaired 1
Fr1R - Human Speech Production
Fr2A - Segmentation, Labelling and Speech Corpora 4
Fr2B - Speaker and Language Recognition 4
Fr2C - Speech Technology Applications and Human-Machine Interface 3
Fr2D - Utterance Verification and Word Spotting 2
Fr4A - Large Vocabulary Continuous Speech Recognition 6
Fr4B - Neural Networks, Fuzzy and Evolutionary Methods 3
Fr4C - Speech Processing for the Speech-Impaired and Hearing-Impaired 2
Fr4D - Prosody and Emotion 7
Fr5A - Plenary Session 2


Tuesday 1 December, 1998

Tu2A - Plenary Session 1

Tu2A1     Volume 2, pp. 1-6
Cochlear Implants in the Second and Third Millennia
Clark, G
University of Melbourne, AUSTRALIA

Tu3A - Text-To-Speech Synthesis 1

Tu3A1     Volume 2, p. 17
Unsupervised training of phone duration and energy models for
text-to-speech synthesis

Bagshaw, P C
France Telecom, CNET, FRANCE

Tu3A2     Volume 2, p. 21
Improved Duration Modeling of English Phonemes Using a Root Sinusoidal Transformation
Bellegarda, J Silverman, K
Apple Computer, USA

Tu3A3     Volume 2, p. 25
Efficient Adaptation of TTS Duration Model to New Speakers
Shih, C Gu, W van Santen, J P H
Bell Labs - Lucent Technologies, USA

Tu3A4     Volume 2, p. 29
Duration Modeling for HMM-Based Speech Synthesis
Yoshimura, T Tokuda, K Masuko, T Kobayashi, T Kitamura, T
Nagoya Institute of Technology, JAPAN

Tu3B - Spoken Language Models and Dialog 1

Tu3B1     Volume 2, p. 33
An educational dialogue system with a user controllable dialogue manager
Gustafson, J Elmberg, P Carlson, R Jönsson, A
Center for Speech Technology, TMH-KTH, SWEDEN

Tu3B2     Volume 2, p. 37
End-user driven dialogue system design: The REWARD experience
Failenschmid, K Thornton, J H S
Vocalis Ltd, UNITED KINGDOM

Tu3B3     Volume 2, p. 41
The Design of a Multi-domain Mandarin Chinese Spoken Dialogue System
Lin, Y-C Chiang, T-H Wang, H-M Peng, C-M Chang, C-H
Industrial Technology Research Institute, TAIWAN R.O.C.

Tu3B4     Volume 2, p. 45
An Integrated Dialogue System for the Automation of Call Centre Services
Georgila, K Tsopanoglou, A Fakotakis, N Kokkinakis, G
University of Patras, GREECE

Tu3C - Prosody and Emotion 1

Tu3C1     Volume 2, p. 49
Tones of a Tridialectal: Acoustic and Perceptual data on Ten Linguistic Tonetic contrasts between Lao, Nyo and Standard Thai
Rose, P
Australian National University, AUSTRALIA

Tu3C2     Volume 2, p. 53
Tone Sandhi between Complex Tones in a Seven-Tone Southern Thai Dialect
Thompson, N G I
Australian National University, AUSTRALIA

Tu3C3     Volume 2, p. 57
The Acoustic and Perceptual Features of Tone in the Tibeto-Burman Language Ao Naga
Coupe, A R
The Australian National University, AUSTRALIA

Tu3C4     Volume 2, p. 61
The Differential Status of Semivowels in the Acoustic Phonetic
Realisation of Tone

Rose, P
Australian National University, AUSTRALIA

Tu3D - Hidden Markov Model Techniques 1

Tu3D1     Volume 2, p. 65
Nonreciprocal Data Sharing in Estimating HMM parameters
Luo, X Jelinek, F
Johns Hopkins University, USA

Tu3D2     Volume 2, p. 69
Data-Driven Extensions to HMM Statistical Dependencies
Bilmes, J A
U.C. Berkeley / ICSI, USA

Tu3D3     Volume 2, p. 73
Use of High-Level Linguistic Constraints for constructing Feature Based Phonological Model in Speech Recognition
Sun, J Deng, L
University of Waterloo, CANADA

Tu3P - Speaker and Language Recognition 1

Tu3P1     Volume 2, p. 77
Sub-Band Based Speaker Verification Using Dynamic Recombination Weights
Sivakumaran, P Ariyaeeinia, A M Hewitt, J
University of Hertfordshire, UNITED KINGDOM

Tu3P2     Volume 2, p. 81
Measuring the Dynamic Encoding of Speaker Identity and Dialect in Prosodic Parameters
Barlow, M Wagner, M
Australian Defence Force Academy, AUSTRALIA

Tu3P3     Volume 2, p. 85
German Regional Variants - A Problem for Automatic Speech Recognition?
Beringer, N Schiel, F Regel-Brietzmann, P
University of Munich, GERMANY

Tu3P4     Volume 2, p. 89
Improving Accent Identification through Knowledge of English Syllable Structure
Berkling, K Vonwiller, J Cleirigh, C Zissman, M
MIT Lincoln Laboratory, USA

Tu3P5     Volume 2, p. 93
Multi-dimensional Scaling of Listener Responses to Complex Auditory stimuli
Bond, Z S Fucci, D Stockmal, V McColl, D
Ohio University, USA

Tu3P6     Volume 2, p. 97
Same Talker, Different Language
Stockmal, V Moates, D Bond, Z
Ohio University, USA

Tu3P7     Volume 2, p. 101
The Impact of Regional Variety upon Specific Word Categories in Spontaneous German
Burger, S Oppermann, D
University of Munich, GERMANY

Tu3P8     Volume 2, p. 105
Speech pre-processing against intentional imposture in speaker recognition
Genoud, D Chollet, G
IDIAP, SWITZERLAND

Tu3P9     Volume 2, p. 109
A Comparison of Two Unsupervised Approaches to Accent Identification
Lincoln, M Cox, S Ringland, S
University of East Anglia, UNITED KINGDOM

Tu3P10     Volume 2, p. 113
The Influence of Accents in Australian English Vowels and their Relation to Articulatory Tract Parameters
Dersch, D Cleirigh, C Vonwiller, J
Sydney University / APPEN Speech Technology, AUSTRALIA

Tu3P11     Volume 2, p. 117
Automatic language recognition using high-order HMMs
du Preez, J A Weber, D M
University of Stellenbosch, SOUTH AFRICA

Tu3P12     Volume 2, p. 121
Speaker Recognition Using Residual Signal of Linear and Nonlinear Prediction Models
Faundez-Zanuy, M Rodriguez-Porcheron, D
Escola Universitaria Politecnica De Mataro, SPAIN

Tu3P13     Volume 2, p. 125
An Implementation and Evaluation of an On-line speaker Verification System for Field Trials
Gu, Y Thomas, T
Vocalis Ltd, UNITED KINGDOM

Tu3P14     Volume 2, p. 129
Speaker Verification on the POLYCOST database using frequency filtered spectral energies
Hernando, J Nadeu, C
Polytechnical University of Catalonia, SPAIN

Tu3P15     Volume 2, p. 133
A High-Performance Text-Independent Speaker Identification System Based On BCDM
Jin, Q Si, L Hu, Q
Tsinghua University, P.R. CHINA

Tu3P16     Volume 2, p. 137
Representation of Voice Quality Features Associated with Talker Individuality
Kido, H Kasuya, H
Utsunomiya University, JAPAN

Tu3P17     Volume 2, p. 141
Candidate Selection Based on Significance Testing and its Use in Normalisation and Scoring
Kim, J Jang, G Yun, S Oh, Y
Advanced Institute of Science and Technology, KOREA

Tu3P18     Volume 2, p. 145
Japanese Forensic Phonetics:Non-contemporaneous within-speaker variation in Natural and Read-out Speech
Kinoshita, Y
The Australian National University, AUSTRALIA

Tu3P19     Volume 2, p. 149
Statistical Modeling of Pronunciation and Production Variations for Speech Recognition
Korkmazskiy, F E Juang B-H
Lucent Technologies Bell Laboratories, USA

Tu3P20     Volume 2, p. 153
Dialect Maps and Dialect Research; Useful Tools for Automatic Speech Recognition?
Foldvik, A K Kvale, K
Telenor R&D, NORWAY

Tu3P21     Volume 2, p. 157
Text Independent Speaker Recognition Using Micro-Prosody
Kyung, Y J Lee, H S
Korea Advanced Institute of Science and Technology, KOREA

Tu3P22     Volume 2, p. 161
Speaker Verification Using Fundamental Frequency
Cheng, Y Leung, H C
Chinese University of Hong Kong, HONG KONG

Tu3P23     Volume 2, p. 165
On Optimum Normalization Method Used for Speaker Verification
Liu, W Isobe, T Mukawa, N
NTT Data Corporation, JAPAN

Tu3P24     Volume 2, p. 169
Recurrent Substrings and Data Fusion for Language Recognition
Lloyd-Thomas, H Parris, E S Wright, J H
Ensigma Ltd, UNITED KINGDOM

Tu3P25     Volume 2, p. 173
Text-Independent Speaker Recognition using Multiple Information Sources
Markov, K Nakagawa, S
Toyohashi University of Technology, JAPAN

Tu3P26     Volume 2, p. 177
Discriminative Training of GMM using a Modified EM Algorithm for Speaker Recognition
Markov, K Nakagawa, S
Toyohashi University of Technology, JAPAN

Tu3P27     Volume 2, p. 181
Language Identification Incorporating Lexical Information
Matrouf, D Adda-Decker, M Lamel, L Gauvain, J-L
LIMSI-CNRS, FRANCE

Tu3P28     Volume 2, p. 185
A VQ based Speaker Recognition System Based in Histogram Distances. Text Independent and for Noisy Environements
Monte, E Arqué, R Miró, X
Polytechnical University of Catalonia, SPAIN

Tu3P29     Volume 2, p. 189
Spanish Dialects: Phonetic Transcription
Moreno, A Marino, J B
Polytechnical University of Catalonia, SPAIN

Tu3P30     Volume 2, p. 193
Acoustic analysis of Japanese English prosody: Comparison between Fukushima dialect speakers and Tokyo dialect speakers in declarative sentences and yes-no questions
Muramatsu, M
Fukushima Medical University, JAPAN

Tu3P31     Volume 2, p. 197
A Context-Dependent Approach for Speaker Verification Using Sequential Decision
Noda, H Harada, K Kawaguchi, E Sawai, H
Kyushu Institute of Technology, JAPAN

Tu3P32     Volume 2, p. 201
Quantitative Influence of Speech Variability Factors for Automatic Speaker Verification in Forensic Tasks
Ortega-Garcia, J Cruz-Llanas, S Gonzalez-Rodriguez, J
Universidad Politecnica Madrid, SPAIN

Tu3P33     Volume 2, p. 205
Creating Hidden Markov Models for Fast Speech
Pfau, T Ruske, G
Technical University of Munich, GERMANY

Tu3P34     Volume 2, p. 209
Speaker Identification using Relaxation Labeling
Pham, T D Wagner, M
University of Canberra, AUSTRALIA

Tu3P35     Volume 2, p. 213
A Novel Technique for the Combination of Utterance and Speaker Verification Systems in a Text-dependent Speaker Verification Task
Rodriguez-Linares, L Garcia-Mateo, C
University of Vigo, SPAIN

Tu3P36     Volume 2, p. 217
A Forensic Phonetic Investigation into Non-contemporaneous Variation in the F-pattern of Similar-sounding Speakers
Rose, P
Australian National University, AUSTRALIA

Tu3P37     Volume 2, p. 221
Human vs. Machine Speaker Identification with Telephone Speech
Schmidt-Nielsen, A Crystal, T H
U.S. Naval Research Laboratory, USA

Tu3P38     Volume 2, p. 225
A Comparison of Fusion Techniques in Mel-Cepstral based Speaker Identification
Slomka, S Sridharan, S Chandran, V
Queensland University of Technology, AUSTRALIA

Tu3P39     Volume 2, p. 229
On the Influence of Hyperarticulated Speech on Recognition Performance
Soltau, H Waibel, A
Carnegie Mellon University, USA

Tu3P40     Volume 2, p. 233
Text-independent speaker identification and verification using the TIMIT Database
Ward, N C Dersch, D R
Alcatel Australia, AUSTRALIA

Tu3P41     Volume 2, p. 237
Incorporating Linguistic Knowledge into Automatic Dialect Identification of Spanish
Yanguas, L O'Leary, G Zissman, M
Mit Lincoln Laboratory, USA

Tu3P42     Volume 2, p. 241
A Novel Text-Independent Speaker Verification Method Using the Global Speaker Model
Zhang, Y Zhu, X-Y
Tsinghua University, P.R. CHINA

Tu3Q - Multimodal Spoken Language Processing 1

Tu3Q1     Volume 2, p. 245
A fast method of producing talking head mouth shapes from real speech
Breen, A P Gloaguen, O Stern, P
BT Labs, UNITED KINGDOM

Tu3Q2     Volume 2, p. 249
The Efficiency of Multimodal Interaction: A Case Study
Cohen, P R Johnston, M McGee, D Oviatt, S L Clow, J Smith, I
Oregon Graduate Institute, USA

Tu3Q3     Volume 2, p. 253
Audio and Audio-visual Perception of Consonants Disturbed by White Noise and 'Cocktail Party'
Czap, L
University of Miskolc, HUNGARY

Tu3Q4     Volume 2, p. 257
Overview of the Maya Spoken Language System
Downey, S Breen, A P Fernandez, M Kaneen, E
BT Labs, UNITED KINGDOM

Tu3Q5     Volume 2, p. 261
Automatic Recognition of Spontaneous Speech Dialogues
Cettolo, M Falavigna, D
ITC-IRST, ITALY

Tu3Q6     Volume 2, p. 265
Using an Animated Talking Character in a web-based City Guide Demonstrator
Fries, G Feldes, S Corbet, A
Deutsche Telekom Berkom, GERMANY

Tu3Q7     Volume 2, p. 269
Influence of facial views on the McGurk effect in auditory noise
Kanzaki, R Kato, T
ATR Information Processing Research Laboratories, JAPAN

Tu3Q8     Volume 2, p. 273
The Intellimedia WorkBench - a generic environment for multimodal systems
Brondsted, T Larsen, L B Manthey, M Moeslund, T B McKevitt, P Olesen, K G
Aalborg University, DENMARK

Tu3Q9     Volume 2, p. 277
Stamp: A Suite of Tools for Analyzing Multimodal System Processing
Clow, J Oviatt, S L
Oregon Graduate Institute of Science & Technology, USA

Tu3Q10     Volume 2, p. 281
Cultural similarities and differences in the recognition of audio-visual speech stimuli
Shigeno, S
Kitasato University, JAPAN

Tu3Q11     Volume 2, p. 285
A Multimodal-Input Multimedia-Output Guidance System: MMGS
Takezawa, T Morimoto, T
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Tu3Q12     Volume 2, p. 289
HMM-based Visual Speech Recognition using Intensity and Location Normalization
Vanegas, O Tanaka, A Tokuda, K Kitamura, T
Nagoya Institute of Technology, JAPAN

Tu3Q13     Volume 2, p. 293
A Hierarchy Probability-based Visual Features Extraction Method for Speechreading
Xu, Y Du, L Li, G Hou, Z
Chinese Academy of Science, P.R. CHINA

Tu3Q14     Volume 2, p. 297
Integration of Talking Heads and Text-to-Speech Synthesizers for Visual TTS
Ostermann, J Beutnagel, M Fischer, A Wang, Y
AT&T Labs Research

Tu3R - Isolated Word Recognition

Tu3R1     Volume 2, p. 301
Improving Accuracy of telephony-based, speaker-independent speech recognition
Azzopardi D Semnani S Milner B Wiseman R
BT Labs, UNITED KINGDOM

Tu3R2     Volume 2, p. 305
Rejection in Speech Recognition Systems with Limited Training
Bayya, A
Rockwell Semiconductor Systems, USA

Tu3R3     Volume 2, p. 309
A Four Layer Sharing HMM System for Very Large Vocabulary Isolated Word Recognition
Chen, R Tanaka, M Wu, D Olorenshaw, L Amador, M
SONY Research Labs, USA

Tu3R4     Volume 2, p. 313
A Comparative Study of Hybrid Modelling Techniques for Improved Telephone Speech Recognition
Chengalvarayan, R
Lucent Technologies, USA

Tu3R5     Volume 2, p. 317
Smoothing and Tying for Korean Flexible Vocabulary Isolated Word Recognition
Choi, J-S Lee, J-S Lee, H-Y
LG Corporate Institute of Technology, KOREA

Tu3R6     Volume 2, p. 321
Recent Work on a Preselection Module for a Flexible Large Vocabulary Speech Recognition System in Telephone Environment
Ferreiros, J Macias-Guarasa, J Gallardo, A Colas, J Cordoba, R Pardo, J
Villarrubia, L
GTH-IEL-UPM, SPAIN

Tu3R7     Volume 2, p. 325
A Study of Noise Robustness for Speaker Independent Speech Recognition
Method Using Phoneme Similarity Vector

Hoshimi, M Yamada, M Niyada, K Makino, S
Matsushita Research Institute Tokyo, Inc., JAPAN

Tu3R8     Volume 2, p. 329
Classification of Taiwanese tones based on pitch and energy movements
Jian, F
University of Reading, UNITED KINGDOM

Tu3R9     Volume 2, p. 333
Phoneme-based recognition for the Norwegian SpeechDat(II) database
Johansen, F T
Telenor R&D, NORWAY

Tu3R10     Volume 2, p. 337
Robust Feature Extraction for Alphabet Recognition
Karnjanadecha, M Zahorian, S A
Old Dominion University, USA

Tu3R11     Volume 2, p. 341
Recognition of Connected Digit Speech in Japanese Collected over the Telephone Network
Kawai, H Higuchi, N
KDD R&D Laboratories Inc., JAPAN

Tu3R12     Volume 2, p. 345
Improving the Speaker-Dependency of Subword-Unit-Based Isolated Word Recognition
Koizumi, T Taniguchi, S Kohtoh, K
Fukui University, JAPAN

Tu3R13     Volume 2, p. 349
Speaker Independent Speech Recognition Method using Constrained Time
Alignment near Phoneme Discriminative Frame

Konuma, T Suzuki, T Yamada, M Ono, Y Hoshimi, M Niyada, K
Matsushita Research Institute Tokyo, Inc, JAPAN

Tu3R14     Volume 2, p. 353
A Nonatationary Autoregressive HMM with Gain Adaptation for Speech Recognition
Lee, K Y Lee, J
Soongsil University, KOREA

Tu3R15     Volume 2, p. 357
A Large-Vocabulary Taiwanese (Min-nan) Multi-syllabic Word Recognition System Based upon Right-Context-Dependent Phones with State Clustering by Acoustic Decision Tree
Lyu, R-Y Chiang, Y-C Hsieh, W-P
Chang Gung University, TAIWAN R.O.C.

Tu3R16     Volume 2, p. 361
Speech Recognition Based on the Distance Calculation Between Intermediate Phonetic Code Sequences in Symbolic Domain
Tanaka, K Kojima, H
Electrotechnical Laboratory, JAPAN

Tu3R17     Volume 2, p. 365
High Accuracy Chinese Speech Recognition Approach with Chinese Input Technology for Telecommunication Use
Yang, Y C-H Kuo, J-J
Matsushita Institute of Technology, Taipei, TAIWAN R.O.C.

Tu4A - Robust Speech Processing in Adverse Environments 1

Tu4A1     Volume 2, p. 369
Robust Speech Recognition using HMM's With Toeplitz State Covariance Matrices
Roberts, W Ephraim, Y
Defence Science & Technology Organisation, AUSTRALIA

Tu4A2     Volume 2, p. 373
Modeling of Output Probability Distributions to improve Small Vocabulary Speech Recognition in Adverse Environments
Thambiratnam, D Sridharan, S
Queensland University of Technology, AUSTRALIA

Tu4A3     Volume 2, p. 377
Robust and compact multilingual word recognizers using features extracted from a phoneme similarity front-end
Morin, P Applebaum, T H Boman, R Zhao, Y Junqua, J-C
Panasonic Technologies Inc., USA

Tu4A4     Volume 2, p. 381
An Effect of Adaptive Beamforming on Hands-free Speech Recognition Based on 3-D Viterbi Search
Yamada, T Nakamura, S Shikano, K
Nara Institute of Science and Technology, JAPAN

Tu4A5     Volume 2, p. 385
Coherence-based Subband Decomposition for Robust Speech and Speaker Recognition in Noisy and Reverberant Rooms
Gonzalez-Rodriguez, J Cruz-Llanas, S Ortega-Garcia, J
Universidad Politecnica Madrid, SPAIN

Tu4A6     Volume 2, p. 389
A Minimax Search Algorithm for CDHMM based Robust Continuous Speech Recognition
Jiang, H Hirose, K Huo, Q
The University of Tokyo, JAPAN

Tu4B - Spoken Language Models and Dialog 2

Tu4B1     Volume 2, p. 393
An event driven model for dialogue systems
Wang, K
Microsoft Research, USA

Tu4B2     Volume 2, p. 397
Automatic Classification of Dialogue Contexts for Dialogue Predictions
Popovici, C Baggia, P Laface, P Moisa, L
Politecnico Di Torino, ITALY

Tu4B3     Volume 2, p. 401
Automatic Identification of Command Boundaries in a Conversational Natural Language User Interface
Ramaswamy, G N Kleindienst, J
I.B.M. Research Center, USA

Tu4B4     Volume 2, p. 405
The Predictive Power of Game Structure in Dialogue Act Recognition: Experimental Results Using Maximum Entropy Estimation
Poesio, M Mikheev, A
University of Edinburgh, HCRC, UNITED KINGDOM

Tu4B5     Volume 2, p. 409
A schema based approach to dialog control
Constantinides, P Hansma, S Tchou, C Rudnicky, A
Carnegie Mellon University, USA

Tu4B6     Volume 2, p. 413
Expanding a time-sensitive conversational architecture for turn-taking to handle content-driven interruption
Aist, G
Carnegie Mellon University, USA

Tu4C - Articulatory Modelling 1

Tu4C1     Volume 2, p. 417
A Three-Dimensional Linear Articulatory Model Based on MRI Data
Badin, P Bailly, G Raybaudi, M Segebarth, C
Institut Communication Parlee, FRANCE

Tu4C2     Volume 2, p. 421
On Loops and Articulatory Biomechanics
Perrier, P Payan, Y Perkell, J Jolly, F Zandipour, M Matthies, M
Institut De La Communication Parlée - INPG, FRANCE

Tu4C3     Volume 2, p. 425
Magnetic Resonance Measurements of the Velum port opening
Demolin, D Lecuit, V Metens, T Nazarian, B Soquet, A
Universite Libre De Bruxelles, BELGIUM

Tu4C4     Volume 2, p. 429
Cantilever-type force-sensor-mounted palatal plate for measuring palatolingual contact stress and pattern during speech phonation.
Matsumura, M Niikawa, T Tanabe, T Tachimura, T Wada, T
Osaka Electro-Communication University, JAPAN

Tu4C5     Volume 2, p. 433
Determination of the vocal tract spectrum from the articulatory movements based on the search of an articulatory-acoustic database
Kaburagi, T Honda, M
NTT Basic Research Laboratories, JAPAN

Tu4C6     Volume 2, p. 437
An MRI study on the relationship between oral cavity shape and larynx position
Honda, K Tiede, M
ATR Human Information Processing Research Laboratories, JAPAN

Tu4D - Talking To Infants, Pets and Lovers

Tu4D1     Volume 2, p. 441
Acoustic and Affective Qualities of IDS in English
Kitamura, C Burnham, D
University of New South Wales, AUSTRALIA

Tu4D2     Volume 2, p. 445
Acoustic Qualities of IDS and ADS in Thai
Thanavisuth, C Luksaneeyanawin, S
Chulalongkorn University, THAILAND

Tu4D3     Volume 2, p. 449
Pragmatic Characteristics of Infant Directed Speech
Luksaneeyanawin, S Thanavisuth, C Sittigasorn, S Rukkarangsarit, O
Chulalongkorn University, THAILAND

Tu4D4     Volume 2, p. 453
Are You My Little Pussy-Cat? Acoustic, Phonetic and Affective Qualities of Infant- and Pet-Directed Speech
Burnham, D Francis, E Vollmer-Conna, U Kitamura, C Averkiou, V Olley, A Nguyen, M Paterson, C
University of New South Wales, AUSTRALIA

Tu4D5     Volume 2, p. 457
Special Speech Registers: Talking To Australian and Thai Infants, and To Pets.
Burnham, D
University of New South Wales, AUSTRALIA

Tu5A - Robust Speech Processing in Adverse Environments 2

Tu5A1     Volume 2, p. 459
Performance improvements through combining phone- and syllable-scale information in automatic speech recognition
Wu, S-L Kingsbury, B E D Morgan, N Greenberg, S
Nuance Communications, USA

Tu5A2     Volume 2, p. 463
Predictive Adaptation and Compensation for Robust Speech Recognition
Surendran, A C Lee, C-H
Bell Labs, Lucent Technologies, USA

Tu5A3     Volume 2, p. 467
Influence of the speaking style and the noise spectral tilt on the Lombard reflex and automatic speech recognition
Junqua, J-C Fincke, S Field, K
Speech Technology Laboratory, USA

Tu5A4     Volume 2, p. 471
Data-Driven PMC and Bayesian Learning Integration for Fast Model Adaptation in Noisy Conditions
Crafa, S Fissore, L Vair, C
CSELT, ITALY

Tu5A5     Volume 2, p. 475
Improving the noise and spectral robustness of an isolated-word recognizer using an auditory-model front end
Hunke, M Hyun, M Love, S Holton, T
San Francisco State University, USA

Tu5A6     Volume 2, p. 479
A Model for Speech Reverberation and Intelligibility Restoring Filters
Kenny, O P Nelson, D J
USA

Tu5B - Spoken Language Models and Dialog 3

Tu5B1     Volume 2, p. 483
On Different Functions of Repetitive Utterances
Swerts, M Koiso, H Shimojima, A Katagiri, Y
IPO, Center for Research On User-System Interaction, THE NETHERLANDS

Tu5B2     Volume 2, p. 487
Prosody based detection of the context of backchannel responses
Noguchi, H Den, Y
NAIST, JAPAN

Tu5B3     Volume 2, p. 491
Robust Interpretation for Spoken Dialogue Systems
Strömbäck, L Jonsson, A
Linkoping University, SWEDEN

Tu5B4     Volume 2, p. 495
System-User Interaction and Response Strategy in Spoken Dialog System
Okato, Y Kato, K Yamamoto, M Itahashi, S
University of Tsukuba, JAPAN

Tu5B5     Volume 2, p. 499
Organizing Self-Motivated Dialogue with Autonomous Creatures
Suzuki, N Ishii, K Okada, M
ATR Media Integration & Communications Research Labs, JAPAN

Tu5B6     Volume 2, p. 503
Fly with the EAGLES: Evaluation of the ACCeSS Spoken Language Dialogue System
Hanrieder, G Heisterkamp, P Brey, T
Daimler-Benz AG, GERMANY

Tu5C - Speech Coding 1

Tu5C1     Volume 2, p. 507
A very low bit rate speech coder using HMM with speaker adaptation
Masuko, T Tokuda, K Kobayashi, T
Tokyo Institute of Technology, JAPAN

Tu5C2     Volume 2, p. 511
ITU-T G.729 extension at 6.4 kbps
Ekudden, E Hagen, R Johansson, B Hayashi, S Kataoka, A Kurihara, S
Ericsson Radio Systems AB, SWEDEN

Tu5C3     Volume 2, p. 515
Adaptive Transformation for Segmented Parametric Speech Coding
Mudugamuwa D J Bradley A B
RMIT, AUSTRALIA

Tu5C4     Volume 2, p. 519
Speech enhancement using STC-based bandwidth extension
Epps, J Holmes, W H
University of NSW, AUSTRALIA

Tu5C5     Volume 2, p. 523
Performance and Optimization of the SEEVOC Algorithm
Zhang, W Holmes, WH
The University of New South Wales, AUSTRALIA

Tu5D - Articulatory Modelling 2

Tu5D1     Volume 2, p. 527
Acoustic-articulatory evaluation of the upper vowel-formant region and its presumed speaker-specific potency
Clermont, F Mokhtari, P
Australian Defence Force Academy, AUSTRALIA

Tu5D2     Volume 2, p. 531
Control of larynx height in vowel production
Hoole, P Kroos, C
Munich University, GERMANY

Tu5D3     Volume 2, p. 535
Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditions
Alku, P Vintturi, J Vilkman, E
University of Turku, FINLAND

Tu5D4     Volume 2, p. 539
An Analysis of Modal Coupling Effects During the Glottal Cycle : Formant Synthesizers From Time-Domain Finite-Difference Simulations
Ramsay, G
Institut De La Communication Parlee, FRANCE

Tu5D5     Volume 2, p. 543
Laryngoscopic Analysis of Pharyngeal Articulations and Larynx-Height Voice Quality Settings
Esling, J
University of Victoria, CANADA

Tu5D6     Volume 2, p. 547
Effects of shapes of radiational aperture on radiation characteristics
Matsuzaki, H Motoki, K Miki, N
Hokkai-Gakuen University, JAPAN

Tu5P - Prosody and Emotion 2

Tu5P1     Volume 3, p. 551
De-accentuation: Linguistic Environments and Prosodic Realizations
Alter, K Steinhauer, K Friederici, A D
Max-Planck-Institute of Cognitive Neuroscience, GERMANY

Tu5P2     Volume 3, p. 555
Towards an automatic classification of emotions in speech
Amir, N Ron, S
Center for Technological Education Holon, ISRAEL

Tu5P3     Volume 3, p. 559
Can we hear smile?
Schröder, M Auberge, V Cathiard, M
ICP - ESA 5009 CNRS, FRANCE

Tu5P4     Volume 3, p. 563
The Automatic Marking of Prominence in Spontaneous Speech Using Duration and Part of Speech Information
Aylett, M P Bull, M
University of Edinburgh, UNITED KINGDOM

Tu5P5     Volume 3, p. 567
On a Pitch Alteration Technique in Excited Cepstral Spectrum for High Quality TTS
Kim, J D Baek, S J Bae, M J
Soongsil University, KOREA

Tu5P6     Volume 3, p. 571
Dovetailing of acoustics and prosody in spontaneous speech recognition
Buckow, J Batliner, A Huber, R Noeth, E Warnke, V Niemann, H
University of Erlangen, GERMANY

Tu5P7     Volume 3, p. 575
A Computational Memory and Processing Model for Prosody
Cahn, J E
M.I.T. Media Laboratory, USA

Tu5P8     Volume 3, p. 579
Convergence of Fundamental Frequencies in Conversation: If it Happens, Does it Matter?
Collins, B M
Australian National University, AUSTRALIA

Tu5P9     Volume 3, p. 583
Analysis and Interpretation of Fundamental Frequency Contours of British English in Terms of a Command-Response Model
Fujisaki, H Ohno, S Yagi, T Ono, T
Science University of Tokyo, JAPAN

Tu5P10     Volume 3, p. 587
Common Patterns in Word Level Prosody
Holm, F Hata, K
Speech Technology Laboratory, USA

Tu5P11     Volume 3, p. 591
Prosodic Structure in Japanese Spontaneous Speech
Horiuchi, Y Ichikawa, A
Chiba University, JAPAN

Tu5P12     Volume 3, p. 595
An acoustic-phonetic description of word tone in Kagoshima Japanese
Ishihara, S
The Australian National University, AUSTRALIA

Tu5P13     Volume 3, p. 599
Representing Prosodic Words Using Statistical Models of Moraic Transition of Fundamental Frequency Contours of Japanese
Iwano, K Hirose, K
University of Tokyo, JAPAN

Tu5P14     Volume 3, p. 603
Disambiguation of Korean Utterances Using Automatic Intonation Recognition
Jang, T Song, M Lee, K
University of Edinburgh, UNITED KINGDOM

Tu5P15     Volume 3, p. 607
Multi-level rhythm control for speech synthesis using hybrid data driven and rule-based approaches
Jokisch, O Hirschfeld, D Eichner, M Hoffmann, R
Dresden University of Technology, GERMANY

Tu5P16     Volume 3, p. 611
EGG Model of Ditoneme in Mandarin
Kong, J
City University of Hong Kong, HONG KONG

Tu5P17     Volume 3, p. 615
Temporal Organization of Speech for normal and fast rates
Krishnan, G Ward, W
Carnegie Mellon University, USA

Tu5P18     Volume 3, p. 619
A syllable-based generalization of Japanese accentuation
Kubozono, H
Kobe University, JAPAN

Tu5P19     Volume 3, p. 623
Non-adjacent Segmental Effects in Tonal Realization of Accentual Phrase in Seoul Korean
Lee, H-J
UCLA, USA

Tu5P20     Volume 3, p. 627
Improvement on Connected Numbers Recognition Using Prosodic Information
Lopez-Gonzalo, E Caminero, J Cortazar, I Hernandez-Gomez, L
ETSIT-UPM, SPAIN

Tu5P21     Volume 3, p. 631
Phonetic investigation of boundary pitch movements in Japanese
Maeda, K Venditti, J J
University of Pennsylvania, USA

Tu5P22     Volume 3, p. 635
Phonetic and phonological characteristics of paralinguistic information in spoken Japanese
Maekawa, K
The National Language Research Institute, JAPAN

Tu5P23     Volume 3, p. 639
ToBI Accent Type Recognition
Maghbouleh, A
Stanford University, USA

Tu5P24     Volume 3, p. 643
The Influence of Syllable Structure on the Timing of Intonational Events in German
Mixdorff, H Fujisaki, H
Dresden University of Technology, GERMANY

Tu5P25     Volume 3, p. 647
New Prosodic Control Rules for Expressive Synthetic Speech
Mizuno, O Nakajima, S
NTT Human Interface Labs, JAPAN

Tu5P26     Volume 3, p. 651
The Use of F0 Reliability Function for Prosodic Command Analysis on F0 Contour Generation Model
Nakai, M Shimodaira, H
JAIST, JAPAN

Tu5P27     Volume 3, p. 655
Analysis of Effects of Lexical Accent, Syntax and Global Speech Rate upon the Local Speech Rate
Ohno, S Fujisaki, H Taguchi, H
Department of Applied Electronics, Science University of Tokyo, JAPAN

Tu5P28     Volume 3, p. 659
On the Effects of Speech Rate upon Parameters of the Command-Response Model for the Fundamental Frequency Contours of Speech
Ohno, S Fujisaki, H Hara, Y
Science University of Tokyo, JAPAN

Tu5P29     Volume 3, p. 663
The maximum-based description of F0 contours and its application to English
Portele, T Heuft, B
IKP, University of Bonn, GERMANY

Tu5P30     Volume 3, p. 667
Perceived prominence and acoustic parameters in American English
Portele, T
IKP, University of Bonn, GERMANY

Tu5P31     Volume 3, p. 671
Generating Emotional Speech with a Concatenative Synthesizer
Rank, E Pirker, H
OFAI, AUSTRIA

Tu5P32     Volume 3, p. 675
A Perceptive Measure of Pure Prosody Linguistic Functions with Reiterant Sentences
Rilliard, A Auberge, V
Institut De La Communication Parlee, FRANCE

Tu5P33     Volume 3, p. 679
Prosodic Parameters in Emotional Speech
Koike, K Suzuki, H Saito, H
Keio University, JAPAN

Tu5P34     Volume 3, p. 683
Automatic detection of prominence (as defined by listenersí judgments) in read aloud Dutch sentences
Streefkerk, B M Pols, L C W ten Bosch, L F M
Institute of Phonetic Science Amsterdam, THE NETHERLANDS

Tu5P35     Volume 3, p. 687
A Schema for Illocutionary Act Identification with Prosodic Feature
Tamoto, M Kawabata, T
NTT Basic Research Laboratory, JAPAN

Tu5P36     Volume 3, p. 691
An Algorithm for Choosing Japanese Acknowledgments using Prosodic Cues and Context
Tsukahara, W
University of Tokyo, JAPAN

Tu5P37     Volume 3, p. 695
A Study of Tones and Tempo in Continuous Mandarin Digit Strings and their Application in Telephone Quality Speech Recognition
Wang, C Seneff, S
MIT Laboratory for Computer Science, USA

Tu5P38     Volume 3, p. 699
Simulated emotions: an acoustic study of voice and perturbation measures
Whiteside, S P
University of Sheffield, UNITED KINGDOM

Tu5P39     Volume 3, p. 703
A Robust Tone Recognition Method of Chinese Based on Sub-syllabic F0 Contours
Zhang, J-S Hirose, K
The University of Tokyo, JAPAN

Tu5P40     Volume 3, p. 707
The Microprosodics of Tone Sandhi in Shanghai Disyllabic Compounds
Zhu, X S
Australian National University, AUSTRALIA

Tu5P41     Volume 3, p. 711
Jitter and Shimmer Differences Between Pathological Voices of School Children
Bolfan-Stosic, N Prizl, T
University of Zagreb, CROATIA

Tu5Q - Neural Networks, Fuzzy and Evolutionary Methods 1

Tu5Q1     Volume 3, p. 715
A Comparison of Thai Speech Recognition Systems using Hidden Markov Model, Neural Network, and Fuzzy-Neural Network
Ahkuputra, V Jitapunkul, S Jittiwarangkul, N Maneenoi, E Kasuriya, S
Chulalongkorn University, THAILAND

Tu5Q2     Volume 3, p. 719
Phoneme recognition with statistical modeling of the prediction error of neural networks
Freitag, F Monte, E
Polytechnical University of Catalonia, SPAIN

Tu5Q3     Volume 3, p. 723
Neural network based pronunciation modeling with applications to speech recognition
Fukada, T Yoshimura, T Sagisaka, Y
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Tu5Q4     Volume 3, p. 727
A Comparative Study of OCON and MLP Architectures for Phoneme Recognition
Haskey, S Datta, S
Loughborough University, UNITED KINGDOM

Tu5Q5     Volume 3, p. 731
Evaluation and Integration of Neural-Network Training Techniques for Continuous Digit Recognition
Hosom, J P Cole, R A Cosi, P
Oregon Graduate Institute, USA

Tu5Q6     Volume 3, p. 735
Hierarchical Neural Networks (HNN) for Chinese Continuous Speech Recognition
Jia, Y Du, L Hou, Z
Chinese Academy of Sciences, P.R. CHINA

Tu5Q7     Volume 3, p. 739
Neural Network Motivation for Segmental Distribution
Keller, E
University of Lausanne, SWITZERLAND

Tu5Q8     Volume 3, p. 743
Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers
Mirghafori, N Morgan, N
International Computer Science Institute, USA

Tu5Q9     Volume 3, p. 747
Initial Speech Recognition Results using the Multinet Architecture
Pizzolato, E B Reynolds, TJ
University of Essex, UNITED KINGDOM

Tu5Q10     Volume 3, p. 751
Selection of the Optimal Structure of the Continuous HMM Using the Genetic Algorithm
Takara, T Iha, Y Nagayama, I
University of the Ryukyus, JAPAN

Tu5Q11     Volume 3, p. 755
A Proposed Decision Rule for Speaker Recognition Based on Fuzzy
C-Means Clustering

Tran, D Wagner, M Le, T V
University of Canberra, AUSTRALIA

Tu5Q12     Volume 3, p. 759
Fuzzy Gaussian Mixture Models for Speaker Recognition
Tran, D Le, T V Wagner, M
University of Canberra, AUSTRALIA

Tu5Q13     Volume 3, p. 763
A New Strategy of Fuzzy-Neural Network for Thai Numeral Speech Recognition
Wutiwiwatchai, C Jitapunkul, S Ahkuputra, V Maneenoi, E
Luksaneeyanawin, S
Chulalongkorn University, THAILAND

Tu5Q14     Volume 3, p. 767
Thai Polysyllabic Word Recognition Using Fuzzy-Neural Network
Wutiwiwatchai, C Jitapunkul, S Ahkuputra, V Maneenoi, E
Luksaneeyanawin, S
Chulalongkorn University, THAILAND

Tu5R - Utterance Verification and Word Spotting 1 / Speaker Adaptation 1

Tu5R1     Volume 3, p. 771
Word Verification Using Confidence Measures in Speech Recognition
Benitez, C Rubio Ayuso, A Garcia Teodoro, P Diaz Verdejo, J E
Universidad De Granada, SPAIN

Tu5R2     Volume 3, p. 775
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems
Bernardis, G Bourlard, H
IDIAP, SWITZERLAND

Tu5R3     Volume 3, p. 779
Two-pass Utterance Verification Algorithm for Long Natural Numbers Recognition
Caminero, J Lopez, E Hernandez, L
ETSIT-UPM, SPAIN

Tu5R4     Volume 3, p. 783
A*-Admissible Key-Phrase Spotting with Sub-Syllable Level Utterance Verification
Chen, B Wang, H-M Shien, L-F Lee, L-S
Academica Sinica, Taiwan, TAIWAN R.O.C.

Tu5R5     Volume 3, p. 787
Speaker-Independent Upfront Dialect Adaptation in a Large Vocabulary Continuous Speech Recognizer
Fischer, V Gao, Y Janke, E
IBM Speech Systems, GERMANY

Tu5R6     Volume 3, p. 791
Word-Based Acoustic Confidence Measures for Large-Vocabulary Speech Recognition
Gunawardana, A Hon, H-W Jiang, L
Microsoft Research, USA

Tu5R7     Volume 3, p. 795
Improved Utterance Rejection using Length Dependent Thresholds
Gupta, S K Soong, F
Bell Laboratories - Lucent Technologies, USA

Tu5R8     Volume 3, p. 799
Bayesian Constrained Frequency Warping HMMs for Speaker Normalization
Ho, C H Vaseghi, S Chen, A
Queen's University of Belfast, UNITED KINGDOM

Tu5R9     Volume 3, p. 803
An Evaluation of Keyword Spotting Performance utilizing False Alarm Rejection based on Prosodic Information
Ida, M Yamasaki, R
OMRON Corporation, JAPAN

Tu5R10     Volume 3, p. 807
Predictive Speaker Adaptation and Its Prior Training
Tran, D Iso, K-I
NEC, JAPAN

Tu5R11     Volume 3, p. 811
Powerful Syllabic Fillers for General-Task Keyword-Spotting and Unlimited-Vocabulary Continuous-Speech Recognition
El Meliani, R O'Shaughnessy, D
INRS-Telecommunications, CANADA

Tu5R12     Volume 3, p. 815
Confidence Scoring for Speech Understanding Systems
Pao, C Schmid, P Glass, J
MIT Laboratory for Computer Science, USA

Tu5R13     Volume 3, p. 819
Phonological Rules for Enhancing Acoustic Enrollment of Unknown Words
Ramabhadran, B Ittycheriah, A
IBM T.J. Watson Research Center, USA

Tu5R14     Volume 3, p. 823
Recognition-Based Word Counting for Reliable Barge-in and Early Endpoint Detection in Continuous Speech Recognition
Setlur, A R Sukkar, R A
Lucent Technologies, USA

Tu5R15     Volume 3, p. 827
Linear Discriminant - A New Criterion for Speaker Normalization
Westphal, M Schultz, T Waibel, A
Carnegie Mellon University, USA

Tu5R16     Volume 3, p. 831
Confidence Measures Derived from an Acceptor HMM
Williams, G Renals, S
Sheffield University, UNITED KINGDOM

Tu5R17     Volume 3, p. 835
Telephone Speech Multi-Keyword Spotting Using Fuzzy Search Algorithm and Prosodic Verification
Wu, C-H Chen, Y-J Hung, Y-C
National Cheng Kung University, TAIWAN R.O.C.

Tu5R18     Volume 3, p. 839
Topic Recognition for News Speech Based on Keyword Spotting
Yamashita, Y Tsunekawa, T Mizoguchi, R
Ritsumeikan University, JAPAN

Tu6A - Text-To-Speech Synthesis 2

Tu6A1     Volume 3, p. 843
Prosody Prediction for Speech Synthesis using Transformational
Rule-based Learning

Fordyce, C Ostendorf, M
Lernout & Hauspie Speech Products, USA

Tu6A2     Volume 3, p. 847
Representing the environments for phonological processes in an accent-independent lexicon for synthesis of English
Fitt, S Isard, S
University of Edinburgh, UNITED KINGDOM

Tu6A3     Volume 3, p. 851
Efficient Lexical Retrieval for English Text-to-Speech Synthesis
Faulkner, D Bryant, C
Aculab Plc, UNITED KINGDOM

Tu6B - Spoken Language Models and Dialog 4

Tu6B1     Volume 3, p. 855
SQEL: A Multilingual and Multifunctional Dialog System
Aretoulaki, M Harbeck, S Gallwitz, F Noeth, E Niemann, H Ivanecky, J Ipsic, I Pavesic, N Matousek, V
University of Erlangen, GERMANY

Tu6B2     Volume 3, p. 859
Semi-automated incremental prototyping of spoken dialog systems
Kaspar, S Hoffmann, A
University of New South Wales, AUSTRALIA

Tu6B3     Volume 3, p. 863
Beyond Structured Dialogues: Factoring out Grounding
Heeman, P Johnston, M Denney, J Kaiser, E
Oregon Graduate Institute, USA

Tu6D - Human Speech Perception 1

Tu6D1     Volume 3, p. 867
Heads and tails in word perception: Evidence for 'Early-to-Late' Processing in listening and reading
Nooteboom, S G Van Dijk, M
Utrecht University, THE NETHERLANDS

Tu6D2     Volume 3, p. 871
Evidence for Early Effects of Sentence Context on Word Segmentation
te Riele, S M M Quené, H
Utrecht University, THE NETHERLANDS

Tu6D3     Volume 3, p. 875
Assimilation and anticipation in word perception
Quené, H van Rossum, M van Wijck, M
Utrecht University, THE NETHERLANDS

Tu6D4     Volume 3, p. 879
Lexical activation by assimilated and reduced tokens
Kelly, M L Bard, E G Sotillo, C
University of Edinburgh, UNITED KINGDOM

Wednesday 2 December, 1998

We1A - Robust Speech Processing in Adverse Environments 3

We1A1     Volume 3, p. 883
Linear and Nonlinear Speech Feature Analysis for Stress Classification
Zhou, G Hansen, J H L Kaiser, J F
Robust Speech Processing Laboratory, USA

We1A2     Volume 3, p. 887
Speech Feature Modeling for Robust Stressed Speech Recognition
Bou-Ghazale, S E Hansen, J H L
Robust Speech Processing Laboratory, USA

We1A3     Volume 3, p. 891
Combining Articulatory and Acoustic Information for Speech Recognition in Noisy and Reverberant Environments

Kirchhoff, K
University of Bielefeld, GERMANY

We1A4     Volume 3, p. 895
Improving Speaker Identification Performance in Reverberant Conditions using Lip Information
Wark, T Sridharan, S
Queensland University of Technology, AUSTRALIA

We1B - Speech and Hearing Disorders 1

We1B1     Volume 3, p. 899
Adults with a severe-to-profound hearing impairment. Investigating the effects of linguistic context on speech perception.
Flynn, M Dowell, R Clark, G
University of Melbourne, AUSTRALIA

We1B2     Volume 3, p. 903
Speech Perception in Dyslexia: Measurements from Birth Onwards
Koopmans-van Beinum, F J Schwippert, C Kuijpers, C T L
University of Amsterdam, THE NETHERLANDS

We1B3     Volume 3, p. 907
An Acoustic Analysis of Vowel Production Across Tasks in a Case of
Non-fluent Progressive Aphasia

Croot, K
University of Sydney / Macquarie University, AUSTRALIA

We1B4     Volume 3, p. 911
Speech technology in clinical environments:
van Doorn, J McLeod, S Baker, E Purcell, A Thorpe, W
The University of Sydney (Cumberland Campus), AUSTRALIA

We1C - Prosody and Emotion 3

We1C1     Volume 3, p. 915
What Spreads, and How? Tonal Rightward Spreading On Shanghai Disyllabic Compounds
Zhu, X S
Australian National University, AUSTRALIA

We1C2     Volume 3, p. 919
Tonal Complexity as a Dialectal Feature: 25 Different Citation Tones from
Four Zhejiang Wu Dialects

Zhu, S X Rose, P
Australian National University, AUSTRALIA

We1C3     Volume 3, p. 923
Emotional Speech Synthesis: from speech database to TTS
Montero, J M Gutierrez-Arriola, J Palazuelos, S Enriquez, E Aguilera, S Pardo, J M
GTH-IEL-UPM, SPAIN

We1C4     Volume 3, p. 927
Some Acoustic Characteristics of Emotion
Pereira, C Watson, C
SHLRC, Macquarie University, AUSTRALIA

We1D - Spoken Language Understanding Systems 1

We1D1     Volume 3, p. 931
Galaxy-II: A Reference Architecture for Conversational System Development
Seneff, S Hurley, E Lau, R Pao, C Schmid, P Zue, V
MIT, USA

We1D2     Volume 3, p. 935
Improvements in Speech Understanding Accuracy Through the Integration of Hierarchical Linguistic, Prosodic, and Phonological Constraints in the Jupiter Domain
Chung, G Seneff, S
MIT Laboratory for Computer Science, USA

We1D3     Volume 3, p. 939
Towards Robust Methods for Spoken Document Retrieval
Ng, K
MIT, USA

We1P - Signal Processing and Speech Analysis 1

We1P1     Volume 3, p. 943
Maximum a Posteriori Pitch Tracking
Droppo, J Acero, A
Microsoft Corporation, USA

We1P2     Volume 3, p. 947
Vowel Separation Using the Reassigned Amplitude-Modulation Spectrum
Yang, D Meyer, G F Ainsworth, W A
Keele University, UNITED KINGDOM

We1P3     Volume 3, p. 951
Feature Decorrelation Methods in Speech Recognition. A Comparative Study
Batlle, E Nadeu, C Fonollosa, J A R
Polytechnical University of Catalonia, SPAIN

We1P4     Volume 3, p. 955
Multi-resolution for Speech Analysis
Caraty, M-J Montacie, C
Universite Pierre Et Marie Curie, FRANCE

We1P5     Volume 3, p. 959
Dynamic features in Children's Vowels
Cassidy, S Watson, C
Macquarie University, AUSTRALIA

We1P6     Volume 3, p. 963
Effectiveness of phase-corrected RASTA for continuous speech recognition
de Veth, J Boves, L
University of Nijmegen, THE NETHERLANDS

We1P7     Volume 3, p. 967
Techniques for capturing temporal variations in speech signals with fixed-rate processing
Dharanipragada, S Gopinath, R Rao, B
IBM T. J. Watson Research Center, USA

We1P8     Volume 3, p. 971
Automatic Detection of Landmark for Nasal Consonants from Speech Waveform
Du, L Stevens, K N
Chinese Academy of Sciences, P.R. CHINA

We1P9     Volume 3, p. 975
Plug and Play Software for Designing High-Level Speech Processing Systems
Dutoit, T Schroeter, J
Faculte Polytechnique De Mons, BELGIUM

We1P10     Volume 3, p. 979
Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing
Girardi, A Shikano, K Nakamura, S
Nara Institute of Science and Technology, JAPAN

We1P11     Volume 3, p. 983
Restoration of hyperbaric speech by correction of the formants and the pitch
Charonnat, L Guitton, M Crestel, J Allee, J
ENSSAT, FRANCE

We1P12     Volume 3, p. 987
Voice Conversion based on parameter transformation
Gutiérrez-Arriola, J M Hsiao, Y S Montero, J M Pardo, J M Childers, D G
GTH-IEL-UPM, SPAIN

We1P13     Volume 3, p. 991
Noise Robust Two-Stream Auditory Feature Extraction Method for Speech Recognition
Tian, J Hariharan, R Laurila, K
Nokia Research Center, FINLAND

We1P14     Volume 3, p. 995
Heterogeneous Measurements and Multiple Classifiers for Speech Recognition
Halberstadt, A K Glass, J R
MIT, USA

We1P15     Volume 3, p. 999
Joint Recognition and Segmentation using Phonetically Derived Features and a Hybrid Phoneme Model
Harte, N Vaseghi, S Milner, B
Queen's University of Belfast, UNITED KINGDOM

We1P16     Volume 3, p. 1003
TRAPS - Classifiers of Temporal Patterns
Hermansky, H Sharma, S
Oregon Graduate Institute, USA

We1P17     Volume 3, p. 1007
Robust measurement of fundamental frequency and degree of voicing
Holmes, J N
UNITED KINGDOM

We1P18     Volume 3, p. 1011
Micropower Electro-Magnetic Sensors for Speech Characterization, Recognition, Verification, and Other Applications
Holzrichter, J F Burnett, G C Gable, T J Ng, L C
Lawrence Livermore National Laboratory, USA

We1P19     Volume 3, p. 1015
Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments
Shen, J-L Hung, J-W Lee, L-S
Academia Sinica, TAIWAN R.O.C.

We1P20     Volume 3, p. 1019
Statistical Integration of Temporal Filter Banks for Robust Speech Recognition Using Linear Discriminant Analysis (LDA)
Shen, J-L Hwang, W-L
Academia Sinica, TAIWAN R.O.C.

We1P21     Volume 3, p. 1023
Feature-Based Approach to Speech Recognition
Iskra, D J Edmondson, W H
University of Birmingham, UNITED KINGDOM

We1P22     Volume 3, p. 1027
Periodicity Emphasis of Voice Wave using Nonlinear IIR Digital Filters and Its Applications
Kamata, H Kaneko, A Ishida, Y
Meiji University, JAPAN

We1P23     Volume 3, p. 1031
Speech recognition via phonetically featured syllables
King, S Stephenson, T Isard, S Taylor, P Strachan, A
CSTR, University of Edinburgh, UNITED KINGDOM

We1P24     Volume 3, p. 1035
Do Phonetic Features Help to Improve Consonant Identification in ASR?
Koreman, J Andreeva, B Barry, W J
University of the Saarland, GERMANY

We1P25     Volume 3, p. 1039
Perceptual and acoustic properties of phonemes in continuous speech for different speaking rate
Kuwabara, H
Teikyo University of Science & Technology, JAPAN

We1P26     Volume 3, p. 1043
On Robust Sequential Estimator Based on t-distribution with Forgetting Factor for Speech Analysis
Lee, J Lee, K Y
Dong-Ah Broadcasting College, KOREA

We1P27     Volume 3, p. 1047
Discriminant Wavelet Basis Construction for Speech Recognition
Long, C Datta, S
Loughborough University, UNITED KINGDOM

We1P28     Volume 3, p. 1051
An Efficient Mel-LPC Analysis Method for Speech Recognition
Matsumoto, H Nakatoh, Y Furuhata, Y
Shinshu University, JAPAN

We1P29     Volume 3, p. 1055
Discriminative Weighting of Multi-resolution Sub-band Cepstral Features for Speech Recognition
McMahon, P McCourt, P Vaseghi, S
Queens University of Belfast, NORTHERN IRELAND

We1P30     Volume 3, p. 1059
Separation of Singing and Piano Sounds
Meron, Y Hirose, K
Tokyo University, JAPAN

We1P31     Volume 3, p. 1063
Modeling of Variations in Cepstral Coefficients Caused by F0 Changes and its Application to Speech Processing
Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN

We1P32     Volume 3, p. 1067
A Detection Framework for Locating Phonetic Events
Niyogi, P Mitra, P P Sondhi, M M
Bell Laboratories, Lucent Technologies, USA

We1P33     Volume 3, p. 1071
On Frequency Averaging for Spectral Analysis in Speech Recognition
Nadeu, C Galindo, F Padrell, J
Polytechnical University of Catalonia, SPAIN

We1P34     Volume 3, p. 1075
Wavelet transform domain blind equalization and its application to speech analysis
Namba, M Ishida, Y
Meiji University, JAPAN

We1P35     Volume 3, p. 1079
A Novel Method of Formant Analysis and Glottal Inverse Filtering
Pearson, S
Speech Technology Lab/Panasonic Technologies Inc., USA

We1P36     Volume 3, p. 1083
Vector Quantizer Acceleration for an Automatic Speech Recognition Application
Araújo, A Pera, V Souza, M
FEUP, PORTUGAL

We1P37     Volume 3, p. 1087
Local Speech Rate as a Combination of Syllable and Phone Rate
Pfitzinger, H R
University of Munich, GERMANY

We1P38     Volume 3, p. 1091
Recovering gestures from speech signals: a preliminary study for nasal vowels
Rossato, S Feng, G Laboissière, R
Institut De La Communication Parlee, FRANCE

We1P39     Volume 3, p. 1095
Extended Linear Discriminant Analysis (ELDA) for Speech Recognition
Ruske, G Faltlhauser, R Pfau, T
Technical University of Munich, GERMANY

We1P40     Volume 3, p. 1099
Speech, Silence, Music and Noise Classification of TV Broadcast Material
Samouelian, A Robert-Ribes, J Plumpe, M
University of Wollongong, AUSTRALIA

We1P41     Volume 3, p. 1103
The relation between vocal tract shape and formant frequencies can be described by means of a system of coupled differential equations
Schoentgen, J Soquet, A Lecuit, V Ciocea, S
Universite Libre De Bruxelles, BELGIUM

We1P42     Volume 3, p. 1107
Improving speech recognizer by broader acoustic-phonetic group classification
Suh, Y Hwang, K Kwon, O Park, J
ETRI, KOREA

We1P43     Volume 3, p. 1111
Separation of speech source and filter by time-domain deconvolution
Thorpe, C W
National Voice Centre, Australia, AUSTRALIA

We1P44     Volume 3, p. 1115
On the Application of the AM-FM Model for the Recovery of Missing Frequency Bands of Telephone Speech
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA

We1P45     Volume 3, p. 1119
Estimation of Voice Source and Vocal Tract Parameters Using Combined Subspace-based and Amplitude Spectrum-based Algorithm
Yang, C-S Kasuya, H
Utsunomiya University, JAPAN

We1P46     Volume 3, p. 1123
The Distance Measure for Line Spectrum Pairs Applied to Speech Recognition
Zheng, F Song, Z-J Li, L Yu, W-J Zheng, F-Z Wu, W-H
Tsinghua University, P.R. CHINA

We1Q - Spoken Language Generation and Translation 1

We1Q1     Volume 4, p. 1127
The Modeling and Realization of Natural Speech Generation System
Chen, F Yuan, B
Northern Jiaotong University, P.R. CHINA

We1Q2     Volume 4, p. 1131
'Ko Tok Ples Ensin bilong Tok Pisin' or the TP-CLE: A first report from a pilot speech-to-speech translation project from Swedish to Tok Pisin
Eklund, R
Telia Research AB, SWEDEN

We1Q3     Volume 4, p. 1135
An Iterative, DP-based Search Algorithm for Statistical Machine Translation
Garcia-Varea, I Casacuberta, F Ney, H
Universidad Politecnic De Valencia, SPAIN

We1Q4     Volume 4, p. 1139
Information extraction and text generation of news reports for a Swedish-English bilingual spoken dialogue system
Gawronska, B House, D
University of Skovde, SWEDEN

We1Q5     Volume 4, p. 1143
Utterance Generation for Transaction Dialogues
Hulstijn, J Van Hessen, A
University of Twente, THE NETHERLANDS

We1Q6     Volume 4, p. 1147
Example-Based Error Recovery Method for Speech Translation: Repairing Sub-Trees According to the Semantic Distance
Ishikawa, K Sumita, E Iida, H
ATR Interpreting Telecommunications Research Laboratories, JAPAN

We1Q7     Volume 4, p. 1151
Context Sensitive Generation of Descriptions
Krahmer, E Theune, M
IPO Center for Research On User-System Interaction, THE NETHERLANDS

We1Q8     Volume 4, p. 1155
An Interlingua Based on Domain Actions for Machine Translation of Task-Oriented Dialogues
Levin, L Gates, D Lavie, A Waibel, A
Carnegie Mellon University, USA

We1Q9     Volume 4, p. 1159
Generating Pitch Accents in a Concept-To-Speech System using a Knowledge Base
Williams, S H
Microsoft Research Institute, Macquarie University, AUSTRALIA

We1Q10     Volume 4, p. 1163
Making the most of multiplicity: a multi-parser, multi-strategy architecture for the robust processing of spoken language
Ruland, T Rupp, C J Spilker, J Weber, H Worm, K L
Siemens AG / University of the Saarland, GERMANY

We1Q11     Volume 4, p. 1167
Natural-Sounding Speech Synthesis Using Variable-Length Units
Yi, J R W Glass, J R
MIT, USA

We1R - Spoken Language Models and Dialog 5

We1R1     Volume 4, p. 1171
A robust dialogue model for spoken dialogue processing
Araki, M Doshita, S
Kyoto University, JAPAN

We1R2     Volume 4, p. 1175
The REWARD Service Creation Environment. An Overview
Brøndsted, T Bai, B Olsen, J
Aalborg University, DENMARK

We1R3     Volume 4, p. 1179
An Analysis of the Timing of Turn-Taking in a Corpus of Goal-Oriented Dialogue
Bull, M C Aylett, M
University of Edinburgh, UNITED KINGDOM

We1R4     Volume 4, p. 1183
The provision of corrective feedback in a spoken dialogue CALL system
Davies, S Poesio, M
University of Edinburgh, HCRC, UNITED KINGDOM

We1R5     Volume 4, p. 1187
Evaluation of dialog strategies for a tourist information retrieval system
Devillers, L Bonneau-Maynard, H
LIMSI/CNRS, FRANCE

We1R6     Volume 4, p. 1191
Designing a Multimodal Dialogue System for Information Retrieval
Furui, S Yamaguchi, K
Tokyo Institute of Technology, JAPAN

We1R7     Volume 4, p. 1195
The Research Project of Man-Computer dialogue System in Chinese
Guan, D Chu, M Zhang, Q Liu, J Zhang, H
Chinese Academy of Sciences, P.R. CHINA

We1R8     Volume 4, p. 1199
Interfaces for Speech Recognition Systems: The Impact of Vocabulary Constraints and Syntax on Performance
Hone, K Golightly, D
ICL Institute of Information Technology, UNITED KINGDOM

We1R9     Volume 4, p. 1203
Pacing Spoken Directions to Suit the Listener
Iwase, T Ward, N
University of Tokyo, JAPAN

We1R10     Volume 4, p. 1207
A Spoken Dialogue System Utilizing Spatial Information
Flycht-Eriksson, A Jonsson, A
Linkoping University, SWEDEN

We1R11     Volume 4, p. 1211
From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Language Dialogue Systems - An Empirical Analysis of the Efficacy of Tutorial Dialogues
Kamm, C A Litman, D J Walker, M A
AT&T Labs Research, USA

We1R12     Volume 4, p. 1215
Emergent Computational Dialogue Management Architecture for Task-Oriented Spoken Dialogue Systems
Kawabata, T
NTT Basic Research Labs, JAPAN

We1R13     Volume 4, p. 1219
An Analysis of Dialogues with Our Dialogue System through a WWW page
Kumamoto, T Ito, A
Communications Research Laboratory, JAPAN

We1R14     Volume 4, p. 1223
Modelling spoken dialogues with state transition diagrams: experiences of the CSLU toolkit
McTear, M
University of Ulster, UNITED KINGDOM

We1R15     Volume 4, p. 1227
Situated Dialogue Coordination for Spoken Dialogue Systems
Okada, M Suzuki, N Terken, J
ATR MI & C Research Laboratories, JAPAN

We1R16     Volume 4, p. 1231
Robust spoken dialogue systems for consumer products: a concrete application
Pouteau, X Arevalo, L
IPO, Center for Research On User-System Interaction, THE NETHERLANDS

We1R17     Volume 4, p. 1235
A German Dialogue System for Scheduling Dates and Meetings by Naturally Spoken Continuous Speech
Willett, D Roemer, A Rottland, J Rigoll, G
Duisburg University, GERMANY

We1R18     Volume 4, p. 1239
Spoken Dialogue System Using Corpus-Based Hidden Markov Model
Wu, C-H Yan, G-L Lin, C-L
National Cheng Kung University, TAIWAN R.O.C.

We1R19     Volume 4, p. 1243
A Realistic Wizard of Oz simulation of a Multimodal Spoken Language System
Wyard, P Churcher, G
BT Laboratories, UNITED KINGDOM

We1R20     Volume 4, p. 1247
A Syllable-Based Chinese Spoken Dialogue System for Telephone Directory Services Primarily Trained with a Corpus
Yang, Y-J Lee, L-S
National Taiwan University, TAIWAN R.O.C.

We1R21     Volume 4, p. 1251
How disagreement expressions are used in cooperative tasks
Yano, H Ito, A
Communications Research Laboratory, JAPAN

We2A - Segmentation, Labelling and Speech Corpora 1

We2A1     Volume 4, p. 1255
Acoustic Indicators of Topic Segmentation
Hirschberg, J Nakatani, C
AT & T Labs - Research, USA

We2A2     Volume 4, p. 1259
IViE - A Comparative Transcription system for Intonational Variation in English
Grabe, E Nolan, F Farrar, K J
University of Cambridge, UNITED KINGDOM

We2A3     Volume 4, p. 1263
Automatic Segmental and Prosodic Labeling of Mandarin Speech Database
Chou, F-C Tseng, C-Y Lee, L-S
National Taiwan University, TAIWAN R.O.C.

We2A4     Volume 4, p. 1267
Automatic Labelling of German Prosody
Rapp, S
Sony International (Europe) GmbH, GERMANY

We2B - Multimodal Spoken Language Processing 2

We2B1     Volume 4, p. 1271
Speech Driven 3-D Face Point Trajectory Synthesis Algorithm
Arslan, L M Talkin, D
Entropic Research Lab, USA

We2B2     Volume 4, p. 1275
Speech-to-Lip Movement Synthesis based on the EM Algorithm using Audio-Visual HMMs
Yamamoto, E Nakamura, S Shikano, K
Nara Institute of Science and Technology, JAPAN

We2B3     Volume 4, p. 1279
Learning Words from Natural Audio-Visual Input
Roy, D Pentland, A
MIT Media Laboratory, USA

We2B4     Volume 4, p. 1283
Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database
Dupont, S Luettin, J
Faculte Polytechnique De Mons, BELGIUM

We2C - Prosody and Emotion 4

We2C1     Volume 4, p. 1287
Intonative structure as a determinant of word order variation in Dutch verbal endgroups
Swerts, M
IPO, Center for Research On User-System Interaction, THE NETHERLANDS

We2C2     Volume 4, p. 1291
Experiments on the Meaning of Two Pitch Accent Types: The 'Pointed Hat' Versus the Accent-Lending Fall in Dutch
Caspers, J
Holland Institute of Generative Linguistics, NETHERLANDS

We2C3     Volume 4, p. 1295
Phonetic and Phonological Markers of Contrastive Focus in Korean
Jun, S-A Lee, H-J
UCLA, USA

We2C4     Volume 4, p. 1299
Reconciling two competing views on contrastiveness
Krahmer, E Swerts, M
IPO, Center for Research On User-System Interaction, THE NETHERLANDS

We2D - Neural Networks, Fuzzy and Evolutionary Methods 2

We2D1     Volume 4, p. 1303
Modular Neural Networks for Low-Complex Phoneme Recognition
Glaeser, A
Ascom Systec Ltd, SWITZERLAND

We2D2     Volume 4, p. 1307
Global Optimisation of Neural Network Models Via Sequential Sampling-Importance Resampling
de Freitas, N Johnson, S Niranjan, M Gee, A
Cambridge University, UNITED KINGDOM

We2D3     Volume 4, p. 1311
Efficient computation of MMI neural networks for large vocabulary speech recognition systems
Rottland, J Luedecke, A Rigoll, G
Duisburg University, GERMANY

We2D4     Volume 4, p. 1315
Modular Connectionist Systems for Identifying Complex Arabic Phonetic Features
Selouani, S A Caelen, J
HB University, ALGERIA

We4A - Large Vocabulary Continuous Speech Recognition 1

We4A1     Volume 4, p. 1319
Real-Time Recognition of Broadcast News
Cook, G Robinson, T Christie, J
Cambridge University, UNITED KINGDOM

We4A2     Volume 4, p. 1323
Automatic Recognition of Korean Broadcast News Speech
Yu, H-J Kim, H Choi, J-S Hong, J-M Park, K-S Lee, J-S Lee, Y-G Lee, H-Y
LG Corporate Institute of Technology, KOREA

We4A3     Volume 4, p. 1327
Telephone-Based Conversational Speech Recognition in the JUPITER Domain
Glass, J R Hazen, T J
MIT, USA

We4A4     Volume 4, p. 1331
Japanese Large-Vocabulary Continuous Speech Recognition System Based on Microsoft Whisper
Hon, H Ju, Y Otani, K
Microsoft Research, USA

We4A5     Volume 4, p. 1335
Partitioning and Transcription of Broadcast News Data
Gauvain, J-L Lamel, L Adda, G
LIMSI-CNRS, FRANCE

We4B - Speaker and Language Recognition 2

We4B1     Volume 4, p. 1339
Speaker Detection in Broadcast Speech Databases
Rosenberg, A E Magrin-Chagnolleau, I Parthsarathy, S Huang, Q
AT&T Labs-Research, USA

We4B2     Volume 4, p. 1343
Multilateral Techniques for Speaker Recognition
Parris, E S Carey, M J
Ensigma, UNITED KINGDOM

We4B3     Volume 4, p. 1347
Real Time Speaker Indexing based on subspace method - Application to TV News Articles and Debate -
Nishida, M Ariki, Y
Ryukoku University, JAPAN

We4B4     Volume 4, p. 1351
SHEEP, GOATS, LAMBS and WOLVES: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation
Doddington, G Liggett, W Martin, A Przybocki, M Reynolds, D
MIT Lincoln Laboratory, USA

We4B5     Volume 4, p. 1355
Progress in Speaker Recognition at Dragon Systems
Corrada-Emmanuel, A Newman, M Peskin, B Gillick, L Roth, R
Dragon Systems, USA

We4B6     Volume 4, p. 1359
A Comparative Study of Speaker Verification Systems using the Polycost Database
Nordström, T Melin, H Lindberg, J
KTH, SWEDEN

We4C - Signal Processing and Speech Analysis 2

We4C1     Volume 4, p. 1363
Improving Pitch Estimation with Short Duration Speech Samples
Ainsworth, W A Day, C R Meyer, G F
Keele University, UNITED KINGDOM

We4C2     Volume 4, p. 1367
An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite

Kawahara, H de Cheveigne, A Patterson, R
Wakayama University, JAPAN

We4C3     Volume 4, p. 1371
Speaker-Independent Speech Recognition using Micro Segment Spectrum Integration
Aikawa, K
NTT Human Interface Laboratories, JAPAN

We4C4     Volume 4, p. 1375
On Robust Speech Analysis Based on Time-Varying Complex AR Model
Funaki, K Miyanaga, Y Tochinai, K
Hokkaido University, JAPAN

We4C5     Volume 4, p. 1379
Spectral Basis Functions from Discriminant Analysis
Hermansky, H Malayath, N
Oregon Graduate Institute, USA

We4D - Prosody and Emotion 5

We4D1     Volume 4, p. 1383
The Tilt Intonation Model
Taylor, P
University of Edinburgh, UNITED KINGDOM

We4D2     Volume 4, p. 1387
Analysis of Occurrences of Pauses and Their Durations in Japanese Text Reading
Fujisaki, H Ohno, S Yamada, S
Science University of Tokyo, JAPAN

We4D3     Volume 4, p. 1391
A statistical study of pitch target points in five languages
Campione, E Véronis, J
Université De Provence, FRANCE

We4D4     Volume 4, p. 1395
Fully Automatic Prosody Generator for Text-to-Speech
Malfrere, F Dutoit, T Mertens, P
Faculte Polytechnique De Mons, BELGIUM

We4D5     Volume 4, p. 1399
Automatic Prosodic Labeling of 6 Languages
Vereecken, H Martens, J-P Grover, C Fackrell, J Van Coile, B
Elis-University of Ghent, BELGIUM

We4D6     Volume 4, p. 1403
Automatic Utterance Type Detection Using Suprasegmental Features
Wright, H
The University of Edinburgh, UNITED KINGDOM

We4P - Robust Speech Processing in Adverse Environments 4

We4P1     Volume 4, p. 1407
Spectral Sequence Compensation based on Continuity of Spectral Sequence
Akagi, M Iwaki, M Sakaguchi, N
Japan Advanced Institute of Science and Technology, JAPAN

We4P2     Volume 4, p. 1411
Robust Features for Speech Recognition Systems
Bayya, A Yegnanarayana, B
Rockwell Semiconductor Systems, USA

We4P3     Volume 4, p. 1415
Interfacing of CASA and Partial Recognition Based on a Multistream Technique
Berthommier, F Glotin, H Tessier, E Bourlard, H
Institut De La Communication Parlee, FRANCE

We4P4     Volume 4, p. 1419
An RNN-based compensation method for Mandarin telephone speech recognition
Chang, S-C Chien, S-C Kuo, C-C
Industrial Technology Research Institute, TAIWAN R.O.C.

We4P5     Volume 4, p. 1423
Robust Speech Recognition Using Discriminative Stream Weighting and Parameter Interpolation
Chu, S Zhao, Y
University of Illinois, USA

We4P6     Volume 4, p. 1427
Acoustic backing-off in the local distance computation for robust automatic speech recognition
de Veth, J Cranen, B Boves, L
University of Nijmegen, THE NETHERLANDS

We4P7     Volume 4, p. 1431
Noise Model Selection for Robust Speech Recognition
Docio-Fernandez,L Garcia-Mateo,C
University of Vigo, SPAIN

We4P8     Volume 4, p. 1435
A novel iterative signal enhancement algorithm for noise reduction in speech
Doclo, S Dologlou, I Moonen, M
KU Leuven - ESAT/SISTA, BELGIUM

We4P9     Volume 4, p. 1439
Missing Data Reconstruction for Robust Automatic Speech Recognition in the Framework of Hybrid HMM/ANN Systems
Dupont, S
Faculte Polytechnique De Mons, BELGIUM

We4P10     Volume 4, p. 1443
Recognition from GSM Digital Speech
Gallardo-Antolin, A Diaz-de-Maria, F Valverde-Albacete, F
Universidad Carlos III De Madrid, SPAIN

We4P11     Volume 4, p. 1447
Conversational Speech Systems for On-Board Car Navigation and Assistance
Geutner, P Denecke, M Meier, U Westphal, M Waibel, A
Universitaet Karlsruhe, GERMANY

We4P12     Volume 4, p. 1451
A signal processing system for having the sound "pop-out" in noise thanks to the image of the speakerís lips: New advances using Multi-Layer Perceptrons

Girin, L Varin, L Feng, G Schwartz, J L
Institut De La Communication Parlée, FRANCE

We4P13     Volume 4, p. 1455
Robust Speech Activity Detection in the Presence of Noise
Sarikaya, R Hansen, J H L
Robust Speech Processing Laboratory, USA

We4P14     Volume 4, p. 1459
Robust Automatic Speech Recognition by the Application of a
Temporal-Correlation-Based Recurrent Multilayer Neural Network to the Mel-based Cepstral Coefficients

Heon, M Tolba, H O'Shaughnessy, D
INRS-Telecommunication, CANADA

We4P15     Volume 4, p. 1463
Speech Recognition from GSM Codec Parameters
Huerta, J M Stern, R M
Carnegie Mellon University, USA

We4P16     Volume 4, p. 1467
Improved Parallel Model Combination Based on Better Domain Transformation for Speech Recognition Under Noisy Environments
Hung, J W Shen, J L Lee, L S
Academia Sinica, TAIWAN R.O.C.

We4P17     Volume 4, p. 1471
Robust Speech/Non-Speech Detection in Adverse Conditions Based on Noise and Speech Statistics
Karray, L Monne, J
France Telecom - CNET, FRANCE

We4P18     Volume 4, p. 1475
Speech Recognition in Car Noise Environments using Multiple Models According to Noise Masking Levels
Song, M G Jung, H I Shim, K-J Kim, H S
Pusan National University, KOREA

We4P19     Volume 4, p. 1479
Spectral Noise Subtraction with Recursive Gain Curves
Linhard, K Haulick, T
Daimler Benz AG, GERMANY

We4P20     Volume 4, p. 1483
A Novel Robust Speech Recognition Algorithm Based on Multi-models and Integrated Decision Method
Pan, S Liu, J Jiang, J-T Wang, Z-Y Lu, D-J
Tsinghua University, P.R. CHINA

We4P21     Volume 4, p. 1487
On the Interaction Between Time and Frequency Filtering of Speech Parameters for Robust Speech Recognition
Macho, D Nadeu, C
Polytechnical University of Catalonia, SPAIN

We4P22     Volume 4, p. 1491
Inference of Missing Spectrographic Features for Robust Speech Recognition
Raj, B Singh, R Stern, R M
Carnegie Mellon University, USA

We4P23     Volume 4, p. 1495
SNR-dependent flooring and noise overestimation for joint application of spectral subtraction and model combination
Schless, V Class, F
Daimler-Benz AG, GERMANY

We4P24     Volume 4, p. 1499
Improved Robust Speech Recognition Considering Signal Correlation Approximated by Taylor Series
Shen, J-L Hung, J-W Lee, L-S
Academia Sinica, TAIWAN R.O.C.

We4P25     Volume 4, p. 1503
Speech Recognition in Noisy Environment Using Weighted
Projection-Based Likelihood Measure

Shin, W H Kim, W G Lee, C Cha, I W
Yonsei University, KOREA

We4P26     Volume 4, p. 1507
Evaluation of Model Adaptation by HMM Decomposition on Telephone Speech Recognition
Takiguchi, T Nakamura, S Shikano, K Morishima, M Isobe, T
Nara Institute of Science and Technology, JAPAN

We4P27     Volume 4, p. 1511
Comparative Experiments to Evaluate a Voiced-Unvoiced-Based
Pre-Processing Approach to Robust Automatic Speech Recognition in Low-SNR Environments

Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA

We4P28     Volume 4, p. 1515
Signal Extraction from Noisy Signal based on Auditory Scene Analysis
Unoki, M Akagi, M
Japan Advanced Institute of Science and Technology, JAPAN

We4P29     Volume 4, p. 1519
Frequency domain binaural model as the front end of speech recognition system
Usagawa, T Sakai, K Ebata, M
Kumamoto University, JAPAN

We4P30     Volume 4, p. 1523
A Study on the Recognition of Low Bit-Rate Encoded Speech
Yu, A-T Wang, H-C
National Tsing Hua University, TAIWAN R.O.C.

We4P31     Volume 4, p. 1527
Weighted Parallel Model Combination for Noisy Speech Recognition
Hwang, T-H Wang, H-C
National Tsing-Hua University, TAIWAN R.O.C.

We4P32     Volume 4, p. 1531
Favourable and Unfavourable Short Duration Segments of Speech in Noise
Woo, D
The University of New South Wales, AUSTRALIA

We4Q - Segmentation, Labelling and Speech Corpora 2

We4Q1     Volume 4, p. 1535
An Efficient Labeling Tool for the QuickSig Speech Database
Karjalainen, M Altosaar, T Huttunen, M
Helsinki University of Technology, FINLAND

We4Q2     Volume 4, p. 1539
Collection and Detailed Transcription of a Speech Database for Development of Language Learning Technologies
Bratt, H Neumeyer, L Shriberg, E Franco, H
SRI International, USA

We4Q3     Volume 4, p. 1543
Resegmentation of SWITCHBOARD
Deshmukh, N Ganapathiraju, A Gleeson, A Hamaker, J Picone, J
Mississippi State University, USA

We4Q4     Volume 4, p. 1547
Automatic Generation of Visual Scenarios for Spoken Corpora Acquisition
Aiello, D Delogu, C De Mori, R Di Carlo, A Nisi, M Tummeacciu, S
Fondazione Ugo Bordoni, ITALY

We4Q5     Volume 4, p. 1551
Automatic Detection of Semantic Boundaries based on Acoustic and Lexical Knowledge
Cettolo, M Falavigna, D
ITC-IRST, ITALY

We4Q6     Volume 4, p. 1555
A New Fast Algorithm for Automatic Segmentation of Continuous Speech
Gholampour, I Nayebi, K
Sharif University of Technology, IRAN

We4Q7     Volume 4, p. 1559
Acoustic Nature and Perceptual Testing of Corpora of Emotional Speech
Iida, A Campbell, W N Iga, S Higuchi, F Yasumura, M
Keio University, JAPAN

We4Q8     Volume 4, p. 1563
Korean Prosodic Break Index Labelling by a new Mixed Method of LDA and VQ
Kang, P S Kang, J Y Kim, J Y
Chonnam National University, KOREA

We4Q9     Volume 4, p. 1567
MOOSE: Management of Otago Speech Environment
Laws, M R Kilgour, R I
University of Otago, NEW ZEALAND

We4Q10     Volume 4, p. 1571
Phonetic Alignment: Speech Synthesis Based vs. Hybrid HMM/ANN
Malfrere, F Deroo, O Dutoit, T
Faculte Polytechnique De Mons, BELGIUM

We4Q11     Volume 4, p. 1575
Customisation and Quality Assessment of Spoken Language Description
Millar, J B
Australian National University, AUSTRALIA

We4Q12     Volume 4, p. 1579
A Silence/Noise/Music/Speech Splitting Algorithm
Montacie, C Caraty, M-J
Universite Pierre Et Marie Curie, FRANCE

We4Q13     Volume 4, p. 1583
Audio-Visual Segmentation for Content-based Retrieval
Pye, D Hollinghurst, N Mills, T Wood, K
Oracle & Olivetti Research Laboratory, UNITED KINGDOM

We4Q14     Volume 4, p. 1587
Same News is Good News: Automatically Collecting Reoccuring Radio News Stories
Rapp, S Dogil, G
Sony International (Europe) GmbH, GERMANY

We4Q15     Volume 4, p. 1591
An annotation system for melodic aspects of German spontaneous speech
Brindoepke, C Schaffranietz, B
University Bielefeld, GERMANY

We4Q16     Volume 4, p. 1595
Additional use of phoneme duration hypotheses in automatic speech segmentation
Stöber, K Hess, W
Bonn University, GERMANY

We4Q17     Volume 4, p. 1599
Towards a minimal standard for dialogue transcripts: A new SGML architecture for the HCRC Map Task Corpus
Isard, A McKelvie, D Thompson, H S
University of Edinburgh, UNITED KINGDOM

We4R - Speech Technology Applications and Human- Machine Interface 1

We4R1     Volume 4, p. 1603
Steps toward the integration of speaker recognition in real-world telecom applications
Glaeser, A Bimbot, F
Ascom Systec Ltd, SWITZERLAND

We4R2     Volume 4, p. 1607
A Bimodal Korean Address Entry/Retrieval System
Chung, H Y Hwang, C J Lee, S W
Yeungnam University, KOREA

We4R3     Volume 4, p. 1611
Usability Evaluation of IVR systems with DTMF and ASR
Delogu, C Di Carlo, A Rotundi, P Sartori, D
Fondazione Ugo Bordoni, ITALY

We4R4     Volume 4, p. 1615
SALSA Version 1.0: A Speech-based Web Browser for Hong Kong English
Fung, P Cheung, C S Lam, K L Liu, W K Lo, Y Y
University of Science & Technology, HONG KONG

We4R5     Volume 4, p. 1619
A Language for Creating Speech Applications
Pargellis, A Zhou, Q Saad, A Lee, C-H
Bell Labs, Lucent Technologies

We4R6     Volume 4, p. 1623
The use of automatic speech recognition to reduce the interference between concurrent tasks of driving and phoning
Graham, R Carter, C Mellor, B
Loughborough University, UNITED KINGDOM

We4R7     Volume 4, p. 1627
Interactive Listening to Structured Speech Content on the Internet
Hirayama, M J Sugahara, T Peng, Z Yamazaki, J
Hewlett Packard Laboratories Japan, JAPAN

We4R8     Volume 4, p. 1631
MSF Format for the Representation of Speech Synchronized Moving Image
Jo, C W
Changwon National University, KOREA

We4R9     Volume 4, p. 1635
Effects of Using Speech in Timetable Information Systems for WWW
Qvarfordt, P Jönsson, A
Linkoping University, SWEDEN

We4R10     Volume 4, p. 1639
The Interactive Systems Labs View4You video indexing system
Kemp, T Geutner, P Schmidt, M Tomaz, B Weber, M Westphal, M Waibel, A
University of Karlsruhe, GERMANY

We4R11     Volume 4, p. 1643
SEMOLE: A Robust Framework for Gathering Information from the World Wide Web
Kim, H J Hetherington, L
MIT Spoken Language Systems, USA

We4R12     Volume 4, p. 1647
Enhancing a WIMP Based Interface with Speech, Gaze Tracking and Agents
Bakman, L Blidegn, M Wittrup, M Larsen, L B Moeslund, T B
Aalborg University, DENMARK

We4R13     Volume 4, p. 1651
Now You Hear It, Now You Don't: Empirical Studies of Audio Browsing Behavior
Nakatani, C Whittaker, S Hirschberg, J
AT & T Labs - Research, USA

We4R14     Volume 4, p. 1655
A Voice Verifier for Face/Voice Based Person Verification System
Qiao, R-Y Choi, Y Agbinya, J I
CSIRO, Australia, AUSTRALIA

We4R15     Volume 4, p. 1659
On the use of automatic speech recognition for TV captioning
Robert-Ribes, J
CSIRO-MIS, AUSTRALIA

We4R16     Volume 4, p. 1663
An Undergraduate Course on Speech Recognition Based on the CSLU Toolkit
Serridge, B
Universidad De Las Americas, MEXICO

We4R17     Volume 4, p. 1667
Real time voice alteration based on Linear Prediction
Yang, P-F Stylianou, Y
AT&T Labs - Research, USA

We4R18     Volume 4, p. 1671
Evaluation and Implementation of a Voice-Activated Dialing System with Utterance Verification
Tan, B T Gu, Y Thomas, T
Vocalis Ltd, UNITED KINGDOM

We4R19     Volume 4, p. 1675
Towards A Mandarin Voice Memo System
Wang, H-M Lin, B-S Chen, B Bai, B-R
Academia Sinica, TAIWAN R.O.C.

We5A - Large Vocabulary Continuous Speech Recognition 2

We5A1     Volume 5, p. 1679
Grammatical word graph re-generation for spontaneous speech recognition
Tsukada, H Yamamoto, H Takezawa, T Sagisaka, Y
ATR Interpreting Telecommunications Research Laboratories, JAPAN

We5A2     Volume 5, p. 1683
Compression Algorithm of Trigram Language Models based on Maximum Likelihood Estimation
Yodo, N Shikano, K Nakamura, S
Nara Institute of Science and Technology, JAPAN

We5A3     Volume 5, p. 1687
Morphological Modeling of Word Classes for Langauge Models
Uebler, U Niemann, H
Bavarian Research Center for Knowledge Based Systems (FORWISS), GERMANY

We5A4     Volume 5, p. 1691
A Comparative Study Between Polyclass and Multiclass Language Models
Zitouni, I Smaili, K Haton, J-P Deligne, S Bimbot, F
LORIA/INRIA Lorraine, FRANCE

We5A5     Volume 5, p. 1695
Log-Linear Interpolation of Language Models
Klakow, D
Philips Research Laboratories, GERMANY

We5A6     Volume 5, p. 1699
The Applicability of Adaptive Language Modelling for the Broadcast News Task

Clarkson, P R Robinson, A J
Cambridge University, UNITED KINGDOM

We5B - Text-To-Speech Synthesis 3

We5B1     Volume 5, p. 1703
The IBM Trainable Speech Synthesis System
Donovan, R E Eide, E M
IBM, USA

We5B2     Volume 5, p. 1707
ProSynth: An Integrated Prosodic Approach to Device-Independent, Natural-Sounding Speech Synthesis
Hawkins, S House, J Huckvale, M Local, J K Ogden, R
University of Cambridge, UNITED KINGDOM

We5B3     Volume 5, p. 1711
Total Quality Evaluation of Speech Synthesis Systems
Zhang, J Dong, S Yu, G
Institute of Acoustics, P.R. CHINA

We5B4     Volume 5, p. 1715
Comparative Evaluation of Synthetic Prosody with the PURR Method
Sonntag, G P Portele, T
IKP, University of Bonn, GERMANY

We5B5     Volume 5, p. 1719
SABLE: A Standard for TTS Markup
Sproat, R Hunt, A Ostendorf, M Taylor, P Black, A Lenzo, K
Bell Laboratories / Lucent Technologies, USA

We5B6     Volume 5, p. 1723
Prosodic vs. segmental contributions to naturalness in a diphone synthesizer
Bunnell, H T Hoskins, S R Yarrington, D M
DuPont Hospital for Children / University of Delaware, USA

We5C - Language Acquisition 1

We5C1     Volume 5, p. 1727
Non-native productions of Japanese single stops that are too long for one mora unit
Minagawa-Kawai, Y Kiritani, S
University of Tokyo, JAPAN

We5C2     Volume 5, p. 1731
The Process of Generation and Development of Second Language Japanese Accentuation
Yamada, N
Ibaraki University, JAPAN

We5C3     Volume 5, p. 1735
Perceptual properties of Russians with Japanese fricatives
Funatsu, S Kiritani, S
Hiroshima Women's University, JAPAN

We5C4     Volume 5, p. 1739
Assessment of Dutch pronunciation by means of automatic speech recognition technology
Cucchiarini, C de Wet, F Strik, H Boves, L
University of Nijmegen, THE NETHERLANDS

We5C5     Volume 5, p. 1743
Phonetic-level Mispronunciation Detection in non-native Swedish Speech
Langlais, P Öster, A-M Granström, B
Center for Speech Technology, TMH-KTH, SWEDEN

We5C6     Volume 5, p. 1747
Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores
Akahane-Yamada, R McDermott, E Adachi, T Kawahara, H Pruitt, J S
ATR Human Information Processing Research Laboratories, JAPAN

We5D - Acoustic Phonetics 1

We5D1     Volume 5, p. 1751
Assimilation of place in Japanese and Dutch
Cutler, A Otake, T
Max-Planck-Institute for Psycholinguistics, NETHERLANDS

We5D2     Volume 5, p. 1755
Prosodic Constraint on V-to-V Coarticulation in Japanese
Kondo, Y Arai, Y
Musashino Art University, JAPAN

We5D3     Volume 5, p. 1759
Postvocalic /r/-deletion in standard Dutch: how experimental phonology can profit from ASR technology
Cucchiarini, C van den Heuvel, H
University of Nijmegen, THE NETHERLANDS

We5D4     Volume 5, p. 1763
More evidence for the perceptual basis of sound change? Suprasegmental effects in the development of distinctive nasalisation
Hajek, J Watson, I
University of Melbourne, AUSTRALIA

We5D5     Volume 5, p. 1767
Speech production of vowel sequences using a physiological articulatory model
Dang, J Honda, K
ATR Human Information Processing Research Labs,, JAPAN

We6A - Speaker Adaptation 2

We6A1     Volume 5, p. 1771
Eigenvoices for Speaker Adaptation
Kuhn, R Nguyen, P Junqua, J-C Goldwasser, L Niedzielski, N Fincke, S
Field, K Contolini, M
Panasonic Technologies, Inc., USA

We6A2     Volume 5, p. 1775
Speaker Clustering Using Direct Maximisation of the MLLR-Adapted Likelihood
Johnson, S E Woodland, P C
Cambridge University, UNITED KINGDOM

We6A3     Volume 5, p. 1779
Incremental On-line Speaker Adaptation in Adverse Conditions
Viikki, O Laurila, K
Nokia Research Center, FINLAND

We6A4     Volume 5, p. 1783
Cluster Adaptive Training for Speech Recognition
Gales, M
IBM Almaden Research Center, USA

We6B - Speech Coding 2

We6B1     Volume 5, p. 1787
Towards a unified model for low bit-rate speech coding using a recognition-synthesis approach
Holmes, W J
Defence Evaluation and Research Agency, UNITED KINGDOM

We6B2     Volume 5, p. 1791
On the Significance of Temporal Masking in Speech Coding
Skoglund, J Kleijn, W B
Royal Insitute of Technology, SWEDEN

We6B3     Volume 5, p. 1795
Waveform Interpolation Coding with Pitch-Spaced Subbands
Kleijn, W B Yang, H Deprettere, E F
Royal Insitute of Technology, SWEDEN

We6B4     Volume 5, p. 1799
An improved Decomposition Method for WI using IIR Wavelet Filter Banks
Chong, N R Burnett, I S Chicharo, J F
University of Wollongong, AUSTRALIA

We6C - Hidden Markov Model Techniques 2

We6C1     Volume 5, p. 1803
Real -Time Probabilistic Segmentation for Segment-Based Speech Recognition
Lee, S Glass, J
MIT, USA

We6C2     Volume 5, p. 1807
Toward Markov random field modeling of speech
Gravier, G Sigelle, M Chollet, G
ENST/TSI and CNRS-URA 820, FRANCE

We6C3     Volume 5, p. 1811
Hidden Markov Models for Trajectory Modeling
Iyer, R Gish, H Siu, M Zavaliagkos, G Matsoukas, S
GTE/BBN Technologies, USA

We6D - Multilingual Perception and Recognition 1

We6D1     Volume 5, p. 1815
Bilingual and Dialectal Adaptation and Retraining
Uebler, U Schuessler, M Niemann, H
Bavarian Research Center for Knowledge Based Systems (FORWISS), GERMANY

We6D2     Volume 5, p. 1819
Language Independent and Language Adaptive Large Vocabulary Speech Recognition
Schultz, T Waibel, A
Interactive Systems Laboratories, GERMANY

We6D3     Volume 5, p. 1823
A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation training
Kawai, G Hirose, K
University of Tokyo, JAPAN

We6D4     Kuhn et al.     see Th4R24

Thursday 3 December, 1998

Th1A - Large Vocabulary Continuous Speech Recognition 3

Th1A1     Volume 5, p. 1827
The BBN Single-Phonetic-Tree Fast-Match Algorithm
Nguyen, L Schwartz, R
BBN Technologies, USA

Th1A2     Volume 5, p. 1831
An Efficient Two-pass Search Algorithm using Word Trellis Index
Lee, A Kawahara, T Shuji, D
Kyoto University, JAPAN

Th1A3     Volume 5, p. 1835
Nozomi - a fast, memory-efficient stack decoder for LVCSR
Schuster, M
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Th1A4     Volume 5, p. 1839
Reducing the OOV Rate in Broadcast News Speech Recognition
Kemp, T Waibel, A
University of Karlsruhe, GERMANY

Th1A5     Volume 5, p. 1843
Using Automatic-Derived Acoustic Sub-Word Units in Large Vocabulary Speech Recognition
Bacchiani, M Ostendorf, M
Boston University, USA

Th1A6     Volume 5, p. 1847
Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch
McAllaster, D Gillick, L Scattone, F Newman, M
Dragon Systems Inc, USA

Th1B - Articulatory Modelling 3

Th1B1     Volume 5, p. 1851
An electropalatographic, kinematic, and acoustic analysis of supralaryngeal correlates of word-level prominence contrasts in English
Harrington, J Beckman, M E Fletcher, J Palethorpe, S
Macquarie University, AUSTRALIA

Th1B2     Volume 5, p. 1855
Consistencies and inconsistencies between EPG and locus equation data on coarticulation
Tabain, M
Macquarie University, AUSTRALIA

Th1B3     Volume 5, p. 1859
Synergy Between Jaw and Lips/ Tongue Movements : Consequences in Articulatory Modelling
Bailly, G Badin, P Vilain, A
Institut Communication Parlee, FRANCE

Th1B4     Volume 5, p. 1863
Modelling tongue configuration in German vowel production
Hoole, P
Munich University, GERMANY

Th1B5     Volume 5, p. 1867
Optopalatograph: Real-time feedback of tongue movement in 3D
Wrench, A McIntosh, A D Watson, C Hardcastle, W J
Queen Margaret College, UNITED KINGDOM

Th1B6     Volume 5, p. 1871
Effects of contrastive focal accent on linguopalatal articulation and coarticulation in the French [kskl] Cluster
Meynadier, Y Pitermann, M Marchal, A
Institut De Phonetique - Laboratoire Parole Et Langage, CNRS, FRANCE

Th1C - Language Acquisition 2

Th1C1     Volume 5, p. 1875
Spoken Word Identification by Native and Nonnative Speakers of English: Effects of training, modality, context and phonetic environment
Hardison, D M
University of California, Davis, USA

Th1C2     Volume 5, p. 1879
The Effect of Background Knowledge on First and Second Language Comprehension Difficulty
Tyler, M D
University of New South Wales, AUSTRALIA

Th1C3     Volume 5, p. 1883
Comparsion of Cross-language Coarticulation: English, Japanese and Japanese-accented English
Tsukada, K
Macquarie University, AUSTRALIA

Th1C4     Volume 5, p. 1887
Plasticity of non-native phonetic perception and production: A training study
Imaizumi, S Itoh, H Tamekawa, Y Deguchi, T Mori, K
University of Tokyo, JAPAN

Th1C5     Volume 5, p. 1891
The Relation Between Perceptual and Production Categories in Acquisition
Watson, I
University of Oxford, UNITED KINGDOM

Th1C6     Volume 5, p. 1895
The development of perceptual cue-weighting in children aged 6 to 12
Hazan, V Barrett, S
Department of Phonetics and Linguistics, UCL, UNITED KINGDOM

Th1D - Speaker and Language Recognition 3

Th1D1     Volume 5, p. 1899
Robust speaker verification insensitive to session-dependent utterance variation and handset-dependent distortion
Matsui, T Aikawa, K
NTT Human Interface Labs, JAPAN

Th1D2     Volume 5, p. 1903
A Comparative Evaluation of Variance Flooring Techniques in HMM-based Speaker Verification
Melin, H Koolwaaij, J Lindberg, J Bimbot, F
KTH, SWEDEN

Th1D3     Volume 5, p. 1907
Text-Independent Speaker Verification Using Automatically Labelled Acoustic Segments
Petrovska Delacretaz, D Cernocky, J Henneber, J Chollet, G
Institute of Radioelectronics, CZECH REPUBLIC

Th1D4     Volume 5, p. 1911
A Fast Decoding Algorithm Based on Sequential Detection of the Changes in Distribution
Li, Q
Bell Labs, Lucent Technologies, USA

Th1D5     Volume 5, p. 1915
Speaker verification with ensemble classifiers based on linear speech transforms
Olsen, J
Aalborg University, DENMARK

Th1D6     Volume 5, p. 1919
Speaker recognition based on discriminative projection models
Olsen, J
Aalborg University, DENMARK

Th1P - Text-To-Speech Synthesis 4

Th1P1     Volume 5, p. 1923
A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale Modification of Speech
Acero, A
Microsoft Corporation, USA

Th1P2     Volume 5, p. 1927
Analytic Generation of Synthesis Units by Closed Loop Training for Totally Speaker Driven Text to Speech System (TOS Drive TTS)
Akamine, M Kagoshima, T
Toshiba Kansai Laboratory, JAPAN

Th1P3     Volume 5, p. 1931
Modeling the Microprosody of Pitch and Loudness for Speech Synthesis with Neural Networks
Vainio, M Altosaar, T
Helsinki University of Technology, FINLAND

Th1P4     Volume 5, p. 1935
Spectral Smoothing for Concatenative Speech Synthesis
Chappell, D T Hansen, J H L H
Robust Speech Processing Laboratory, USA

Th1P5     Volume 5, p. 1939
Mimic: A Voice-Adaptive Phonetic-Tree Speech Synthesiser
Chen, A Vaseghi, S Ho, C
Queen's University of Belfast, UNITED KINGDOM

Th1P6     Volume 5, p. 1943
Automatic Generation of Korean Pronunciation Variants by Multistage Applications of Phonological Rules
Jeon, J Cha, S Chung, M Park, J Hwang, K
Sogang University, KOREA

Th1P7     Volume 5, p. 1947
Techniques for accurate automatic annotation of speech waveforms
Cox, S Brady, R Jackson, P
University of East Anglia, UNITED KINGDOM

Th1P8     Volume 5, p. 1951
Optimized Stopping Criteria for Tree-Based Unit Selection in Concatenative Synthesis
Cronk, A Macon, M
Oregon Graduate Institute, USA

Th1P9     Volume 5, p. 1955
Automatic transcription of intonation using an identified prosodic alphabet
de Tournemire, S
France Telecom, CNET, FRANCE

Th1P10     Volume 5, p. 1959
Frequency Analysis of Phonetic Units for Concatenative Synthesis in Catalan
Esquerra, I Febrer, A Nadeu, C
Polytechnical University of Catalonia, SPAIN

Th1P11     Volume 5, p. 1963
Investigating the Syntactic Characteristics of English Tone Units
Fang, A C House, J Huckvale, M
University College London, UNITED KINGDOM

Th1P12     Volume 5, p. 1967
The UPC Text-to-Speech System for Spanish and Catalan
Bonafonte, A Esquerra, I Febrer, A Fonollosa, J A R Vallverdu, F
Polytechnical University of Catalonia, SPAIN

Th1P13     Volume 5, p. 1971
The New Version of the ROMVOX Text-To-Speech Synthesis System Based on a Hybrid Time Domain-LPC Synthesis Technique
Ferencz, A Nagy, I Kovacs, T C S Ferencz, M Ratiu, T
Software ITC, ROMANIA

Th1P14     Volume 5, p. 1975
An F0 Contour Control Model for Totally Speaker Driven Text to Speech System
Kagoshima, T Morita, M Seto, S Akamine, M
Toshiba Corporation, JAPAN

Th1P15     Volume 5, p. 1979
On the Relationship of Speech Rates with Prosodic Units in Dialogue Speech
Hirose, K Kawanami, H
University of Tokyo, JAPAN

Th1P16     Volume 5, p. 1983
On the Reduction of Concatenation Artefacts in Diphone Synthesis
Klabbers, E Veldhuis, R
IPO, THE NETHERLANDS

Th1P17     Volume 5, p. 1987
Error Analysis and Confidence Measure of Chinese Word Segmentation
Kuo, C-C Ma, K-Y
Industrial Technology Research Institute, TAIWAN R.O.C.

Th1P18     Volume 5, p. 1991
Energy Contour Generation for a Sentence Using a Neural Network Learning Method
Lee, J Kang, D Kim, S Sung, K-M
ETRI, KOREA

Th1P19     Volume 5, p. 1995
A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using K-ToBI system
Lee, Y-J Lee, S-H Kim, J-J Ho, H-J Kim, Y-I Kim, S-H Lee, J
Wonkwang University, KOREA

Th1P20     Volume 5, p. 1999
Rapid-Deployment Text-to-Speech in the DIPLOMAT System
Lenzo, K Hogan, C Allen, J
Carnegie Mellon University, USA

Th1P21     Volume 5, p. 2003
Formant diphone parameter extraction utilising a labelled single-speaker database
Mannell, R
Macquarie University, AUSTRALIA

Th1P22     Volume 5, p. 2007
A New Synthetic Speech/Sound Control Language
Mizuno, O Nakajima, S
NTT Human Interface Labs, JAPAN

Th1P23     Volume 5, p. 2011
A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accent
Mochizuki, R Arai, Y Honda, T
AVC Research Lab., CED, Matsushita Communication Industrial, JAPAN

Th1P24     Volume 5, p. 2015
Letter to Sound Rules for Accented Lexicon Compression
Pagel, V Lenzo, K Black, A W
Facultie Polytechnique of Mons, BELGIUM

Th1P25     Volume 5, p. 2019
A Name Announcement Algorithm With Memory Size and Computational Power Constraints
Roth, Z Rosenhouse, J
DSP Semiconductors Ltd., ISRAEL

Th1P26     Volume 5, p. 2023
How a French TTS System Can Describe Loanwords
Sannier, F Belrhali, R Auberge, V
ICP - ESA 5009 CNRS, FRANCE

Th1P27     Volume 5, p. 2027
Improvements in Slovene Text-to-Speech Synthesis
Sef, T Dobnikar, A Gams, M
Jozef Stefan Institute, SLOVENIA

Th1P28     Volume 5, p. 2031
Automatic Rule Generation for Linguistic Features Analysis Using Inductive Learning Technique - Linguistic Features Analysis in TOS Drive TTS System
Seto, S Morita, M Kagoshima, T Akamine, M
Toshiba Corporation, JAPAN

Th1P29     Volume 5, p. 2035
Segmental Duration Control Based on an Articulatory Model
Shiga, Y Matsuura, H Nitta, T
Toshiba Corporation, JAPAN

Th1P30     Volume 5, p. 2039
Text Analysis for the Bell Labs French Text-to-Speech System
Tzoukermann, E
Bell Labs - Lucent Technologies, USA

Th1P31     Volume 5, p. 2043
Modeling vowel duration for Japanese text-to-speech synthesis
Venditti, J J van Santen, J P H
Ohio State University / Bell Labs, USA

Th1P32     Volume 5, p. 2047
Towards a Chinese Text-to-Speech System with Higher Naturalness
Wang, R-H Liu, Q Teng, Y Xia, D
University of Science & Technology of China, P.R. CHINA

Th1Q - Spoken Language Understanding Systems 4

Th1Q1     Volume 5, p. 2051
Grammar Fragment Acquisition using Syntactic and Semantic Clustering
Arai, K Wright, J Riccardi, G Gorin, A
NTT Human Interface Laboratories, JAPAN

Th1Q2     Volume 5, p. 2055
Non-Expert Access to Unification-based Speech Understanding
Brøndsted, T
Aalborg University, DENMARK

Th1Q3     Volume 5, p. 2059
Natural Language Call Routing: A Robust, Self-Organizing Approach
Carpenter, B Chu-Carroll, J
Lucent Technologies - Bell Labs, USA

Th1Q4     Volume 5, p. 2063
Automatic Grammar Induction from Semantic Parsing
Ghosh, D Goddeau, D
Nuance Communications, USA

Th1Q5     Volume 5, p. 2067
BTH: An efficient parsing algorithm for word-spotting
Kono, Y Yano, T Sasajima, M
Toshiba Corporation, JAPAN

Th1Q6     Volume 5, p. 2071
Syntax Coordination: Interaction of Discourse and Extrapositions
Kronenberg, S C Kummert, F
University of Bielefeld, GERMANY

Th1Q7     Volume 5, p. 2075
Hierarchical Tag-Graph Search for Spontaneous Speech Understanding in Spoken Dialog Systems
Lin, B-S Chen, B Wang, H-M Lee, L-S
National Taiwan University, TAIWAN R.O.C.

Th1Q8     Volume 5, p. 2079
Extraction of the dialog act and the topic from utterances in a spoken dialog system
Niimi, Y Takinaga, N Nishimoto, T
Kyoto Institute of Technology, JAPAN

Th1Q9     Volume 5, p. 2083
Fast Computation of Maximum Entropy / Minimum Divergence Feature Gain
Printz, H
Carnegie Mellon University, USA

Th1Q10     Volume 5, p. 2087
Stochastic Language Models for Speech Recognition and Understanding
Riccardi, G Gorin, A L
AT&T Labs-Research, USA

Th1Q11     Volume 5, p. 2091
Linguistically Engineered Tools for Speech Recognition Error Analysis
Van Ess-Dykema, C Ries, K
U.S. Department of Defense, USA

Th1Q12     Volume 5, p. 2095
Estimating Entropy of a Language from Optimal Word Insertion Penalty
Takeda, K Ogawa, A Itakura, F
Nagoya University, JAPAN

Th1Q13     Volume 5, p. 2099
A Linguistic Analysis of Repair Signals in Co-operative Spoken Dialogues
Tseng, S-C
University of Bielefeld, GERMANY

Th1Q14     Volume 5, p. 2103
A Hierarchical Language Model for CSR
Valverde-Albacete, F J Pardo, J M
Universidad Carlos III De Madrid, SPAIN

Th1Q15     Volume 5, p. 2107
Spoken Language Understanding Within Dialogues Using a Graphical Model of Task Structure
Wright, J H Gorin, A L Abella, A
AT&T Labs - Research, USA

Th1Q16     Volume 5, p. 2111
Keyword Extraction of Radio News using Domain Identification based on Categories of an Encyclopedia
Suzuki, Y Fukumoto, F Sekiguchi, Y
Yamanashi University, JAPAN

Th1R - Human Speech Perception 2

Th1R1     Volume 5, p. 2115
Fundamental frequency flunctuation in continuous vowel utterance and its perception
Akagi, M Iwaki, M Minakawa, T
Japan Advanced Institute of Science and Technology, JAPAN

Th1R2     Volume 5, p. 2119
Estimation of Mental Lexicon Size with Word Familiarity Database
Amano, S Kondo, T
NTT Basic Research Laboratories, JAPAN

Th1R3     Volume 5, p. 2123
Vowel Quality in Spontaneous Speech: What Makes a Good Vowel?
Aylett, M P Turk, A
University of Edinburgh, UNITED KINGDOM

Th1R4     Volume 5, p. 2127
Cooperation and Competition of Burst and Formant Transitions for the Perception and the Identification of French Stops
Neagu, A Bailly, G
Institut Communication Parlee, FRANCE

Th1R5     Volume 5, p. 2131
The effect of modifying formant amplitudes on the perception of French vowels generated by copy synthesis
Bonneau, A Laprie, Y
Loria & CNRS, FRANCE

Th1R6     Volume 5, p. 2135
Segmental and tonal processing in Cantonese
Chen, H-C Yip, M Wong, S-Y
Chinese University of Hong Kong, HONG KONG

Th1R7     Volume 5, p. 2139
Phonological similarity effects in Cantonese spoken-word processing
Yip, M Leung, P-Y Chen, H-C
Chinese University of Hong Kong, HONG KONG

Th1R8     Volume 5, p. 2143
On the learnability of the voicing contrast for initial stops
Damper, R I Gunn, S R
University of Southampton, UNITED KINGDOM

Th1R9     Volume 5, p. 2147
Acoustic and Perceptual Characteristics of Italian Stop Consonants
Cerrato, L Falcone, M
Fondazione Ugo Bordoni, ITALY

Th1R10     Volume 5, p. 2151
Acoustic cues for the auditory identification of the Spanish fricative /f/
Fernandez, S Feijoo, S Balsa, R Barros, N
Universidad de Santiago, SPAIN

Th1R11     Volume 5, p. 2155
Recognition of vowels in fricative context
Fernandez, S Feijoo, S Balsa, R Barros, N
Universidad De Santiago, SPAIN

Th1R12     Volume 5, p. 2159
Voicing affects perceived manner of articulation
Fernandez, S Feijoo, S Almeida, P
Universidad De Santiago, SPAIN

Th1R13     Volume 5, p. 2163
Enhancement techniques to improve the intelligibility of consonants in noise: Speaker and listener effects
Hazan, V Simpson, A Huckvale, M
Department of Phonetics and Linguistics, UCL, UNITED KINGDOM

Th1R14     Volume 5, p. 2167
Boundaries of Perception of Long Tones in Taiwanese Speech
Jian F
University of Reading, UNITED KINGDOM

Th1R15     Volume 5, p. 2171
Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speech
Kato, H Tsuzaki, M Sagisaka, Y
ATR Human Information Processing Research Labs, JAPAN

Th1R16     Volume 5, p. 2175
Dynamic vs. Static Spectral Detail in the Perception of Gated Stops
Kiefte, M Neary, T M
University of Alberta, CANADA

Th1R17     Volume 5, p. 2179
Phonological Units in Speech Segmentation and Phonological Awareness
Otake, T Yoneyama, K
Dokkyo University, JAPAN

Th1R18     Volume 5, p. 2183
How Far Do Speakers Back Up in Repairs? A Quantitative Model
Shriberg, E Stolcke, A
SRI International, USA

Th1R19     Volume 5, p. 2187
Donít blame it (all) on the pause: Further ERP evidence for a
prosody-induced garden-path in running speech
Steinhauer, K Alter, K Friederici, A D
Max-Planck-Institute of Cognitive Neuroscience, GERMANY

Th1R20     Volume 5, p. 2191
The role of stress for lexical selection in Dutch
Vroomen, J de Gelder, B
University of Tilburg, THE NETHERLANDS

Th1R21     Volume 5, p. 2195
The perception of stressed syllables in Finnish
Tuomainen, J Vroomen, J de Gelder, B
University of Tilburg, THE NETHERLANDS

Th1R22     Volume 5, p. 2199
The perception of the morae with devocalized vowels in Japanese language
Yamakawa, K Baba, R
Prefectural University of Kumamoto, JAPAN

Th2A - Large Vocabulary Continuous Speech Recognition 4

Th2A1     Volume 5, p. 2203
High Resolution Decision Tree based Acoustic Modeling beyond CART
Chou, W Reichl, W
Bell Labs, Lucent Technologies, USA

Th2A2     Volume 5, p. 2207
Unsupervised Training of a Speech Recognizer Using TV Broadcasts
Kemp, T Waibel, A
University of Karlsruhe, GERMANY

Th2A3     Volume 5, p. 2211
A New Method to Achieve Fast Acoustic Matching for Speech Recognition
Lee, C Z O'Shaughnessy, D
INRS-Telecommunications, CANADA

Th2A4     Volume 5, p. 2215
Improved Parameter Tying for Efficient Acoustic Model Evaluation in Large Vocabulary Continuous Speech Recognition
Duchateau, J Demuynck, K Van Compernolle, D Wambacq, P
Kath. Univ. Leuven - ESAT/PSI, BELGIUM

Th2A5     Volume 5, p. 2219
A New Look at HMM Parameter Tying for Large Vocabulary Speech Recognition
Sankar, A
SRI International, USA

Th2A6     Volume 5, p. 2223
Factor Analysis Invariant to Linear Transformations of Data
Gopinath, R A Ramabhadran, B Dharanipragada, S
IBM T. J. Watson Research Center, USA

Th2B - Spoken Language Understanding Systems 2

Th2B1     Volume 5, p. 2227
Automatic Ambiguity Detection
Sproat, R van Santen, J
Bell Laboratories / Lucent Technologies, USA

Th2B2     Volume 5, p. 2231
Empowering Knowledge Based Speech Understanding through Statistics
Fischer, J Haas, J Noth, E Niemann, H Deinzer, P
University of Erlangen, GERMANY

Th2B3     Volume 5, p. 2235
Concept-driven speech understanding incorporated with a statistic language model
Nagai, A Ishikawa, Y
Mitsubishi Electric Corporation, JAPAN

Th2B4     Volume 5, p. 2239
On the Limitations of Stochastic Conceptual Finite-State Language Models for Speech Understanding
Colas, J Ferreiros, J Montero, J M Pastor, J Gallardo, A Pardo, J M
GTH-IEL-UPM, SPAIN

Th2B5     Volume 5, p. 2243
Towards Speech Understanding across multiple languages
Ward, T Roukos, S Neti, C Gros, J Epstein, M Dharanipragada, S
IBM T.J. Watson Research Center, USA

Th2B6     Volume 5, p. 2247
Automatic Detection of Sentence Boundaries and Disfluencies Based on Recognized Words
Stolcke, A Shriberg, E Bates, R Ostendorf, M Hakkani, D Plauche, M Tur, G Lu, Y
SRI International, USA

Th2C - Signal Processing and Speech Analysis 3

Th2C1     Volume 6, p. 2251
Determination of articulator positions from speech acoustics by applying dynamic articulatory constraints
Suzuki, S Okadome, T Honda, M
NTT Basic Research Laboratories, JAPAN

Th2C2     Volume 6, p. 2255
Recognizing Emotions in Speech Using Short-term and Long-term Features
Li, Y Zhao, Y
University of Illinois, USA

Th2C3     Volume 6, p. 2259
PeriphEar : A Nonlinear Active Model of the Auditory Periphery
Robert, A R Eriksson, J L E
CIRC, EPFL, SWITZERLAND

Th2C4     Volume 6, p. 2263
The Voicing Feature for Stop Consonants: Acoustic Phonetic Analyses and Automatic Speech Recognition Experiments
Ramesh, P Niyogi, P
Lucent Technologies, USA

Th2C5     Volume 6, p. 2267
Wavelet-based Energy Binning Cepstral Features for Automatic Speech Recognition
Basu, S Maes, S
IBM T.J. Watson Research Center, USA

Th2C6     Volume 6, p. 2271
Articulatory Analysis using a Codebook for Articulatory based Low
Bit-Rate Speech Coding

Silva, C Chennoukh, S
Universidade Do Minho, PORTUGAL

Th2D - Human Speech Perception 3

Th2D1     Volume 6, p. 2275
Categorical perception: Important phenomenon or lasting myth?
Massaro, D W
University of California, Santa Cruz, USA

Th2D2     Volume 6, p. 2279
Categorical Perception of Vowels
Gerrits, E Schouten, B
Utrecht University, THE NETHERLANDS

Th2D3     Volume 6, p. 2283
Suprasegmental Cues for the Segmentation of Identical Vowel Sequences in Japanese
Kakehi, K Hirose, Y
Nagoya University, JAPAN

Th2D4     Volume 6, p. 2287
Perception of concurrent approximant-vowel syllables
Ainsworth, W A
Keele University, UNITED KINGDOM

Th2D5     Volume 6, p. 2291
Perceived Swedish Vowel Quantity: Effects of Postvocalic Consonant Duration
Behne, D M Czigler, P E Sullivan, K P H
Norwegian University of Science & Technology, SWEDEN

Th4A - Speaker Adaptation 3

Th4A1     Volume 6, p. 2295
On-line hierarchical transformation of hidden Markov models for speaker adaptation
Chien, J-T
National Cheng Kung University, TAIWAN R.O.C.

Th4A2     Volume 6, p. 2299
High-Speed Speaker Adaptation Using Phoneme-Dependent
Tree-Structured Speaker Clustering

Suzuki, M Abe, T Mori, H Makino, S Aso, H
Tohoku University, JAPAN

Th4A3     Volume 6, p. 2303
The Use of Confidence Measures in Unsupervised Adaptation of Speech Recognizers
Anastasakos, T Balakrishnan, S
Motorola, Lexicus Division, USA

Th4A4     Volume 6, p. 2307
Speaker Normalization with All-Pass Transforms
McDonough, J Byrne, W Luo, X
Johns Hopkins University, USA

Th4A5     Volume 6, p. 2311
The CHAM model of hyperarticulate adaptation during human-computer error resolution
Oviatt, S L
Oregon Graduate Institute of Science & Technology, USA

Th4B - Spoken Language Understanding Systems 3

Th4B1     Volume 6, p. 2315
Language Modeling for Content Extraction in Human-Computer Dialogues
Reichl, W Carpenter, R Chu-Carroll, J Chou, W
Bell Labs - Lucent Technologies, USA

Th4B2     Volume 6, p. 2319
A Language Model Combining Trigrams and Stochastic Context-Free Grammars
Gillett, J Ward, W
Carnegie Mellon University, USA

Th4B3     Volume 6, p. 2323
Online Adaptation of Language Models in Spoken Dialogue Systems
Souvignier, B Kellner, A
Philips Research Laboratories, GERMANY

Th4B4     Volume 6, p. 2327
Language Model Adaptation for Spoken Language Systems
Riccardi, G Potamianos, A Narayanan, S
AT&T Labs-Research, USA

Th4B5     Volume 6, p. 2331
Detecting Topic Shifts Using a Cache Memory
Bigi, B De Mori, R El-Beze, M Spriet, T
University of Avignon, FRANCE

Th4B6     Volume 6, p. 2335
A Discourse Coding Scheme for Conversational Spanish
Levin, L Thyme-Gobbel, A Lavie, A Ries, K Zechner, K
Carnegie Mellon University, USA

Th4C - Multimodal Spoken Language Processing 3

Th4C1     Volume 6, p. 2339
Referential Features and Linguistic Indirection in Multimodal Language
Oviatt, S L Kuhn, K
Oregon Graduate Institute of Science & Technology, USA

Th4C2     Volume 6, p. 2343
Multimodal Language Processing
Johnston, M
Oregon Graduate Institute, USA

Th4C3     Volume 6, p. 2347
Implementation of Coordinative Nodding Behavior on Spoken Dialogue Systems
Hirasawa, J-I Miyazaki, N Nakano, M Kawabata, T
Nippon Telegraph & Telephone, JAPAN

Th4C4     Volume 6, p. 2351
Use of Non-verbal Information in Communication between Human and Robot
Yokoyama, M Aoyama, K Kikuchi, H Shirai, K
Waseda University, JAPAN

Th4C5     Volume 6, p. 2355
What you see is (almost) what you hear: Design principles for user interfaces for accessing speech archives
Whittaker, S Choi, J Hirschberg, J Nakatani, C
AT & T Labs - Research, USA

Th4D - Acoustic Phonetics 2

Th4D1     Volume 6, p. 2359
Regional variations in the vowels of female adolescents from Sydney
Cox, F Palethorpe, S
Macquarie University, AUSTRALIA

Th4D2     Volume 6, p. 2363
A kinematic analysis of New Zealand and Australian English vowel spaces
Watson, C Harrington, J Palethorpe, S
Macquarie University, AUSTRALIA

Th4D3     Volume 6, p. 2367
Syllable-onset acoustic properties associated with syllable-coda voicing
Nguyen, N Hawkins, S
University of Geneva, SWITZERLAND

Th4D4     Volume 6, p. 2371
Articulatory, acoustic and perceptual aspects of fricative/stop coarticulation
Nguyen, N Wrench, A Gibbon, F Hardcastle, W
University of Geneva, SWITZERLAND

Th4D5     Volume 6, p. 2375
Efficiency as an organizing principle of natural speech
Van Son, R J J H Koopmans-van Beinum, F J Pols, L C W
University of Amsterdam, THE NETHERLANDS

Th4D6     Volume 6, p. 2379
Within-speaker variability due to speaking manners
Karlsson, I Banziger, T Dankovicová, J Johnstone, T Lindberg, J Melin, H Nolan, F Scherer, K
KTH, SWEDEN

Th4P - Large Vocabulary Continuous Speech Recognition 5

Th4P1     Volume 6, p. 2383
A Thesaurus-based Statistical Language Model for Broadcast New Transcription
Ando, A Kobayashi, A Imai, T
NHK Science & Technical Research Laboratories, JAPAN

Th4P2     Volume 6, p. 2387
Effect of Task Complexity on Search Strategies for the Motorola Lexicus Continuous Speech Recognition System
Balakrishnan, S V
Motorola Lexicus Division, USA

Th4P3     Volume 6, p. 2391
New Features for Confidence Annotation
Bansal, D Ravishankar, M
Carnegie Mellon University, USA

Th4P4     Volume 6, p. 2395
Multi-Span Statistical Language Modeling for Large Vocabulary Speech Recognition
Bellegarda, J R
Apple Computer, USA

Th4P5     Volume 6, p. 2399
Maximum-Likelihood Updates of HMM Duration Parameters for Discriminative Continuous Speech Recognition
Chengalvarayan, R
Lucent Technologies, USA

Th4P6     Volume 6, p. 2403
Towards Better Integration of Semantic Predictors in Statistical Language Modeling
Coccaro, N Jurafsky, D
University of Colorado, USA

Th4P7     Volume 6, p. 2407
An Asymmetric Stochastic Language Model Based on Multi-Tagged Words
Pastor, J Colas, J San-Seguno, R Pardo, J
GTH-IEL-UPM, SPAIN

Th4P8     Volume 6, p. 2411
Product-Code Vector Quantization of Cepstral Parameters for Speech Recognition over the WWW
Digalakis, V Neumeyer, L Perakakis, M
Technical University of Crete, GREECE

Th4P9     Volume 6, p. 2415
Context Dependent Tree Based Transforms for Phonetic Speech Recognition
Doherty, B Vaseghi, S McCourt, P
Queen's University of Belfast, UNITED KINGDOM

Th4P10     Volume 6, p. 2419
Interfacing Acoustic Models with Natural Language Processing Systems
Johnson, M T Harper, M P Jamieson, L H
Purdue University, USA

Th4P11     Volume 6, p. 2423
Hierachical Cluster Language Modelling with Statistical Rule Extraction for Rescoring N-Best Hypotheses During Speech Decoding
Jang, P J Hauptmann, A
Carnegie Mellon University, USA

Th4P12     Volume 6, p. 2427
Dealing with out-of-vocabulary words and speech disfluencies in an
N-gram based speech understanding system

Kai, A Hirose, Y Nakagawa, S
Toyohashi University of Technology, JAPAN

Th4P13     Volume 6, p. 2431
Source-Extended Language Model for Large Vocabulary Continuous Speech Recognition
Kobayashi, T Wada, Y Kobayashi, N
Waseda University, JAPAN

Th4P14     Volume 6, p. 2435
Time Dependent Language Model for Broadcast News Transcription and Its Post-Correction
Kobayashi, A Onoe, K Imai, T Ando, A
NHK Science & Technical Research Laboratories, JAPAN

Th4P15     Volume 6, p. 2439
Exploiting Transitions and Focussing on Linguistic Properties for ASR
Koreman, J Barry, W J Andreeva, B
University of the Saarland, GERMANY

Th4P16     Volume 6, p. 2443
A Unified Framework for Sublexical and Linguistic Modelling Supporting Flexible Vocabulary Speech Understanding

Lau, R Seneff, S
MIT, USA

Th4P17     Volume 6, p. 2447
A method for modeling liaison in a speech recognition system for French
Bahl,L De Gennaro, S De Souza, P Epstein, E Le Roux, J M Lewis, B Waast, C
IBM, USA

Th4P18     Volume 6, p. 2451
On Variable Sampling Frequencies in Speech Recognition
Liu, F-H Picheny, M
IBM Watson Research Center, USA

Th4P19     Volume 6, p. 2455
Pronunciation Modeling for Large Vocabulary Conversational Speech Recognition
Ma, K Zavaliagkos, G Iyer, R
GTE/BBN Technologies, USA

Th4P20     Volume 6, p. 2459
Time Shift Invariant Speech Recognition
Basu, S Ittycheriah, A Maes, S H
IBM T.J. Watson Research Center, USA

Th4P21     Volume 6, p. 2463
The Demiphone versus the Triphone in a Decision-tree State-tying Framework
Mariño, J B Pachès-Leal, P Nogueiras, A
Polytechnical University of Catalonia, SPAIN

Th4P22     Volume 6, p. 2467
Word Clustering for a Word Bi-gram Model
Mori, S Nishimura, M Itoh, N
Tokyo Research Laboratory, JAPAN

Th4P23     Volume 6, p. 2471
A Large Vocabulary Continuous Speech Recognition Hybrid System for the Portuguese Language
Neto, J Martins, C Almeida, L
INESC, PORTUGAL

Th4P24     Volume 6, p. 2475
Speech Recognition Performance on a New Voicemail Transcription Task
Padmanabhan, M Ramabhadran, B Basu, S
IBM T J Watson Research Center, USA

Th4P25     Volume 6, p. 2479
Grammatical and statistical word prediction system for Spanish integrated in an aid for people with disabilities
Palazuelos, S Aguilera, S Rodrigo, J Godino, J
ETSI Telecomunicación, SPAIN

Th4P26     Volume 6, p. 2483
Segmentation using a maximum entropy approach
Papineni, K Dharanipragada, S
IBM T.J. Watson Research Center, USA

Th4P27     Volume 6, p. 2487
Recognition Performance of a Large-Scale Dependency Grammar Language Model
Berger, A Printz, H
Carnegie Mellon University, USA

Th4P28     Volume 6, p. 2491
A Bootstrap Technique for Building Domain-Dependent Language Models
Ramaswamy, G N Printz, H Gopalakrishnan, P S
I.B.M. Research Center, USA

Th4P29     Volume 6, p. 2495
Estimation of the Probability Distributions of Stochastic Context-Free Grammars from the K-Best Derivations
Sanchez, J-A Benedi, J-M
Universidad Politecnica De Valencia, SPAIN

Th4P30     Volume 6, p. 2499
Robust HMM Estimation with Gaussian Merging-Splitting and
Tied-Transform HMMs

Sankar, A
SRI International, USA

Th4P31     Volume 6, p. 2503
Nonlinear Interpolation of Topic Models for Language Model Adaptation
Seymore, K Chen, S Rosenfeld, R
Carnegie Mellon University, USA

Th4P32     Volume 6, p. 2507
Performance evaluation of word phrase and noun category language models for broadcast news speech recognition
Takagi, K Oguro, R Hashimoto, K Ozeki, K
The University of Electro-Communications, JAPAN

Th4P33     Volume 6, p. 2511
Robust Automatic Continuous-Speech Recognition Based on a Voiced-Unvoiced Decision
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA

Th4P34     Volume 6, p. 2515
Double Tree Beam Search Using Hierarchical Subword Units
Torrecilla, J C Cortazar, I Hernandez, L
Telefonica I+D, SPAIN

Th4P35     Volume 6, p. 2519
Text Segmentation and Topic Tracking on Broadcast News Via a Hidden Markov Model Approach
van Mulbregt, P Carp, I Gillick, L Lowe, S Yamron, J
Dragon Systems, Inc, USA

Th4P36     Volume 6, p. 2523
Multi-Phone Strings as SubWord Units for Speech Recognition
OíNeill, P Vaseghi, S Doherty, B Tan, W McCourt, P
The Queen's University of Belfast

Th4P37     Volume 6, p. 2527
Phonetic modification of the Syllable / tu / in two spontaneous American English dialogues
Veilleux, N Shattuck-Hufnagel, S
Boston University Metropolitan College, USA

Th4P38     Volume 6, p. 2531
Efficient Lattice Representation and Generation
Weng, F Stolcke, A Sankar, A
SRI International, USA

Th4P39     Volume 6, p. 2535
Modeling pronunciation variation for a Dutch CSR: Testing three methods
Wester, M Kessens, J M Strik, H
University of Nijmegen, THE NETHERLANDS

Th4P40     Volume 6, p. 2539
Comparison of Language Modelling Techniques for Russian and English
Whittaker, E W D Woodland, P C
Cambridge University, UNITED KINGDOM

Th4P41     Volume 6, p. 2543
Optimized POS-Based Language Models for Large Vocabulary Speech Recognition
Witschel, P
Siemens AG ZT IK 5, GERMANY

Th4P42     Volume 6, p. 2547
Reducing Peak Search Effort Using Two-Tier Pruning
Wright, M Hovell, S Ringland, S
BT Laboratories, UNITED KINGDOM

Th4P43     Volume 6, p. 2551
Using Untranscribed Training Data to Improve Performance
Zavaliagkos, G Siu, M Colthurst, T Billa, J
BBN Technologies, USA

Th4P44     Volume 6, p. 2555
Telephone Band LVCSR for Hearing-Impaired Users
Jan, E Bakis, R Liu, F Picheny, M
IBM T.J. Watson Research Center, USA

Th4P45     Volume 6, p. 2559
Using X-gram for efficient speech recognition
Bonafonte, A Mariño, J B
Polytechnical University of Catalonia, SPAIN

Th4P46     Mariño et al.     (see Th4P21)

Th4Q - Speech Coding 3

Th4Q1     Volume 6, p. 2563
A New Linear Predictive Method for Compression of Speech Signals
Alku, P Varho, S
University of Turku, FINLAND

Th4Q2     Volume 6, p. 2567
Hierarchical Temporal Decomposition: A Novel Approach to Efficient Compression of Spectral Characteristics of Speech
Ghaemmaghami, S Deriche, M Sridharan, S
Queensland University of Technology, AUSTRALIA

Th4Q3     Volume 6, p. 2571
Speech intelligibility testing for new technologies
Hura, S
Lucent Technologies, USA

Th4Q4     Volume 6, p. 2575
Efficient Quantization of LSF Parameters Based on Temporal Decomposition
Kim, S-J Lee, S-H Han, W-J Oh, Y-H
KAIST, KOREA

Th4Q5     Volume 6, p. 2579
A Sinusoidal Harmonic Vocoder at 1.2 kbps Using Auditory Perceptual Characteristics
Kohata, M
Chiba Institute of Technology, JAPAN

Th4Q6     Volume 6, p. 2583
A 16 kbit/s Wideband CELP Coder Using Mel-Generalized Cepstral Analysis and Its Subjective Evaluation
Koishida, K Hirabayashi, G Tokuda, K Kobayashi, T
University of California, Santa Barbara, USA

Th4Q7     Volume 6, p. 2587
Comparison of spectral estimation techniques for low bit-rate speech coding
Molyneux, D J Parris, C I Sun, X Q Cheetham, B M G
University of Manchester, UNITED KINGDOM

Th4Q8     Volume 6, p. 2591
Low Bit Rate Coding for Speech and Audio Using Mel Linear Predictive Coding (MLPC) analysis
Nakatoh, Y Norimatsu, T Low, A H Matsumoto, H
Matsushita Electric Industrial Co Ltd, JAPAN

Th4Q9     Volume 6, p. 2595
Comparison Study on VQ Codevector Index Assignment
Pan, J S Shieh, C S Chu, S C
National Kaohsiung Institute of Technology, TAIWAN R.O.C.

Th4Q10     Volume 6, p. 2599
Using linguistic knowledge to improve the design of low-bit rate LSF quantisation
Parry, J J Burnett, I S Chicharo, J F
University of Wollongong, AUSTRALIA

Th4Q11     Volume 6, p. 2603
Transform Coding of LSF Parameters Using Wavelets
Petrinovic, D
University of Zagreb, CROATIA

Th4Q12     Volume 6, p. 2607
Source Controlled Variable Bit-Rate Speech Coder Based on Waveform Interpolation
Plante, F Cheetham, B M G Marston, D Barrett, P A
Liverpool University, UNITED KINGDOM

Th4Q13     Volume 6, p. 2611
Improving Speaker Recognizability in Phonetic Vocoders
Ribeiro, C E M Trancoso, I M
INESC, PORTUGAL

Th4R - Language Acquisition 3 / Multilingual Perception and Recognition 2

Th4R1     Volume 6, p. 2615
Speech Perception and Spoken Language in Children with Impaired Hearing
Blamey, P Sarant, J Serry, T Wales, R James, C Barry, J Clark, G Wright, M Tooher, R Psarros, C Godwin, G Rennie, M Meskin, T
University of Melbourne, AUSTRALIA

Th4R2     Volume 6, p. 2619
Quantitative assessment of second language learners' fluency: An automatic approach
Cucchiarinim, C Strik, H Boves, L
University of Nijmegen, THE NETHERLANDS

Th4R3     Volume 6, p. 2623
Cross-Language Merged Speech Units and their Descriptive Phonetic Correlates
Dalsgaard, P Andersen, O Barry, W
Aalborg University, DENMARK

Th4R4     Volume 6, p. 2627
Crosslinguistic Disfluency Modelling: A Comparative Analysis of Swedish and American English Human-Human and Human-Machine Dialogues
Eklund, R Shriberg, E
Telia Research AB, SWEDEN

Th4R5     Volume 6, p. 2631
Calibration of machine scores for pronunciation grading
Franco, H Neumeyer, L
SRI International, USA

Th4R6     Volume 6, p. 2635
Phonetic Distance-Based Hypothesis Driven Lexical Adaptation for Transcribing Multilingual Broadcast News
Geutner, P Finke, M Waibel, A
Carnegie Mellon University, USA

Th4R7     Volume 6, p. 2639
Automatic Pronunciation Error Detection and Guidance for Foreign Language Learning
Jo, C-H Kawahara, T Doshita, S Dantsuji, M
Kyoto University, JAPAN

Th4R8     Volume 6, p. 2643
Lexical Access for Large-Vocabulary Speech Recognition
Leung, H Y Leung, H C
Chinese University of Hong Kong, HONG KONG

Th4R9     Volume 6, p. 2647
The effect of fundamental frequency on Mandarin speech recognition
Liu, S Doyle, S Morris, A Ehsani, F
Nuance Communications, USA

Th4R10     Volume 6, p. 2651
The perception of nativeness: Variable speakers and flexible listeners
Markham, D
Deakin University, AUSTRALIA

Th4R11     Volume 6, p. 2655
Voice Dictation in the Secondary School Classroom
McTear, M O'Hare, E
University of Ulster, UNITED KINGDOM

Th4R12     Volume 6, p. 2659
The importance of first syllable in English spoken word recognition by adult Japanese speakers
Nakayama, K Tomita-Nakayama, K
Yamagata University, JAPAN

Th4R13     Volume 6, p. 2663
Spoken L2 Teaching with Contrastive Visual and Auditory Feedback
Öster, A-M
KTH, SWEDEN

Th4R14     Volume 6, p. 2667
The role of phonological, morphological and orthographic knowledge in the intuitive syllabification of Dutch words: A longitudinal approach
Sandra, D Gillis, S
University of Antwerp, BELGIUM

Th4R15     Volume 6, p. 2671
The Acquisition of Japanese Compound Accent Rule
Shirose, A Kubozono, H Kiritani, S
University of Tokyo, JAPAN

Th4R16     Volume 6, p. 2675
The Acquisition of Putonghua Phonology
So, L K H Zhou, J
The University of Hong Kong, HONG

Th4R17     Volume 6, p. 2679
Enhancing Speech Processing of Japanese Learners of English Utilizing Time-Scale Expansion with Constant Pitch
Tomita-Nakayama, K Nakayama, K Misaki, M
Yamagata University, JAPAN

Th4R18     Volume 6, p. 2683
A Bootstrap Training Approach for Language Model Classifiers
Warnke, V Noeth, E Buckow, J Harbeck, S Niemann, H
University of Erlangen, GERMANY

Th4R19     Volume 6, p. 2687
Voice onset time patterns in 7-, 9- and 11-year old children
Whiteside, S P Marshall, J
University of Sheffield, UNITED KINGDOM

Th4R20     Volume 6, p. 2691
Some developmental patterns in the speech of 6-, 8- and 10-year old children: An acoustic phonetic study
Whiteside, S P Hodgson, C
University of Sheffield, UNITED KINGDOM

Th4R21     Volume 6, p. 2695
Language Development After Extreme Childhood Deprivation: A Case Study
Brown, L Locke, J L Jones, P Whiteside, S
University of Sheffield, UNITED KINGDOM

Th4R22     Volume 6, p. 2699
Phonological elements as a basis for language-independent ASR
Williams, G Terry, A M P Kaye, J D
SOAS, University of London, UNITED KINGDOM

Th4R23     Volume 6, p. 2703
A Phonetic and Acoustic Study of Babbling in an Italian Child
Zmarich, C Lanni, R
Instituto Di Fonetica E Dialettologia, CNR, ITALY

Th4R24     Volume 6, p. 2707
Rescoring Multiple Pronunciations Generated from Spelled Words
Kuhn, R Junqua, J-C Martzen, P
Panasonic Technologies, Inc., USA

Th5A - Segmentation, Labelling and Speech Corpora 3

Th5A1     Volume 6, p. 2711
A recursive algorithm for the forced alignment of very long audio segments
Moreno, P J Joerg, C Van Thong, J-M Glickman, O
Compaq (Cambridge Research Lab), USA

Th5A2     Volume 6, p. 2715
The selection of pronunciation variants: comparing the performance of man and machine
Kessens, J M Wester, M Cucchiarini, C Strik, H
University of Nijmegen, THE NETHERLANDS

Th5A3     Volume 6, p. 2719
Acoustic Confidence Measures for Segmenting Broadcast News
Barker, J Williams, G Renals, S
Sheffield University, UNITED KINGDOM

Th5A4     Volume 6, p. 2723
A Duration-Based Confidence Measure for Automatic Segmentation of Noise Corrupted Speech
Pellom, B L Hansen, J H L
Duke University, USA

Th5A5     Volume 6, p. 2727
Segmentation and Classification of Broadcast News Audio
Hain, T Woodland, P C
Cambridge University, UNITED KINGDOM

Th5A6     Volume 6, p. 2731
Speaker Recruitment Methods and Speaker Coverage - Experiences From a Large Multilingual Speech Database Collection
Lindberg, B Comeyne, R Draxler, C Senia, F
Aalborg University, DENMARK

Th5B - Text-To-Speech Synthesis 5

Th5B1     Volume 6, p. 2735
A phonologically motivated method of selecting non-uniform units
Breen, A P Jackson, P
BT Labs, UNITED KINGDOM

Th5B2     Volume 6, p. 2739
A Synthesis Method Based on Concatenation of Demisyllables and a Residual Excited Vocal Tract Model
Pearson, S Kibre, N Niedzielski, N
Speech Technology Lab/Panasonic Technologies Inc., USA

Th5B3     Volume 6, p. 2743
Exploration of Acoustic Correlates in Speaker Selection for Concatenative Synthesis
Syrdal, A K Conkie, A Stylianou, Y
AT&T Labs - Research, USA

Th5B4     Volume 6, p. 2747
A Perceptual Evaluation of Distance Measures for Concatenative Speech Synthesis
Wouters, J Macon, M W
Center for Spoken Language Understanding, USA

Th5B5     Volume 6, p. 2751
HMM-Based Smoothing for Concatenative Speech Synthesis
Plumpe, M Acero, A Hon, H Huang, X
Microsoft Research, USA

Th5B6     Volume 6, p. 2755
A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features
Holzapfel, M Cambpell, N
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Th5C - Spoken Language Generation and Translation 2

Th5C1     Volume 6, p. 2759
A Generic Algorithm for Generating Spoken Monologues
Klabbers, E Krahmer, E Theune, M
IPO Center for Research On User-System Interaction, THE NETHERLANDS

Th5C2     Volume 6, p. 2763
On the Use of Automatically Generated Discourse-level Information in a Concept-to-Speech Synthesis System
Hitzeman, J Black, A W Taylor, P Mellish, C Oberlander, J
University of Edinburgh, UNITED KINGDOM

Th5C3     Volume 6, p. 2767
Learning Phrase-based Head Transduction Models for Translation of Spoken Utterances
Alshawi, H Bangalore, S Douglas, S
AT&T Labs, USA

Th5C4     Volume 6, p. 2771
Probabilistic dialogue act extraction for concept based multilingual translation systems
Fukada, T Koll, D Waibel, A Tanigaki, K
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Th5C5     Volume 6, p. 2775
Fast Decoding for Statistical Machine Translation
Wang, Y Waibel, A
Carnegie Mellon University, USA

Th5C6     Volume 6, p. 2779
A Japanese-to-English Speech Translation System: ATR-MATRIX
Takezawa, T Morimoto, T Sagisaka, Y Campbell, N Iida, H Sugaya, F Yokoo, A Yamamoto, S
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Th5D - Human Speech Perception 4

Th5D1     Volume 6, p. 2783
Orthografik inkoncistensy ephekts in foneme detektion?
Cutler, A Treiman, R van Ooijen, B
Max Plank Institute for Psycholinguistics, THE NETHERLANDS

Th5D2     Volume 6, p. 2787
The effect of orthographic knowledge on the segmentation of speech
Derwing, B L Nearey, T M Yoon, Y B
University of Alberta, CANADA

Th5D3     Volume 6, p. 2791
Spotting (different types of) Words in (different types of) Context
McQueen, J M Cutler, A
Max-Planck-Institute for Psycholinguistics, NETHERLANDS

Th5D4     Volume 6, p. 2795
Correlation Between Consonantal VC Transitions and degree of Perceptual Confusion of Place Contrasts in Hindi
Ohala, M Ohala, J J
San Jose State University, USA

Th5D5     Volume 6, p. 2799
Perception of Tonal Rises and Falls for Accentuation and Phrasing in Swedish
House, D Hermes, D Beaugendre, F
University of Skovde, SWEDEN

Th5D6     Volume 6, p. 2803
Speech Intelligibility Derived from Exceedingly Sparse Spectral Information
Greenberg, S Arai, T Silipo, R
International Computer Science Institute, USA

Friday 4 December, 1998

Fr1A - Robust Speech Processing in Adverse Environments 5

Fr1A1     Volume 7, p. 2807
Auditory Modelling Techniques for Robust Pitch Extraction and Noise Reduction
Cosi, P Pasquin, S Zovato, E
IIFD - Institute of Phonetics and Dialectology, ITALY

Fr1A2     Volume 7, p. 2811
Wavelet Transform-based Speech Enhancement
Ambikairajah, E Tattersall, G Davis, A
Athlone Institute of Technology, IRELAND

Fr1A3     Volume 7, p. 2815
A Practical Perceptual Frequency Autoregressive HMM Enhancement System
Logan, B Robinson, A
University of Cambridge, UNITED KINGDOM

Fr1A4     Volume 7, p. 2819
An Effective Quality Evaluation Protocol for Speech Enhancement Algorithms
Hansen, J H L Pellom, B L
Robust Speech Processing Laboratory, Duke University, USA

Fr1A5     Volume 7, p. 2823
An adaptive beamforming microphone array system using a blind deconvolution
Park, J-N Usagawa, T Ebata, M
Kumamoto University, JAPAN

Fr1A6     Volume 7, p. 2827
Speech Enhancement using Critical Band Spectral Subtraction
Singh, L Sridharan, S
Queensland University of Technology, AUSTRALIA

Fr1B - Text-To-Speech Synthesis 6

Fr1B1     Volume 7, p. 2831
How to Handle 'Foreign' Sounds in Swedish Text-to-Speech Conversion: Approaching the 'Xenophone' Problem
Eklund, R Lindström, A
Telia Research AB, SWEDEN

Fr1B2     Volume 7, p. 2835
Multi-Lingual Concatenative Speech Synthesis
Campbell, N
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Fr1B3     Volume 7, p. 2839
On the Use of F0 features in Automatic Segmentation for Speech Synthesis
Saito, T
IBM Japan Ltd, JAPAN

Fr1B4     Volume 7, p. 2843
A linguistic and prosodic database for data-driven Japanese TTS synthesis
Sakurai, A Natsume, T Hirose, K
University of Tokyo, JAPAN

Fr1B5     Volume 7, p. 2847
Text-to-Speech Voice Adaptation from Sparse Training Data
Kani, A Macon, M
Oregon Graduate Institute, USA

Fr1B6     Volume 7, p. 2851
Describing Intonation with a Parametric Model
Möhler, G
University of Stuttgart, GERMANY

Fr1C - Speech Technology Applications and Human- Machine Interface 2

Fr1C1     Volume 7, p. 2855
Development of CAI system employing synthesized speech responses
Shinozaki, T Abe, M
NTT Human Interface Labs, JAPAN

Fr1C2     Volume 7, p. 2859
Using Combined Decisions and Confidence Measures for Name Recognition in Automatic Directory Assistance Systems
Kellner, A Rueber, B Schramm, H
Philips Research Laboratories, GERMANY

Fr1C3     Volume 7, p. 2863
VPQ: A Spoken Language Interface to Large Scale Directory Information
Buntschuh, B Kamm, C DiFabbrizio, G Abella, A Mohri, M Narayanan, S Zeljkovic, I Sharp, R Wright, R Marcus, S Shaffer J Duncan, R Wilpon, J
AT&T Labs - Research, USA

Fr1C4     Volume 7, p. 2867
SCAN - Speech Content Based Audio Navigator: A System Overview
Choi, J Hindle, D Hirschberg, J Magrin-Chagnolleau, I Nakatani, C Pereira, F Singhal, A Whittaker, S
AT&T Labs - Research, USA

Fr1C5     Volume 7, p. 2871
Controlling a HIFI with a continuous speech understanding system
Ferreiros, J Colas, J Macias-Guarasa, J Ruiz, A Pardo, J M
GTH-IEL-UPM, SPAIN

Fr1C6     Volume 7, p. 2875
User Evaluation of the Mask Kiosk
Lamel, L Bennacef, S Gauvain, J L Dartigues, H Temem, J-N
LIMSI-CNRS, FRANCE

Fr1D - Prosody and Emotion 6

Fr1D1     Volume 7, p. 2879
A Contrastive Study of Lexical Stress Placement in Singapore English and
British English

Low, E L Grabe, E
Nayang Technological University, SINGAPORE

Fr1D2     Volume 7, p. 2883
Integrated Recognition of Words and Phrase Boundaries
Gallwitz, F Batliner, A Buckow, J Huber, R Niemann, H Noeth, E
University of Erlangen, GERMANY

Fr1D3     Volume 7, p. 2887
Phrase accents revisited: comparative evidence from Standard and Cypriot Greek
Arvaniti, A
University of Cyprus, CYPRUS

Fr1D4     Volume 7, p. 2891
Phonetic invariance and phonological stability: Lithuanian pitch accents
Dogil, G Möhler, G
University of Stuttgart, GERMANY

Fr1D5     Volume 7, p. 2895
A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech
Brindoepke, C Fink, G Kummert, F Sagerer, G
University Bielefeld, GERMANY

Fr1D6     Volume 7, p. 2899
Towards a reversible symbolic coding of intonation
Véronis, J Campione, E
Université de Provence, FRANCE

Fr1P - Hidden Markov Model Techniques 3

Fr1P1     Volume 7, p. 2903
A Statistical Phonemic Segment Model for Speech Recognition Based on Automatic Phonemic Segmentation
Aizawa, K Furuichi, C
TOIN University of Yokohama, JAPAN

Fr1P2     Volume 7, p. 2907
Improved Feature Decorrelation for HMM-based Speech Recognition
Demuynck, K Duchateau, J Van Compernolle, D Wambacq, P
K.U. Leuven - ESAT/PSI, BELGIUM

Fr1P3     Volume 7, p. 2911
Efficient high-order Hidden Markov Modelling
du Preez, J A Weber, D M
University of Stellenbosch, SOUTH AFRICA

Fr1P4     Volume 7, p. 2915
A Time-synchronous, Tree-based Search Strategy in the Acoustic Fast Match of an Asynchronous Speech Recognition System
Eide, E Bahl, L
IBM, USA

Fr1P5     Volume 7, p. 2919
Effective Structural Adaptation of LVCSR Systems to Unseen Domains Using Hierarchical Connectionist Acoustic Models
Fritsch, J Finke, M Waibel, A
University of Karlsruhe, GERMANY

Fr1P6     Volume 7, p. 2923
Support Vector Machines for Speech Recognition
Ganapathiraju, A Hamaker, J Picone, J
Mississippi State University, USA

Fr1P7     Volume 7, p. 2927
Natural Number Recognition Using Discriminatively Trained Inter-Word Context Dependent Hidden Markov Models
Gandhi, M B
Lucent Technologies, USA

Fr1P8     Volume 7, p. 2931
Information Theoretic Approaches to Model Selection
Hamaker, J Ganapathiraju, A Picone, J
Mississippi State University, USA

Fr1P9     Volume 7, p. 2935
Continuous Speech Recognition Using Segmental Unit Input HMMs with a Mixture of Probability Density Functions and Context Dependency
Hanai, K Yamamoto, K Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN

Fr1P10     Volume 7, p. 2939
Gaussian density tree structure in a multi-Gaussian HMM-based speech recognition system
Simonin, J Delphin-Poulat, L Damnati, G
France Telecom - CNET, FRANCE

Fr1P11     Volume 7, p. 2943
Generalized Phone Modeling Based on Piecewise Linear Segment Lattice
Kojima, H Tanaka, K
Electrotechnical Laboratory, AIST, MITI, JAPAN

Fr1P12     Volume 7, p. 2947
A Flexible Method of Creating HMM Using Block-diagonalization of Covariance Matrices
Koshiba, R Tachimori, M Kanazawa, H
Toshiba, JAPAN

Fr1P13     Volume 7, p. 2951
HMM Topology Selection for Accurate Acoustic and Duration Modeling
Chesta, C Laface, P Ravera, F
Politecnico Di Torino, ITALY

Fr1P14     Volume 7, p. 2955
Context-dependent duration modelling for continuous speech recognition
Lee, T Carlson, R Grandstrom, B
The Chinese University of Hong Kong, HONG KONG

Fr1P15     Volume 7, p. 2959
Training of Context-Dependent Subspace Distribution Clustering Hidden Markov Model
Mak, B Bocchieri, E
Hong Kong University of Science & Technology, HONG KONG

Fr1P16     Volume 7, p. 2963
Unsupervised Training of HMMs With Variable Number of Mixture Components per State
Martin, C Villarrubia, L Gonzalez, F J Hernandez, L
Telefonica I+D, SPAIN

Fr1P17     Volume 7, p. 2967
Acoustic Observation Context Modeling in Segment Based Speech Recognition
Szarvas, M Matsunaga, S
Technical University of Budapest, HUNGARY

Fr1P18     Volume 7, p. 2971
Capturing Discriminative Information Using Multiple Modeling Techniques
Ming, J Hanna, P Stewart, D Vaseghi, S Smith, F J
Queen's University of Belfast, UNITED KINGDOM

Fr1P19     Volume 7, p. 2975
Suprasegmental Duration Modelling with Elastic Constraints in Automatic Speech Recognition
Molloy, L Isard, S
Edinburgh University, UNITED KINGDOM

Fr1P20     Volume 7, p. 2979
An Adaptive Gradient-Search-Based Algorithm for Discriminative Training of HMMs
Nogueiras-Rodriguez, A Marinho, J B Monte, E
Polytechnical University of Catalonia, SPAIN

Fr1P21     Volume 7, p. 2983
Task Adaptation of Sub-Lexical Unit Models using the Minimum Confusibility Criterion on Task Independent Databases
Nogueiras-Rodriguez, A Marinho, J B
Polytechnical University of Catalonia, SPAIN

Fr1P22     Volume 7, p. 2987
Stochastic Calculus, Non-Linear Filtering, and the Internal Model Principle: Implications for Articulatory Speech Recognition
Ramsay, G
Institut De La Communication Parlee, FRANCE

Fr1P23     Volume 7, p. 2991
The Use of Meta-HMM in Multistream HMM Training for Automatic Speech Recognition
Wellekens, C J Kangasharju, J Milesi, C
Institut Eurecom, FRANCE

Fr1P24     Volume 7, p. 2995
Enhanced ASR by acoustic feature filtering
Wellekens, C J
Institut Eurecom, FRANCE

Fr1P25     Volume 7, p. 2999
Soft State-Tying for HMM-Based Speech Recognition
Neukirchen, C Willett, D Rigoll, G
Duisburg University, GERMANY

Fr1P26     Volume 7, p. 3003
Estimation of models for non-native speech in computer-assisted language learning based on linear model combination
Witt, S M Young, S J
Cambridge University, UNITED KINGDOM

Fr1P27     Volume 7, p. 3007
Duration Modeling using Cumulative Duration Probability and Speaking Rate Compensation
Yang, T-Y Kim, J-S Lee, C Youn, D H Cha, I-W
Yonsei University, KOREA

Fr1P28     Volume 7, p. 3011
Probabilistic Modeling with Bayesian Networks for Automatic Speech Recognition
Zweig, G Russell, S
IBM T.J. Watson Research Center, USA

Fr1Q - Speech & Hearing Disorders 2 / Speech Processing for the Speech Impaired and Hearing Impaired 1

Fr1Q1     Volume 7, p. 3015
SIVHA, Visual Speech Synthesis System
Blanco, Y Cuellar, M Villanueva, A Lacunza, F Cabeza, R Marcotegui, B
Universidad Publica De Navarraica, UPNA, SPAIN

Fr1Q2     Volume 7, p. 3019
Using automatic speech recognition and its possible effects on the voice
de Bruijn, C G Whiteside, S P Cudd, P A Syder, D Rosen, M Nord, L
University of Sheffield, UNITED KINGDOM

Fr1Q3     Volume 7, p. 3023
The Importance of F0 or Voice Pitch for Perception of Tonal Language: Simulations with Cochlear Implant Speech Processing Strategies
Fearn, R
University of New South Wales, AUSTRALIA

Fr1Q4     Volume 7, p. 3027
Assessing high level language in individuals with multiple sclerosis - a pilot study
Brunnegård, K Laakso, K Hartelius, L Ahlsén, E
Goteborg University, SWEDEN

Fr1Q5     Volume 7, p. 3031
Design of Cochlear Implant Device for Transmitting Voice Pitch Information in a Speech Sound of Asian Languages
Hiki, S Imaizumi, K Fukuda, Y
Waseda University, JAPAN

Fr1Q6     Volume 7, p. 3035
Abnormal     Volume-Duration Relationship in Parkinsonian Speech
Ho, A K Bradshaw, J L Iansek, R Alfredson, R J
Monash University, AUSTRALIA

Fr1Q7     Volume 7, p. 3039
Analysis of Disordered Speech Signal Using Wavelet Transform
Jo, C W Kim, D H
Changwon National University, KOREA

Fr1Q8     Volume 7, p. 3043
Multi-channel Pulsation Strategy for Electric Stimulation of Cochlea
Kitazawa, S Kirihata, H Kitamura, T
Shizuoka University, JAPAN

Fr1Q9     Volume 7, p. 3047
Synthetic faces as a lipreading support
Agelfors, E Beskow, J Dahlquist, M Granström, B Lundeberg, M Spens, K-E Öhman, T
KTH, SWEDEN

Fr1Q10     Volume 7, p. 3051
Predicting language scores from speech perception scores of hearing impaired children
Martin, L Bench, J
La Trobe University, AUSTRALIA

Fr1Q11     Volume 7, p. 3055
Context-Independent Duration Model on Categories of Voice and Unvoice Segments
Skljarov, O P
Research Institute of Ear Throat Nose & Speech, RUSSIA

Fr1Q12     Volume 7, p. 3059
Dynamical spectrogram, an aid for the deaf
Soltani Farani, A A Chilton, E H S Shirley, R
University of Surrey, UNITED KINGDOM

Fr1Q13     Volume 7, p. 3063
Evidence of dual-route phonetic encoding from apraxia of speech: Implications for phonetic encoding models
Varley, R A Whiteside, S P
University of Sheffield, UNITED KINGDOM

Fr1Q14     Volume 7, p. 3067
Speech Communication Profiles Across the Adult Lifespan: Persons Without Self-Identified Hearing Impairment
Cheesman, M F Smilsky, K L Major, T M Lewis, F Boorman, L M
University of Western Ontario, CANADA

Fr1R - Human Speech Production

Fr1R1     Volume 7, p. 3071
Time as a factor in the acoustic variation of schwa
Barry, W J
University of the Saarland, GERMANY

Fr1R2     Volume 7, p. 3075
On the Structure of Vowel Space: A Genealogy of General Phonetic Concepts
Boshoff, H F V Botha, E C
University of Pretoria, SOUTH AFRICA

Fr1R3     Volume 7, p. 3079
The relationship between intensity and subglottal pressure with controlled pitch
Lecuit, V Demolin, D
Universite Libre De Bruxelles, BELGIUM

Fr1R4     Volume 7, p. 3083
Segmentation of the airway from the surrounding tissues on Magnetic Resonance Images: A comparative study
Soquet, A Lecuit, V Metens, T Nazarian, B Demolin, D
Universite Libre De Bruxelles, BELGIUM

Fr1R5     Volume 7, p. 3087
Recovering Vocal Tract Shapes from MFCC Parameters
Dusan, S Deng, L
University of Waterloo, CANADA

Fr1R6     Volume 7, p. 3091
Quantification of Pharyngeal Articulations Using Measurements from Laryngoscopic Images
Esling, J Clayards, J Edmondson, J Fuyuan, Q Harris, J
University of Victoria, CANADA

Fr1R7     Volume 7, p. 3095
Variance and Invariance in Speech Rate as a Reflection of Conceptual Planning
Fon, Y-J
The Ohio State University, USA

Fr1R8     Volume 7, p. 3099
Correspondence between the Glottal Gesture Overlap Pattern and Vowel Devoicing in Japanese
Fujimoto, M Murano, E Niimi, S Kiritani, S
University of Tokyo, JAPAN

Fr1R9     Volume 7, p. 3103
Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique
Fujisawa, Y Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN

Fr1R10     Volume 7, p. 3107
Independence of consonantal voicing and vocoid F0 perturbation in English and Japanese
Ishihara, S
The Australian National University, AUSTRALIA

Fr1R11     Volume 7, p. 3111
Reduction of English Function Words in Switchboard
Jurafsky, D Bell, A Fosler-Lussier, E Girand, C Raymond, W
University of Colorado, USA

Fr1R12     Volume 7, p. 3115
Duration Compensation in Non-Adjacent Consonant and Temporal Regularity
Kim, H-S
Stanford University, USA

Fr1R13     Volume 7, p. 3119
Relationship Between Lip Shapes and Acoustical Characteristics During Speech
Mori, K Sonoda, Y
Kyushu Kyoritsu University, JAPAN

Fr1R14     Volume 7, p. 3123
A model to represent propagation and radiation of higher-order modes for 3-D vocal-tract configuration
Motoki, K Matsuzaki, H
Hokkai-Gakuen University, JAPAN

Fr1R15     Volume 7, p. 3127
FEM Analysis of aspirated air flow in three-dimensional vocal tract during fricative consonant phonation
Niikawa,T Matsumura,M Tachimura,T Wada,T
Osaka Electro-Communication University, JAPAN

Fr1R16     Volume 7, p. 3131
Trajectory formation of articulatory movements for a given sequence of phonemes
Okadome, T Kaburagi, T Honda, M
NTT Basic Research Laboratories, JAPAN

Fr1R17     Volume 7, p. 3135
Contextual Effects on Voicing Profiles of German and Mandarin Consonants
Shih, C Möbius, B
Bell Labs - Lucent Technologies, USA

Fr1R18     Volume 7, p. 3139
Reconstructing the tongue surface from six cross-sectional contours: Ultrasound data
Lundberg, A Stone, M
Johns Hopkins University, USA

Fr1R19     Volume 7, p. 3143
Articulability of two consecutive morae in Japanese speech production: Evidence from sound exchange errors in spontaneous speech
Terao, Y Murata, T
Tokoha Gakuen College, JAPAN

Fr1R20     Volume 7, p. 3147
Coarticulation and degrees of freedom in the elaboration of a new articulatory plant: Gentiane
Vilain, A Abry, C Badin, P
ICP - Stendhal, FRANCE

Fr1R21     Volume 7, p. 3151
A pressure sensitive palatography: Application of new pressure sensitive sheet for measuring tongue-palatal contact pressure
Wakumoto, M Masaki, S Honda, K Ohue, T
ATR, JAPAN

Fr1R22     Volume 7, p. 3155
Dual-route phonetic encoding: Some acoustic evidence
Whiteside, S P Varley, R A
University of Sheffield, UNITED KINGDOM

Fr1R23     Volume 7, p. 3159
Fast and Slow Speech Rate: A Characterisation for French
Zellner, B
University of Lausanne, SWITZERLAND

Fr2A - Segmentation, Labelling and Speech Corpora 4

Fr2A1     Volume 7, p. 3163
A multilingual prosodic database
Campione, E Véronis, J
Université De Provence, FRANCE

Fr2A2     Volume 7, p. 3167
The CSLU Speaker Recognition Corpus
Cole, R Noel, M Noel, V
Oregon Graduate Institute, USA

Fr2A3     Volume 7, p. 3171
How effective is unsupervised data collection for children's speech recognition?
Aist, G Chan, P Huang, X D Jiang, L Kennedy, R Latimer, D Mostow, J Yueng, C
Carnegie Mellon University, USA

Fr2A4     Volume 7, p. 3175
An Algorithm for Automatic Generation of Mandarin Phonetic Balanced Corpus
Shyuu, J-S Wang, J-F
Department of Computer Science and Information Engineering, TAIWAN R.O.C.

Fr2A5     Volume 7, p. 3179
Towards a formal framework for linguistic annotations
Bird, S Liberman, M
University of Pennsylvania, USA

Fr2A6     Volume 7, p. 3181
Forming Generic Models of Speech for Uniform Database Access
Altosaar, T Vainio, M
Helsinki University of Technology, FINLAND

Fr2B - Speaker and Language Recognition 4

Fr2B1     Volume 7, p. 3185
On the Convergence of Gaussian Mixture Models: Improvements through Vector Quantization
Moody, J Slomka, S Pelecanos, J Sridharan, S
Queensland University of Technology, AUSTRALIA

Fr2B2     Volume 7, p. 3189
Modeling Dynamic Prosodic Variation for Speaker Verification
Sonmez, M K Shriberg, E Heck, L Weintraub, M
SRI International, USA

Fr2B3     Volume 7, p. 3193
Blind Clustering of Speech Utterances Based on Speaker and Language Characteristics
Reynolds, D Singer, E Carlson, B O'Leary, G McLaughlin, J Zissman, M
MIT Lincoln Laboratory, USA

Fr2B4     Volume 7, p. 3197
Spoken Language Identification Using the Speechdat Corpus
Antonio Caseiro, D Martins Trancoso, I
INESC, PORTUGAL

Fr2B5     Volume 7, p. 3201
Automatic Language Identification with Perceptually Guided Training and Recurrent Neural Networks
Braun, J Levkowitz, H
University of Massachusetts Lowell, USA

Fr2B6     Volume 7, p. 3205
On the Importance of Components of the Modulation Spectrum for Speaker Verification
Van Vuuren, S Hermansky, H
Oregon Graduate Institute, USA

Fr2C - Speech Technology Applications and Human- Machine Interface 3

Fr2C1     Volume 7, p. 3209
Is Speech the Right Thing for your Application?
Bernsen, N O Dybkjaer, L
Centre for Cognitive Science, Odense University, DENMARK

Fr2C2     Volume 7, p. 3213
A PC-Based Tool for Helping in Diagnosis of Pathologic Voice
Godino Llorente, J I Aguilera Navarro, S Palazeulos, S Nieto Altuzarra, A Gómez Vilda, P
ETSI Telecomunicación, SPAIN

Fr2C3     Volume 7, p. 3217
Web-based educational tools for speech technology
Sjölander, K Beskow, J Gustafson, J Levin, E Carlson, R Granström, B
KTH, SWEDEN

Fr2C4     Volume 7, p. 3221
Universal Speech Tools: The CSLU Toolkit
Sutton, S Cole, R de Villiers, J Schalkwyk, J Vermeulen, P Macon, M
Yan, Y-H Kaiser, E Rundle, B Shobaki, K Hosom, P Kain, A Wouters, J
Massaro, D Cohen, M
Oregon Graduate Institute, USA

Fr2C5     Volume 7, p. 3225
Creating a Mexican Spanish Version of the CSLU Toolkit
Serridge, B Barbosa, A Cole, R Munive, N Vargas, A
Universidad De Las Americas, MEXICO

Fr2C6     Volume 7, p. 3229
A Voice User Interface Demonstration System for Mexican Spanish
Garcia-Mateo, C Zhou, Q Lee, C H Pargellis, A
Universidad De Vigo, SPAIN

Fr2D - Utterance Verification and Word Spotting 2

Fr2D1     Volume 7, p. 3233
Context Dependent Anti Subword Modeling for Utterance Verification
Ramesh, P Lee, C-H Juang, B-H
Lucent Technologies, USA

Fr2D2     Volume 7, p. 3237
Combination of confidence measures in isolated word recognition
Dolfing, J G A Wendemuth, A
Philips Research Laboratories Aachen, GERMANY

Fr2D3     Volume 7, p. 3241
Confidence Measures for HMM-based Speech Recognition
Willett, D Worm, A Neukirchen, C Rigoll, G
Duisburg University, GERMANY

Fr2D4     Volume 7, p. 3245
Vocabulary-Independent Word Confidence Measure using Subword Features
Jiang, L Huang, X D
Microsoft Research, USA

Fr2D5     Volume 7, p. 3249
A New Confidence Measure Based on Rank-Ordering Subphone Scores
Lin, Q Das, S Lubensky, D Picheny, M
IBM T.J. Watson Research Center, USA

Fr2D6     Volume 7, p. 3253
Speaking-Style Dependent Lexicalized Filler Model for Key-Phrase Detection and Verification
Kawahara, T Ishizuka, K Doshita, S Lee, C-H
Kyoto University, JAPAN

Fr4A - Large Vocabulary Continuous Speech Recognition 6

Fr4A1     Volume 7, p. 3257
Sharable Software Repository for Japanese Large Vocabulary Continuous Speech Recognition

Kawahara, T Kobayashi, T Takeda, K Minematsu, N Itou, K Yamamoto, M Yamada, A Utsuro, T Shikano, K
Kyoto University, JAPAN

Fr4A2     Volume 7, p. 3261
The Design of the Newspaper-Based Japanese Large Vocabulary Continuous Speech Recognition corpus
Itou, K Yamamoto, M Takeda, K Takezawa, T Matsuoka, T Shikano, K Kobayashi, T Itahashi, S
Electrotechnical Laboratory, JAPAN

Fr4A3     Volume 7, p. 3265
Indexing and Classification of TV News Articles Based on Speech Dictation Using Word Bigram
Ogata, J Ariki, Y
Ryukoku University, JAPAN

Fr4A4     Volume 7, p. 3269
Parametric Trajectory Mixtures for LVCSR
Siu, M Iyer, R Gish, H Quillen, C
GTE/BBN Technologies, USA

Fr4B - Neural Networks, Fuzzy and Evolutionary Methods 3

Fr4B1     Volume 7, p. 3273
Fuzzy-Integration Based Normalisation for Speaker Verification
Pham, T D Wagner, M
University of Canberra, AUSTRALIA

Fr4B2     Volume 7, p. 3277
Improving the generalization performance of the MCE/GPD learning
Shimodaira, H Rokui, J Nakai, M
Japan Advanced Institute of Science and Technology, JAPAN

Fr4B3     Volume 7, p. 3281
Acoustic Speech Recognition Model by Neural Net Equation with Competition and Cooperation
Kitazoe, T Ichiki, T Kim, S-I
Miyazaki University, JAPAN

Fr4B4     Volume 7, p. 3285
Improved Surname Pronunciations Using Decision Trees
Ngan, J Ganapathiraju, A Picone, J
Mississippi State University, USA

Fr4C - Speech Processing for the Speech-Impaired and Hearing-Impaired 2

Fr4C1     Volume 7, p. 3289
A Speechreading Aid Based on Phonetic ASR
Duchnowski, P Braida, L Bratakos, M Lum, D Sexton, M Krause, J
Massachusetts Institute of Technology, USA

Fr4C2     Volume 7, p. 3293
Training Speech through Visual Feedback Patterns
Nouza, J
Technical University of Liberec, CZECH REPUBLIC

Fr4C3     Volume 7, p. 3297
Word sequence pair spotting for synchronization of speech and text in production of closed-caption TV programs for the hearing impaired
Maruyama, I Abe, Y Sawamura, E Wakao, T Ehara, T Shirai, K
Telecommunications Advancement Organization of Japan, JAPAN

Fr4C4     Volume 7, p. 3301
    Volume Regulation in Parkinsonian Speech
Ho, A K Bradshaw, J L Iansek, R Alfredson, R J
Monash University, AUSTRALIA

Fr4D - Prosody and Emotion 7

Fr4D1     Volume 7, p. 3305
On the amount and domain of focal lengthening in Swedish
Strangert, E Heldner, M
Department of Phonetics, Umeå University, SWEDEN

Fr4D2     Volume 7, p. 3309
Differential lengthening of syllabic constituents in French: the effect of accent type and speaking style
Hirst, D J Astesano, C Di Cristo, A
CNRS, Université De Provence, FRANCE

Fr4D3     Volume 7, p. 3313
Prosodic Analysis of Fillers and Self-Repair in Japanese Speech
Quimbo, F Kawahara, T Doshita, S
Kyoto University, JAPAN

Fr4D4     Volume 7, p. 3317
A synthesis-oriented model of phrasal pitch movements in standard Chinese
Ni, J Kawai, G Hirose, K
University of Tokyo, JAPAN

Fr5A - Plenary Session 2

Fr5A1     Volume 7, pp. 3321-3330    (also in Volume 2, pp. 7-16)
The Use of Linguistic Hierarchies in Speech Understanding
Seneff, S
MIT, USA