ICSLP'98 Final Program

Please note: In the following program the listed affiliation for each paper refers only to the first author of that paper and is not necessarily the affiliation of any other authors.

Tuesday 1 December, 1998


Opening Ceremony

Time & Place: 0930 - 1000, Antechinus Room
Chair: Bruce Millar


Plenary Session
Tu2A - Keynote Speech 1

Title: Cochlear Implants in the Second and Third Millennia
Speaker: Graeme Clark
University of Melbourne, AUSTRALIA
Time & Place: 1000 - 1100, Antechinus Room
Chair: Robert Dale


Tu3A - Text-To-Speech Synthesis 1

Time & Place: 1130 - 1230, Antechinus Room
Chair: Shuichi Itahashi

1130
Tu3A1

. Unsupervised training of phone duration and energy models for text-to-speech synthesis
Bagshaw, P C
France Telecom, CNET, FRANCE

1145
Tu3A2

Improved Duration Modeling of English Phonemes Using a Root Sinusoidal Transformation
Bellegarda, J Silverman, K
Apple Computer, USA

1200
Tu3A3

Efficient Adaptation of TTS Duration Model to New Speakers
Shih, C Gu, W van Santen, J P H
Bell Labs - Lucent Technologies, USA

1215
Tu3A4

Duration Modeling for HMM-Based Speech Synthesis
Yoshimura, T Tokuda, K Masuko, T Kobayashi, T Kitamura, T
Nagoya Institute of Technology, JAPAN


Tu3B - Spoken Language Models and Dialog 1

Time & Place: 1130 - 1230, Bandicoot Room
Chair: Greg Aist

1130
Tu3B1

An educational dialogue system with a user controllable dialogue manager
Gustafson, J Elmberg, P Carlson, R Jönsson, A
Center for Speech Technology, TMH-KTH, SWEDEN

1145
Tu3B2

End-user driven dialogue system design: The REWARD experience
Failenschmid, K Thornton, J H S
Vocalis Ltd, UNITED KINGDOM

1200
Tu3B3

The Design of a Multi-domain Mandarin Chinese Spoken Dialogue System
Lin, Y-C Chiang, T-H Wang, H-M Peng, C-M Chang, C-H
Industrial Technology Research Institute, TAIWAN R.O.C.

1215
Tu3B4

An Integrated Dialogue System for the Automation of Call Centre Services
Georgila, K Tsopanoglou, A Fakotakis, N Kokkinakis, G
University of Patras, GREECE


Tu3C - Prosody and Emotion 1

Time & Place: 1130 - 1230, Currawong Room
Chair: Phil Rose

1130
Tu3C1

Tones of a Tridialectal: Acoustic and Perceptual data on Ten Linguistic Tonetic contrasts between Lao, Nyo and Standard Thai
Rose, P
Australian National University, AUSTRALIA

1145
Tu3C2

Tone Sandhi between Complex Tones in a Seven-Tone Southern Thai Dialect
Thompson, N G I
Australian National University, AUSTRALIA

1200
Tu3C3

The Acoustic and Perceptual Features of Tone in the Tibeto-Burman Language Ao Naga
Coupe, A R
The Australian National University, AUSTRALIA

1215
Tu3C4

The Differential Status of Semivowels in the Acoustic Phonetic Realisation of Tone
Rose, P
Australian National University, AUSTRALIA


Tu3D - Hidden Markov Model Techniques 1

Time & Place: 1130 - 1230, Dingo Room
Chair: Roberto Togneri

1130
Tu3D1

Nonreciprocal Data Sharing in Estimating HMM parameters
Luo, X Jelinek, F
Johns Hopkins University, USA
1145

Tu3D2
Data-Driven Extensions to HMM Statistical Dependencies
Bilmes, J A
U.C. Berkeley / ICSI, USA

1200
Tu3D3

Use of High-Level Linguistic Constraints for constructing Feature Based Phonological Model in Speech Recognition
Sun, J Deng, L
University of Waterloo, CANADA


Tu3P - Speaker and Language Recognition 1

Time & Place: 1130 - 1430, Platypus Room
Chair: Kay Berkling

Tu3P1
Sub-Band Based Speaker Verification Using Dynamic Recombination Weights
Sivakumaran, P Ariyaeeinia, A M Hewitt, J
University of Hertfordshire, UNITED KINGDOM

Tu3P2
Measuring the Dynamic Encoding of Speaker Identity and Dialect in Prosodic Parameters
Barlow, M Wagner, M
Australian Defence Force Academy, AUSTRALIA

Tu3P3
German Regional Variants - A Problem for Automatic Speech Recognition?
Beringer, N Schiel, F Regel-Brietzmann, P
University of Munich, GERMANY

Tu3P4
Improving Accent Identification through Knowledge of English Syllable Structure
Berkling, K Vonwiller, J Cleirigh, C Zissman, M
MIT Lincoln Laboratory, USA

Tu3P5
Multi-dimensional Scaling of Listener Responses to Complex Auditory stimuli
Bond, Z S Fucci, D Stockmal, V McColl, D
Ohio University, USA

Tu3P6
Same Talker, Different Language
Stockmal, V Moates, D Bond, Z
Ohio University, USA

Tu3P7
The Impact of Regional Variety upon Specific Word Categories in Spontaneous German
Burger, S Oppermann, D
University of Munich, GERMANY

Tu3P8
Speech pre-processing against intentional imposture in speaker recognition
Genoud, D Chollet, G
IDIAP, SWITZERLAND

Tu3P9
A Comparison of Two Unsupervised Approaches to Accent Identification
Lincoln, M Cox, S Ringland, S
University of East Anglia, UNITED KINGDOM

Tu3P10
The Influence of Accents in Australian English Vowels and their Relation to Articulatory Tract Parameters
Dersch, D Cleirigh, C Vonwiller, J Sydney University /
APPEN Speech Technology, AUSTRALIA

Tu3P11
Automatic language recognition using high-order HMMs
du Preez, J A Weber, D M
University of Stellenbosch, SOUTH AFRICA

Tu3P12
Speaker Recognition Using Residual Signal of Linear and Nonlinear Prediction Models
Faundez-Zanuy, M Rodriguez-Porcheron, D
Escola Universitaria Politecnica De Mataro, SPAIN

Tu3P13
An Implementation and Evaluation of an On-line speaker Verification System for Field Trials
Gu, Y Thomas, T
Vocalis Ltd, UNITED KINGDOM

Tu3P14
Speaker Verification on the POLYCOST database using frequency filtered spectral energies
Hernando, J Nadeu, C
Polytechnical University of Catalonia, SPAIN

Tu3P15
A High-Performance Text-Independent Speaker Identification System Based On BCDM
Jin, Q Si, L Hu, Q
Tsinghua University, P.R. CHINA

Tu3P16
Representation of Voice Quality Features Associated with Talker Individuality
Kido, H Kasuya, H
Utsunomiya University, JAPAN

Tu3P17
Candidate Selection Based on Significance Testing and its Use in Normalisation and Scoring
Kim, J Jang, G Yun, S Oh, Y
Advanced Institute of Science and Technology, KOREA

Tu3P18
Japanese Forensic Phonetics:Non-contemporaneous within-speaker variation in Natural and Read-out Speech
Kinoshita, Y
The Australian National University, AUSTRALIA

Tu3P19
Statistical Modeling of Pronunciation and Production Variations for Speech Recognition
Korkmazskiy, F E Juang B-H
Lucent Technologies Bell Laboratories, USA

Tu3P20
Dialect Maps and Dialect Research; Useful Tools for Automatic Speech Recognition?
Foldvik, A K Kvale, K
Telenor R&D, NORWAY

Tu3P21
Text Independent Speaker Recognition Using Micro-Prosody
Kyung, Y J Lee, H S
Korea Advanced Institute of Science and Technology, KOREA

Tu3P22
Speaker Verification Using Fundamental Frequency
Cheng, Y Leung, H C
Chinese University of Hong Kong, HONG KONG

Tu3P23
On Optimum Normalization Method Used for Speaker Verification
Liu, W Isobe, T Mukawa, N
NTT Data Corporation, JAPAN

Tu3P24
Recurrent Substrings and Data Fusion for Language Recognition
Lloyd-Thomas, H Parris, E S Wright, J H
Ensigma Ltd, UNITED KINGDOM

Tu3P25
Text-Independent Speaker Recognition using Multiple Information Sources
Markov, K Nakagawa, S
Toyohashi University of Technology, JAPAN

Tu3P26
Discriminative Training of GMM using a Modified EM Algorithm for Speaker Recognition
Markov, K Nakagawa, S
Toyohashi University of Technology, JAPAN

Tu3P27
Language Identification Incorporating Lexical Information
Matrouf, D Adda-Decker, M Lamel, L Gauvain, J-L
LIMSI-CNRS, FRANCE

Tu3P28
A VQ based Speaker Recognition System Based in Histogram Distances. Text Independent and for Noisy Environments
Monte, E Arqué, R Miró, X
Polytechnical University of Catalonia, SPAIN

Tu3P29
Spanish Dialects: Phonetic Transcription
Moreno, A Marino, J B
Polytechnical University of Catalonia, SPAIN

Tu3P30
Acoustic analysis of Japanese English prosody: Comparison between Fukushima dialect speakers and Tokyo dialect speakers in declarative sentences and yes-no questions
Muramatsu, M
Fukushima Medical University, JAPAN

Tu3P31
A Context-Dependent Approach for Speaker Verification Using Sequential Decision
Noda, H Harada, K Kawaguchi, E Sawai, H
Kyushu Institute of Technology, JAPAN

Tu3P32
Quantitative Influence of Speech Variability Factors for Automatic Speaker Verification in Forensic Tasks
Ortega-Garcia, J Cruz-Llanas, S Gonzalez-Rodriguez, J
Universidad Politecnica Madrid, SPAIN

Tu3P33
Creating Hidden Markov Models for Fast Speech
Pfau, T Ruske, G
Technical University of Munich, GERMANY

Tu3P34
Speaker Identification using Relaxation Labeling
Pham, T D Wagner, M
University of Canberra, AUSTRALIA

Tu3P35
A Novel Technique for the Combination of Utterance and Speaker Verification Systems in a Text-dependent Speaker Verification Task
Rodriguez-Linares, L Garcia-Mateo, C
University of Vigo, SPAIN

Tu3P36
A Forensic Phonetic Investigation into Non-contemporaneous Variation in the F-pattern of Similar-sounding Speakers
Rose, P
Australian National University, AUSTRALIA

Tu3P37
Human vs. Machine Speaker Identification with Telephone Speech
Schmidt-Nielsen, A Crystal, T H
U.S. Naval Research Laboratory, USA

Tu3P38
A Comparison of Fusion Techniques in Mel-Cepstral based Speaker Identification
Slomka, S Sridharan, S Chandran, V
Queensland University of Technology, AUSTRALIA

Tu3P39
On the Influence of Hyperarticulated Speech on Recognition Performance
Soltau, H Waibel, A
Carnegie Mellon University, USA

Tu3P40
Text-independent speaker identification and verification using the TIMIT Database
Ward, N C Dersch, D R
Alcatel Australia, AUSTRALIA

Tu3P41
Incorporating Linguistic Knowledge into Automatic Dialect Identification of Spanish
Yanguas, L O'Leary, G Zissman, M
Mit Lincoln Laboratory, USA

Tu3P42
A Novel Text-Independent Speaker Verification Method Using the Global Speaker Model
Zhang, Y Zhu, X-Y
Tsinghua University, P.R. CHINA


Tu3Q - Multimodal Spoken Language Processing 1

Time & Place: 1130 - 1430, Quokka Room
Chair: Sharon Oviatt

Tu3Q1
A fast method of producing talking head mouth shapes from real speech
Breen, A P Gloaguen, O Stern, P
BT Labs, UNITED KINGDOM

Tu3Q2
The Efficiency of Multimodal Interaction: A Case Study
Cohen, P R Johnston, M McGee, D Oviatt, S L Clow, J Smith, I
Oregon Graduate Institute, USA

Tu3Q3
Audio and Audio-visual Perception of Consonants Disturbed by White Noise and 'Cocktail Party'
Czap, L
University of Miskolc, HUNGARY

Tu3Q4
Overview of the Maya Spoken Language System
Downey, S Breen, A P Fernandez, M Kaneen, E
BT Labs, UNITED KINGDOM

Tu3Q5
Automatic Recognition of Spontaneous Speech Dialogues
Cettolo, M Falavigna, D
ITC-IRST, ITALY

Tu3Q6
Using an Animated Talking Character in a web-based City Guide Demonstrator
Fries, G Feldes, S Corbet, A
Deutsche Telekom Berkom, GERMANY

Tu3Q7
Influence of facial views on the McGurk effect in auditory noise
Kanzaki, R Kato, T
ATR Information Processing Research Laboratories, JAPAN

Tu3Q8
The Intellimedia WorkBench - a generic environment for multimodal systems
Brondsted, T Larsen, L B Manthey, M Moeslund, T B McKevitt, P Olesen, K G
Aalborg University, DENMARK

Tu3Q9
Stamp: A Suite of Tools for Analyzing Multimodal System Processing
Clow, J Oviatt, S L
Oregon Graduate Institute of Science & Technology, USA

Tu3Q10
Cultural similarities and differences in the recognition of audio-visual speech stimuli
Shigeno, S
Kitasato University, JAPAN

Tu3Q11
A Multimodal-Input Multimedia-Output Guidance System: MMGS
Takezawa, T Morimoto, T
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Tu3Q12
HMM-based Visual Speech Recognition using Intensity and Location Normalization
Vanegas, O Tanaka, A Tokuda, K Kitamura, T
Nagoya Institute of Technology, JAPAN

Tu3Q13
A Hierarchy Probability-based Visual Features Extraction Method for Speechreading
Xu, Y Du, L Li, G Hou, Z
Chinese Academy of Science, P.R. CHINA

Tu3Q14
Integration of Talking Heads and Text-to-Speech Synthesizers for Visual TTS
Ostermann, J Beutnagel, M Fischer, A Wang, Y
AT&T Labs Research


Tu3R - Isolated Word Recognition

Time & Place: 1130 - 1430, Rosella Room
Chair: Li Deng

Tu3R1
Improving Accuracy of telephony-based, speaker-independent speech recognition
Azzopardi D Semnani S Milner B Wiseman R
BT Labs, UNITED KINGDOM

Tu3R2
Rejection in Speech Recognition Systems with Limited Training
Bayya, A
Rockwell Semiconductor Systems, USA

Tu3R3
A Four Layer Sharing HMM System for Very Large Vocabulary Isolated Word Recognition
Chen, R Tanaka, M Wu, D Olorenshaw, L Amador, M
SONY Research Labs, USA

Tu3R4
A Comparative Study of Hybrid Modelling Techniques for Improved Telephone Speech Recognition
Chengalvarayan, R
Lucent Technologies, USA

Tu3R5
Smoothing and Tying for Korean Flexible Vocabulary Isolated Word Recognition
Choi, J-S Lee, J-S Lee, H-Y
LG Corporate Institute of Technology, KOREA

Tu3R6
Recent Work on a Preselection Module for a Flexible Large Vocabulary Speech Recognition System in Telephone Environment
Ferreiros, J Macias-Guarasa, J Gallardo, A Colas, J Cordoba, R Pardo, J Villarrubia, L
GTH-IEL-UPM, SPAIN

Tu3R7
A Study of Noise Robustness for Speaker Independent Speech Recognition Method Using Phoneme Similarity Vector
Hoshimi, M Yamada, M Niyada, K Makino, S
Matsushita Research Institute Tokyo, Inc., JAPAN

Tu3R8
Classification of Taiwanese tones based on pitch and energy movements
Jian, F
University of Reading, UNITED KINGDOM

Tu3R9
Phoneme-based recognition for the Norwegian SpeechDat(II) database
Johansen, F T
Telenor R&D, NORWAY

Tu3R10
Robust Feature Extraction for Alphabet Recognition
Karnjanadecha, M Zahorian, S A
Old Dominion University, USA

Tu3R11
Recognition of Connected Digit Speech in Japanese Collected over the Telephone Network
Kawai, H Higuchi, N
KDD R&D Laboratories Inc., JAPAN

Tu3R12
Improving the Speaker-Dependency of Subword-Unit-Based Isolated Word Recognition
Koizumi, T Taniguchi, S Kohtoh, K
Fukui University, JAPAN

Tu3R13
Speaker Independent Speech Recognition Method using Constrained Time Alignment near Phoneme Discriminative Frame
Konuma, T Suzuki, T Yamada, M Ono, Y Hoshimi, M Niyada, K
Matsushita Research Institute Tokyo, Inc, JAPAN

Tu3R14
A Nonatationary Autoregressive HMM with Gain Adaptation for Speech Recognition
Lee, K Y Lee, J
Soongsil University, KOREA

Tu3R15
A Large-Vocabulary Taiwanese (Min-nan) Multi-syllabic Word Recognition System Based upon Right-Context-Dependent Phones with State Clustering by Acoustic Decision Tree
Lyu, R-Y Chiang, Y-C Hsieh, W-P
Chang Gung University, TAIWAN R.O.C.

Tu3R16
Speech Recognition Based on the Distance Calculation Between Intermediate Phonetic Code Sequences in Symbolic Domain
Tanaka, K Kojima, H
Electrotechnical Laboratory, JAPAN

Tu3R17
High Accuracy Chinese Speech Recognition Approach with Chinese Input Technology for Telecommunication Use
Yang, Y C-H Kuo, J-J
Matsushita Institute of Technology, Taipei, TAIWAN R.O.C.


Tu4A - Robust Speech Processing in Adverse Environments 1

Time & Place: 1330 - 1500, Antechinus Room
Chair: Sridha Sridharan

1330
Tu4A1

Robust Speech Recognition using HMM's With Toeplitz State Covariance Matrices
Roberts, W Ephraim, Y
Defence Science & Technology Organisation, AUSTRALIA

1345
Tu4A2

Modeling of Output Probability Distributions to improve Small Vocabulary Speech Recognition in Adverse Environments
Thambiratnam, D Sridharan, S
Queensland University of Technology, AUSTRALIA

1400
Tu4A3

Robust and compact multilingual word recognizers using features extracted from a phoneme similarity front-end
Morin, P Applebaum, T H Boman, R Zhao, Y Junqua, J-C
Panasonic Technologies Inc., USA

1415
Tu4A4

An Effect of Adaptive Beamforming on Hands-free Speech Recognition Based on 3-D Viterbi Search
Yamada, T Nakamura, S Shikano, K
Nara Institute of Science and Technology, JAPAN

1430
Tu4A5

Coherence-based Subband Decomposition for Robust Speech and Speaker Recognition in Noisy and Reverberant Rooms
Gonzalez-Rodriguez, J Cruz-Llanas, S Ortega-Garcia, J
Universidad Politecnica Madrid, SPAIN

1445
Tu4A6

A Minimax Search Algorithm for CDHMM based Robust Continuous Speech Recognition
Jiang, H Hirose, K Huo, Q
The University of Tokyo, JAPAN


Tu4B - Spoken Language Models and Dialog 2

Time & Place: 1330 - 1500, Bandicoot Room
Chair: Rolf Carlson

1330
Tu4B1

An event driven model for dialogue systems
Wang, K
Microsoft Research, USA

1345
Tu4B2

Automatic Classification of Dialogue Contexts for Dialogue Predictions
Popovici, C Baggia, P Laface, P Moisa, L
Politecnico Di Torino, ITALY

1400
Tu4B3

Automatic Identification of Command Boundaries in a Conversational Natural Language User Interface
Ramaswamy, G N Kleindienst, J
I.B.M. Research Center, USA

1415
Tu4B4

The Predictive Power of Game Structure in Dialogue Act Recognition: Experimental Results Using Maximum Entropy Estimation
Poesio, M Mikheev, A
University of Edinburgh, HCRC, UNITED KINGDOM

1430
Tu4B5

A schema based approach to dialog control
Constantinides, P Hansma, S Tchou, C Rudnicky, A
Carnegie Mellon University, USA

1445
Tu4B6

Expanding a time-sensitive conversational architecture for turn-taking to handle content-driven interruption
Aist, G
Carnegie Mellon University, USA


Tu4C - Articulatory Modelling 1

Time & Place: 1330 - 1500, Currawong Room
Chair: Alain Marchal

1330
Tu4C1

A Three-Dimensional Linear Articulatory Model Based on MRI Data
Badin, P Bailly, G Raybaudi, M Segebarth, C
Institut Communication Parlee, FRANCE

1345
Tu4C2

On Loops and Articulatory Biomechanics
Perrier, P Payan, Y Perkell, J Jolly, F Zandipour, M Matthies, M
Institut De La Communication Parlée - INPG, FRANCE

1400
Tu4C3

Magnetic Resonance Measurements of the Velum port opening
Demolin, D Lecuit, V Metens, T Nazarian, B Soquet, A
Universite Libre De Bruxelles, BELGIUM

1415
Tu4C4

Cantilever-type force-sensor-mounted palatal plate for measuring palatolingual contact stress and pattern during speech phonation.
Matsumura, M Niikawa, T Tanabe, T Tachimura, T Wada, T
Osaka Electro-Communication University, JAPAN

1430
Tu4C5

Determination of the vocal tract spectrum from the articulatory movements based on the search of an articulatory-acoustic database
Kaburagi, T Honda, M
NTT Basic Research Laboratories, JAPAN

1445
Tu4C6

An MRI study on the relationship between oral cavity shape and larynx position
Honda, K Tiede, M
ATR Human Information Processing Research Laboratories, JAPAN


Tu4D - Talking To Infants, Pets and Lovers

Time & Place: 1330 - 1500, Dingo Room
Chair: Cecile Pereira

1330
Tu4D1

Acoustic and Affective Qualities of IDS in English
Kitamura, C Burnham, D
University of New South Wales, AUSTRALIA

1345
Tu4D2

Acoustic Qualities of IDS and ADS in Thai
Thanavisuth, C Luksaneeyanawin, S
Chulalongkorn University, THAILAND

1400
Tu4D3

Pragmatic Characteristics of Infant Directed Speech
Luksaneeyanawin, S Thanavisuth, C Sittigasorn, S Rukkarangsarit, O
Chulalongkorn University, THAILAND

1415
Tu4D4

Are You My Little Pussy-Cat? Acoustic, Phonetic and Affective Qualities of Infant- and Pet-Directed Speech
Burnham, D Francis, E Vollmer-Conna, U Kitamura, C Averkiou, V Olley,
A Nguyen, M Paterson, C
University of New South Wales, AUSTRALIA

1430
Tu4D5

Special Speech Registers: Talking To Australian and Thai Infants, and To Pets.
Burnham, D
University of New South Wales, AUSTRALIA


Tu5A - Robust Speech Processing in Adverse Environments 2

Time & Place: 1530 - 1700, Antechinus Room
Chair: Phil Woodland

1530
Tu5A1

Performance improvements through combining phone- and syllable-scale information in automatic speech recognition
Wu, S-L Kingsbury, B E D Morgan, N Greenberg, S
Nuance Communications, USA

1545
Tu5A2

Predictive Adaptation and Compensation for Robust Speech Recognition
Surendran, A C Lee, C-H
Bell Labs, Lucent Technologies, USA

1600
Tu5A3

Influence of the speaking style and the noise spectral tilt on the Lombard reflex and automatic speech recognition
Junqua, J-C Fincke, S Field, K
Speech Technology Laboratory, USA

1615
Tu5A4

Data-Driven PMC and Bayesian Learning Integration for Fast Model Adaptation in Noisy Conditions
Crafa, S Fissore, L Vair, C
CSELT, ITALY

1630
Tu5A5

Improving the noise and spectral robustness of an isolated-word recognizer using an auditory-model front end
Hunke, M Hyun, M Love, S Holton, T
San Francisco State University, USA

1645
Tu5A6

A Model for Speech Reverberation and Intelligibility Restoring Filters
Kenny, O P Nelson, D J
USA


Tu5B - Spoken Language Models and Dialog 3

Time & Place: 1530 - 1700, Bandicoot Room
Chair: Michael Barlow

1530
Tu5B1

On Different Functions of Repetitive Utterances
Swerts, M Koiso, H Shimojima, A Katagiri, Y
IPO, Center for Research On User-System Interaction, THE NETHERLANDS

1545
Tu5B2

Prosody based detection of the context of backchannel responses
Noguchi, H Den, Y
NAIST, JAPAN

1600
Tu5B3

Robust Interpretation for Spoken Dialogue Systems
Strömbäck, L Jonsson, A
Linkoping University, SWEDEN

1615
Tu5B4

System-User Interaction and Response Strategy in Spoken Dialog System
Okato, Y Kato, K Yamamoto, M Itahashi, S
University of Tsukuba, JAPAN

1630
Tu5B5

Organizing Self-Motivated Dialogue with Autonomous Creatures
Suzuki, N Ishii, K Okada, M
ATR Media Integration & Communications Research Labs, JAPAN  

1645
Tu5B6

Fly with the EAGLES: Evaluation of the ACCeSS Spoken Language Dialogue System
Hanrieder, G Heisterkamp, P Brey, T
Daimler-Benz AG, GERMANY


Tu5C - Speech Coding 1

Time & Place: 1530 - 1700, Currawong Room
Chair: Ian Burnett

1530
Tu5C1

A very low bit rate speech coder using HMM with speaker adaptation
Masuko, T Tokuda, K Kobayashi, T
Tokyo Institute of Technology, JAPAN

1545
Tu5C2

ITU-T G.729 extension at 6.4 kbps
Ekudden, E Hagen, R Johansson, B Hayashi, S Kataoka, A Kurihara, S
Ericsson Radio Systems AB, SWEDEN

1600
Tu5C3

Adaptive Transformation for Segmented Parametric Speech Coding
Mudugamuwa D J Bradley A B
RMIT, AUSTRALIA

1615
Tu5C4

Speech enhancement using STC-based bandwidth extension
Epps, J Holmes, W H
University of NSW, AUSTRALIA

1630
Tu5C5

Performance and Optimization of the SEEVOC Algorithm
Zhang, W Holmes, WH
The University of New South Wales, AUSTRALIA


Tu5D - Articulatory Modelling 2

Time & Place: 1530 - 1700, Dingo Room
Chair: Frantz Clermont

1530
Tu5D1

Acoustic-articulatory evaluation of the upper vowel-formant region and its presumed speaker-specific potency
Clermont, F Mokhtari, P
Australian Defence Force Academy, AUSTRALIA

1545
Tu5D2

Control of larynx height in vowel production
Hoole, P Kroos, C
Munich University, GERMANY

1600
Tu5D3

Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditions
Alku, P Vintturi, J Vilkman, E
University of Turku, FINLAND

1615
Tu5D4

An Analysis of Modal Coupling Effects During the Glottal Cycle : Formant Synthesizers From Time-Domain Finite-Difference Simulations
Ramsay, G
Institut De La Communication Parlee, FRANCE

1630
Tu5D5

Laryngoscopic Analysis of Pharyngeal Articulations and Larynx-Height Voice Quality Settings
Esling, J
University of Victoria, CANADA

1645
Tu5D6

Effects of shapes of radiational aperture on radiation characteristics
Matsuzaki, H Motoki, K Miki, N
Hokkai-Gakuen University, JAPAN


Tu5P - Prosody and Emotion 2

Time & Place: 1530 - 1800, Platypus Room
Chair: David House

Tu5P1
De-accentuation: Linguistic Environments and Prosodic Realizations
Alter, K Steinhauer, K Friederici, A D
Max-Planck-Institute of Cognitive Neuroscience, GERMANY

Tu5P2
Towards an automatic classification of emotions in speech
Amir, N Ron, S
Center for Technological Education Holon, ISRAEL

Tu5P3
Can we hear smile?
Schröder, M Auberge, V Cathiard, M
ICP - ESA 5009 CNRS, FRANCE

Tu5P4
The Automatic Marking of Prominence in Spontaneous Speech Using Duration and Part of Speech Information
Aylett, M P Bull, M
University of Edinburgh, UNITED KINGDOM

Tu5P5
On a Pitch Alteration Technique in Excited Cepstral Spectrum for High
Quality TTS

Kim, J D Baek, S J Bae, M J
Soongsil University, KOREA

Tu5P6
Dovetailing of acoustics and prosody in spontaneous speech recognition
Buckow, J Batliner, A Huber, R Noeth, E Warnke, V Niemann, H
University of Erlangen, GERMANY

Tu5P7
A Computational Memory and Processing Model for Prosody
Cahn, J E
M.I.T. Media Laboratory, USA

Tu5P8
Convergence of Fundamental Frequencies in Conversation: If it Happens, Does it Matter?
Collins, B M
Australian National University, AUSTRALIA

Tu5P9
Analysis and Interpretation of Fundamental Frequency Contours of British English in Terms of a Command-Response Model
Fujisaki, H Ohno, S Yagi, T Ono, T
Science University of Tokyo, JAPAN

Tu5P10
Common Patterns in Word Level Prosody
Holm, F Hata, K
Speech Technology Laboratory, USA

Tu5P11
Prosodic Structure in Japanese Spontaneous Speech
Horiuchi, Y Ichikawa, A
Chiba University, JAPAN

Tu5P12
An acoustic-phonetic description of word tone in Kagoshima Japanese
Ishihara, S
The Australian National University, AUSTRALIA

Tu5P13
Representing Prosodic Words Using Statistical Models of Moraic Transition of Fundamental Frequency Contours of Japanese
Iwano, K Hirose, K
University of Tokyo, JAPAN

Tu5P14
Disambiguation of Korean Utterances Using Automatic Intonation Recognition
Jang, T Song, M Lee, K
University of Edinburgh, UNITED KINGDOM

Tu5P15
Multi-level rhythm control for speech synthesis using hybrid data driven and rule-based approaches
Jokisch, O Hirschfeld, D Eichner, M Hoffmann, R
Dresden University of Technology, GERMANY

Tu5P16
EGG Model of Ditoneme in Mandarin*
Kong, J
City University of Hong Kong, HONG KONG

Tu5P17
Temporal Organization of Speech for normal and fast rates
Krishnan, G Ward, W
Carnegie Mellon University, USA

Tu5P18
A syllable-based generalization of Japanese accentuation
Kubozono, H
Kobe University, JAPAN

Tu5P19
Non-adjacent Segmental Effects in Tonal Realization of Accentual Phrase in Seoul Korean
Lee, H-J
UCLA, USA

Tu5P20
Improvement on Connected Numbers Recognition Using Prosodic Information
Lopez-Gonzalo, E Caminero, J Cortazar, I Hernandez-Gomez, L
ETSIT-UPM, SPAIN

Tu5P21
Phonetic investigation of boundary pitch movements in Japanese
Maeda, K Venditti, J J
University of Pennsylvania, USA

Tu5P22
Phonetic and phonological characteristics of paralinguistic information in spoken Japanese
Maekawa, K
The National Language Research Institute, JAPAN

Tu5P23
ToBI Accent Type Recognition
Maghbouleh, A
Stanford University, USA

Tu5P24
The Influence of Syllable Structure on the Timing of Intonational Events in German
Mixdorff, H Fujisaki, H
Dresden University of Technology, GERMANY

Tu5P25
New Prosodic Control Rules for Expressive Synthetic Speech
Mizuno, O Nakajima, S
NTT Human Interface Labs, JAPAN

Tu5P26
The Use of F0 Reliability Function for Prosodic Command Analysis on F0 Contour Generation Model
Nakai, M Shimodaira, H
JAIST, JAPAN

Tu5P27
Analysis of Effects of Lexical Accent, Syntax and Global Speech Rate upon the Local Speech Rate
Ohno, S Fujisaki, H Taguchi, H
Department of Applied Electronics, Science University of Tokyo, JAPAN

Tu5P28
On the Effects of Speech Rate upon Parameters of the Command-Response Model for the Fundamental Frequency Contours of Speech
Ohno, S Fujisaki, H Hara, Y
Science University of Tokyo, JAPAN

Tu5P29
The maximum-based description of F0 contours and its application to English
Portele, T Heuft, B
IKP, University of Bonn, GERMANY

Tu5P30
Perceived prominence and acoustic parameters in American English
Portele, T
IKP, University of Bonn, GERMANY

Tu5P31
Generating Emotional Speech with a Concatenative Synthesizer
Rank, E Pirker, H
OFAI, AUSTRIA

Tu5P32
A Perceptive Measure of Pure Prosody Linguistic Functions with Reiterant Sentences
Rilliard, A Auberge, V
Institut De La Communication Parlee, FRANCE

Tu5P33
Prosodic Parameters in Emotional Speech
Koike, K Suzuki, H Saito, H
Keio University, JAPAN

Tu5P34
Automatic detection of prominence (as defined by listeners’ judgments) in read aloud Dutch sentences
Streefkerk, B M Pols, L C W ten Bosch, L F M
Institute of Phonetic Science Amsterdam, THE NETHERLANDS

Tu5P35
A Schema for Illocutionary Act Identification with Prosodic Feature
Tamoto, M Kawabata, T
NTT Basic Research Laboratory, JAPAN

Tu5P36
An Algorithm for Choosing Japanese Acknowledgments using Prosodic Cues and Context
Tsukahara, W
University of Tokyo, JAPAN

Tu5P37
A Study of Tones and Tempo in Continuous Mandarin Digit Strings and their Application in Telephone Quality Speech Recognition
Wang, C Seneff, S
MIT Laboratory for Computer Science, USA

Tu5P38
Simulated emotions: an acoustic study of voice and perturbation measures
Whiteside, S P
University of Sheffield, UNITED KINGDOM

Tu5P39
A Robust Tone Recognition Method of Chinese Based on Sub-syllabic F0 Contours
Zhang, J-S Hirose, K
The University of Tokyo, JAPAN

Tu5P40
The Microprosodics of Tone Sandhi in Shanghai Disyllabic Compounds
Zhu, X S
Australian National University, AUSTRALIA

Tu5P41
Jitter and Shimmer Differences Between Pathological Voices of School Children
Bolfan-Stosic, N Prizl, T
University of Zagreb, CROATIA


Tu5Q - Neural Networks, Fuzzy and Evolutionary Methods 1

Time & Place: 1530 - 1800, Quokka Room
Chair: Christian Wellekens

Tu5Q1
A Comparison of Thai Speech Recognition Systems using Hidden Markov Model, Neural Network, and Fuzzy-Neural Network
Ahkuputra, V Jitapunkul, S Jittiwarangkul, N Maneenoi, E Kasuriya, S
Chulalongkorn University, THAILAND

Tu5Q2
Phoneme recognition with statistical modeling of the prediction error of neural networks
Freitag, F Monte, E
Polytechnical University of Catalonia, SPAIN

Tu5Q3
Neural network based pronunciation modeling with applications to speech recognition
Fukada, T Yoshimura, T Sagisaka, Y
ATR Interpreting Telecommunications Research Laboratories, JAPAN

Tu5Q4
A Comparative Study of OCON and MLP Architectures for Phoneme Recognition
Haskey, S Datta, S
Loughborough University, UNITED KINGDOM

Tu5Q5
Evaluation and Integration of Neural-Network Training Techniques for Continuous Digit Recognition
Hosom, J P Cole, R A Cosi, P
Oregon Graduate Institute, USA

Tu5Q6
Hierarchical Neural Networks (HNN) for Chinese Continuous Speech Recognition
Jia, Y Du, L Hou, Z
Chinese Academy of Sciences, P.R. CHINA

Tu5Q7
Neural Network Motivation for Segmental Distribution
Keller, E
University of Lausanne, SWITZERLAND

Tu5Q8
Combining Connectionist Multi-Band and Full-Band Probability Streams for Speech Recognition of Natural Numbers
Mirghafori, N Morgan, N
International Computer Science Institute, USA

Tu5Q9
Initial Speech Recognition Results using the Multinet Architecture
Pizzolato, E B Reynolds, TJ
University of Essex, UNITED KINGDOM

Tu5Q10
Selection of the Optimal Structure of the Continuous HMM Using the Genetic Algorithm
Takara, T Iha, Y Nagayama, I
University of the Ryukyus, JAPAN

Tu5Q11
A Proposed Decision Rule for Speaker Recognition Based on Fuzzy C-Means Clustering
Tran, D Wagner, M Le, T V
University of Canberra, AUSTRALIA

Tu5Q12
Fuzzy Gaussian Mixture Models for Speaker Recognition
Tran, D Le, T V Wagner, M
University of Canberra, AUSTRALIA

Tu5Q13
A New Strategy of Fuzzy-Neural Network for Thai Numeral Speech Recognition
Wutiwiwatchai, C Jitapunkul, S Ahkuputra, V Maneenoi, E Luksaneeyanawin, S
Chulalongkorn University, THAILAND

Tu5Q14
Thai Polysyllabic Word Recognition Using Fuzzy-Neural Network
Wutiwiwatchai, C Jitapunkul, S Ahkuputra, V Maneenoi, E Luksaneeyanawin, S
Chulalongkorn University, THAILAND


Tu5R - Utterance Verification and Word Spotting 1 / Speaker Adaptation 1

Time & Place: 1530 - 1800, Rosella Room
Chair: Douglas O'Shaughnessy

Tu5R1
Word Verification Using Confidence Measures in Speech Recognition
Benitez, C Rubio Ayuso, A Garcia Teodoro, P Diaz Verdejo, J E
Universidad De Granada, SPAIN

Tu5R2
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN Speech Recognition Systems
Bernardis, G Bourlard, H
IDIAP, SWITZERLAND

Tu5R3
Two-pass Utterance Verification Algorithm for Long Natural Numbers Recognition
Caminero, J Lopez, E Hernandez, L
ETSIT-UPM, SPAIN

Tu5R4
A*-Admissible Key-Phrase Spotting with Sub-Syllable Level Utterance Verification
Chen, B Wang, H-M Shien, L-F Lee, L-S
Academica Sinica, Taiwan, TAIWAN R.O.C.

Tu5R5
Speaker-Independent Upfront Dialect Adaptation in a Large Vocabulary Continuous Speech Recognizer
Fischer, V Gao, Y Janke, E
IBM Speech Systems, GERMANY

Tu5R6
Word-Based Acoustic Confidence Measures for Large-Vocabulary Speech Recognition
Gunawardana, A Hon, H-W Jiang, L
Microsoft Research, USA

Tu5R7
Improved Utterance Rejection using Length Dependent Thresholds
Gupta, S K Soong, F
Bell Laboratories - Lucent Technologies, USA

Tu5R8
Bayesian Constrained Frequency Warping HMMs for Speaker Normalization
Ho, C H Vaseghi, S Chen, A
Queen's University of Belfast, UNITED KINGDOM

Tu5R9
An Evaluation of Keyword Spotting Performance utilizing False Alarm Rejection based on Prosodic Information
Ida, M Yamasaki, R
OMRON Corporation, JAPAN

Tu5R10
Predictive Speaker Adaptation and Its Prior Training
Tran, D Iso, K-I
NEC, JAPAN

Tu5R11
Powerful Syllabic Fillers for General-Task Keyword-Spotting and Unlimited-Vocabulary Continuous-Speech Recognition
El Meliani, R O'Shaughnessy, D
INRS-Telecommunications, CANADA

Tu5R12
Confidence Scoring for Speech Understanding Systems
Pao, C Schmid, P Glass, J
MIT Laboratory for Computer Science, USA

Tu5R13
Phonological Rules for Enhancing Acoustic Enrollment of Unknown Words
Ramabhadran, B Ittycheriah, A
IBM T.J. Watson Research Center, USA

Tu5R14
Recognition-Based Word Counting for Reliable Barge-in and Early Endpoint Detection in Continuous Speech Recognition
Setlur, A R Sukkar, R A
Lucent Technologies, USA

Tu5R15
Linear Discriminant - A New Criterion for Speaker Normalization
Westphal, M Schultz, T Waibel, A
Carnegie Mellon University, USA

Tu5R16
Confidence Measures Derived from an Acceptor HMM
Williams, G Renals, S
Sheffield University, UNITED KINGDOM

Tu5R17
Telephone Speech Multi-Keyword Spotting Using Fuzzy Search Algorithm and Prosodic Verification
Wu, C-H Chen, Y-J Hung, Y-C
National Cheng Kung University, TAIWAN R.O.C.

Tu5R18
Topic Recognition for News Speech Based on Keyword Spotting
Yamashita, Y Tsunekawa, T Mizoguchi, R
Ritsumeikan University, JAPAN


Tu6A - Text-To-Speech Synthesis 2

Time & Place: 1700 - 1800, Antechinus Room
Chair: Susan Hertz

1700
Tu6A1

Prosody Prediction for Speech Synthesis using Transformational Rule-based Learning
Fordyce, C Ostendorf, M
Lernout & Hauspie Speech Products, USA

1715
Tu6A2

Representing the environments for phonological processes in an accent-independent lexicon for synthesis of English
Fitt, S Isard, S
University of Edinburgh, UNITED KINGDOM

1730
Tu6A3

Efficient Lexical Retrieval for English Text-to-Speech Synthesis
Faulkner, D Bryant, C
Aculab Plc, UNITED KINGDOM


Tu6B - Spoken Language Models and Dialog 4

Time & Place: 1700 - 1800, Bandicoot Room
Chair: James Glass

1700
Tu6B1

SQEL: A Multilingual and Multifunctional Dialog System
Aretoulaki, M Harbeck, S Gallwitz, F Noeth, E Niemann, H Ivanecky, J Ipsic, I Pavesic, N Matousek, V
University of Erlangen, GERMANY

1715
Tu6B2

Semi-automated incremental prototyping of spoken dialog systems
Kaspar, S Hoffmann, A
University of New South Wales, AUSTRALIA

1730
Tu6B3

Beyond Structured Dialogues: Factoring out Grounding
Heeman, P Johnston, M Denney, J Kaiser, E
Oregon Graduate Institute, USA


Tu6D - Human Speech Perception 1

Time & Place: 1700 - 1800, Dingo Room
Chair: Anne Cutler

1700
Tu6D1

Heads and tails in word perception: Evidence for 'Early-to-Late' Processing in listening and reading
Nooteboom, S G Van Dijk, M
Utrecht University, THE NETHERLANDS

1715
Tu6D2

Evidence for Early Effects of Sentence Context on Word Segmentation
te Riele, S M M Quené, H
Utrecht University, THE NETHERLANDS

1730
Tu6D3 Assimilation and anticipation in word perception

Quené, H van Rossum, M van Wijck, M
Utrecht University, THE NETHERLANDS

1745
Tu6D4

Lexical activation by assimilated and reduced tokens
Kelly, M L Bard, E G Sotillo, C
University of Edinburgh, UNITED KINGDOM



Wednesday 2 December, 1998


We1A - Robust Speech Processing in Adverse Environments 3

Time & Place: 0900 - 1000, Antechinus Room
Chair: Douglas Reynolds

0900
We1A1

Linear and Nonlinear Speech Feature Analysis for Stress Classification
Zhou, G Hansen, J H L Kaiser, J F
Robust Speech Processing Laboratory, USA

0915
We1A2

Speech Feature Modeling for Robust Stressed Speech Recognition
Bou-Ghazale, S E Hansen, J H L
Robust Speech Processing Laboratory, USA

0930
We1A3

Combining Articulatory and Acoustic Information for Speech Recognition in Noisy and Reverberant Environments
Kirchhoff, K
University of Bielefeld, GERMANY

0945
We1A4

Improving Speaker Identification Performance in Reverberant Conditions using Lip Information
Wark, T Sridharan, S
Queensland University of Technology, AUSTRALIA


We1B - Speech and Hearing Disorders 1

Time & Place: 0900 - 1000, Bandicoot Room
Chair: Florien Koopmans-van-Bienum

0900
We1B1

Adults with a severe-to-profound hearing impairment. Investigating the effects of linguistic context on speech perception.
Flynn, M Dowell, R Clark, G
University of Melbourne, AUSTRALIA

0915
We1B2

Speech Perception in Dyslexia: Measurements from Birth Onwards
Koopmans-van Beinum, F J Schwippert, C Kuijpers, C T L
University of Amsterdam, THE NETHERLANDS

0930
We1B3

An Acoustic Analysis of Vowel Production Across Tasks in a Case of Non-fluent Progressive Aphasia
Croot, K
University of Sydney / Macquarie University, AUSTRALIA

0945
We1B4

Speech technology in clinical environments:
van Doorn, J McLeod, S Baker, E Purcell, A Thorpe, W
The University of Sydney (Cumberland Campus), AUSTRALIA


We1C - Prosody and Emotion 3

Time & Place: 0900 - 1000, Currawong Room
Chair: Mark Swerts

0900
We1C1

What Spreads, and How? Tonal Rightward Spreading On Shanghai Disyllabic Compounds
Zhu, X S
Australian National University, AUSTRALIA

0915
We1C2

Tonal Complexity as a Dialectal Feature: 25 Different Citation Tones from Four Zhejiang Wu Dialects
Zhu, S X Rose, P
Australian National University, AUSTRALIA

0930
We1C3

Emotional Speech Synthesis: from speech database to TTS
Montero, J M Gutierrez-Arriola, J Palazuelos, S Enriquez, E Aguilera, S Pardo, J M
GTH-IEL-UPM, SPAIN

0945
We1C4

Some Acoustic Characteristics of Emotion
Pereira, C Watson, C
SHLRC, Macquarie University, AUSTRALIA


We1D - Spoken Language Understanding Systems 1

Time & Place: 0900 - 1000, Dingo Room
Chair: David Grayden

0900
We1D1

Galaxy-II: A Reference Architecture for Conversational System Development
Seneff, S Hurley, E Lau, R Pao, C Schmid, P Zue, V
MIT, USA

0915
We1D2

Improvements in Speech Understanding Accuracy Through the Integration of Hierarchical Linguistic, Prosodic, and Phonological Constraints in the Jupiter Domain
Chung, G Seneff, S
MIT Laboratory for Computer Science, USA

0930
We1D3

Towards Robust Methods for Spoken Document Retrieval
Ng, K
MIT, USA


We1P - Signal Processing and Speech Analysis 1

Time & Place: 0900 - 1130, Platypus Room
Chair: Aaron Rosenberg

We1P1
Maximum a Posteriori Pitch Tracking
Droppo, J Acero, A
Microsoft Corporation, USA

We1P2
Vowel Separation Using the Reassigned Amplitude-Modulation Spectrum
Yang, D Meyer, G F Ainsworth, W A
Keele University, UNITED KINGDOM

We1P3
Feature Decorrelation Methods in Speech Recognition. A Comparative Study
Batlle, E Nadeu, C Fonollosa, J A R
Polytechnical University of Catalonia, SPAIN

We1P4
Multi-resolution for Speech Analysis
Caraty, M-J Montacie, C
Universite Pierre Et Marie Curie, FRANCE

We1P5
Dynamic features in Children's Vowels
Cassidy, S Watson, C
Macquarie University, AUSTRALIA

We1P6
Effectiveness of phase-corrected RASTA for continuous speech recognition
de Veth, J Boves, L
University of Nijmegen, THE NETHERLANDS

We1P7
Techniques for capturing temporal variations in speech signals with fixed-rate processing
Dharanipragada, S Gopinath, R Rao, B
IBM T. J. Watson Research Center, USA

We1P8
Automatic Detection of Landmark for Nasal Consonants from Speech Waveform
Du, L Stevens, K N
Chinese Academy of Sciences, P.R. CHINA

We1P9
Plug and Play Software for Designing High-Level Speech Processing Systems
Dutoit, T Schroeter, J
Faculte Polytechnique De Mons, BELGIUM

We1P10
Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphing
Girardi, A Shikano, K Nakamura, S
Nara Institute of Science and Technology, JAPAN

We1P11
Restoration of hyperbaric speech by correction of the formants and the
pitch

Charonnat, L Guitton, M Crestel, J Allee, J
ENSSAT, FRANCE

We1P12
Voice Conversion based on parameter transformation
Gutiérrez-Arriola, J M Hsiao, Y S Montero, J M Pardo, J M Childers, D G
GTH-IEL-UPM, SPAIN

We1P13
Noise Robust Two-Stream Auditory Feature Extraction Method for Speech Recognition
Tian, J Hariharan, R Laurila, K
Nokia Research Center, FINLAND

We1P14
Heterogeneous Measurements and Multiple Classifiers for Speech Recognition
Halberstadt, A K Glass, J R
MIT, USA

We1P15
Joint Recognition and Segmentation using Phonetically Derived Features and a Hybrid Phoneme Model
Harte, N Vaseghi, S Milner, B
Queen's University of Belfast, UNITED KINGDOM

We1P16
TRAPS - Classifiers of Temporal Patterns
Hermansky, H Sharma, S
Oregon Graduate Institute, USA

We1P17
Robust measurement of fundamental frequency and degree of voicing
Holmes, J N
UNITED KINGDOM

We1P18
Micropower Electro-Magnetic Sensors for Speech Characterization, Recognition, Verification, and Other Applications
Holzrichter, J F Burnett, G C Gable, T J Ng, L C
Lawrence Livermore National Laboratory, USA

We1P19
Robust Entropy-based Endpoint Detection for Speech Recognition in Noisy Environments
Shen, J-L Hung, J-W Lee, L-S
Academia Sinica, TAIWAN R.O.C.

We1P20
Statistical Integration of Temporal Filter Banks for Robust Speech Recognition Using Linear Discriminant Analysis (LDA)
Shen, J-L Hwang, W-L
Academia Sinica, TAIWAN R.O.C.

We1P21
Feature-Based Approach to Speech Recognition
Iskra, D J Edmondson, W H
University of Birmingham, UNITED KINGDOM

We1P22
Periodicity Emphasis of Voice Wave using Nonlinear IIR Digital Filters and Its Applications
Kamata, H Kaneko, A Ishida, Y
Meiji University, JAPAN

We1P23
Speech recognition via phonetically featured syllables
King, S Stephenson, T Isard, S Taylor, P Strachan, A
CSTR, University of Edinburgh, UNITED KINGDOM

We1P24
Do Phonetic Features Help to Improve Consonant Identification in ASR?
Koreman, J Andreeva, B Barry, W J
University of the Saarland, GERMANY

We1P25
Perceptual and acoustic properties of phonemes in continuous speech for different speaking rate
Kuwabara, H
Teikyo University of Science & Technology, JAPAN

We1P26
On Robust Sequential Estimator Based on t-distribution with Forgetting Factor for Speech Analysis
Lee, J Lee, K Y
Dong-Ah Broadcasting College, KOREA

We1P27
Discriminant Wavelet Basis Construction for Speech Recognition
Long, C Datta, S
Loughborough University, UNITED KINGDOM

We1P28
An Efficient Mel-LPC Analysis Method for Speech Recognition
Matsumoto, H Nakatoh, Y Furuhata, Y
Shinshu University, JAPAN

We1P29
Discriminative Weighting of Multi-resolution Sub-band Cepstral Features for Speech Recognition
McMahon, P McCourt, P Vaseghi, S
Queens University of Belfast, NORTHERN IRELAND

We1P30
Separation of Singing and Piano Sounds
Meron, Y Hirose, K
Tokyo University, JAPAN

We1P31
Modeling of Variations in Cepstral Coefficients Caused by F0 Changes and its Application to Speech Processing
Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN

We1P32
A Detection Framework for Locating Phonetic Events
Niyogi, P Mitra, P P Sondhi, M M
Bell Laboratories, Lucent Technologies, USA

We1P33
On Frequency Averaging for Spectral Analysis in Speech Recognition
Nadeu, C Galindo, F Padrell, J
Polytechnical University of Catalonia, SPAIN

We1P34
Wavelet transform domain blind equalization and its application to speech analysis
Namba, M Ishida, Y
Meiji University, JAPAN

We1P35
A Novel Method of Formant Analysis and Glottal Inverse Filtering
Pearson, S
Speech Technology Lab/Panasonic Technologies Inc., USA

We1P36
Vector Quantizer Acceleration for an Automatic Speech Recognition Application
Araújo, A Pera, V Souza, M
FEUP, PORTUGAL

We1P37
Local Speech Rate as a Combination of Syllable and Phone Rate
Pfitzinger, H R
University of Munich, GERMANY

We1P38
Recovering gestures from speech signals: a preliminary study for nasal vowels
Rossato, S Feng, G Laboissière, R
Institut De La Communication Parlee, FRANCE

We1P39
Extended Linear Discriminant Analysis (ELDA) for Speech Recognition
Ruske, G Faltlhauser, R Pfau, T
Technical University of Munich, GERMANY

We1P40
Speech, Silence, Music and Noise Classification of TV Broadcast Material
Samouelian, A Robert-Ribes, J Plumpe, M
University of Wollongong, AUSTRALIA

We1P41
The relation between vocal tract shape and formant frequencies can be described by means of a system of coupled differential equations
Schoentgen, J Soquet, A Lecuit, V Ciocea, S
Universite Libre De Bruxelles, BELGIUM

We1P42
Improving speech recognizer by broader acoustic-phonetic group
classification

Suh, Y Hwang, K Kwon, O Park, J
ETRI, KOREA

We1P43
Separation of speech source and filter by time-domain deconvolution
Thorpe, C W
National Voice Centre, Australia, AUSTRALIA

We1P44
On the Application of the AM-FM Model for the Recovery of Missing Frequency Bands of Telephone Speech
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA

We1P45
Estimation of Voice Source and Vocal Tract Parameters Using Combined Subspace-based and Amplitude Spectrum-based Algorithm
Yang, C-S Kasuya, H
Utsunomiya University, JAPAN

We1P46
The Distance Measure for Line Spectrum Pairs Applied to Speech Recognition
Zheng, F Song, Z-J Li, L Yu, W-J Zheng, F-Z Wu, W-H
Tsinghua University, P.R. CHINA


We1Q - Spoken Language Generation and Translation 1

Time & Place: 0900 - 1130, Quokka Room
Chair: Yoshinori Sagisaka

We1Q1
The Modeling and Realization of Natural Speech Generation System
Chen, F Yuan, B
Northern Jiaotong University, P.R. CHINA

We1Q2
'Ko Tok Ples Ensin bilong Tok Pisin' or the TP-CLE: A first report from a pilot speech-to-speech translation project from Swedish to Tok Pisin
Eklund, R
Telia Research AB, SWEDEN

We1Q3
An Iterative, DP-based Search Algorithm for Statistical Machine Translation
Garcia-Varea, I Casacuberta, F Ney, H
Universidad Politecnic De Valencia, SPAIN

We1Q4
Information extraction and text generation of news reports for a Swedish-English bilingual spoken dialogue system
Gawronska, B House, D
University of Skovde, SWEDEN

We1Q5
Utterance Generation for Transaction Dialogues
Hulstijn, J Van Hessen, A
University of Twente, THE NETHERLANDS

We1Q6
Example-Based Error Recovery Method for Speech Translation: Repairing Sub-Trees According to the Semantic Distance
Ishikawa, K Sumita, E Iida, H
ATR Interpreting Telecommunications Research Laboratories, JAPAN

We1Q7
Context Sensitive Generation of Descriptions
Krahmer, E Theune, M
IPO Center for Research On User-System Interaction, THE NETHERLANDS

We1Q8
An Interlingua Based on Domain Actions for Machine Translation of Task-Oriented Dialogues
Levin, L Gates, D Lavie, A Waibel, A
Carnegie Mellon University, USA

We1Q9
Generating Pitch Accents in a Concept-To-Speech System using a Knowledge Base
Williams, S H
Microsoft Research Institute, Macquarie University, AUSTRALIA

We1Q10
Making the most of multiplicity: a multi-parser, multi-strategy architecture for the robust processing of spoken language
Ruland, T Rupp, C J Spilker, J Weber, H Worm, K L
Siemens AG / University of the Saarland, GERMANY

We1Q11
Natural-Sounding Speech Synthesis Using Variable-Length Units
Yi, J R W Glass, J R
MIT, USA


We1R - Spoken Language Models and Dialog 5

Time & Place: 0900 - 1130, Rosella Room
Chair: Pietro Laface

We1R1
A robust dialogue model for spoken dialogue processing
Araki, M Doshita, S
Kyoto University, JAPAN

We1R2
The REWARD Service Creation Environment. An Overview
Brøndsted, T Bai, B Olsen, J
Aalborg University, DENMARK

We1R3
An Analysis of the Timing of Turn-Taking in a Corpus of Goal-Oriented Dialogue
Bull, M C Aylett, M
University of Edinburgh, UNITED KINGDOM

We1R4
The provision of corrective feedback in a spoken dialogue CALL system
Davies, S Poesio, M
University of Edinburgh, HCRC, UNITED KINGDOM

We1R5
Evaluation of dialog strategies for a tourist information retrieval system
Devillers, L Bonneau-Maynard, H
LIMSI/CNRS, FRANCE

We1R6
Designing a Multimodal Dialogue System for Information Retrieval
Furui, S Yamaguchi, K
Tokyo Institute of Technology, JAPAN

We1R7
The Research Project of Man-Computer dialogue System in Chinese
Guan, D Chu, M Zhang, Q Liu, J Zhang, H
Chinese Academy of Sciences, P.R. CHINA

We1R8
Interfaces for Speech Recognition Systems: The Impact of Vocabulary Constraints and Syntax on Performance
Hone, K Golightly, D
ICL Institute of Information Technology, UNITED KINGDOM

We1R9
Pacing Spoken Directions to Suit the Listener
Iwase, T Ward, N
University of Tokyo, JAPAN

We1R10
A Spoken Dialogue System Utilizing Spatial Information
Flycht-Eriksson, A Jonsson, A
Linkoping University, SWEDEN

We1R11
From Novice to Expert: The Effect of Tutorials on User Expertise with Spoken Language Dialogue Systems - An Empirical Analysis of the Efficacy of Tutorial Dialogues
Kamm, C A Litman, D J Walker, M A
AT&T Labs Research, USA

We1R12
Emergent Computational Dialogue Management Architecture for Task-Oriented Spoken Dialogue Systems
Kawabata, T
NTT Basic Research Labs, JAPAN

We1R13
An Analysis of Dialogues with Our Dialogue System through a WWW page
Kumamoto, T Ito, A
Communications Research Laboratory, JAPAN

We1R14
Modelling spoken dialogues with state transition diagrams: experiences of the CSLU toolkit
McTear, M
University of Ulster, UNITED KINGDOM

We1R15
Situated Dialogue Coordination for Spoken Dialogue Systems
Okada, M Suzuki, N Terken, J
ATR MI & C Research Laboratories, JAPAN

We1R16
Robust spoken dialogue systems for consumer products: a concrete application
Pouteau, X Arevalo, L
IPO, Center for Research On User-System Interaction, THE NETHERLANDS

We1R17
A German Dialogue System for Scheduling Dates and Meetings by Naturally Spoken Continuous Speech
Willett, D Roemer, A Rottland, J Rigoll, G
Duisburg University, GERMANY

We1R18
Spoken Dialogue System Using Corpus-Based Hidden Markov Model
Wu, C-H Yan, G-L Lin, C-L
National Cheng Kung University, TAIWAN R.O.C.

We1R19
A Realistic Wizard of Oz simulation of a Multimodal Spoken Language System
Wyard, P Churcher, G
BT Laboratories, UNITED KINGDOM

We1R20
A Syllable-Based Chinese Spoken Dialogue System for Telephone Directory Services Primarily Trained with a Corpus
Yang, Y-J Lee, L-S
National Taiwan University, TAIWAN R.O.C.

We1R21
How disagreement expressions are used in cooperative tasks
Yano, H Ito, A
Communications Research Laboratory, JAPAN


We2A - Segmentation, Labelling and Speech Corpora 1

Time & Place: 1030 - 1130, Antechinus Room
Chair: Bill Barry

1030
We2A1

Acoustic Indicators of Topic Segmentation
Hirschberg, J Nakatani, C
AT & T Labs - Research, USA

1045
We2A2

IViE - A Comparative Transcription system for Intonational Variation in English
Grabe, E Nolan, F Farrar, K J
University of Cambridge, UNITED KINGDOM

1100
We2A3

Automatic Segmental and Prosodic Labeling of Mandarin Speech Database
Chou, F-C Tseng, C-Y Lee, L-S
National Taiwan University, TAIWAN R.O.C.

1115
We2A4

Automatic Labelling of German Prosody
Rapp, S
Sony International (Europe) GmbH, GERMANY


We2B - Multimodal Spoken Language Processing 2

Time & Place: 1030 - 1130, Bandicoot Room
Chair: Dominic Massaro

1030
We2B1

Speech Driven 3-D Face Point Trajectory Synthesis Algorithm
Arslan, L M Talkin, D
Entropic Research Lab, USA

1045
We2B2

Speech-to-Lip Movement Synthesis based on the EM Algorithm using Audio-Visual HMMs
Yamamoto, E Nakamura, S Shikano, K
Nara Institute of Science and Technology, JAPAN

1100
We2B3

Learning Words from Natural Audio-Visual Input
Roy, D Pentland, A
MIT Media Laboratory, USA

1115
We2B4

Using the Multi-Stream Approach for Continuous Audio-Visual Speech Recognition: Experiments on the M2VTS Database
Dupont, S Luettin, J
Faculte Polytechnique De Mons, BELGIUM


We2C - Prosody and Emotion 4

Time & Place: 1030 - 1130, Currawong Room
Chair: Hiroya Fujisaki

1030
We2C1

Intonative structure as a determinant of word order variation in Dutch verbal endgroups
Swerts, M
IPO, Center for Research On User-System Interaction, THE NETHERLANDS

1045
We2C2

Experiments on the Meaning of Two Pitch Accent Types: The 'Pointed Hat' Versus the Accent-Lending Fall in Dutch
Caspers, J
Holland Institute of Generative Linguistics, NETHERLANDS

1100
We2C3

Phonetic and Phonological Markers of Contrastive Focus in Korean
Jun, S-A Lee, H-J
UCLA, USA

1115
We2C4

Reconciling two competing views on contrastiveness
Krahmer, E Swerts, M
IPO, Center for Research On User-System Interaction, THE NETHERLANDS


We2D - Neural Networks, Fuzzy and Evolutionary Methods 2

Time & Place: 1030 - 1130, Dingo Room
Chair: Tuan Pham

1030
We2D1

Modular Neural Networks for Low-Complex Phoneme Recognition
Glaeser, A
Ascom Systec Ltd, SWITZERLAND

1045
We2D2

Global Optimisation of Neural Network Models Via Sequential Sampling-Importance Resampling
de Freitas, N Johnson, S Niranjan, M Gee, A
Cambridge University, UNITED KINGDOM

1100
We2D3

Efficient computation of MMI neural networks for large vocabulary speech recognition systems
Rottland, J Luedecke, A Rigoll, G
Duisburg University, GERMANY

1115
We2D4

Modular Connectionist Systems for Identifying Complex Arabic Phonetic Features
Selouani, S A Caelen, J
HB University, ALGERIA


Plenary Session
We3A - Plenary Panel Discussion

Title: Automatic Speech Recognition Is A Solved Problem: A Fact Or Convenient Fiction?
Time & Place: 1135 - 1235, Antechinus Room
Chair: Roberto Togneri
Moderator: Roger Moore
Panelists:
Hynek Hermansky (Oregon Graduate Institute)
Sadaoki Furui (Tokyo Institute of Technology)
Chin-Hui Lee (Bell Laboratories)
Clive Summerfield (Syrinx Speech Systems)



We4A - Large Vocabulary Continuous Speech Recognition 1

Time & Place: 1330 - 1500, Antechinus Room
Chair: Sadaoki Furui

1330
We4A1

Real-Time Recognition of Broadcast News
Cook, G Robinson, T Christie, J
Cambridge University, UNITED KINGDOM

1345
We4A2

Automatic Recognition of Korean Broadcast News Speech
Yu, H-J Kim, H Choi, J-S Hong, J-M Park, K-S Lee, J-S Lee, Y-G Lee, H-Y
LG Corporate Institute of Technology, KOREA

1400
We4A3

Telephone-Based Conversational Speech Recognition in the JUPITER
Domain

Glass, J R Hazen, T J
MIT, USA

1415
We4A4

Japanese Large-Vocabulary Continuous Speech Recognition System Based on Microsoft Whisper
Hon, H Ju, Y Otani, K
Microsoft Research, USA

1430
We4A5

Partitioning and Transcription of Broadcast News Data
Gauvain, J-L Lamel, L Adda, G
LIMSI-CNRS, FRANCE


We4B - Speaker and Language Recognition 2

Time & Place: 1330 - 1500, Bandicoot Room
Chair: Marc Zissman

1330
We4B1

Speaker Detection in Broadcast Speech Databases
Rosenberg, A E Magrin-Chagnolleau, I Parthsarathy, S Huang, Q
AT&T Labs-Research, USA

1345
We4B2

Multilateral Techniques for Speaker Recognition
Parris, E S Carey, M J
Ensigma, UNITED KINGDOM

1400
We4B3

Real Time Speaker Indexing based on subspace method - Application to TV News Articles and Debate -
Nishida, M Ariki, Y
Ryukoku University, JAPAN

1415
We4B4

SHEEP, GOATS, LAMBS and WOLVES: A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation
Doddington, G Liggett, W Martin, A Przybocki, M Reynolds, D
MIT Lincoln Laboratory, USA

1430
We4B5

Progress in Speaker Recognition at Dragon Systems
Corrada-Emmanuel, A Newman, M Peskin, B Gillick, L Roth, R
Dragon Systems, USA

1445
We4B6

A Comparative Study of Speaker Verification Systems using the Polycost Database
Nordström, T Melin, H Lindberg, J
KTH, SWEDEN


We4C - Signal Processing and Speech Analysis 2

Time & Place: 1330 - 1500, Currawong Room
Chair: Kazuyo Tanaka

1300
We4C1

Improving Pitch Estimation with Short Duration Speech Samples
Ainsworth, W A Day, C R Meyer, G F
Keele University, UNITED KINGDOM

1345
We4C2

An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suite
Kawahara, H de Cheveigne, A Patterson, R
Wakayama University, JAPAN

1400
We4C3

Speaker-Independent Speech Recognition using Micro Segment Spectrum Integration
Aikawa, K
NTT Human Interface Laboratories, JAPAN

1415
We4C4

On Robust Speech Analysis Based on Time-Varying Complex AR Model
Funaki, K Miyanaga, Y Tochinai, K
Hokkaido University, JAPAN

1430
We4C5

Spectral Basis Functions from Discriminant Analysis
Hermansky, H Malayath, N
Oregon Graduate Institute, USA


We4D - Prosody and Emotion 5

Time & Place: 1330 - 1500, Dingo Room
Chair: Daniel Hirst

1330
We4D1

The Tilt Intonation Model
Taylor, P
University of Edinburgh, UNITED KINGDOM

1345
We4D2

Analysis of Occurrences of Pauses and Their Durations in Japanese Text Reading
Fujisaki, H Ohno, S Yamada, S
Science University of Tokyo, JAPAN

1400
We4D3

A statistical study of pitch target points in five languages
Campione, E Véronis, J
Université De Provence, FRANCE

1415
We4D4

Fully Automatic Prosody Generator for Text-to-Speech
Malfrere, F Dutoit, T Mertens, P
Faculte Polytechnique De Mons, BELGIUM

1430
We4D5

Automatic Prosodic Labeling of 6 Languages
Vereecken, H Martens, J-P Grover, C Fackrell, J Van Coile, B
Elis-University of Ghent, BELGIUM

1445
We4D6

Automatic Utterance Type Detection Using Suprasegmental Features
Wright, H
The University of Edinburgh, UNITED KINGDOM


We4P- Robust Speech Processing in Adverse Environments 4

Time & Place: 1330 - 1700, Platypus Room
Chair: Keikichi Hirose

We4P1
Spectral Sequence Compensation based on Continuity of Spectral Sequence
Akagi, M Iwaki, M Sakaguchi, N
Japan Advanced Institute of Science and Technology, JAPAN

We4P2
Robust Features for Speech Recognition Systems
Bayya, A Yegnanarayana, B
Rockwell Semiconductor Systems, USA

We4P3
Interfacing of CASA and Partial Recognition Based on a Multistream Technique
Berthommier, F Glotin, H Tessier, E Bourlard, H
Institut De La Communication Parlee, FRANCE

We4P4
An RNN-based compensation method for Mandarin telephone speech recognition
Chang, S-C Chien, S-C Kuo, C-C
Industrial Technology Research Institute, TAIWAN R.O.C.

We4P5
Robust Speech Recognition Using Discriminative Stream Weighting and Parameter Interpolation
Chu, S Zhao, Y
University of Illinois, USA

We4P6
Acoustic backing-off in the local distance computation for robust automatic speech recognition
de Veth, J Cranen, B Boves, L
University of Nijmegen, THE NETHERLANDS

We4P7
Noise Model Selection for Robust Speech Recognition
Docio-Fernandez,L Garcia-Mateo,C
University of Vigo, SPAIN

We4P8
A novel iterative signal enhancement algorithm for noise reduction in speech
Doclo, S Dologlou, I Moonen, M
KU Leuven - ESAT/SISTA, BELGIUM

We4P9
Missing Data Reconstruction for Robust Automatic Speech Recognition in the Framework of Hybrid HMM/ANN Systems
Dupont, S
Faculte Polytechnique De Mons, BELGIUM

We4P10
Recognition from GSM Digital Speech
Gallardo-Antolin, A Diaz-de-Maria, F Valverde-Albacete, F
Universidad Carlos III De Madrid, SPAIN

We4P11
Conversational Speech Systems for On-Board Car Navigation and Assistance
Geutner, P Denecke, M Meier, U Westphal, M Waibel, A
Universitaet Karlsruhe, GERMANY

We4P12
A signal processing system for having the sound "pop-out" in noise thanks to the image of the speaker’s lips: New advances using Multi-Layer Perceptrons
Girin, L Varin, L Feng, G Schwartz, J L
Institut De La Communication Parlée, FRANCE

We4P13
Robust Speech Activity Detection in the Presence of Noise
Sarikaya, R Hansen, J H L
Robust Speech Processing Laboratory, USA

We4P14
Robust Automatic Speech Recognition by the Application of a Temporal-Correlation-Based Recurrent Multilayer Neural Network to the Mel-based Cepstral Coefficients
Heon, M Tolba, H O'Shaughnessy, D
INRS-Telecommunication, CANADA

We4P15
Speech Recognition from GSM Codec Parameters
Huerta, J M Stern, R M
Carnegie Mellon University, USA

We4P16
Improved Parallel Model Combination Based on Better Domain Transformation for Speech Recognition Under Noisy Environments
Hung, J W Shen, J L Lee, L S
Academia Sinica, TAIWAN R.O.C.

We4P17
Robust Speech/Non-Speech Detection in Adverse Conditions Based on Noise and Speech Statistics
Karray, L Monne, J
France Telecom - CNET, FRANCE

We4P18
Speech Recognition in Car Noise Environments using Multiple Models According to Noise Masking Levels
Song, M G Jung, H I Shim, K-J Kim, H S
Pusan National University, KOREA

We4P19
Spectral Noise Subtraction with Recursive Gain Curves
Linhard, K Haulick, T
Daimler Benz AG, GERMANY

We4P20
A Novel Robust Speech Recognition Algorithm Based on Multi-models and Integrated Decision Method
Pan, S Liu, J Jiang, J-T Wang, Z-Y Lu, D-J
Tsinghua University, P.R. CHINA

We4P21
On the Interaction Between Time and Frequency Filtering of Speech Parameters for Robust Speech Recognition
Macho, D Nadeu, C
Polytechnical University of Catalonia, SPAIN

We4P22
Inference of Missing Spectrographic Features for Robust Speech Recognition
Raj, B Singh, R Stern, R M
Carnegie Mellon University, USA

We4P23
SNR-dependent flooring and noise overestimation for joint application of spectral subtraction and model combination
Schless, V Class, F
Daimler-Benz AG, GERMANY

We4P24
Improved Robust Speech Recognition Considering Signal Correlation Approximated by Taylor Series
Shen, J-L Hung, J-W Lee, L-S
Academia Sinica, TAIWAN R.O.C.

We4P25
Speech Recognition in Noisy Environment Using Weighted Projection-Based Likelihood Measure
Shin, W H Kim, W G Lee, C Cha, I W
Yonsei University, KOREA

We4P26
Evaluation of Model Adaptation by HMM Decomposition on Telephone Speech Recognition
Takiguchi, T Nakamura, S Shikano, K Morishima, M Isobe, T
Nara Institute of Science and Technology, JAPAN

We4P27
Comparative Experiments to Evaluate a Voiced-Unvoiced-Based Pre-Processing Approach to Robust Automatic Speech Recognition in Low-SNR Environments
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA

We4P28
Signal Extraction from Noisy Signal based on Auditory Scene Analysis
Unoki, M Akagi, M
Japan Advanced Institute of Science and Technology, JAPAN

We4P29
Frequency domain binaural model as the front end of speech recognition system
Usagawa, T Sakai, K Ebata, M
Kumamoto University, JAPAN

We4P30
A Study on the Recognition of Low Bit-Rate Encoded Speech
Yu, A-T Wang, H-C
National Tsing Hua University, TAIWAN R.O.C.

We4P31
Weighted Parallel Model Combination for Noisy Speech Recognition
Hwang, T-H Wang, H-C
National Tsing-Hua University, TAIWAN R.O.C.

We4P32
Favourable and Unfavourable Short Duration Segments of Speech in Noise
Woo, D
The University of New South Wales, AUSTRALIA


We4Q - Segmentation, Labelling and Speech Corpora 2

Time & Place: 1330 - 1700, Quokka Room
Chair: Paul Dalsgaard

We4Q1
An Efficient Labeling Tool for the QuickSig Speech Database
Karjalainen, M Altosaar, T Huttunen, M
Helsinki University of Technology, FINLAND

We4Q2
Collection and Detailed Transcription of a Speech Database for Development of Language Learning Technologies
Bratt, H Neumeyer, L Shriberg, E Franco, H
SRI International, USA

We4Q3
Resegmentation of SWITCHBOARD
Deshmukh, N Ganapathiraju, A Gleeson, A Hamaker, J Picone, J
Mississippi State University, USA

We4Q4
Automatic Generation of Visual Scenarios for Spoken Corpora Acquisition
Aiello, D Delogu, C De Mori, R Di Carlo, A Nisi, M Tummeacciu, S
Fondazione Ugo Bordoni, ITALY

We4Q5
Automatic Detection of Semantic Boundaries based on Acoustic and Lexical Knowledge
Cettolo, M Falavigna, D
ITC-IRST, ITALY

We4Q6
A New Fast Algorithm for Automatic Segmentation of Continuous Speech
Gholampour, I Nayebi, K
Sharif University of Technology, IRAN

We4Q7
Acoustic Nature and Perceptual Testing of Corpora of Emotional Speech
Iida, A Campbell, W N Iga, S Higuchi, F Yasumura, M
Keio University, JAPAN

We4Q8
Korean Prosodic Break Index Labelling by a new Mixed Method of LDA and VQ
Kang, P S Kang, J Y Kim, J Y
Chonnam National University, KOREA

We4Q9
MOOSE: Management of Otago Speech Environment
Laws, M R Kilgour, R I
University of Otago, NEW ZEALAND

We4Q10
Phonetic Alignment: Speech Synthesis Based vs. Hybrid HMM/ANN
Malfrere, F Deroo, O Dutoit, T
Faculte Polytechnique De Mons, BELGIUM

We4Q11
Customisation and Quality Assessment of Spoken Language Description
Millar, J B
Australian National University, AUSTRALIA

We4Q12
A Silence/Noise/Music/Speech Splitting Algorithm
Montacie, C Caraty, M-J
Universite Pierre Et Marie Curie, FRANCE

We4Q13
Audio-Visual Segmentation for Content-based Retrieval
Pye, D Hollinghurst, N Mills, T Wood, K
Oracle & Olivetti Research Laboratory, UNITED KINGDOM

We4Q14
Same News is Good News: Automatically Collecting Reoccuring Radio News Stories
Rapp, S Dogil, G
Sony International (Europe) GmbH, GERMANY

We4Q15
An annotation system for melodic aspects of German spontaneous speech
Brindoepke, C Schaffranietz, B
University Bielefeld, GERMANY

We4Q16
Additional use of phoneme duration hypotheses in automatic speech segmentation
Stöber, K Hess, W
Bonn University, GERMANY

We4Q17
Towards a minimal standard for dialogue transcripts:A new SGML architecture for the HCRC Map Task Corpus
Isard, A McKelvie, D Thompson, H S
University of Edinburgh, UNITED KINGDOM


We4R - Speech Technology Applications and Human- Machine Interface 1

Time & Place: 1330 - 1700, Rosella Room
Chair: Jordi Robert-Ribes

We4R1
Steps toward the integration of speaker recognition in real-world telecom applications
Glaeser, A Bimbot, F
Ascom Systec Ltd, SWITZERLAND

We4R2
A Bimodal Korean Address Entry/Retrieval System
Chung, H Y Hwang, C J Lee, S W
Yeungnam University, KOREA

We4R3
Usability Evaluation of IVR systems with DTMF and ASR
Delogu, C Di Carlo, A Rotundi, P Sartori, D
Fondazione Ugo Bordoni, ITALY

We4R4
SALSA Version 1.0: A Speech-based Web Browser for Hong Kong English
Fung, P Cheung, C S Lam, K L Liu, W K Lo, Y Y
University of Science & Technology, HONG KONG

We4R5
A Language for Creating Speech Applications
Pargellis, A Zhou, Q Saad, A Lee, C-H
Bell Labs, Lucent Technologies

We4R6
The use of automatic speech recognition to reduce the interference between concurrent tasks of driving and phoning
Graham, R Carter, C Mellor, B
Loughborough University, UNITED KINGDOM

We4R7
Interactive Listening to Structured Speech Content on the Internet
Hirayama, M J Sugahara, T Peng, Z Yamazaki, J
Hewlett Packard Laboratories Japan, JAPAN

We4R8
MSF Format for the Representation of Speech Synchronized Moving Image
Jo, C W
Changwon National University, KOREA

We4R9
Effects of Using Speech in Timetable Information Systems for WWW
Qvarfordt, P Jönsson, A
Linkoping University, SWEDEN

We4R10
The Interactive Systems Labs View4You video indexing system
Kemp, T Geutner, P Schmidt, M Tomaz, B Weber, M Westphal, M
Waibel, A
University of Karlsruhe, GERMANY

We4R11
SEMOLE: A Robust Framework for Gathering Information from the World Wide Web
Kim, H J Hetherington, L
MIT Spoken Language Systems, USA

We4R12
Enhancing a WIMP Based Interface with Speech, Gaze Tracking and Agents
Bakman, L Blidegn, M Wittrup, M Larsen, L B Moeslund, T B
Aalborg University, DENMARK

We4R13
Now You Hear It, Now You Don't: Empirical Studies of Audio Browsing Behavior
Nakatani, C Whittaker, S Hirschberg, J
AT & T Labs - Research, USA

We4R14
A Voice Verifier for Face/Voice Based Person Verification System
Qiao, R-Y Choi, Y Agbinya, J I
CSIRO, Australia, AUSTRALIA

We4R15
On the use of automatic speech recognition for TV captioning
Robert-Ribes, J
CSIRO-MIS, AUSTRALIA

We4R16
An Undergraduate Course on Speech Recognition Based on the CSLU Toolkit
Serridge, B
Universidad De Las Americas, MEXICO

We4R17
Real time voice alteration based on Linear Prediction
Yang, P-F Stylianou, Y
AT&T Labs - Research, USA

We4R18
Evaluation and Implementation of a Voice-Activated Dialing System with Utterance Verification
Tan, B T Gu, Y Thomas, T
Vocalis Ltd, UNITED KINGDOM

We4R19
Towards A Mandarin Voice Memo System
Wang, H-M Lin, B-S Chen, B Bai, B-R
Academia Sinica, TAIWAN R.O.C.


We5A - Large Vocabulary Continuous Speech Recognition 2

Time & Place: 1530 - 1700, Antechinus Room
Chair: Gary Cook

1530
We5A1

Grammatical word graph re-generation for spontaneous speech
recognition

Tsukada, H Yamamoto, H Takezawa, T Sagisaka, Y
ATR Interpreting Telecommunications Research Laboratories, JAPAN

1545
We5A2

Compression Algorithm of Trigram Language Models based on Maximum Likelihood Estimation
Yodo, N Shikano, K Nakamura, S
Nara Institute of Science and Technology, JAPAN

1600
We5A3

Morphological Modeling of Word Classes for Langauge Models
Uebler, U Niemann, H
Bavarian Research Center for Knowledge Based Systems (FORWISS), GERMANY

1615
We5A4

A Comparative Study Between Polyclass and Multiclass Language
Models

Zitouni, I Smaili, K Haton, J-P Deligne, S Bimbot, F
LORIA/INRIA Lorraine, FRANCE

1630
We5A5

Log-Linear Interpolation of Language Models
Klakow, D
Philips Research Laboratories, GERMANY

1645
We5A6

The Applicability of Adaptive Language Modelling for the Broadcast News Task
Clarkson, P R Robinson, A J
Cambridge University, UNITED KINGDOM


We5B - Text-To-Speech Synthesis 3

Time & Place: 1530 - 1700, Bandicoot Room
Chair: Nick Campbell

1530
We5B1

The IBM Trainable Speech Synthesis System
Donovan, R E Eide, E M
IBM, USA

1545
We5B2

ProSynth: An Integrated Prosodic Approach to Device-Independent, Natural-Sounding Speech Synthesis
Hawkins, S House, J Huckvale, M Local, J K Ogden, R
University of Cambridge, UNITED KINGDOM

1600
We5B3

Total Quality Evaluation of Speech Synthesis Systems
Zhang, J Dong, S Yu, G
Institute of Acoustics, P.R. CHINA

1615
We5B4

Comparative Evaluation of Synthetic Prosody with the PURR Method
Sonntag, G P Portele, T
IKP, University of Bonn, GERMANY

1630
We5B5

SABLE: A Standard for TTS Markup
Sproat, R Hunt, A Ostendorf, M Taylor, P Black, A Lenzo, K
Bell Laboratories / Lucent Technologies, USA

1645
We5B6

Prosodic vs. segmental contributions to naturalness in a diphone synthesizer
Bunnell, H T Hoskins, S R Yarrington, D M
DuPont Hospital for Children / University of Delaware, USA


We5C - Language Acquisition 1

Time & Place: 1530 - 1700, Currawong Room
Chair: Jean Vroomen

1530
We5C1

Non-native productions of Japanese single stops that are too long for one mora unit
Minagawa-Kawai, Y Kiritani, S
University of Tokyo, JAPAN

1545
We5C2

The Process of Generation and Development of Second Language Japanese Accentuation
Yamada, N
Ibaraki University, JAPAN

1600
We5C3

Perceptual properties of Russians with Japanese fricatives
Funatsu, S Kiritani, S
Hiroshima Women's University, JAPAN

1615
We5C4

Assessment of Dutch pronunciation by means of automatic speech recognition technology
Cucchiarini, C de Wet, F Strik, H Boves, L
University of Nijmegen, THE NETHERLANDS

1630
We5C5

Phonetic-level Mispronunciation Detection in non-native Swedish Speech
Langlais, P Öster, A-M Granström, B
Center for Speech Technology, TMH-KTH, SWEDEN

1645
We5C6

Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scores
Akahane-Yamada, R McDermott, E Adachi, T Kawahara, H Pruitt, J S
ATR Human Information Processing Research Laboratories, JAPAN


We5D - Acoustic Phonetics 1

Time & Place: 1530 - 1700, Dingo Room
Chair: Louis Pols

1530
We5D1

Assimilation of place in Japanese and Dutch
Cutler, A Otake, T
Max-Planck-Institute for Psycholinguistics, NETHERLANDS

1545
We5D2

Prosodic Constraint on V-to-V Coarticulation in Japanese
Kondo, Y Arai, Y
Musashino Art University, JAPAN

1600
We5D3

Postvocalic /r/-deletion in standard Dutch: how experimental phonology can profit from ASR technology
Cucchiarini, C van den Heuvel, H
University of Nijmegen, THE NETHERLANDS

1615
We5D4

More evidence for the perceptual basis of sound change? Suprasegmental effects in the development of distinctive nasalisation
Hajek, J Watson, I
University of Melbourne, AUSTRALIA

1630
We5D5

Speech production of vowel sequences using a physiological articulatory model
Dang, J Honda, K
ATR Human Information Processing Research Labs,, JAPAN


We6A - Speaker Adaptation 2

Time & Place: 1700 - 1800, Antechinus Room
Chair: Frank Soong

1700
We6A1

Eigenvoices for Speaker Adaptation
Kuhn, R Nguyen, P Junqua, J-C Goldwasser, L Niedzielski, N Fincke, S Field, K Contolini, M
Panasonic Technologies, Inc., USA

1715
We6A2

Speaker Clustering Using Direct Maximisation of the MLLR-Adapted Likelihood
Johnson, S E Woodland, P C
Cambridge University, UNITED KINGDOM

1730
We6A3

Incremental On-line Speaker Adaptation in Adverse Conditions
Viikki, O Laurila, K
Nokia Research Center, FINLAND

1745
We6A4

Cluster Adaptive Training for Speech Recognition
Gales, M
IBM Almaden Research Center, USA


We6B - Speech Coding 2

Time & Place: 1700 - 1800, Bandicoot Room
Chair: Barry Cheetham

1700
We6B1

Towards a unified model for low bit-rate speech coding using a recognition-synthesis approach
Holmes, W J
Defence Evaluation and Research Agency, UNITED KINGDOM

1715
We6B2

On the Significance of Temporal Masking in Speech Coding
Skoglund, J Kleijn, W B
Royal Insitute of Technology, SWEDEN

1730
We6B3

Waveform Interpolation Coding with Pitch-Spaced Subbands
Kleijn, W B Yang, H Deprettere, E F
Royal Insitute of Technology, SWEDEN

1745
We6B4

An improved Decomposition Method for WI using IIR Wavelet Filter Banks
Chong, N R Burnett, I S Chicharo, J F
University of Wollongong, AUSTRALIA


We6C - Hidden Markov Model Techniques 2

Time & Place: 1700 - 1800, Currawong Room
Chair: Jeff Bilmes

1700
We6C1

Real -Time Probabilistic Segmentation for Segment-Based Speech Recognition
Lee, S Glass, J
MIT, USA

1715
We6C2

Toward Markov random field modeling of speech
Gravier, G Sigelle, M Chollet, G
ENST/TSI and CNRS-URA 820, FRANCE

1730
We6C3

Hidden Markov Models for Trajectory Modeling
Iyer, R Gish, H Siu, M Zavaliagkos, G Matsoukas, S
GTE/BBN Technologies, USA


We6D - Multilingual Perception and Recognition 1

Time & Place: 1700 - 1800, Dingo Room
Chair: Valerie Hazan

1700
We6D1

Bilingual and Dialectal Adaptation and Retraining
Uebler, U Schuessler, M Niemann, H
Bavarian Research Center for Knowledge Based Systems (FORWISS), GERMANY

1715
We6D2

Language Independent and Language Adaptive Large Vocabulary Speech Recognition
Schultz, T Waibel, A
Interactive Systems Laboratories, GERMANY

1730
We6D3

A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation training
Kawai, G Hirose, K
University of Tokyo, JAPAN



Thursday 3 December, 1998


Th1A - Large Vocabulary Continuous Speech Recognition 3

Time & Place: 0900 - 1030, Antechinus Room
Chair: Michael Wagner

0900
Th1A1

The BBN Single-Phonetic-Tree Fast-Match Algorithm
Nguyen, L Schwartz, R
BBN Technologies, USA

0915
Th1A2

An Efficient Two-pass Search Algorithm using Word Trellis Index
Lee, A Kawahara, T Shuji, D
Kyoto University, JAPAN

0930
Th1A3

Nozomi - a fast, memory-efficient stack decoder for LVCSR
Schuster, M
ATR Interpreting Telecommunications Research Laboratories, JAPAN

0945
Th1A4

Reducing the OOV Rate in Broadcast News Speech Recognition
Kemp, T Waibel, A
University of Karlsruhe, GERMANY

1000
Th1A5

Using Automatic-Derived Acoustic Sub-Word Units in Large Vocabulary Speech Recognition
Bacchiani, M Ostendorf, M
Boston University, USA

1015
Th1A6

Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch
McAllaster, D Gillick, L Scattone, F Newman, M
Dragon Systems Inc, USA


Th1B - Articulatory Modelling 3

Time & Place: 0900 - 1030, Bandicoot Room
Chair: Philip Hoole

0900
Th1B1

An electropalatographic, kinematic, and acoustic analysis of supralaryngeal correlates of word-level prominence contrasts in English
Harrington, J Beckman, M E Fletcher, J Palethorpe, S
Macquarie University, AUSTRALIA

0915
Th1B2

Consistencies and inconsistencies between EPG and locus equation data on coarticulation
Tabain, M
Macquarie University, AUSTRALIA

0930
Th1B3

Synergy Between Jaw and Lips/ Tongue Movements : Consequences in Articulatory Modelling
Bailly, G Badin, P Vilain, A
Institut Communication Parlee, FRANCE

0945
Th1B4

Modelling tongue configuration in German vowel production
Hoole, P
Munich University, GERMANY

1000
Th1B5

Optopalatograph: Real-time feedback of tongue movement in 3D
Wrench, A McIntosh, A D Watson, C Hardcastle, W J
Queen Margaret College, UNITED KINGDOM

1015
Th1B6

Effects of contrastive focal accent on linguopalatal articulation and coarticulation in the French [kskl] Cluster
Meynadier, Y Pitermann, M Marchal, A
Institut De Phonetique - Laboratoire Parole Et Langage, CNRS, FRANCE


Th1C - Language Acquisition 2

Time & Place: 0900 - 1030, Currawong Room
Chair: Shigeru Kiritani

0900
Th1C1

Spoken Word Identification by Native and Nonnative Speakers of English: Effects of training, modality, context and phonetic environment
Hardison, D M
University of California, Davis, USA

0915
Th1C2

The Effect of Background Knowledge on First and Second Language Comprehension Difficulty
Tyler, M D
University of New South Wales, AUSTRALIA

0930
Th1C3

Comparsion of Cross-language Coarticulation: English, Japanese and Japanese-accented English
Tsukada, K
Macquarie University, AUSTRALIA

0945
Th1C4

Plasticity of non-native phonetic perception and production: A training study
Imaizumi, S Itoh, H Tamekawa, Y Deguchi, T Mori, K
University of Tokyo, JAPAN

1000
Th1C5

The Relation Between Perceptual and Production Categories in Acquisition
Watson, I
University of Oxford, UNITED KINGDOM

1015
Th1C6

The development of perceptual cue-weighting in children aged 6 to 12
Hazan, V Barrett, S
Department of Phonetics and Linguistics, UCL, UNITED KINGDOM


Th1D - Speaker and Language Recognition 3

Time & Place: 0900 - 1030, Dingo Room
Chair: Ara Samoulian

0900
Th1D1

Robust speaker verification insensitive to session-dependent utterance variation and handset-dependent distortion
Matsui, T Aikawa, K
NTT Human Interface Labs, JAPAN

0915
Th1D2

A Comparative Evaluation of Variance Flooring Techniques in HMM-based Speaker Verification
Melin, H Koolwaaij, J Lindberg, J Bimbot, F
KTH, SWEDEN

0930
Th1D3

Text-Independent Speaker Verification Using Automatically Labelled Acoustic Segments
Petrovska Delacretaz, D Cernocky, J Henneber, J Chollet, G
Institute of Radioelectronics, CZECH REPUBLIC

0945
Th1D4

A Fast Decoding Algorithm Based on Sequential Detection of the Changes in Distribution
Li, Q
Bell Labs, Lucent Technologies, USA

1000
Th1D5

Speaker verification with ensemble classifiers based on linear speech transforms
Olsen, J
Aalborg University, DENMARK

1015
Th1D6

Speaker recognition based on discriminative projection models
Olsen, J
Aalborg University, DENMARK


Th1P - Text-To-Speech Synthesis 4

Time & Place: 0900 - 1230, Platypus Room
Chair: Robert Mannell

Th1P1
A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale Modification of Speech
Acero, A
Microsoft Corporation, USA

Th1P2
Analytic Generation of Synthesis Units by Closed Loop Training for Totally Speaker Driven Text to Speech System (TOS Drive TTS)
Akamine, M Kagoshima, T
Toshiba Kansai Laboratory, JAPAN

Th1P3
Modeling the Microprosody of Pitch and Loudness for Speech Synthesis with Neural Networks
Vainio, M Altosaar, T
Helsinki University of Technology, FINLAND

Th1P4
Spectral Smoothing for Concatenative Speech Synthesis
Chappell, D T Hansen, J H L H
Robust Speech Processing Laboratory, USA

Th1P5
Mimic: A Voice-Adaptive Phonetic-Tree Speech Synthesiser
Chen, A Vaseghi, S Ho, C
Queen's University of Belfast, UNITED KINGDOM

Th1P6
Automatic Generation of Korean Pronunciation Variants by Multistage Applications of Phonological Rules
Jeon, J Cha, S Chung, M Park, J Hwang, K
Sogang University, KOREA

Th1P7
Techniques for accurate automatic annotation of speech waveforms
Cox, S Brady, R Jackson, P
University of East Anglia, UNITED KINGDOM

Th1P8
Optimized Stopping Criteria for Tree-Based Unit Selection in Concatenative Synthesis
Cronk, A Macon, M
Oregon Graduate Institute, USA

Th1P9
Automatic transcription of intonation using an identified prosodic alphabet
de Tournemire, S
France Telecom, CNET, FRANCE

Th1P10
Frequency Analysis of Phonetic Units for Concatenative Synthesis in Catalan
Esquerra, I Febrer, A Nadeu, C
Polytechnical University of Catalonia, SPAIN

Th1P11
Investigating the Syntactic Characteristics of English Tone Units
Fang, A C House, J Huckvale, M
University College London, UNITED KINGDOM

Th1P12
The UPC Text-to-Speech System for Spanish and Catalan
Bonafonte, A Esquerra, I Febrer, A Fonollosa, J A R Vallverdu, F
Polytechnical University of Catalonia, SPAIN

Th1P13
The New Version of the ROMVOX Text-To-Speech Synthesis System Based on a Hybrid Time Domain-LPC Synthesis Technique
Ferencz, A Nagy, I Kovacs, T C S Ferencz, M Ratiu, T
Software ITC, ROMANIA

Th1P14
An F0 Contour Control Model for Totally Speaker Driven Text to Speech System
Kagoshima, T Morita, M Seto, S Akamine, M
Toshiba Corporation, JAPAN

Th1P15
On the Relationship of Speech Rates with Prosodic Units in Dialogue Speech
Hirose, K Kawanami, H
University of Tokyo, JAPAN

Th1P16
On the Reduction of Concatenation Artefacts in Diphone Synthesis
Klabbers, E Veldhuis, R
IPO, THE NETHERLANDS

Th1P17
Error Analysis and Confidence Measure of Chinese Word Segmentation
Kuo, C-C Ma, K-Y
Industrial Technology Research Institute, TAIWAN R.O.C.

Th1P18
Energy Contour Generation for a Sentence Using a Neural Network Learning Method
Lee, J Kang, D Kim, S Sung, K-M
ETRI, KOREA

Th1P19
A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using K-ToBI system
Lee, Y-J Lee, S-H Kim, J-J Ho, H-J Kim, Y-I Kim, S-H Lee, J
Wonkwang University, KOREA

Th1P20
Rapid-Deployment Text-to-Speech in the DIPLOMAT System
Lenzo, K Hogan, C Allen, J
Carnegie Mellon University, USA

Th1P21
Formant diphone parameter extraction utilising a labelled single-speaker database
Mannell, R
Macquarie University, AUSTRALIA

Th1P22
A New Synthetic Speech/Sound Control Language
Mizuno, O Nakajima, S
NTT Human Interface Labs, JAPAN

Th1P23
A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accent
Mochizuki, R Arai, Y Honda, T
AVC Research Lab., CED, Matsushita Communication Industrial, JAPAN

Th1P24
Letter to Sound Rules for Accented Lexicon Compression
Pagel, V Lenzo, K Black, A W
Facultie Polytechnique of Mons, BELGIUM

Th1P25
A Name Announcement Algorithm With Memory Size and Computational Power Constraints
Roth, Z Rosenhouse, J
DSP Semiconductors Ltd., ISRAEL

Th1P26
How a French TTS System Can Describe Loanwords
Sannier, F Belrhali, R Auberge, V
ICP - ESA 5009 CNRS, FRANCE

Th1P27
Improvements in Slovene Text-to-Speech Synthesis
Sef, T Dobnikar, A Gams, M
Jozef Stefan Institute, SLOVENIA

Th1P28
Automatic Rule Generation for Linguistic Features Analysis Using Inductive Learning Technique - Linguistic Features Analysis in TOS Drive TTS System
Seto, S Morita, M Kagoshima, T Akamine, M
Toshiba Corporation, JAPAN

Th1P29
Segmental Duration Control Based on an Articulatory Model
Shiga, Y Matsuura, H Nitta, T
Toshiba Corporation, JAPAN

Th1P30
Text Analysis for the Bell Labs French Text-to-Speech System
Tzoukermann, E
Bell Labs - Lucent Technologies, USA

Th1P31
Modeling vowel duration for Japanese text-to-speech synthesis
Venditti, J J van Santen, J P H
Ohio State University / Bell Labs, USA

Th1P32
Towards a Chinese Text-to-Speech System with Higher Naturalness
Wang, R-H Liu, Q Teng, Y Xia, D
University of Science & Technology of China, P.R. CHINA


Th1Q - Spoken Language Understanding Systems 4

Time & Place: 0900 - 1230, Quokka Room
Chair: Robert Dale

Th1Q1
Grammar Fragment Acquisition using Syntactic and Semantic Clustering
Arai, K Wright, J Riccardi, G Gorin, A
NTT Human Interface Laboratories, JAPAN

Th1Q2
Non-Expert Access to Unification-based Speech Understanding
Brøndsted, T
Aalborg University, DENMARK

Th1Q3
Natural Language Call Routing: A Robust, Self-Organizing Approach
Carpenter, B Chu-Carroll, J
Lucent Technologies - Bell Labs, USA

Th1Q4
Automatic Grammar Induction from Semantic Parsing
Ghosh, D Goddeau, D
Nuance Communications, USA

Th1Q5
BTH: An efficient parsing algorithm for word-spotting
Kono, Y Yano, T Sasajima, M
Toshiba Corporation, JAPAN

Th1Q6
Syntax Coordination: Interaction of Discourse and Extrapositions
Kronenberg, S C Kummert, F
University of Bielefeld, GERMANY

Th1Q7
Hierarchical Tag-Graph Search for Spontaneous Speech Understanding in Spoken Dialog Systems
Lin, B-S Chen, B Wang, H-M Lee, L-S
National Taiwan University, TAIWAN R.O.C.

Th1Q8
Extraction of the dialog act and the topic from utterances in a spoken dialog system
Niimi, Y Takinaga, N Nishimoto, T
Kyoto Institute of Technology, JAPAN

Th1Q9
Fast Computation of Maximum Entropy / Minimum Divergence Feature Gain
Printz, H
Carnegie Mellon University, USA

Th1Q10
Stochastic Language Models for Speech Recognition and Understanding
Riccardi, G Gorin, A L
AT&T Labs-Research, USA

Th1Q11
Linguistically Engineered Tools for Speech Recognition Error Analysis
Van Ess-Dykema, C Ries, K
U.S. Department of Defense, USA

Th1Q12
Estimating Entropy of a Language from Optimal Word Insertion Penalty
Takeda, K Ogawa, A Itakura, F
Nagoya University, JAPAN

Th1Q13
A Linguistic Analysis of Repair Signals in Co-operative Spoken Dialogues
Tseng, S-C
University of Bielefeld, GERMANY

Th1Q14
A Hierarchical Language Model for CSR
Valverde-Albacete, F J Pardo, J M
Universidad Carlos III De Madrid, SPAIN

Th1Q15
Spoken Language Understanding Within Dialogues Using a Graphical Model of Task Structure
Wright, J H Gorin, A L Abella, A
AT&T Labs - Research, USA

Th1Q16
Keyword Extraction of Radio News using Domain Identification based on Categories of an Encyclopedia
Suzuki, Y Fukumoto, F Sekiguchi, Y
Yamanashi University, JAPAN


Th1R - Human Speech Perception 2

Time & Place: 0900 - 1230, Rosella Room
Chair: Kerrie Lee

Th1R1
Fundamental frequency flunctuation in continuous vowel utterance and its perception
Akagi, M Iwaki, M Minakawa, T
Japan Advanced Institute of Science and Technology, JAPAN

Th1R2
Estimation of Mental Lexicon Size with Word Familiarity Database
Amano, S Kondo, T
NTT Basic Research Laboratories, JAPAN

Th1R3
Vowel Quality in Spontaneous Speech: What Makes a Good Vowel?
Aylett, M P Turk, A
University of Edinburgh, UNITED KINGDOM

Th1R4
Cooperation and Competition of Burst and Formant Transitions for the Perception and the Identification of French Stops
Neagu, A Bailly, G
Institut Communication Parlee, FRANCE

Th1R5
The effect of modifying formant amplitudes on the perception of French vowels generated by copy synthesis
Bonneau, A Laprie, Y
Loria & CNRS, FRANCE

Th1R6
Segmental and tonal processing in Cantonese
Chen, H-C Yip, M Wong, S-Y
Chinese University of Hong Kong, HONG KONG

Th1R7
Phonological similarity effects in Cantonese spoken-word processing
Yip, M Leung, P-Y Chen, H-C
Chinese University of Hong Kong, HONG KONG

Th1R8
On the learnability of the voicing contrast for initial stops
Damper, R I Gunn, S R
University of Southampton, UNITED KINGDOM

Th1R9
Acoustic and Perceptual Characteristics of Italian Stop Consonants
Cerrato, L Falcone, M
Fondazione Ugo Bordoni, ITALY

Th1R10
Acoustic cues for the auditory identification of the Spanish fricative /f/
Fernandez, S Feijoo, S Balsa, R Barros, N
Universidad de Santiago, SPAIN

Th1R11
Recognition of vowels in fricative context
Fernandez, S Feijoo, S Balsa, R Barros, N
Universidad De Santiago, SPAIN

Th1R12
Voicing affects perceived manner of articulation
Fernandez, S Feijoo, S Almeida, P
Universidad De Santiago, SPAIN

Th1R13
Enhancement techniques to improve the intelligibility of consonants in noise: Speaker and listener effects

Hazan, V Simpson, A Huckvale, M
Department of Phonetics and Linguistics, UCL, UNITED KINGDOM

Th1R14
Boundaries of Perception of Long Tones in Taiwanese Speech
Jian F
University of Reading, UNITED KINGDOM

Th1R15
Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speech
Kato, H Tsuzaki, M Sagisaka, Y
ATR Human Information Processing Research Labs, JAPAN

Th1R16
Dynamic vs. Static Spectral Detail in the Perception of Gated Stops
Kiefte, M Neary, T M
University of Alberta, CANADA

Th1R17
Phonological Units in Speech Segmentation and Phonological Awareness
Otake, T Yoneyama, K
Dokkyo University, JAPAN

Th1R18
How Far Do Speakers Back Up in Repairs? A Quantitative Model
Shriberg, E Stolcke, A
SRI International, USA

Th1R19
Don’t blame it (all) on the pause: Further ERP evidence for a prosody-induced garden-path in running speech
Steinhauer, K Alter, K Friederici, A D
Max-Planck-Institute of Cognitive Neuroscience, GERMANY

Th1R20
The role of stress for lexical selection in Dutch
Vroomen, J de Gelder, B
University of Tilburg, THE NETHERLANDS

Th1R21
The perception of stressed syllables in Finnish
Tuomainen, J Vroomen, J de Gelder, B
University of Tilburg, THE NETHERLANDS

Th1R22
The perception of the morae with devocalized vowels in Japanese language
Yamakawa, K Baba, R
Prefectural University of Kumamoto, JAPAN


Th2A - Large Vocabulary Continuous Speech Recognition 4

Time & Place: 1100 - 1230, Antechinus Room
Chair: Richard Stern

1100
Th2A1

High Resolution Decision Tree based Acoustic Modeling beyond CART
Chou, W Reichl, W
Bell Labs, Lucent Technologies, USA

1115
Th2A2

Unsupervised Training of a Speech Recognizer Using TV Broadcasts
Kemp, T Waibel, A
University of Karlsruhe, GERMANY

1130
Th2A3

A New Method to Achieve Fast Acoustic Matching for Speech Recognition
Lee, C Z O'Shaughnessy, D
INRS-Telecommunications, CANADA

1145
Th2A4

Improved Parameter Tying for Efficient Acoustic Model Evaluation in Large Vocabulary Continuous Speech Recognition
Duchateau, J Demuynck, K Van Compernolle, D Wambacq, P
Kath. Univ. Leuven - ESAT/PSI, BELGIUM

1200
Th2A5

A New Look at HMM Parameter Tying for Large Vocabulary Speech Recognition
Sankar, A
SRI International, USA

1215
Th2A6

Factor Analysis Invariant to Linear Transformations of Data
Gopinath, R A Ramabhadran, B Dharanipragada, S
IBM T. J. Watson Research Center, USA


Th2B - Spoken Language Understanding Systems 2

Time & Place: 1100 - 1230, Bandicoot Room
Chair: Sandra Williams

1100
Th2B1

Automatic Ambiguity Detection
Sproat, R van Santen, J
Bell Laboratories / Lucent Technologies, USA

1115
Th2B2

Empowering Knowledge Based Speech Understanding through Statistics
Fischer, J Haas, J Noth, E Niemann, H Deinzer, P
University of Erlangen, GERMANY

1130
Th2B3

Concept-driven speech understanding incorporated with a statistic language model
Nagai, A Ishikawa, Y
Mitsubishi Electric Corporation, JAPAN

1145
Th2B4

On the Limitations of Stochastic Conceptual Finite-State Language Models for Speech Understanding
Colas, J Ferreiros, J Montero, J M Pastor, J Gallardo, A Pardo, J M
GTH-IEL-UPM, SPAIN

1200
Th2B5

Towards Speech Understanding across multiple languages
Ward, T Roukos, S Neti, C Gros, J Epstein, M Dharanipragada, S
IBM T.J. Watson Research Center, USA

1215
Th2B6

Automatic Detection of Sentence Boundaries and Disfluencies Based on Recognized Words
Stolcke, A Shriberg, E Bates, R Ostendorf, M Hakkani, D Plauche, M Tur, G Lu, Y
SRI International, USA


Th2C - Signal Processing and Speech Analysis 3

Time & Place: 1100 - 1230, Currawong Room
Chair: Wolfgang Hess

1100
Th2C1

Determination of articulator positions from speech acoustics by applying dynamic articulatory constraints
Suzuki, S Okadome, T Honda, M
NTT Basic Research Laboratories, JAPAN

1115
Th2C2

Recognizing Emotions in Speech Using Short-term and Long-term
Features

Li, Y Zhao, Y
University of Illinois, USA

1130
Th2C3

PeriphEar : A Nonlinear Active Model of the Auditory Periphery
Robert, A R Eriksson, J L E
CIRC, EPFL, SWITZERLAND

1145
Th2C4

The Voicing Feature for Stop Consonants: Acoustic Phonetic Analyses and Automatic Speech Recognition Experiments
Ramesh, P Niyogi, P
Lucent Technologies, USA

1200
Th2C5

Wavelet-based Energy Binning Cepstral Features for Automatic Speech Recognition
Basu, S Maes, S
IBM T.J. Watson Research Center, USA

1215
Th2C6

Articulatory Analysis using a Codebook for Articulatory based Low Bit-Rate Speech Coding
Silva, C Chennoukh, S
Universidade Do Minho, PORTUGAL


Th2D - Human Speech Perception 3

Time & Place: 1100 - 1215, Dingo Room
Chair: Philip Rubin

1100
Th2D1

Categorical perception: Important phenomenon or lasting myth?
Massaro, D W
University of California, Santa Cruz, USA

1115
Th2D2

Categorical Perception of Vowels
Gerrits, E Schouten, B
Utrecht University, THE NETHERLANDS

1130
Th2D3

Suprasegmental Cues for the Segmentation of Identical Vowel Sequences in Japanese
Kakehi, K Hirose, Y
Nagoya University, JAPAN

1145
Th2D4

Perception of concurrent approximant-vowel syllables
Ainsworth, W A
Keele University, UNITED KINGDOM

1200
Th2D5

Perceived Swedish Vowel Quantity: Effects of Postvocalic Consonant Duration
Behne, D M Czigler, P E Sullivan, K P H
Norwegian University of Science & Technology, SWEDEN


Th4A - Speaker Adaptation 3

Time & Place: 1330 - 1500, Antechinus Room
Chair: Jean-Claude Junqua

1330
Th4A1

On-line hierarchical transformation of hidden Markov models for speaker adaptation
Chien, J-T
National Cheng Kung University, TAIWAN R.O.C.

1345
Th4A2

High-Speed Speaker Adaptation Using Phoneme-Dependent Tree-Structured Speaker Clustering
Suzuki, M Abe, T Mori, H Makino, S Aso, H
Tohoku University, JAPAN

1400
Th4A3

The Use of Confidence Measures in Unsupervised Adaptation of Speech Recognizers
Anastasakos, T Balakrishnan, S
Motorola, Lexicus Division, USA

1415
Th4A4

Speaker Normalization with All-Pass Transforms
McDonough, J Byrne, W Luo, X
Johns Hopkins University, USA

1430
Th4A5

The CHAM model of hyperarticulate adaptation during human-computer error resolution
Oviatt, S L
Oregon Graduate Institute of Science & Technology, USA


Th4B - Spoken Language Understanding Systems 3

Time & Place: 1330 - 1500, Bandicoot Room
Chair: Richard Sproat

1330
Th4B1

Language Modeling for Content Extraction in Human-Computer
Dialogues

Reichl, W Carpenter, R Chu-Carroll, J Chou, W
Bell Labs - Lucent Technologies, USA

1345
Th4B2

A Language Model Combining Trigrams and Stochastic Context-Free Grammars
Gillett, J Ward, W
Carnegie Mellon University, USA

1400
Th4B3

Online Adaptation of Language Models in Spoken Dialogue Systems
Souvignier, B Kellner, A
Philips Research Laboratories, GERMANY

1415
Th4B4

Language Model Adaptation for Spoken Language Systems
Riccardi, G Potamianos, A Narayanan, S
AT&T Labs-Research, USA

1430
Th4B5

Detecting Topic Shifts Using a Cache Memory
Bigi, B De Mori, R El-Beze, M Spriet, T
University of Avignon, FRANCE

1445
Th4B6

A Discourse Coding Scheme for Conversational Spanish
Levin, L Thyme-Gobbel, A Lavie, A Ries, K Zechner, K
Carnegie Mellon University, USA


Th4C - Multimodal Spoken Language Processing 3

Time & Place: 1330 - 1500, Currawong Room
Chair: Eric Vatikiotis-Bateson

1330
Th4C1

Referential Features and Linguistic Indirection in Multimodal Language
Oviatt, S L Kuhn, K
Oregon Graduate Institute of Science & Technology, USA

1345
Th4C2

Multimodal Language Processing
Johnston, M
Oregon Graduate Institute, USA

1400
Th4C3

Implementation of Coordinative Nodding Behavior on Spoken Dialogue Systems
Hirasawa, J-I Miyazaki, N Nakano, M Kawabata, T
Nippon Telegraph & Telephone, JAPAN

1415
Th4C4

Use of Non-verbal Information in Communication between Human and Robot
Yokoyama, M Aoyama, K Kikuchi, H Shirai, K
Waseda University, JAPAN

1430
Th4C5

What you see is (almost) what you hear: Design principles for user interfaces for accessing speech archives
Whittaker, S Choi, J Hirschberg, J Nakatani, C
AT & T Labs - Research, USA


Th4D - Acoustic Phonetics 2

Time & Place: 1330 - 1500, Dingo Room
Chair: John Hajek

1330
Th4D1

Regional variations in the vowels of female adolescents from Sydney
Cox, F Palethorpe, S
Macquarie University, AUSTRALIA

1345
Th4D2

A kinematic analysis of New Zealand and Australian English vowel spaces
Watson, C Harrington, J Palethorpe, S
Macquarie University, AUSTRALIA

1400
Th4D3

Syllable-onset acoustic properties associated with syllable-coda voicing
Nguyen, N Hawkins, S
University of Geneva, SWITZERLAND

1415
Th4D4

Articulatory, acoustic and perceptual aspects of fricative/stop coarticulation
Nguyen, N Wrench, A Gibbon, F Hardcastle, W
University of Geneva, SWITZERLAND

1430
Th4D5

Efficiency as an organizing principle of natural speech
Van Son, R J J H Koopmans-van Beinum, F J Pols, L C W
University of Amsterdam, THE NETHERLANDS

1445
Th4D6

Within-speaker variability due to speaking manners
Karlsson, I Banziger, T Dankovicová, J Johnstone, T Lindberg, J Melin, H Nolan, F Scherer, K
KTH, SWEDEN


Th4P - Large Vocabulary Continuous Speech Recognition 5

Time & Place: 1330 - 1700, Platypus Room
Chair: Herve Bourlard

Th4P1
A Thesaurus-based Statistical Language Model for Broadcast New Transcription
Ando, A Kobayashi, A Imai, T
NHK Science & Technical Research Laboratories, JAPAN

Th4P2
Effect of Task Complexity on Search Strategies for the Motorola Lexicus Continuous Speech Recognition System
Balakrishnan, S V
Motorola Lexicus Division, USA

Th4P3
New Features for Confidence Annotation
Bansal, D Ravishankar, M
Carnegie Mellon University, USA

Th4P4
Multi-Span Statistical Language Modeling for Large Vocabulary Speech Recognition
Bellegarda, J R
Apple Computer, USA

Th4P5
Maximum-Likelihood Updates of HMM Duration Parameters for Discriminative Continuous Speech Recognition
Chengalvarayan, R
Lucent Technologies, USA

Th4P6
Towards Better Integration of Semantic Predictors in Statistical Language Modeling
Coccaro, N Jurafsky, D
University of Colorado, USA

Th4P7
An Asymmetric Stochastic Language Model Based on Multi-Tagged Words
Pastor, J Colas, J San-Seguno, R Pardo, J
GTH-IEL-UPM, SPAIN

Th4P8
Product-Code Vector Quantization of Cepstral Parameters for Speech Recognition over the WWW
Digalakis, V Neumeyer, L Perakakis, M
Technical University of Crete, GREECE

Th4P9
Context Dependent Tree Based Transforms for Phonetic Speech Recognition
Doherty, B Vaseghi, S McCourt, P
Queen's University of Belfast, UNITED KINGDOM

Th4P10
Interfacing Acoustic Models with Natural Language Processing Systems
Johnson, M T Harper, M P Jamieson, L H
Purdue University, USA

Th4P11
Hierachical Cluster Language Modelling with Statistical Rule Extraction for Rescoring N-Best Hypotheses During Speech Decoding
Jang, P J Hauptmann, A
Carnegie Mellon University, USA

Th4P12
Dealing with out-of-vocabulary words and speech disfluencies in an N-gram based speech understanding system
Kai, A Hirose, Y Nakagawa, S
Toyohashi University of Technology, JAPAN

Th4P13
Source-Extended Language Model for Large Vocabulary Continuous Speech Recognition
Kobayashi, T Wada, Y Kobayashi, N
Waseda University, JAPAN

Th4P14
Time Dependent Language Model for Broadcast News Transcription and Its Post-Correction
Kobayashi, A Onoe, K Imai, T Ando, A
NHK Science & Technical Research Laboratories, JAPAN

Th4P15
Exploiting Transitions and Focussing on Linguistic Properties for ASR
Koreman, J Barry, W J Andreeva, B
University of the Saarland, GERMANY

Th4P16
A Unified Framework for Sublexical and Linguistic Modelling Supporting Flexible Vocabulary Speech Understanding
Lau, R Seneff, S
MIT, USA

Th4P17
A method for modeling liaison in a speech recognition system for French
Bahl,L De Gennaro, S De Souza, P Epstein, E Le Roux, J M Lewis, B Waast, C
IBM, USA

Th4P18
On Variable Sampling Frequencies in Speech Recognition
Liu, F-H Picheny, M
IBM Watson Research Center, USA

Th4P19
Pronunciation Modeling for Large Vocabulary Conversational Speech Recognition
Ma, K Zavaliagkos, G Iyer, R
GTE/BBN Technologies, USA

Th4P20
Time Shift Invariant Speech Recognition
Basu, S Ittycheriah, A Maes, S H
IBM T.J. Watson Research Center, USA

Th4P21
The Demiphone versus the Triphone in a Decision-tree State-tying Framework
Mariño, J B Pachès-Leal, P Nogueiras, A
Polytechnical University of Catalonia, SPAIN

Th4P22
Word Clustering for a Word Bi-gram Model
Mori, S Nishimura, M Itoh, N
Tokyo Research Laboratory, JAPAN

Th4P23
A Large Vocabulary Continuous Speech Recognition Hybrid System for the Portuguese Language
Neto, J Martins, C Almeida, L
INESC, PORTUGAL

Th4P24
Speech Recognition Performance on a New Voicemail Transcription Task
Padmanabhan, M Ramabhadran, B Basu, S
IBM T J Watson Research Center, USA

Th4P25
Grammatical and statistical word prediction system for Spanish integrated in an aid for people with disabilities
Palazuelos, S Aguilera, S Rodrigo, J Godino, J
ETSI Telecomunicación, SPAIN

Th4P26
Segmentation using a maximum entropy approach
Papineni, K Dharanipragada, S
IBM T.J. Watson Research Center, USA

Th4P27
Recognition Performance of a Large-Scale Dependency Grammar Language Model
Berger, A Printz, H
Carnegie Mellon University, USA

Th4P28
A Bootstrap Technique for Building Domain-Dependent Language Models
Ramaswamy, G N Printz, H Gopalakrishnan, P S
I.B.M. Research Center, USA

Th4P29
Estimation of the Probability Distributions of Stochastic Context-Free Grammars from the K-Best Derivations
Sanchez, J-A Benedi, J-M
Universidad Politecnica De Valencia, SPAIN

Th4P30
Robust HMM Estimation with Gaussian Merging-Splitting and Tied-Transform HMMs
Sankar, A
SRI International, USA

Th4P31
Nonlinear Interpolation of Topic Models for Language Model Adaptation
Seymore, K Chen, S Rosenfeld, R
Carnegie Mellon University, USA

Th4P32
Performance evaluation of word phrase and noun category language models for broadcast news speech recognition
Takagi, K Oguro, R Hashimoto, K Ozeki, K
The University of Electro-Communications, JAPAN

Th4P33
Robust Automatic Continuous-Speech Recognition Based on a Voiced-Unvoiced Decision
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA

Th4P34
Double Tree Beam Search Using Hierarchical Subword Units
Torrecilla, J C Cortazar, I Hernandez, L
Telefonica I+D, SPAIN

Th4P35
Text Segmentation and Topic Tracking on Broadcast News Via a Hidden Markov Model Approach
van Mulbregt, P Carp, I Gillick, L Lowe, S Yamron, J
Dragon Systems, Inc, USA

Th4P36
Multi-Phone Strings as SubWord Units for Speech Recognition
O’Neill, P Vaseghi, S Doherty, B Tan, W McCourt, P
The Queen's University of Belfast

Th4P37
Phonetic modification of the Syllable / tu / in two spontaneous American English dialogues
Veilleux, N Shattuck-Hufnagel, S
Boston University Metropolitan College, USA

Th4P38
Efficient Lattice Representation and Generation
Weng, F Stolcke, A Sankar, A
SRI International, USA

Th4P39
Modeling pronunciation variation for a Dutch CSR: Testing three methods
Wester, M Kessens, J M Strik, H
University of Nijmegen, THE NETHERLANDS
Th4P40
Comparison of Language Modelling Techniques for Russian and English
Whittaker, E W D Woodland, P C
Cambridge University, UNITED KINGDOM

Th4P41
Optimized POS-Based Language Models for Large Vocabulary Speech Recognition
Witschel, P
Siemens AG ZT IK 5, GERMANY

Th4P42
Reducing Peak Search Effort Using Two-Tier Pruning
Wright, M Hovell, S Ringland, S
BT Laboratories, UNITED KINGDOM

Th4P43
Using Untranscribed Training Data to Improve Performance
Zavaliagkos, G Siu, M Colthurst, T Billa, J
BBN Technologies, USA

Th4P44
Telephone Band LVCSR for Hearing-Impaired Users
Jan, E Bakis, R Liu, F Picheny, M
IBM T.J. Watson Research Center, USA

Th4P45
Using X-gram for efficient speech recognition
Bonafonte, A Mariño, J B
Polytechnical University of Catalonia, SPAIN


Th4Q - Speech Coding 3

Time & Place: 1330 - 1700, Quokka Room
Chair: Alan Bradley

Th4Q1
A New Linear Predictive Method for Compression of Speech Signals
Alku, P Varho, S
University of Turku, FINLAND

Th4Q2
Hierarchical Temporal Decomposition: A Novel Approach to Efficient Compression of Spectral Characteristics of Speech
Ghaemmaghami, S Deriche, M Sridharan, S
Queensland University of Technology, AUSTRALIA

Th4Q3
Speech intelligibility testing for new technologies
Hura, S
Lucent Technologies, USA

Th4Q4
Efficient Quantization of LSF Parameters Based on Temporal Decomposition
Kim, S-J Lee, S-H Han, W-J Oh, Y-H
KAIST, KOREA

Th4Q5
A Sinusoidal Harmonic Vocoder at 1.2 kbps Using Auditory Perceptual Characteristics
Kohata, M
Chiba Institute of Technology, JAPAN

Th4Q6
A 16 kbit/s Wideband CELP Coder Using Mel-Generalized Cepstral Analysis and Its Subjective Evaluation
Koishida, K Hirabayashi, G Tokuda, K Kobayashi, T
University of California, Santa Barbara, USA

Th4Q7
Comparison of spectral estimation techniques for low bit-rate speech coding
Molyneux, D J Parris, C I Sun, X Q Cheetham, B M G
University of Manchester, UNITED KINGDOM

Th4Q8
Low Bit Rate Coding for Speech and Audio Using Mel Linear Predictive Coding (MLPC) analysis
Nakatoh, Y Norimatsu, T Low, A H Matsumoto, H
Matsushita Electric Industrial Co Ltd, JAPAN

Th4Q9
Comparison Study on VQ Codevector Index Assignment
Pan, J S Shieh, C S Chu, S C
National Kaohsiung Institute of Technology, TAIWAN R.O.C.

Th4Q10
Using linguistic knowledge to improve the design of low-bit rate LSF quantisation
Parry, J J Burnett, I S Chicharo, J F
University of Wollongong, AUSTRALIA

Th4Q11
Transform Coding of LSF Parameters Using Wavelets
Petrinovic, D
University of Zagreb, CROATIA

Th4Q12
Source Controlled Variable Bit-Rate Speech Coder Based on Waveform Interpolation
Plante, F Cheetham, B M G Marston, D Barrett, P A
Liverpool University, UNITED KINGDOM

Th4Q13
Improving Speaker Recognizability in Phonetic Vocoders
Ribeiro, C E M Trancoso, I M
INESC, PORTUGAL


Th4R - Language Acquisition 3 / Multilingual Perception and Recognition 2

Time & Place: 1330 - 1700, Rosella Room
Chair: Sudaporn Luksaneeyyanawin

Th4R1
Speech Perception and Spoken Language in Children with Impaired Hearing
Blamey, P Sarant, J Serry, T Wales, R James, C Barry, J Clark, G Wright, M Tooher, R Psarros, C Godwin, G Rennie, M Meskin, T
University of Melbourne, AUSTRALIA

Th4R2
Quantitative assessment of second language learners' fluency: An automatic approach
Cucchiarinim, C Strik, H Boves, L
University of Nijmegen, THE NETHERLANDS

Th4R3
Cross-Language Merged Speech Units and their Descriptive Phonetic Correlates
Dalsgaard, P Andersen, O Barry, W
Aalborg University, DENMARK

Th4R4
Crosslinguistic Disfluency Modelling: A Comparative Analysis of Swedish and American English Human-Human and Human-Machine Dialogues
Eklund, R Shriberg, E
Telia Research AB, SWEDEN

Th4R5
Calibration of machine scores for pronunciation grading
Franco, H Neumeyer, L
SRI International, USA

Th4R6
Phonetic Distance-Based Hypothesis Driven Lexical Adaptation for Transcribing Multilingual Broadcast News
Geutner, P Finke, M Waibel, A
Carnegie Mellon University, USA

Th4R7
Automatic Pronunciation Error Detection and Guidance for Foreign Language Learning
Jo, C-H Kawahara, T Doshita, S Dantsuji, M
Kyoto University, JAPAN

Th4R8
Lexical Access for Large-Vocabulary Speech Recognition
Leung, H Y Leung, H C
Chinese University of Hong Kong, HONG KONG

Th4R9
The effect of fundamental frequency on Mandarin speech recognition
Liu, S Doyle, S Morris, A Ehsani, F
Nuance Communications, USA

Th4R10
The perception of nativeness: Variable speakers and flexible listeners
Markham, D
Deakin University, AUSTRALIA

Th4R11
Voice Dictation in the Secondary School Classroom
McTear, M O'Hare, E
University of Ulster, UNITED KINGDOM

Th4R12
The importance of first syllable in English spoken word recognition by adult Japanese speakers
Nakayama, K Tomita-Nakayama, K
Yamagata University, JAPAN

Th4R13
Spoken L2 Teaching with Contrastive Visual and Auditory Feedback
Öster, A-M
KTH, SWEDEN

Th4R14
The role of phonological, morphological and orthographic knowledge in the intuitive syllabification of Dutch words: A longitudinal approach
Sandra, D Gillis, S
University of Antwerp, BELGIUM

Th4R15
The Acquisition of Japanese Compound Accent Rule
Shirose, A Kubozono, H Kiritani, S
University of Tokyo, JAPAN

Th4R16
The Acquisition of Putonghua Phonology
So, L K H Zhou, J
The University of Hong Kong, HONG

Th4R17
Enhancing Speech Processing of Japanese Learners of English Utilizing Time-Scale Expansion with Constant Pitch
Tomita-Nakayama, K Nakayama, K Misaki, M
Yamagata University, JAPAN

Th4R18
A Bootstrap Training Approach for Language Model Classifiers
Warnke, V Noeth, E Buckow, J Harbeck, S Niemann, H
University of Erlangen, GERMANY

Th4R19
Voice onset time patterns in 7-, 9- and 11-year old children
Whiteside, S P Marshall, J
University of Sheffield, UNITED KINGDOM

Th4R20
Some developmental patterns in the speech of 6-, 8- and 10-year old children: An acoustic phonetic study
Whiteside, S P Hodgson, C
University of Sheffield, UNITED KINGDOM

Th4R21
Language Development After Extreme Childhood Deprivation: A Case Study
Brown, L Locke, J L Jones, P Whiteside, S
University of Sheffield, UNITED KINGDOM

Th4R22
Phonological elements as a basis for language-independent ASR
Williams, G Terry, A M P Kaye, J D
SOAS, University of London, UNITED KINGDOM

Th4R23
A Phonetic and Acoustic Study of Babbling in an Italian Child
Zmarich, C Lanni, R
Instituto Di Fonetica E Dialettologia, CNR, ITALY

Th4R24
Rescoring Multiple Pronunciations Generated from Spelled Words
Kuhn, R Junqua, J-C Martzen, P
Panasonic Technologies, Inc., USA


Th5A - Segmentation, Labelling and Speech Corpora 3

Time & Place: 1530 - 1700, Antechinus Room
Chair: Toomas Altosaar

1530
Th5A1

A recursive algorithm for the forced alignment of very long audio segments
Moreno, P J Joerg, C Van Thong, J-M Glickman, O
Compaq (Cambridge Research Lab), USA

1545
Th5A2

The selection of pronunciation variants: comparing the performance of man and machine
Kessens, J M Wester, M Cucchiarini, C Strik, H
University of Nijmegen, THE NETHERLANDS

1600
Th5A3

Acoustic Confidence Measures for Segmenting Broadcast News
Barker, J Williams, G Renals, S
Sheffield University, UNITED KINGDOM

1615
Th5A4

A Duration-Based Confidence Measure for Automatic Segmentation of Noise Corrupted Speech
Pellom, B L Hansen, J H L
Duke University, USA

1630
Th5A5

Segmentation and Classification of Broadcast News Audio
Hain, T Woodland, P C
Cambridge University, UNITED KINGDOM

1645
Th5A6

Speaker Recruitment Methods and Speaker Coverage - Experiences From a Large Multilingual Speech Database Collection
Lindberg, B Comeyne, R Draxler, C Senia, F
Aalborg University, DENMARK


Th5B - Text-To-Speech Synthesis 5

Time & Place: 1530 - 1700, Bandicoot Room
Chair: Ron van Son

1530
Th5B1

A phonologically motivated method of selecting non-uniform units
Breen, A P Jackson, P
BT Labs, UNITED KINGDOM

1545
Th5B2

A Synthesis Method Based on Concatenation of Demisyllables and a Residual Excited Vocal Tract Model
Pearson, S Kibre, N Niedzielski, N
Speech Technology Lab/Panasonic Technologies Inc., USA

1600
Th5B3

Exploration of Acoustic Correlates in Speaker Selection for Concatenative Synthesis
Syrdal, A K Conkie, A Stylianou, Y
AT&T Labs - Research, USA

1615
Th5B4

A Perceptual Evaluation of Distance Measures for Concatenative Speech Synthesis
Wouters, J Macon, M W
Center for Spoken Language Understanding, USA

1630
Th5B5

HMM-Based Smoothing for Concatenative Speech Synthesis
Plumpe, M Acero, A Hon, H Huang, X
Microsoft Research, USA

1645
Th5B6

A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level features
Holzapfel, M Cambpell, N
ATR Interpreting Telecommunications Research Laboratories, JAPAN


Th5C - Spoken Language Generation and Translation 2

Time & Place: 1530 - 1700, Currawong Room
Chair: Julia Hirschberg

1530
Th5C1

A Generic Algorithm for Generating Spoken Monologues
Klabbers, E Krahmer, E Theune, M
IPO Center for Research On User-System Interaction, THE NETHERLANDS

1545
Th5C2

On the Use of Automatically Generated Discourse-level Information in a Concept-to-Speech Synthesis System
Hitzeman, J Black, A W Taylor, P Mellish, C Oberlander, J
University of Edinburgh, UNITED KINGDOM

1600
Th5C3

Learning Phrase-based Head Transduction Models for Translation of Spoken Utterances
Alshawi, H Bangalore, S Douglas, S
AT&T Labs, USA

1615
Th5C4

Probabilistic dialogue act extraction for concept based multilingual translation systems
Fukada, T Koll, D Waibel, A Tanigaki, K
ATR Interpreting Telecommunications Research Laboratories, JAPAN

1630
Th5C5

Fast Decoding for Statistical Machine Translation
Wang, Y Waibel, A
Carnegie Mellon University, USA

1645
Th5C6

A Japanese-to-English Speech Translation System: ATR-MATRIX
Takezawa, T Morimoto, T Sagisaka, Y Campbell, N Iida, H Sugaya, F Yokoo, A Yamamoto, S
ATR Interpreting Telecommunications Research Laboratories, JAPAN


Th5D - Human Speech Perception 4

Time & Place: 1530 - 1700, Dingo Room
Chair: Sieb Nooteboom

1530
Th5D1

Orthografik inkoncistensy ephekts in foneme detektion?
Cutler, A Treiman, R van Ooijen, B
Max Plank Institute for Psycholinguistics, THE NETHERLANDS

1545
Th5D2

The effect of orthographic knowledge on the segmentation of speech
Derwing, B L Nearey, T M Yoon, Y B
University of Alberta, CANADA

1600
Th5D3

Spotting (different types of) Words in (different types of) Context
McQueen, J M Cutler, A
Max-Planck-Institute for Psycholinguistics, NETHERLANDS

1615
Th5D4

Correlation Between Consonantal VC Transitions and degree of Perceptual Confusion of Place Contrasts in Hindi
Ohala, M Ohala, J J
San Jose State University, USA

1630
Th5D5

Perception of Tonal Rises and Falls for Accentuation and Phrasing in Swedish
House, D Hermes, D Beaugendre, F
University of Skovde, SWEDEN

1645
Th5D6

Speech Intelligibility Derived from Exceedingly Sparse Spectral Information
Greenberg, S Arai, T Silipo, R
International Computer Science Institute, USA



Friday 4 December, 1998


Fr1A - Robust Speech Processing in Adverse Environments 5

Time & Place: 0900 - 1030, Antechinus Room
Chair: John Hansen

0900
Fr1A1

Auditory Modelling Techniques for Robust Pitch Extraction and Noise Reduction
Cosi, P Pasquin, S Zovato, E
IIFD - Institute of Phonetics and Dialectology, ITALY

0915
Fr1A2

Wavelet Transform-based Speech Enhancement
Ambikairajah, E Tattersall, G Davis, A
Athlone Institute of Technology, IRELAND

0930
Fr1A3

A Practical Perceptual Frequency Autoregressive HMM Enhancement System
Logan, B Robinson, A
University of Cambridge, UNITED KINGDOM

0945
Fr1A4

An Effective Quality Evaluation Protocol for Speech Enhancement Algorithms
Hansen, J H L Pellom, B L
Robust Speech Processing Laboratory, Duke University, USA

1000
Fr1A5

An adaptive beamforming microphone array system using a blind deconvolution
Park, J-N Usagawa, T Ebata, M
Kumamoto University, JAPAN

1015
Fr1A6

Speech Enhancement using Critical Band Spectral Subtraction
Singh, L Sridharan, S
Queensland University of Technology, AUSTRALIA


Fr1B - Text-To-Speech Synthesis 6

Time & Place: 0900 - 1030, Bandicoot Room
Chair: Catherine Watson

0900
Fr1B1

How to Handle 'Foreign' Sounds in Swedish Text-to-Speech Conversion: Approaching the 'Xenophone' Problem
Eklund, R Lindström, A
Telia Research AB, SWEDEN

0915
Fr1B2

Multi-Lingual Concatenative Speech Synthesis
Campbell, N
ATR Interpreting Telecommunications Research Laboratories, JAPAN

0930
Fr1B3

On the Use of F0 features in Automatic Segmentation for Speech Synthesis
Saito, T
IBM Japan Ltd, JAPAN

0945
Fr1B4

A linguistic and prosodic database for data-driven Japanese TTS synthesis
Sakurai, A Natsume, T Hirose, K
University of Tokyo, JAPAN

1000
Fr1B5

Text-to-Speech Voice Adaptation from Sparse Training Data
Kani, A Macon, M
Oregon Graduate Institute, USA

1015
Fr1B6

Describing Intonation with a Parametric Model
Möhler, G
University of Stuttgart, GERMANY


Fr1C - Speech Technology Applications and Human- Machine Interface 2

Time & Place: 0900 - 1030, Currawong Room
Chair: Ben Serridge

0900
Fr1C1

Development of CAI system employing synthesized speech responses
Shinozaki, T Abe, M
NTT Human Interface Labs, JAPAN

0915
Fr1C2

Using Combined Decisions and Confidence Measures for Name Recognition in Automatic Directory Assistance Systems
Kellner, A Rueber, B Schramm, H
Philips Research Laboratories, GERMANY

0930
Fr1C3

VPQ: A Spoken Language Interface to Large Scale Directory Information
Buntschuh, B Kamm, C DiFabbrizio, G Abella, A Mohri, M Narayanan, S Zeljkovic, I Sharp, R Wright, R Marcus, S Shaffer J Duncan, R Wilpon, J
AT&T Labs - Research, USA

0945
Fr1C4

SCAN - Speech Content Based Audio Navigator: A System Overview
Choi, J Hindle, D Hirschberg, J Magrin-Chagnolleau, I Nakatani, C Pereira, F Singhal, A Whittaker, S
AT&T Labs - Research, USA

1000
Fr1C5

Controlling a HIFI with a continuous speech understanding system
Ferreiros, J Colas, J Macias-Guarasa, J Ruiz, A Pardo, J M
GTH-IEL-UPM, SPAIN

1015
Fr1C6

User Evaluation of the Mask Kiosk
Lamel, L Bennacef, S Gauvain, J L Dartigues, H Temem, J-N
LIMSI-CNRS, FRANCE


Fr1D - Prosody and Emotion 6

Time & Place: 0900 - 1030, Dingo Room
Chair: Mary Beckman

0900
Fr1D1

A Contrastive Study of Lexical Stress Placement in Singapore English and British English
Low, E L Grabe, E
Nayang Technological University, SINGAPORE

0915
Fr1D2

Integrated Recognition of Words and Phrase Boundaries
Gallwitz, F Batliner, A Buckow, J Huber, R Niemann, H Noeth, E
University of Erlangen, GERMANY

0930
Fr1D3

Phrase accents revisited: comparative evidence from Standard and Cypriot Greek
Arvaniti, A
University of Cyprus, CYPRUS

0945
Fr1D4

Phonetic invariance and phonological stability: Lithuanian pitch accents
Dogil, G Möhler, G
University of Stuttgart, GERMANY

1000
Fr1D5

A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech
Brindoepke, C Fink, G Kummert, F Sagerer, G
University Bielefeld, GERMANY

1015
Fr1D6

Towards a reversible symbolic coding of intonation
Véronis, J Campione, E
Université de Provence, FRANCE


Fr1P - Hidden Markov Model Techniques 3

Time & Place: 0900 - 1230, Platypus Room
Chair: Joe Picone

Fr1P1
A Statistical Phonemic Segment Model for Speech Recognition Based on Automatic Phonemic Segmentation
Aizawa, K Furuichi, C
TOIN University of Yokohama, JAPAN

Fr1P2
Improved Feature Decorrelation for HMM-based Speech Recognition
Demuynck, K Duchateau, J Van Compernolle, D Wambacq, P
K.U. Leuven - ESAT/PSI, BELGIUM

Fr1P3
Efficient high-order Hidden Markov Modelling
du Preez, J A Weber, D M
University of Stellenbosch, SOUTH AFRICA

Fr1P4
A Time-synchronous, Tree-based Search Strategy in the Acoustic Fast Match of an Asynchronous Speech Recognition System
Eide, E Bahl, L
IBM, USA

Fr1P5
Effective Structural Adaptation of LVCSR Systems to Unseen Domains Using Hierarchical Connectionist Acoustic Models
Fritsch, J Finke, M Waibel, A
University of Karlsruhe, GERMANY

Fr1P6
Support Vector Machines for Speech Recognition
Ganapathiraju, A Hamaker, J Picone, J
Mississippi State University, USA

Fr1P7
Natural Number Recognition Using Discriminatively Trained Inter-Word Context Dependent Hidden Markov Models
Gandhi, M B
Lucent Technologies, USA

Fr1P8
Information Theoretic Approaches to Model Selection
Hamaker, J Ganapathiraju, A Picone, J
Mississippi State University, USA

Fr1P9
Continuous Speech Recognition Using Segmental Unit Input HMMs with a Mixture of Probability Density Functions and Context Dependency
Hanai, K Yamamoto, K Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN

Fr1P10
Gaussian density tree structure in a multi-Gaussian HMM-based speech recognition system
Simonin, J Delphin-Poulat, L Damnati, G
France Telecom - CNET, FRANCE

Fr1P11
Generalized Phone Modeling Based on Piecewise Linear Segment Lattice
Kojima, H Tanaka, K
Electrotechnical Laboratory, AIST, MITI, JAPAN

Fr1P12
A Flexible Method of Creating HMM Using Block-diagonalization of Covariance Matrices
Koshiba, R Tachimori, M Kanazawa, H
Toshiba, JAPAN

Fr1P13
HMM Topology Selection for Accurate Acoustic and Duration Modeling
Chesta, C Laface, P Ravera, F
Politecnico Di Torino, ITALY

Fr1P14
Context-dependent duration modelling for continuous speech recognition
Lee, T Carlson, R Grandstrom, B
The Chinese University of Hong Kong, HONG KONG

Fr1P15
Training of Context-Dependent Subspace Distribution Clustering Hidden Markov Model
Mak, B Bocchieri, E
Hong Kong University of Science & Technology, HONG KONG

Fr1P16
Unsupervised Training of HMMs With Variable Number of Mixture Components per State
Martin, C Villarrubia, L Gonzalez, F J Hernandez, L
Telefonica I+D, SPAIN

Fr1P17
Acoustic Observation Context Modeling in Segment Based Speech Recognition
Szarvas, M Matsunaga, S
Technical University of Budapest, HUNGARY

Fr1P18
Capturing Discriminative Information Using Multiple Modeling Techniques
Ming, J Hanna, P Stewart, D Vaseghi, S Smith, F J
Queen's University of Belfast, UNITED KINGDOM

Fr1P19
Suprasegmental Duration Modelling with Elastic Constraints in Automatic Speech Recognition
Molloy, L Isard, S
Edinburgh University, UNITED KINGDOM

Fr1P20
An Adaptive Gradient-Search-Based Algorithm for Discriminative Training of HMMs
Nogueiras-Rodriguez, A Marinho, J B Monte, E
Polytechnical University of Catalonia, SPAIN

Fr1P21
Task Adaptation of Sub-Lexical Unit Models using the Minimum Confusibility Criterion on Task Independent Databases
Nogueiras-Rodriguez, A Marinho, J B
Polytechnical University of Catalonia, SPAIN

Fr1P22
Stochastic Calculus, Non-Linear Filtering, and the Internal Model Principle: Implications for Articulatory Speech Recognition
Ramsay, G
Institut De La Communication Parlee, FRANCE

Fr1P23
The Use of Meta-HMM in Multistream HMM Training for Automatic Speech Recognition
Wellekens, C J Kangasharju, J Milesi, C
Institut Eurecom, FRANCE

Fr1P24
Enhanced ASR by acoustic feature filtering
Wellekens, C J
Institut Eurecom, FRANCE

Fr1P25
Soft State-Tying for HMM-Based Speech Recognition
Neukirchen, C Willett, D Rigoll, G
Duisburg University, GERMANY

Fr1P26
Estimation of models for non-native speech in computer-assisted language learning based on linear model combination
Witt, S M Young, S J
Cambridge University, UNITED KINGDOM

Fr1P27
Duration Modeling using Cumulative Duration Probability and Speaking Rate Compensation
Yang, T-Y Kim, J-S Lee, C Youn, D H Cha, I-W
Yonsei University, KOREA

Fr1P28
Probabilistic Modeling with Bayesian Networks for Automatic Speech Recognition
Zweig, G Russell, S
IBM T.J. Watson Research Center, USA


Fr1Q - Speech & Hearing Disorders 2 / Speech Processing for the Speech Impaired and Hearing Impaired 1

Time & Place: 0900 - 1230, Quokka Room
Chair: Chris James

Fr1Q1
SIVHA, Visual Speech Synthesis System
Blanco, Y Cuellar, M Villanueva, A Lacunza, F Cabeza, R Marcotegui, B
Universidad Publica De Navarraica, UPNA, SPAIN

Fr1Q2
Using automatic speech recognition and its possible effects on the voice
de Bruijn, C G Whiteside, S P Cudd, P A Syder, D Rosen, M Nord, L
University of Sheffield, UNITED KINGDOM

Fr1Q3
The Importance of F0 or Voice Pitch for Perception of Tonal Language: Simulations with Cochlear Implant Speech Processing Strategies
Fearn, R
University of New South Wales, AUSTRALIA

Fr1Q4
Assessing high level language in individuals with multiple sclerosis - a pilot study
Brunnegård, K Laakso, K Hartelius, L Ahlsén, E
Goteborg University, SWEDEN

Fr1Q5
Design of Cochlear Implant Device for Transmitting Voice Pitch Information in a Speech Sound of Asian Languages
Hiki, S Imaizumi, K Fukuda, Y
Waseda University, JAPAN

Fr1Q6
Abnormal Volume-Duration Relationship in Parkinsonian Speech
Ho, A K Bradshaw, J L Iansek, R Alfredson, R J
Monash University, AUSTRALIA

Fr1Q7
Analysis of Disordered Speech Signal Using Wavelet Transform
Jo, C W Kim, D H
Changwon National University, KOREA

Fr1Q8
Multi-channel Pulsation Strategy for Electric Stimulation of Cochlea
Kitazawa, S Kirihata, H Kitamura, T
Shizuoka University, JAPAN

Fr1Q9
Synthetic faces as a lipreading support
Agelfors, E Beskow, J Dahlquist, M Granström, B Lundeberg, M Spens, K-E Öhman, T
KTH, SWEDEN

Fr1Q10
Predicting language scores from speech perception scores of hearing impaired children
Martin, L Bench, J
La Trobe University, AUSTRALIA

Fr1Q11
Context-Independent Duration Model on Categories of Voice and Unvoice Segments
Skljarov, O P
Research Institute of Ear Throat Nose & Speech, RUSSIA

Fr1Q12
Dynamical spectrogram, an aid for the deaf
Soltani Farani, A A Chilton, E H S Shirley, R
University of Surrey, UNITED KINGDOM

Fr1Q13
Evidence of dual-route phonetic encoding from apraxia of speech: Implications for phonetic encoding models
Varley, R A Whiteside, S P
University of Sheffield, UNITED KINGDOM

Fr1Q14
Speech Communication Profiles Across the Adult Lifespan: Persons Without Self-Identified Hearing Impairment
Cheesman, M F Smilsky, K L Major, T M Lewis, F Boorman, L M
University of Western Ontario, CANADA


Fr1R - Human Speech Production

Time & Place: 0900 - 1230, Rosella Room
Chair: Jan van Doorn

Fr1R1
Time as a factor in the acoustic variation of schwa
Barry, W J
University of the Saarland, GERMANY

Fr1R2
On the Structure of Vowel Space: A Genealogy of General Phonetic Concepts
Boshoff, H F V Botha, E C
University of Pretoria, SOUTH AFRICA

Fr1R3
The relationship between intensity and subglottal pressure with controlled pitch
Lecuit, V Demolin, D
Universite Libre De Bruxelles, BELGIUM

Fr1R4
Segmentation of the airway from the surrounding tissues on Magnetic Resonance Images: A comparative study
Soquet, A Lecuit, V Metens, T Nazarian, B Demolin, D
Universite Libre De Bruxelles, BELGIUM

Fr1R5
Recovering Vocal Tract Shapes from MFCC Parameters
Dusan, S Deng, L
University of Waterloo, CANADA

Fr1R6
Quantification of Pharyngeal Articulations Using Measurements from Laryngoscopic Images
Esling, J Clayards, J Edmondson, J Fuyuan, Q Harris, J
University of Victoria, CANADA

Fr1R7
Variance and Invariance in Speech Rate as a Reflection of Conceptual Planning
Fon, Y-J
The Ohio State University, USA

Fr1R8
Correspondence between the Glottal Gesture Overlap Pattern and Vowel Devoicing in Japanese
Fujimoto, M Murano, E Niimi, S Kiritani, S
University of Tokyo, JAPAN

Fr1R9
Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique
Fujisawa, Y Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN

Fr1R10
Independence of consonantal voicing and vocoid F0 perturbation in English and Japanese
Ishihara, S
The Australian National University, AUSTRALIA

Fr1R11
Reduction of English Function Words in Switchboard
Jurafsky, D Bell, A Fosler-Lussier, E Girand, C Raymond, W
University of Colorado, USA

Fr1R12
Duration Compensation in Non-Adjacent Consonant and Temporal Regularity
Kim, H-S
Stanford University, USA

Fr1R13
Relationship Between Lip Shapes and Acoustical Characteristics During Speech
Mori, K Sonoda, Y
Kyushu Kyoritsu University, JAPAN

Fr1R14
A model to represent propagation and radiation of higher-order modes for 3-D vocal-tract configuration
Motoki, K Matsuzaki, H
Hokkai-Gakuen University, JAPAN

Fr1R15
FEM Analysis of aspirated air flow in three-dimensional vocal tract during fricative consonant phonation
Niikawa,T Matsumura,M Tachimura,T Wada,T
Osaka Electro-Communication University, JAPAN

Fr1R16
Trajectory formation of articulatory movements for a given sequence of phonemes
Okadome, T Kaburagi, T Honda, M
NTT Basic Research Laboratories, JAPAN

Fr1R17
Contextual Effects on Voicing Profiles of German and Mandarin Consonants
Shih, C Möbius, B
Bell Labs - Lucent Technologies, USA

Fr1R18
Reconstructing the tongue surface from six cross-sectional contours: Ultrasound data
Lundberg, A Stone, M
Johns Hopkins University, USA

Fr1R19
Articulability of two consecutive morae in Japanese speech production: Evidence from sound exchange errors in spontaneous speech
Terao, Y Murata, T
Tokoha Gakuen College, JAPAN

Fr1R20
Coarticulation and degrees of freedom in the elaboration of a new articulatory plant: Gentiane
Vilain, A Abry, C Badin, P
ICP - Stendhal, FRANCE

Fr1R21
A pressure sensitive palatography: Application of new pressure sensitive sheet for measuring tongue-palatal contact pressure
Wakumoto, M Masaki, S Honda, K Ohue, T
ATR, JAPAN

Fr1R22
Dual-route phonetic encoding: Some acoustic evidence
Whiteside, S P Varley, R A
University of Sheffield, UNITED KINGDOM

Fr1R23
Fast and Slow Speech Rate: A Characterisation for French
Zellner, B
University of Lausanne, SWITZERLAND


Fr2A - Segmentation, Labelling and Speech Corpora 4

Time & Place: 1100 - 1230, Antechinus Room
Chair: Khalid Choukri

1100
Fr2A1

A multilingual prosodic database
Campione, E Véronis, J
Université De Provence, FRANCE

1115
Fr2A2

The CSLU Speaker Recognition Corpus
Cole, R Noel, M Noel, V
Oregon Graduate Institute, USA

1130
Fr2A3

How effective is unsupervised data collection for children's speech recognition?
Aist, G Chan, P Huang, X D Jiang, L Kennedy, R Latimer, D Mostow, J Yueng, C
Carnegie Mellon University, USA

1145
Fr2A4

An Algorithm for Automatic Generation of Mandarin Phonetic Balanced Corpus
Shyuu, J-S Wang, J-F
Department of Computer Science and Information Engineering, TAIWAN R.O.C.

1200
Fr2A5

Towards a formal framework for linguistic annotations
Bird, S Liberman, M
University of Pennsylvania, USA

1215
Fr2A6

Forming Generic Models of Speech for Uniform Database Access
Altosaar, T Vainio, M
Helsinki University of Technology, FINLAND


Fr2B - Speaker and Language Recognition 4

Time & Place: 1100 - 1230, Bandicoot Room
Chair: Jesper Olsen

1100
Fr2B1

On the Convergence of Gaussian Mixture Models: Improvements through Vector Quantization
Moody, J Slomka, S Pelecanos, J Sridharan, S
Queensland University of Technology, AUSTRALIA

1115
Fr2B2

Modeling Dynamic Prosodic Variation for Speaker Verification
Sonmez, M K Shriberg, E Heck, L Weintraub, M
SRI International, USA

1130
Fr2B3

Blind Clustering of Speech Utterances Based on Speaker and Language Characteristics
Reynolds, D Singer, E Carlson, B O'Leary, G McLaughlin, J Zissman, M
MIT Lincoln Laboratory, USA

1145
Fr2B4

Spoken Language Identification Using the Speechdat Corpus
Antonio Caseiro, D Martins Trancoso, I
INESC, PORTUGAL

1200
Fr2B5

Automatic Language Identification with Perceptually Guided Training and Recurrent Neural Networks
Braun, J Levkowitz, H
University of Massachusetts Lowell, USA

1215
Fr2B6

On the Importance of Components of the Modulation Spectrum for Speaker Verification
Van Vuuren, S Hermansky, H
Oregon Graduate Institute, USA


Fr2C - Speech Technology Applications and Human- Machine Interface 3

Time & Place: 1100 - 1230, Currawong Room
Chair: George Kokkinakis

1100
Fr2C1

Is Speech the Right Thing for your Application?
Bernsen, N O Dybkjaer, L
Centre for Cognitive Science, Odense University, DENMARK

1115
Fr2C2

A PC-Based Tool for Helping in Diagnosis of Pathologic Voice
Godino Llorente, J I Aguilera Navarro, S Palazeulos, S Nieto Altuzarra, A Gómez Vilda, P
ETSI Telecomunicación, SPAIN

1130
Fr2C3

Web-based educational tools for speech technology
Sjölander, K Beskow, J Gustafson, J Levin, E Carlson, R Granström, B
KTH, SWEDEN

1145
Fr2C4

Universal Speech Tools: The CSLU Toolkit
Sutton, S Cole, R de Villiers, J Schalkwyk, J Vermeulen, P Macon, M
Yan, Y-H Kaiser, E Rundle, B Shobaki, K Hosom, P Kain, A Wouters, J Massaro, D Cohen, M
Oregon Graduate Institute, USA

1200
Fr2C5

Creating a Mexican Spanish Version of the CSLU Toolkit
Serridge, B Barbosa, A Cole, R Munive, N Vargas, A
Universidad De Las Americas, MEXICO

1215
Fr2C6

A Voice User Interface Demonstration System for Mexican Spanish
Garcia-Mateo, C Zhou, Q Lee, C H Pargellis, A
Universidad De Vigo, SPAIN


Fr2D - Utterance Verification and Word Spotting 2

Time & Place: 1100 - 1230, Dingo Room
Chair: Jean-Luc Gauvain

1100
Fr2D1

Context Dependent Anti Subword Modeling for Utterance Verification
Ramesh, P Lee, C-H Juang, B-H
Lucent Technologies, USA

1115
Fr2D2

Combination of confidence measures in isolated word recognition
Dolfing, J G A Wendemuth, A
Philips Research Laboratories Aachen, GERMANY

1130
Fr2D3

Confidence Measures for HMM-based Speech Recognition
Willett, D Worm, A Neukirchen, C Rigoll, G
Duisburg University, GERMANY

1145
Fr2D4

Vocabulary-Independent Word Confidence Measure using Subword Features
Jiang, L Huang, X D
Microsoft Research, USA

1200
Fr2D5

A New Confidence Measure Based on Rank-Ordering Subphone Scores
Lin, Q Das, S Lubensky, D Picheny, M
IBM T.J. Watson Research Center, USA

1215
Fr2D6

Speaking-Style Dependent Lexicalized Filler Model for Key-Phrase Detection and Verification
Kawahara, T Ishizuka, K Doshita, S Lee, C-H
Kyoto University, JAPAN


Fr4A - Large Vocabulary Continuous Speech Recognition 6

Time & Place: 1330 - 1430, Antechinus Room
Chair: Lori Lamel

1330
Fr4A1

Sharable Software Repository for Japanese Large Vocabulary Continuous Speech Recognition
Kawahara, T Kobayashi, T Takeda, K Minematsu, N Itou, K Yamamoto, M Yamada, A Utsuro, T Shikano, K
Kyoto University, JAPAN

1345
Fr4A2

The Design of the Newspaper-Based Japanese Large Vocabulary Continuous Speech Recognition corpus
Itou, K Yamamoto, M Takeda, K Takezawa, T Matsuoka, T Shikano, K Kobayashi, T Itahashi, S
Electrotechnical Laboratory, JAPAN

1400
Fr4A3

Indexing and Classification of TV News Articles Based on Speech Dictation Using Word Bigram
Ogata, J Ariki, Y
Ryukoku University, JAPAN

1415
Fr4A4

Parametric Trajectory Mixtures for LVCSR
Siu, M Iyer, R Gish, H Quillen, C
GTE/BBN Technologies, USA


Fr4B - Neural Networks, Fuzzy and Evolutionary Methods 3

Time & Place: 1330 - 1430, Bandicoot Room
Chair: Joerg Rottland

1330
Fr4B1

Fuzzy-Integration Based Normalisation for Speaker Verification
Pham, T D Wagner, M
University of Canberra, AUSTRALIA

1345
Fr4B2

Improving the generalization performance of the MCE/GPD learning
Shimodaira, H Rokui, J Nakai, M
Japan Advanced Institute of Science and Technology, JAPAN

1400
Fr4B3

Acoustic Speech Recognition Model by Neural Net Equation with Competition and Cooperation
Kitazoe, T Ichiki, T Kim, S-I
Miyazaki University, JAPAN

1415
Fr4B4

Improved Surname Pronunciations Using Decision Trees
Ngan, J Ganapathiraju, A Picone, J
Mississippi State University, USA


Fr4C - Speech Processing for the Speech-Impaired and Hearing-Impaired 2

Time & Place: 1330 - 1430, Currawong Room
Chair: Björn Granström

1330
Fr4C1

A Speechreading Aid Based on Phonetic ASR
Duchnowski, P Braida, L Bratakos, M Lum, D Sexton, M Krause, J
Massachusetts Institute of Technology, USA

1345
Fr4C2

Training Speech through Visual Feedback Patterns
Nouza, J
Technical University of Liberec, CZECH REPUBLIC

1400
Fr4C3

Word sequence pair spotting for synchronization of speech and text in production of closed-caption TV programs for the hearing impaired
Maruyama, I Abe, Y Sawamura, E Wakao, T Ehara, T Shirai, K
Telecommunications Advancement Organization of Japan, JAPAN

1415
Fr4C4

Volume Regulation in Parkinsonian Speech
Ho, A K Bradshaw, J L Iansek, R Alfredson, R J
Monash University, AUSTRALIA


Fr4D - Prosody and Emotion 7

Time & Place: 1330 - 1430, Dingo Room
Chair: Gosta Bruce

1330
Fr4D1

On the amount and domain of focal lengthening in Swedish
Strangert, E Heldner, M
Department of Phonetics, Umeå University, SWEDEN

1345
Fr4D2

Differential lengthening of syllabic constituents in French: the effect of accent type and speaking style
Hirst, D J Astesano, C Di Cristo, A
CNRS, Université De Provence, FRANCE

1400
Fr4D3

Prosodic Analysis of Fillers and Self-Repair in Japanese Speech
Quimbo, F Kawahara, T Doshita, S
Kyoto University, JAPAN

1415
Fr4D4

A synthesis-oriented model of phrasal pitch movements in standard Chinese
Ni, J Kawai, G Hirose, K
University of Tokyo, JAPAN


Plenary Session
Fr5A - Keynote Speech 2

Title: The Use of Linguistic Hierarchies in Speech Understanding
Speaker: Stephanie Seneff
MIT, USA
Time & Place: 1440 - 1540, Antechinus Room
Chair: Michael Wagner


Closing Ceremony

Time & Place: 1545 - 1615, Antechinus Room
Chair: Peter Blamey