Please note: In the following program the listed affiliation for each paper refers only to the first author of that paper and is not necessarily the affiliation of any other authors.
Tuesday 1 December, 1998
Opening Ceremony
Time & Place: 0930 - 1000, Antechinus Room
Chair: Bruce Millar
Plenary Session
Tu2A - Keynote Speech 1
Title: Cochlear Implants in the Second and Third Millennia
Speaker: Graeme Clark
University of Melbourne, AUSTRALIA
Time & Place: 1000 - 1100, Antechinus Room
Chair: Robert Dale
Tu3A - Text-To-Speech Synthesis 1
Time & Place: 1130 - 1230, Antechinus Room
Chair: Shuichi Itahashi
1130
Tu3A1
.
Unsupervised training of phone duration and energy models for
text-to-speech synthesis
Bagshaw, P C
France Telecom, CNET, FRANCE
1145
Tu3A2
Improved Duration Modeling of English Phonemes Using a Root
Sinusoidal Transformation
Bellegarda, J Silverman, K
Apple Computer, USA
1200
Tu3A3
Efficient Adaptation of TTS Duration Model to New Speakers
Shih, C Gu, W van Santen, J P H
Bell Labs - Lucent Technologies, USA
1215
Tu3A4
Duration Modeling for HMM-Based Speech Synthesis
Yoshimura, T Tokuda, K Masuko, T Kobayashi, T Kitamura, T
Nagoya Institute of Technology, JAPAN
Tu3B - Spoken Language Models and Dialog 1
Time & Place: 1130 - 1230, Bandicoot Room
Chair: Greg Aist
1130
Tu3B1
An educational dialogue system with a user controllable dialogue
manager
Gustafson, J Elmberg, P Carlson, R Jönsson, A
Center for Speech Technology, TMH-KTH, SWEDEN
1145
Tu3B2
End-user driven dialogue system design: The REWARD experience
Failenschmid, K Thornton, J H S
Vocalis Ltd, UNITED KINGDOM
1200
Tu3B3
The Design of a Multi-domain Mandarin Chinese Spoken Dialogue
System
Lin, Y-C Chiang, T-H Wang, H-M Peng, C-M Chang, C-H
Industrial Technology Research Institute, TAIWAN R.O.C.
1215
Tu3B4
An Integrated Dialogue System for the Automation of Call Centre
Services
Georgila, K Tsopanoglou, A Fakotakis, N Kokkinakis, G
University of Patras, GREECE
Tu3C - Prosody and Emotion 1
Time & Place: 1130 - 1230, Currawong Room
Chair: Phil Rose
1130
Tu3C1
Tones of a Tridialectal: Acoustic and Perceptual data on Ten Linguistic
Tonetic contrasts between Lao, Nyo and Standard Thai
Rose, P
Australian National University, AUSTRALIA
1145
Tu3C2
Tone Sandhi between Complex Tones in a Seven-Tone Southern Thai
Dialect
Thompson, N G I
Australian National University, AUSTRALIA
1200
Tu3C3
The Acoustic and Perceptual Features of Tone in the Tibeto-Burman
Language Ao Naga
Coupe, A R
The Australian National University, AUSTRALIA
1215
Tu3C4
The Differential Status of Semivowels in the Acoustic Phonetic
Realisation of Tone
Rose, P
Australian National University, AUSTRALIA
Tu3D - Hidden Markov Model Techniques 1
Time & Place: 1130 - 1230, Dingo Room
Chair: Roberto Togneri
1130
Tu3D1
Nonreciprocal Data Sharing in Estimating HMM parameters
Luo, X Jelinek, F
Johns Hopkins University, USA
1145
Tu3D2
Data-Driven Extensions to HMM Statistical Dependencies
Bilmes, J A
U.C. Berkeley / ICSI, USA
1200
Tu3D3
Use of High-Level Linguistic Constraints for constructing Feature Based
Phonological Model in Speech Recognition
Sun, J Deng, L
University of Waterloo, CANADA
Tu3P - Speaker and Language Recognition 1
Time & Place: 1130 - 1430, Platypus Room
Chair: Kay Berkling
Tu3P1
Sub-Band Based Speaker Verification Using Dynamic Recombination
Weights
Sivakumaran, P Ariyaeeinia, A M Hewitt, J
University of Hertfordshire, UNITED KINGDOM
Tu3P2
Measuring the Dynamic Encoding of Speaker Identity and Dialect in
Prosodic Parameters
Barlow, M Wagner, M
Australian Defence Force Academy, AUSTRALIA
Tu3P3
German Regional Variants - A Problem for Automatic Speech
Recognition?
Beringer, N Schiel, F Regel-Brietzmann, P
University of Munich, GERMANY
Tu3P4
Improving Accent Identification through Knowledge of English Syllable
Structure
Berkling, K Vonwiller, J Cleirigh, C Zissman, M
MIT Lincoln Laboratory, USA
Tu3P5
Multi-dimensional Scaling of Listener Responses to Complex Auditory
stimuli
Bond, Z S Fucci, D Stockmal, V McColl, D
Ohio University, USA
Tu3P6
Same Talker, Different Language
Stockmal, V Moates, D Bond, Z
Ohio University, USA
Tu3P7
The Impact of Regional Variety upon Specific Word Categories in
Spontaneous German
Burger, S Oppermann, D
University of Munich, GERMANY
Tu3P8
Speech pre-processing against intentional imposture in speaker
recognition
Genoud, D Chollet, G
IDIAP, SWITZERLAND
Tu3P9
A Comparison of Two Unsupervised Approaches to Accent Identification
Lincoln, M Cox, S Ringland, S
University of East Anglia, UNITED KINGDOM
Tu3P10
The Influence of Accents in Australian English Vowels and their Relation
to Articulatory Tract Parameters
Dersch, D Cleirigh, C Vonwiller, J
Sydney University /
APPEN Speech Technology, AUSTRALIA
Tu3P11
Automatic language recognition using high-order HMMs
du Preez, J A Weber, D M
University of Stellenbosch, SOUTH AFRICA
Tu3P12
Speaker Recognition Using Residual Signal of Linear and Nonlinear
Prediction Models
Faundez-Zanuy, M Rodriguez-Porcheron, D
Escola Universitaria Politecnica De Mataro, SPAIN
Tu3P13
An Implementation and Evaluation of an On-line speaker Verification
System for Field Trials
Gu, Y Thomas, T
Vocalis Ltd, UNITED KINGDOM
Tu3P14
Speaker Verification on the POLYCOST database using frequency
filtered spectral energies
Hernando, J Nadeu, C
Polytechnical University of Catalonia, SPAIN
Tu3P15
A High-Performance Text-Independent Speaker Identification System
Based On BCDM
Jin, Q Si, L Hu, Q
Tsinghua University, P.R. CHINA
Tu3P16
Representation of Voice Quality Features Associated with Talker
Individuality
Kido, H Kasuya, H
Utsunomiya University, JAPAN
Tu3P17
Candidate Selection Based on Significance Testing and its Use in
Normalisation and Scoring
Kim, J Jang, G Yun, S Oh, Y
Advanced Institute of Science and Technology, KOREA
Tu3P18
Japanese Forensic Phonetics:Non-contemporaneous within-speaker
variation in Natural and Read-out Speech
Kinoshita, Y
The Australian National University, AUSTRALIA
Tu3P19
Statistical Modeling of Pronunciation and Production Variations for
Speech Recognition
Korkmazskiy, F E Juang B-H
Lucent Technologies Bell Laboratories, USA
Tu3P20
Dialect Maps and Dialect Research; Useful Tools for Automatic Speech
Recognition?
Foldvik, A K Kvale, K
Telenor R&D, NORWAY
Tu3P21
Text Independent Speaker Recognition Using Micro-Prosody
Kyung, Y J Lee, H S
Korea Advanced Institute of Science and Technology, KOREA
Tu3P22
Speaker Verification Using Fundamental Frequency
Cheng, Y Leung, H C
Chinese University of Hong Kong, HONG KONG
Tu3P23
On Optimum Normalization Method Used for Speaker Verification
Liu, W Isobe, T Mukawa, N
NTT Data Corporation, JAPAN
Tu3P24
Recurrent Substrings and Data Fusion for Language Recognition
Lloyd-Thomas, H Parris, E S Wright, J H
Ensigma Ltd, UNITED KINGDOM
Tu3P25
Text-Independent Speaker Recognition using Multiple Information
Sources
Markov, K Nakagawa, S
Toyohashi University of Technology, JAPAN
Tu3P26
Discriminative Training of GMM using a Modified EM Algorithm for
Speaker Recognition
Markov, K Nakagawa, S
Toyohashi University of Technology, JAPAN
Tu3P27
Language Identification Incorporating Lexical Information
Matrouf, D Adda-Decker, M Lamel, L Gauvain, J-L
LIMSI-CNRS, FRANCE
Tu3P28
A VQ based Speaker Recognition System Based in Histogram Distances.
Text Independent and for Noisy Environments
Monte, E Arqué, R Miró, X
Polytechnical University of Catalonia, SPAIN
Tu3P29
Spanish Dialects: Phonetic Transcription
Moreno, A Marino, J B
Polytechnical University of Catalonia, SPAIN
Tu3P30
Acoustic analysis of Japanese English prosody: Comparison between
Fukushima dialect speakers and Tokyo dialect speakers in declarative
sentences and yes-no questions
Muramatsu, M
Fukushima Medical University, JAPAN
Tu3P31
A Context-Dependent Approach for Speaker Verification Using
Sequential Decision
Noda, H Harada, K Kawaguchi, E Sawai, H
Kyushu Institute of Technology, JAPAN
Tu3P32
Quantitative Influence of Speech Variability Factors for Automatic
Speaker Verification in Forensic Tasks
Ortega-Garcia, J Cruz-Llanas, S Gonzalez-Rodriguez, J
Universidad Politecnica Madrid, SPAIN
Tu3P33
Creating Hidden Markov Models for Fast Speech
Pfau, T Ruske, G
Technical University of Munich, GERMANY
Tu3P34
Speaker Identification using Relaxation Labeling
Pham, T D Wagner, M
University of Canberra, AUSTRALIA
Tu3P35
A Novel Technique for the Combination of Utterance and Speaker
Verification Systems in a Text-dependent Speaker Verification Task
Rodriguez-Linares, L Garcia-Mateo, C
University of Vigo, SPAIN
Tu3P36
A Forensic Phonetic Investigation into Non-contemporaneous Variation
in the F-pattern of Similar-sounding Speakers
Rose, P
Australian National University, AUSTRALIA
Tu3P37
Human vs. Machine Speaker Identification with Telephone Speech
Schmidt-Nielsen, A Crystal, T H
U.S. Naval Research Laboratory, USA
Tu3P38
A Comparison of Fusion Techniques in Mel-Cepstral based Speaker
Identification
Slomka, S Sridharan, S Chandran, V
Queensland University of Technology, AUSTRALIA
Tu3P39
On the Influence of Hyperarticulated Speech on Recognition Performance
Soltau, H Waibel, A
Carnegie Mellon University, USA
Tu3P40
Text-independent speaker identification and verification using the TIMIT
Database
Ward, N C Dersch, D R
Alcatel Australia, AUSTRALIA
Tu3P41
Incorporating Linguistic Knowledge into Automatic Dialect Identification
of Spanish
Yanguas, L O'Leary, G Zissman, M
Mit Lincoln Laboratory, USA
Tu3P42
A Novel Text-Independent Speaker Verification Method Using the Global
Speaker Model
Zhang, Y Zhu, X-Y
Tsinghua University, P.R. CHINA
Tu3Q - Multimodal Spoken Language Processing 1
Time & Place: 1130 - 1430, Quokka Room
Chair: Sharon Oviatt
Tu3Q1
A fast method of producing talking head mouth shapes from real speech
Breen, A P Gloaguen, O Stern, P
BT Labs, UNITED KINGDOM
Tu3Q2
The Efficiency of Multimodal Interaction: A Case Study
Cohen, P R Johnston, M McGee, D Oviatt, S L Clow, J Smith, I
Oregon Graduate Institute, USA
Tu3Q3
Audio and Audio-visual Perception of Consonants Disturbed by White
Noise and 'Cocktail Party'
Czap, L
University of Miskolc, HUNGARY
Tu3Q4
Overview of the Maya Spoken Language System
Downey, S Breen, A P Fernandez, M Kaneen, E
BT Labs, UNITED KINGDOM
Tu3Q5
Automatic Recognition of Spontaneous Speech Dialogues
Cettolo, M Falavigna, D
ITC-IRST, ITALY
Tu3Q6
Using an Animated Talking Character in a web-based City Guide
Demonstrator
Fries, G Feldes, S Corbet, A
Deutsche Telekom Berkom, GERMANY
Tu3Q7
Influence of facial views on the McGurk effect in auditory noise
Kanzaki, R Kato, T
ATR Information Processing Research Laboratories, JAPAN
Tu3Q8
The Intellimedia WorkBench - a generic environment for multimodal
systems
Brondsted, T Larsen, L B Manthey, M Moeslund, T B McKevitt, P
Olesen, K G
Aalborg University, DENMARK
Tu3Q9
Stamp: A Suite of Tools for Analyzing Multimodal System Processing
Clow, J Oviatt, S L
Oregon Graduate Institute of Science & Technology, USA
Tu3Q10
Cultural similarities and differences in the recognition of audio-visual
speech stimuli
Shigeno, S
Kitasato University, JAPAN
Tu3Q11
A Multimodal-Input Multimedia-Output Guidance System: MMGS
Takezawa, T Morimoto, T
ATR Interpreting Telecommunications Research Laboratories, JAPAN
Tu3Q12
HMM-based Visual Speech Recognition using Intensity and Location
Normalization
Vanegas, O Tanaka, A Tokuda, K Kitamura, T
Nagoya Institute of Technology, JAPAN
Tu3Q13
A Hierarchy Probability-based Visual Features Extraction Method for
Speechreading
Xu, Y Du, L Li, G Hou, Z
Chinese Academy of Science, P.R. CHINA
Tu3Q14
Integration of Talking Heads and Text-to-Speech Synthesizers for Visual
TTS
Ostermann, J Beutnagel, M Fischer, A Wang, Y
AT&T Labs Research
Tu3R - Isolated Word Recognition
Time & Place: 1130 - 1430, Rosella Room
Chair: Li Deng
Tu3R1
Improving Accuracy of telephony-based, speaker-independent speech
recognition
Azzopardi D Semnani S Milner B Wiseman R
BT Labs, UNITED KINGDOM
Tu3R2
Rejection in Speech Recognition Systems with Limited Training
Bayya, A
Rockwell Semiconductor Systems, USA
Tu3R3
A Four Layer Sharing HMM System for Very Large Vocabulary Isolated
Word Recognition
Chen, R Tanaka, M Wu, D Olorenshaw, L Amador, M
SONY Research Labs, USA
Tu3R4
A Comparative Study of Hybrid Modelling Techniques for Improved
Telephone Speech Recognition
Chengalvarayan, R
Lucent Technologies, USA
Tu3R5
Smoothing and Tying for Korean Flexible Vocabulary Isolated Word
Recognition
Choi, J-S Lee, J-S Lee, H-Y
LG Corporate Institute of Technology, KOREA
Tu3R6
Recent Work on a Preselection Module for a Flexible Large Vocabulary
Speech Recognition System in Telephone Environment
Ferreiros, J Macias-Guarasa, J Gallardo, A Colas, J Cordoba, R Pardo, J
Villarrubia, L
GTH-IEL-UPM, SPAIN
Tu3R7
A Study of Noise Robustness for Speaker Independent Speech Recognition
Method Using Phoneme Similarity Vector
Hoshimi, M Yamada, M Niyada, K Makino, S
Matsushita Research Institute Tokyo, Inc., JAPAN
Tu3R8
Classification of Taiwanese tones based on pitch and energy movements
Jian, F
University of Reading, UNITED KINGDOM
Tu3R9
Phoneme-based recognition for the Norwegian SpeechDat(II) database
Johansen, F T
Telenor R&D, NORWAY
Tu3R10
Robust Feature Extraction for Alphabet Recognition
Karnjanadecha, M Zahorian, S A
Old Dominion University, USA
Tu3R11
Recognition of Connected Digit Speech in Japanese Collected over the
Telephone Network
Kawai, H Higuchi, N
KDD R&D Laboratories Inc., JAPAN
Tu3R12
Improving the Speaker-Dependency of Subword-Unit-Based Isolated
Word Recognition
Koizumi, T Taniguchi, S Kohtoh, K
Fukui University, JAPAN
Tu3R13
Speaker Independent Speech Recognition Method using Constrained Time
Alignment near Phoneme Discriminative Frame
Konuma, T Suzuki, T Yamada, M Ono, Y Hoshimi, M Niyada, K
Matsushita Research Institute Tokyo, Inc, JAPAN
Tu3R14
A Nonatationary Autoregressive HMM with Gain Adaptation for Speech
Recognition
Lee, K Y Lee, J
Soongsil University, KOREA
Tu3R15
A Large-Vocabulary Taiwanese (Min-nan) Multi-syllabic Word
Recognition System Based upon Right-Context-Dependent Phones with
State Clustering by Acoustic Decision Tree
Lyu, R-Y Chiang, Y-C Hsieh, W-P
Chang Gung University, TAIWAN R.O.C.
Tu3R16
Speech Recognition Based on the Distance Calculation Between
Intermediate Phonetic Code Sequences in Symbolic Domain
Tanaka, K Kojima, H
Electrotechnical Laboratory, JAPAN
Tu3R17
High Accuracy Chinese Speech Recognition Approach with Chinese
Input Technology for Telecommunication Use
Yang, Y C-H Kuo, J-J
Matsushita Institute of Technology, Taipei, TAIWAN R.O.C.
Tu4A - Robust Speech Processing in Adverse
Environments 1
Time & Place: 1330 - 1500, Antechinus Room
Chair: Sridha Sridharan
1330
Tu4A1
Robust Speech Recognition using HMM's With Toeplitz State Covariance
Matrices
Roberts, W Ephraim, Y
Defence Science & Technology Organisation, AUSTRALIA
1345
Tu4A2
Modeling of Output Probability Distributions to improve Small
Vocabulary Speech Recognition in Adverse Environments
Thambiratnam, D Sridharan, S
Queensland University of Technology, AUSTRALIA
1400
Tu4A3
Robust and compact multilingual word recognizers using features
extracted from a phoneme similarity front-end
Morin, P Applebaum, T H Boman, R Zhao, Y Junqua, J-C
Panasonic Technologies Inc., USA
1415
Tu4A4
An Effect of Adaptive Beamforming on Hands-free Speech Recognition
Based on 3-D Viterbi Search
Yamada, T Nakamura, S Shikano, K
Nara Institute of Science and Technology, JAPAN
1430
Tu4A5
Coherence-based Subband Decomposition for Robust Speech and
Speaker Recognition in Noisy and Reverberant Rooms
Gonzalez-Rodriguez, J Cruz-Llanas, S Ortega-Garcia, J
Universidad Politecnica Madrid, SPAIN
1445
Tu4A6
A Minimax Search Algorithm for CDHMM based Robust Continuous
Speech Recognition
Jiang, H Hirose, K Huo, Q
The University of Tokyo, JAPAN
Tu4B - Spoken Language Models and Dialog 2
Time & Place: 1330 - 1500, Bandicoot Room
Chair: Rolf Carlson
1330
Tu4B1
An event driven model for dialogue systems
Wang, K
Microsoft Research, USA
1345
Tu4B2
Automatic Classification of Dialogue Contexts for Dialogue Predictions
Popovici, C Baggia, P Laface, P Moisa, L
Politecnico Di Torino, ITALY
1400
Tu4B3
Automatic Identification of Command Boundaries in a Conversational
Natural Language User Interface
Ramaswamy, G N Kleindienst, J
I.B.M. Research Center, USA
1415
Tu4B4
The Predictive Power of Game Structure in Dialogue Act Recognition:
Experimental Results Using Maximum Entropy Estimation
Poesio, M Mikheev, A
University of Edinburgh, HCRC, UNITED KINGDOM
1430
Tu4B5
A schema based approach to dialog control
Constantinides, P Hansma, S Tchou, C Rudnicky, A
Carnegie Mellon University, USA
1445
Tu4B6
Expanding a time-sensitive conversational architecture for turn-taking to
handle content-driven interruption
Aist, G
Carnegie Mellon University, USA
Tu4C - Articulatory Modelling 1
Time & Place: 1330 - 1500, Currawong Room
Chair: Alain Marchal
1330
Tu4C1
A Three-Dimensional Linear Articulatory Model Based on MRI Data
Badin, P Bailly, G Raybaudi, M Segebarth, C
Institut Communication Parlee, FRANCE
1345
Tu4C2
On Loops and Articulatory Biomechanics
Perrier, P Payan, Y Perkell, J Jolly, F Zandipour, M Matthies, M
Institut De La Communication Parlée - INPG, FRANCE
1400
Tu4C3
Magnetic Resonance Measurements of the Velum port opening
Demolin, D Lecuit, V Metens, T Nazarian, B Soquet, A
Universite Libre De Bruxelles, BELGIUM
1415
Tu4C4
Cantilever-type force-sensor-mounted palatal plate for measuring
palatolingual contact stress and pattern during speech phonation.
Matsumura, M Niikawa, T Tanabe, T Tachimura, T Wada, T
Osaka Electro-Communication University, JAPAN
1430
Tu4C5
Determination of the vocal tract spectrum from the articulatory
movements based on the search of an articulatory-acoustic database
Kaburagi, T Honda, M
NTT Basic Research Laboratories, JAPAN
1445
Tu4C6
An MRI study on the relationship between oral cavity shape and larynx
position
Honda, K Tiede, M
ATR Human Information Processing Research Laboratories, JAPAN
Tu4D - Talking To Infants, Pets and Lovers
Time & Place: 1330 - 1500, Dingo Room
Chair: Cecile Pereira
1330
Tu4D1
Acoustic and Affective Qualities of IDS in English
Kitamura, C Burnham, D
University of New South Wales, AUSTRALIA
1345
Tu4D2
Acoustic Qualities of IDS and ADS in Thai
Thanavisuth, C Luksaneeyanawin, S
Chulalongkorn University, THAILAND
1400
Tu4D3
Pragmatic Characteristics of Infant Directed Speech
Luksaneeyanawin, S Thanavisuth, C Sittigasorn, S Rukkarangsarit, O
Chulalongkorn University, THAILAND
1415
Tu4D4
Are You My Little Pussy-Cat? Acoustic, Phonetic and Affective Qualities
of Infant- and Pet-Directed Speech
Burnham, D Francis, E Vollmer-Conna, U Kitamura, C Averkiou, V Olley,
A Nguyen, M Paterson, C
University of New South Wales, AUSTRALIA
1430
Tu4D5
Special Speech Registers: Talking To Australian and Thai Infants, and
To Pets.
Burnham, D
University of New South Wales, AUSTRALIA
Tu5A - Robust Speech Processing in Adverse
Environments 2
Time & Place: 1530 - 1700, Antechinus Room
Chair: Phil Woodland
1530
Tu5A1
Performance improvements through combining phone- and syllable-scale
information in automatic speech recognition
Wu, S-L Kingsbury, B E D Morgan, N Greenberg, S
Nuance Communications, USA
1545
Tu5A2
Predictive Adaptation and Compensation for Robust Speech Recognition
Surendran, A C Lee, C-H
Bell Labs, Lucent Technologies, USA
1600
Tu5A3
Influence of the speaking style and the noise spectral tilt on the Lombard
reflex and automatic speech recognition
Junqua, J-C Fincke, S Field, K
Speech Technology Laboratory, USA
1615
Tu5A4
Data-Driven PMC and Bayesian Learning Integration for Fast Model
Adaptation in Noisy Conditions
Crafa, S Fissore, L Vair, C
CSELT, ITALY
1630
Tu5A5
Improving the noise and spectral robustness of an isolated-word
recognizer using an auditory-model front end
Hunke, M Hyun, M Love, S Holton, T
San Francisco State University, USA
1645
Tu5A6
A Model for Speech Reverberation and Intelligibility Restoring Filters
Kenny, O P Nelson, D J
USA
Tu5B - Spoken Language Models and Dialog 3
Time & Place: 1530 - 1700, Bandicoot Room
Chair: Michael Barlow
1530
Tu5B1
On Different Functions of Repetitive Utterances
Swerts, M Koiso, H Shimojima, A Katagiri, Y
IPO, Center for Research On User-System Interaction, THE NETHERLANDS
1545
Tu5B2
Prosody based detection of the context of backchannel responses
Noguchi, H Den, Y
NAIST, JAPAN
1600
Tu5B3
Robust Interpretation for Spoken Dialogue Systems
Strömbäck, L Jonsson, A
Linkoping University, SWEDEN
1615
Tu5B4
System-User Interaction and Response Strategy in Spoken Dialog System
Okato, Y Kato, K Yamamoto, M Itahashi, S
University of Tsukuba, JAPAN
1630
Tu5B5
Organizing Self-Motivated Dialogue with Autonomous Creatures
Suzuki, N Ishii, K Okada, M
ATR Media Integration & Communications Research Labs, JAPAN
1645
Tu5B6
Fly with the EAGLES: Evaluation of the ACCeSS Spoken Language
Dialogue System
Hanrieder, G Heisterkamp, P Brey, T
Daimler-Benz AG, GERMANY
Tu5C - Speech Coding 1
Time & Place: 1530 - 1700, Currawong Room
Chair: Ian Burnett
1530
Tu5C1
A very low bit rate speech coder using HMM with speaker adaptation
Masuko, T Tokuda, K Kobayashi, T
Tokyo Institute of Technology, JAPAN
1545
Tu5C2
ITU-T G.729 extension at 6.4 kbps
Ekudden, E Hagen, R Johansson, B Hayashi, S Kataoka, A Kurihara, S
Ericsson Radio Systems AB, SWEDEN
1600
Tu5C3
Adaptive Transformation for Segmented Parametric Speech Coding
Mudugamuwa D J Bradley A B
RMIT, AUSTRALIA
1615
Tu5C4
Speech enhancement using STC-based bandwidth extension
Epps, J Holmes, W H
University of NSW, AUSTRALIA
1630
Tu5C5
Performance and Optimization of the SEEVOC Algorithm
Zhang, W Holmes, WH
The University of New South Wales, AUSTRALIA
Tu5D - Articulatory Modelling 2
Time & Place: 1530 - 1700, Dingo Room
Chair: Frantz Clermont
1530
Tu5D1
Acoustic-articulatory evaluation of the upper vowel-formant region and
its presumed speaker-specific potency
Clermont, F Mokhtari, P
Australian Defence Force Academy, AUSTRALIA
1545
Tu5D2
Control of larynx height in vowel production
Hoole, P Kroos, C
Munich University, GERMANY
1600
Tu5D3
Analyzing the effect of secondary excitations of the vocal tract on vocal
intensity in different loudness conditions
Alku, P Vintturi, J Vilkman, E
University of Turku, FINLAND
1615
Tu5D4
An Analysis of Modal Coupling Effects During the Glottal Cycle :
Formant Synthesizers From Time-Domain Finite-Difference Simulations
Ramsay, G
Institut De La Communication Parlee, FRANCE
1630
Tu5D5
Laryngoscopic Analysis of Pharyngeal Articulations and Larynx-Height
Voice Quality Settings
Esling, J
University of Victoria, CANADA
1645
Tu5D6
Effects of shapes of radiational aperture on radiation characteristics
Matsuzaki, H Motoki, K Miki, N
Hokkai-Gakuen University, JAPAN
Tu5P - Prosody and Emotion 2
Time & Place: 1530 - 1800, Platypus Room
Chair: David House
Tu5P1
De-accentuation: Linguistic Environments and Prosodic Realizations
Alter, K Steinhauer, K Friederici, A D
Max-Planck-Institute of Cognitive Neuroscience, GERMANY
Tu5P2
Towards an automatic classification of emotions in speech
Amir, N Ron, S
Center for Technological Education Holon, ISRAEL
Tu5P3
Can we hear smile?
Schröder, M Auberge, V Cathiard, M
ICP - ESA 5009 CNRS, FRANCE
Tu5P4
The Automatic Marking of Prominence in Spontaneous Speech Using
Duration and Part of Speech Information
Aylett, M P Bull, M
University of Edinburgh, UNITED KINGDOM
Tu5P5
On a Pitch Alteration Technique in Excited Cepstral Spectrum for High
Quality TTS
Kim, J D Baek, S J Bae, M J
Soongsil University, KOREA
Tu5P6
Dovetailing of acoustics and prosody in spontaneous speech recognition
Buckow, J Batliner, A Huber, R Noeth, E Warnke, V Niemann, H
University of Erlangen, GERMANY
Tu5P7
A Computational Memory and Processing Model for Prosody
Cahn, J E
M.I.T. Media Laboratory, USA
Tu5P8
Convergence of Fundamental Frequencies in Conversation: If it Happens,
Does it Matter?
Collins, B M
Australian National University, AUSTRALIA
Tu5P9
Analysis and Interpretation of Fundamental Frequency Contours of
British English in Terms of a Command-Response Model
Fujisaki, H Ohno, S Yagi, T Ono, T
Science University of Tokyo, JAPAN
Tu5P10
Common Patterns in Word Level Prosody
Holm, F Hata, K
Speech Technology Laboratory, USA
Tu5P11
Prosodic Structure in Japanese Spontaneous Speech
Horiuchi, Y Ichikawa, A
Chiba University, JAPAN
Tu5P12
An acoustic-phonetic description of word tone in Kagoshima Japanese
Ishihara, S
The Australian National University, AUSTRALIA
Tu5P13
Representing Prosodic Words Using Statistical Models of Moraic
Transition of Fundamental Frequency Contours of Japanese
Iwano, K Hirose, K
University of Tokyo, JAPAN
Tu5P14
Disambiguation of Korean Utterances Using Automatic Intonation
Recognition
Jang, T Song, M Lee, K
University of Edinburgh, UNITED KINGDOM
Tu5P15
Multi-level rhythm control for speech synthesis using hybrid data driven
and rule-based approaches
Jokisch, O Hirschfeld, D Eichner, M Hoffmann, R
Dresden University of Technology, GERMANY
Tu5P16
EGG Model of Ditoneme in Mandarin*
Kong, J
City University of Hong Kong, HONG KONG
Tu5P17
Temporal Organization of Speech for normal and fast rates
Krishnan, G Ward, W
Carnegie Mellon University, USA
Tu5P18
A syllable-based generalization of Japanese accentuation
Kubozono, H
Kobe University, JAPAN
Tu5P19
Non-adjacent Segmental Effects in Tonal Realization of Accentual Phrase
in Seoul Korean
Lee, H-J
UCLA, USA
Tu5P20
Improvement on Connected Numbers Recognition Using Prosodic
Information
Lopez-Gonzalo, E Caminero, J Cortazar, I Hernandez-Gomez, L
ETSIT-UPM, SPAIN
Tu5P21
Phonetic investigation of boundary pitch movements in Japanese
Maeda, K Venditti, J J
University of Pennsylvania, USA
Tu5P22
Phonetic and phonological characteristics of paralinguistic information in
spoken Japanese
Maekawa, K
The National Language Research Institute, JAPAN
Tu5P23
ToBI Accent Type Recognition
Maghbouleh, A
Stanford University, USA
Tu5P24
The Influence of Syllable Structure on the Timing of Intonational Events
in German
Mixdorff, H Fujisaki, H
Dresden University of Technology, GERMANY
Tu5P25
New Prosodic Control Rules for Expressive Synthetic Speech
Mizuno, O Nakajima, S
NTT Human Interface Labs, JAPAN
Tu5P26
The Use of F0 Reliability Function for Prosodic Command Analysis on F0
Contour Generation Model
Nakai, M Shimodaira, H
JAIST, JAPAN
Tu5P27
Analysis of Effects of Lexical Accent, Syntax and Global Speech Rate
upon the Local Speech Rate
Ohno, S Fujisaki, H Taguchi, H
Department of Applied Electronics, Science University of Tokyo, JAPAN
Tu5P28
On the Effects of Speech Rate upon Parameters of the Command-Response
Model for the Fundamental Frequency Contours of Speech
Ohno, S Fujisaki, H Hara, Y
Science University of Tokyo, JAPAN
Tu5P29
The maximum-based description of F0 contours and its application to
English
Portele, T Heuft, B
IKP, University of Bonn, GERMANY
Tu5P30
Perceived prominence and acoustic parameters in American English
Portele, T
IKP, University of Bonn, GERMANY
Tu5P31
Generating Emotional Speech with a Concatenative Synthesizer
Rank, E Pirker, H
OFAI, AUSTRIA
Tu5P32
A Perceptive Measure of Pure Prosody Linguistic Functions with
Reiterant Sentences
Rilliard, A Auberge, V
Institut De La Communication Parlee, FRANCE
Tu5P33
Prosodic Parameters in Emotional Speech
Koike, K Suzuki, H Saito, H
Keio University, JAPAN
Tu5P34
Automatic detection of prominence (as defined by listeners’ judgments) in
read aloud Dutch sentences
Streefkerk, B M Pols, L C W ten Bosch, L F M
Institute of Phonetic Science Amsterdam, THE NETHERLANDS
Tu5P35
A Schema for Illocutionary Act Identification with Prosodic Feature
Tamoto, M Kawabata, T
NTT Basic Research Laboratory, JAPAN
Tu5P36
An Algorithm for Choosing Japanese Acknowledgments using Prosodic
Cues and Context
Tsukahara, W
University of Tokyo, JAPAN
Tu5P37
A Study of Tones and Tempo in Continuous Mandarin Digit Strings and
their Application in Telephone Quality Speech Recognition
Wang, C Seneff, S
MIT Laboratory for Computer Science, USA
Tu5P38
Simulated emotions: an acoustic study of voice and perturbation
measures
Whiteside, S P
University of Sheffield, UNITED KINGDOM
Tu5P39
A Robust Tone Recognition Method of Chinese Based on Sub-syllabic F0
Contours
Zhang, J-S Hirose, K
The University of Tokyo, JAPAN
Tu5P40
The Microprosodics of Tone Sandhi in Shanghai Disyllabic Compounds
Zhu, X S
Australian National University, AUSTRALIA
Tu5P41
Jitter and Shimmer Differences Between Pathological Voices of School
Children
Bolfan-Stosic, N Prizl, T
University of Zagreb, CROATIA
Tu5Q - Neural Networks, Fuzzy and Evolutionary
Methods 1
Time & Place: 1530 - 1800, Quokka Room
Chair: Christian Wellekens
Tu5Q1
A Comparison of Thai Speech Recognition Systems using Hidden Markov
Model, Neural Network, and Fuzzy-Neural Network
Ahkuputra, V Jitapunkul, S Jittiwarangkul, N Maneenoi, E Kasuriya, S
Chulalongkorn University, THAILAND
Tu5Q2
Phoneme recognition with statistical modeling of the prediction error of
neural networks
Freitag, F Monte, E
Polytechnical University of Catalonia, SPAIN
Tu5Q3
Neural network based pronunciation modeling with applications to
speech recognition
Fukada, T Yoshimura, T Sagisaka, Y
ATR Interpreting Telecommunications Research Laboratories, JAPAN
Tu5Q4
A Comparative Study of OCON and MLP Architectures for Phoneme
Recognition
Haskey, S Datta, S
Loughborough University, UNITED KINGDOM
Tu5Q5
Evaluation and Integration of Neural-Network Training Techniques for
Continuous Digit Recognition
Hosom, J P Cole, R A Cosi, P
Oregon Graduate Institute, USA
Tu5Q6
Hierarchical Neural Networks (HNN) for Chinese Continuous Speech
Recognition
Jia, Y Du, L Hou, Z
Chinese Academy of Sciences, P.R. CHINA
Tu5Q7
Neural Network Motivation for Segmental Distribution
Keller, E
University of Lausanne, SWITZERLAND
Tu5Q8
Combining Connectionist Multi-Band and Full-Band Probability Streams
for Speech Recognition of Natural Numbers
Mirghafori, N Morgan, N
International Computer Science Institute, USA
Tu5Q9
Initial Speech Recognition Results using the Multinet Architecture
Pizzolato, E B Reynolds, TJ
University of Essex, UNITED KINGDOM
Tu5Q10
Selection of the Optimal Structure of the Continuous HMM Using the
Genetic Algorithm
Takara, T Iha, Y Nagayama, I
University of the Ryukyus, JAPAN
Tu5Q11
A Proposed Decision Rule for Speaker Recognition Based on Fuzzy
C-Means Clustering
Tran, D Wagner, M Le, T V
University of Canberra, AUSTRALIA
Tu5Q12
Fuzzy Gaussian Mixture Models for Speaker Recognition
Tran, D Le, T V Wagner, M
University of Canberra, AUSTRALIA
Tu5Q13
A New Strategy of Fuzzy-Neural Network for Thai Numeral Speech
Recognition
Wutiwiwatchai, C Jitapunkul, S Ahkuputra, V Maneenoi, E
Luksaneeyanawin, S
Chulalongkorn University, THAILAND
Tu5Q14
Thai Polysyllabic Word Recognition Using Fuzzy-Neural Network
Wutiwiwatchai, C Jitapunkul, S Ahkuputra, V Maneenoi, E
Luksaneeyanawin, S
Chulalongkorn University, THAILAND
Tu5R - Utterance Verification and Word Spotting 1 /
Speaker Adaptation 1
Time & Place: 1530 - 1800, Rosella Room
Chair: Douglas O'Shaughnessy
Tu5R1
Word Verification Using Confidence Measures in Speech Recognition
Benitez, C Rubio Ayuso, A Garcia Teodoro, P Diaz Verdejo, J E
Universidad De Granada, SPAIN
Tu5R2
Improving Posterior Based Confidence Measures in Hybrid HMM/ANN
Speech Recognition Systems
Bernardis, G Bourlard, H
IDIAP, SWITZERLAND
Tu5R3
Two-pass Utterance Verification Algorithm for Long Natural Numbers
Recognition
Caminero, J Lopez, E Hernandez, L
ETSIT-UPM, SPAIN
Tu5R4
A*-Admissible Key-Phrase Spotting with Sub-Syllable Level Utterance
Verification
Chen, B Wang, H-M Shien, L-F Lee, L-S
Academica Sinica, Taiwan, TAIWAN R.O.C.
Tu5R5
Speaker-Independent Upfront Dialect Adaptation in a Large Vocabulary
Continuous Speech Recognizer
Fischer, V Gao, Y Janke, E
IBM Speech Systems, GERMANY
Tu5R6
Word-Based Acoustic Confidence Measures for Large-Vocabulary
Speech Recognition
Gunawardana, A Hon, H-W Jiang, L
Microsoft Research, USA
Tu5R7
Improved Utterance Rejection using Length Dependent Thresholds
Gupta, S K Soong, F
Bell Laboratories - Lucent Technologies, USA
Tu5R8
Bayesian Constrained Frequency Warping HMMs for Speaker
Normalization
Ho, C H Vaseghi, S Chen, A
Queen's University of Belfast, UNITED KINGDOM
Tu5R9
An Evaluation of Keyword Spotting Performance utilizing False Alarm
Rejection based on Prosodic Information
Ida, M Yamasaki, R
OMRON Corporation, JAPAN
Tu5R10
Predictive Speaker Adaptation and Its Prior Training
Tran, D Iso, K-I
NEC, JAPAN
Tu5R11
Powerful Syllabic Fillers for General-Task Keyword-Spotting and
Unlimited-Vocabulary Continuous-Speech Recognition
El Meliani, R O'Shaughnessy, D
INRS-Telecommunications, CANADA
Tu5R12
Confidence Scoring for Speech Understanding Systems
Pao, C Schmid, P Glass, J
MIT Laboratory for Computer Science, USA
Tu5R13
Phonological Rules for Enhancing Acoustic Enrollment of Unknown
Words
Ramabhadran, B Ittycheriah, A
IBM T.J. Watson Research Center, USA
Tu5R14
Recognition-Based Word Counting for Reliable Barge-in and Early
Endpoint Detection in Continuous Speech Recognition
Setlur, A R Sukkar, R A
Lucent Technologies, USA
Tu5R15
Linear Discriminant - A New Criterion for Speaker Normalization
Westphal, M Schultz, T Waibel, A
Carnegie Mellon University, USA
Tu5R16
Confidence Measures Derived from an Acceptor HMM
Williams, G Renals, S
Sheffield University, UNITED KINGDOM
Tu5R17
Telephone Speech Multi-Keyword Spotting Using Fuzzy Search
Algorithm and Prosodic Verification
Wu, C-H Chen, Y-J Hung, Y-C
National Cheng Kung University, TAIWAN R.O.C.
Tu5R18
Topic Recognition for News Speech Based on Keyword Spotting
Yamashita, Y Tsunekawa, T Mizoguchi, R
Ritsumeikan University, JAPAN
Tu6A - Text-To-Speech Synthesis 2
Time & Place: 1700 - 1800, Antechinus Room
Chair: Susan Hertz
1700
Tu6A1
Prosody Prediction for Speech Synthesis using Transformational
Rule-based Learning
Fordyce, C Ostendorf, M
Lernout & Hauspie Speech Products, USA
1715
Tu6A2
Representing the environments for phonological processes in an
accent-independent lexicon for synthesis of English
Fitt, S Isard, S
University of Edinburgh, UNITED KINGDOM
1730
Tu6A3
Efficient Lexical Retrieval for English Text-to-Speech Synthesis
Faulkner, D Bryant, C
Aculab Plc, UNITED KINGDOM
Tu6B - Spoken Language Models and Dialog 4
Time & Place: 1700 - 1800, Bandicoot Room
Chair: James Glass
1700
Tu6B1
SQEL: A Multilingual and Multifunctional Dialog System
Aretoulaki, M Harbeck, S Gallwitz, F Noeth, E Niemann, H Ivanecky, J
Ipsic, I Pavesic, N Matousek, V
University of Erlangen, GERMANY
1715
Tu6B2
Semi-automated incremental prototyping of spoken dialog systems
Kaspar, S Hoffmann, A
University of New South Wales, AUSTRALIA
1730
Tu6B3
Beyond Structured Dialogues: Factoring out Grounding
Heeman, P Johnston, M Denney, J Kaiser, E
Oregon Graduate Institute, USA
Tu6D - Human Speech Perception 1
Time & Place: 1700 - 1800, Dingo Room
Chair: Anne Cutler
1700
Tu6D1
Heads and tails in word perception: Evidence for 'Early-to-Late'
Processing in listening and reading
Nooteboom, S G Van Dijk, M
Utrecht University, THE NETHERLANDS
1715
Tu6D2
Evidence for Early Effects of Sentence Context on Word Segmentation
te Riele, S M M Quené, H
Utrecht University, THE NETHERLANDS
1730
Tu6D3
Assimilation and anticipation in word perception
Quené, H van Rossum, M van Wijck, M
Utrecht University, THE NETHERLANDS
1745
Tu6D4
Lexical activation by assimilated and reduced tokens
Kelly, M L Bard, E G Sotillo, C
University of Edinburgh, UNITED KINGDOM
Wednesday 2 December, 1998
We1A - Robust Speech Processing in Adverse
Environments 3
Time & Place: 0900 - 1000, Antechinus Room
Chair: Douglas Reynolds
0900
We1A1
Linear and Nonlinear Speech Feature Analysis for Stress Classification
Zhou, G Hansen, J H L Kaiser, J F
Robust Speech Processing Laboratory, USA
0915
We1A2
Speech Feature Modeling for Robust Stressed Speech Recognition
Bou-Ghazale, S E Hansen, J H L
Robust Speech Processing Laboratory, USA
0930
We1A3
Combining Articulatory and Acoustic Information for Speech Recognition
in Noisy and Reverberant Environments
Kirchhoff, K
University of Bielefeld, GERMANY
0945
We1A4
Improving Speaker Identification Performance in Reverberant
Conditions using Lip Information
Wark, T Sridharan, S
Queensland University of Technology, AUSTRALIA
We1B - Speech and Hearing Disorders 1
Time & Place: 0900 - 1000, Bandicoot Room
Chair: Florien Koopmans-van-Bienum
0900
We1B1
Adults with a severe-to-profound hearing impairment. Investigating the
effects of linguistic context on speech perception.
Flynn, M Dowell, R Clark, G
University of Melbourne, AUSTRALIA
0915
We1B2
Speech Perception in Dyslexia: Measurements from Birth Onwards
Koopmans-van Beinum, F J Schwippert, C Kuijpers, C T L
University of Amsterdam, THE NETHERLANDS
0930
We1B3
An Acoustic Analysis of Vowel Production Across Tasks in a Case of
Non-fluent Progressive Aphasia
Croot, K
University of Sydney / Macquarie University, AUSTRALIA
0945
We1B4
Speech technology in clinical environments:
van Doorn, J McLeod, S Baker, E Purcell, A Thorpe, W
The University of Sydney (Cumberland Campus), AUSTRALIA
We1C - Prosody and Emotion 3
Time & Place: 0900 - 1000, Currawong Room
Chair: Mark Swerts
0900
We1C1
What Spreads, and How? Tonal Rightward Spreading On Shanghai
Disyllabic Compounds
Zhu, X S
Australian National University, AUSTRALIA
0915
We1C2
Tonal Complexity as a Dialectal Feature: 25 Different Citation Tones from
Four Zhejiang Wu Dialects
Zhu, S X Rose, P
Australian National University, AUSTRALIA
0930
We1C3
Emotional Speech Synthesis: from speech database to TTS
Montero, J M Gutierrez-Arriola, J Palazuelos, S Enriquez, E Aguilera, S
Pardo, J M
GTH-IEL-UPM, SPAIN
0945
We1C4
Some Acoustic Characteristics of Emotion
Pereira, C Watson, C
SHLRC, Macquarie University, AUSTRALIA
We1D - Spoken Language Understanding Systems 1
Time & Place: 0900 - 1000, Dingo Room
Chair: David Grayden
0900
We1D1
Galaxy-II: A Reference Architecture for Conversational System
Development
Seneff, S Hurley, E Lau, R Pao, C Schmid, P Zue, V
MIT, USA
0915
We1D2
Improvements in Speech Understanding Accuracy Through the
Integration of Hierarchical Linguistic, Prosodic, and Phonological
Constraints in the Jupiter Domain
Chung, G Seneff, S
MIT Laboratory for Computer Science, USA
0930
We1D3
Towards Robust Methods for Spoken Document Retrieval
Ng, K
MIT, USA
We1P - Signal Processing and Speech Analysis 1
Time & Place: 0900 - 1130, Platypus Room
Chair: Aaron Rosenberg
We1P1
Maximum a Posteriori Pitch Tracking
Droppo, J Acero, A
Microsoft Corporation, USA
We1P2
Vowel Separation Using the Reassigned Amplitude-Modulation Spectrum
Yang, D Meyer, G F Ainsworth, W A
Keele University, UNITED KINGDOM
We1P3
Feature Decorrelation Methods in Speech Recognition. A Comparative
Study
Batlle, E Nadeu, C Fonollosa, J A R
Polytechnical University of Catalonia, SPAIN
We1P4
Multi-resolution for Speech Analysis
Caraty, M-J Montacie, C
Universite Pierre Et Marie Curie, FRANCE
We1P5
Dynamic features in Children's Vowels
Cassidy, S Watson, C
Macquarie University, AUSTRALIA
We1P6
Effectiveness of phase-corrected RASTA for continuous speech
recognition
de Veth, J Boves, L
University of Nijmegen, THE NETHERLANDS
We1P7
Techniques for capturing temporal variations in speech signals with
fixed-rate processing
Dharanipragada, S Gopinath, R Rao, B
IBM T. J. Watson Research Center, USA
We1P8
Automatic Detection of Landmark for Nasal Consonants from Speech
Waveform
Du, L Stevens, K N
Chinese Academy of Sciences, P.R. CHINA
We1P9
Plug and Play Software for Designing High-Level Speech Processing
Systems
Dutoit, T Schroeter, J
Faculte Polytechnique De Mons, BELGIUM
We1P10
Creating speaker independent HMM models for restricted database using
STRAIGHT-TEMPO morphing
Girardi, A Shikano, K Nakamura, S
Nara Institute of Science and Technology, JAPAN
We1P11
Restoration of hyperbaric speech by correction of the formants and the
pitch
Charonnat, L Guitton, M Crestel, J Allee, J
ENSSAT, FRANCE
We1P12
Voice Conversion based on parameter transformation
Gutiérrez-Arriola, J M Hsiao, Y S Montero, J M
Pardo, J M Childers, D G
GTH-IEL-UPM, SPAIN
We1P13
Noise Robust Two-Stream Auditory Feature Extraction Method for
Speech Recognition
Tian, J Hariharan, R Laurila, K
Nokia Research Center, FINLAND
We1P14
Heterogeneous Measurements and Multiple Classifiers for Speech
Recognition
Halberstadt, A K Glass, J R
MIT, USA
We1P15
Joint Recognition and Segmentation using Phonetically Derived Features
and a Hybrid Phoneme Model
Harte, N Vaseghi, S Milner, B
Queen's University of Belfast, UNITED KINGDOM
We1P16
TRAPS - Classifiers of Temporal Patterns
Hermansky, H Sharma, S
Oregon Graduate Institute, USA
We1P17
Robust measurement of fundamental frequency and degree of voicing
Holmes, J N
UNITED KINGDOM
We1P18
Micropower Electro-Magnetic Sensors for Speech Characterization,
Recognition, Verification, and Other Applications
Holzrichter, J F Burnett, G C Gable, T J Ng, L C
Lawrence Livermore National Laboratory, USA
We1P19
Robust Entropy-based Endpoint Detection for Speech Recognition in
Noisy Environments
Shen, J-L Hung, J-W Lee, L-S
Academia Sinica, TAIWAN R.O.C.
We1P20
Statistical Integration of Temporal Filter Banks for Robust Speech
Recognition Using Linear Discriminant Analysis (LDA)
Shen, J-L Hwang, W-L
Academia Sinica, TAIWAN R.O.C.
We1P21
Feature-Based Approach to Speech Recognition
Iskra, D J Edmondson, W H
University of Birmingham, UNITED KINGDOM
We1P22
Periodicity Emphasis of Voice Wave using Nonlinear IIR Digital Filters
and Its Applications
Kamata, H Kaneko, A Ishida, Y
Meiji University, JAPAN
We1P23
Speech recognition via phonetically featured syllables
King, S Stephenson, T Isard, S Taylor, P Strachan, A
CSTR, University of Edinburgh, UNITED KINGDOM
We1P24
Do Phonetic Features Help to Improve Consonant Identification in ASR?
Koreman, J Andreeva, B Barry, W J
University of the Saarland, GERMANY
We1P25
Perceptual and acoustic properties of phonemes in continuous speech for
different speaking rate
Kuwabara, H
Teikyo University of Science & Technology, JAPAN
We1P26
On Robust Sequential Estimator Based on t-distribution with Forgetting
Factor for Speech Analysis
Lee, J Lee, K Y
Dong-Ah Broadcasting College, KOREA
We1P27
Discriminant Wavelet Basis Construction for Speech Recognition
Long, C Datta, S
Loughborough University, UNITED KINGDOM
We1P28
An Efficient Mel-LPC Analysis Method for Speech Recognition
Matsumoto, H Nakatoh, Y Furuhata, Y
Shinshu University, JAPAN
We1P29
Discriminative Weighting of Multi-resolution Sub-band Cepstral
Features for Speech Recognition
McMahon, P McCourt, P Vaseghi, S
Queens University of Belfast, NORTHERN IRELAND
We1P30
Separation of Singing and Piano Sounds
Meron, Y Hirose, K
Tokyo University, JAPAN
We1P31
Modeling of Variations in Cepstral Coefficients Caused by F0 Changes
and its Application to Speech Processing
Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN
We1P32
A Detection Framework for Locating Phonetic Events
Niyogi, P Mitra, P P Sondhi, M M
Bell Laboratories, Lucent Technologies, USA
We1P33
On Frequency Averaging for Spectral Analysis in Speech Recognition
Nadeu, C Galindo, F Padrell, J
Polytechnical University of Catalonia, SPAIN
We1P34
Wavelet transform domain blind equalization and its application to
speech analysis
Namba, M Ishida, Y
Meiji University, JAPAN
We1P35
A Novel Method of Formant Analysis and Glottal Inverse Filtering
Pearson, S
Speech Technology Lab/Panasonic Technologies Inc., USA
We1P36
Vector Quantizer Acceleration for an Automatic Speech Recognition
Application
Araújo, A Pera, V Souza, M
FEUP, PORTUGAL
We1P37
Local Speech Rate as a Combination of Syllable and Phone Rate
Pfitzinger, H R
University of Munich, GERMANY
We1P38
Recovering gestures from speech signals: a preliminary study for nasal
vowels
Rossato, S Feng, G Laboissière, R
Institut De La Communication Parlee, FRANCE
We1P39
Extended Linear Discriminant Analysis (ELDA) for Speech Recognition
Ruske, G Faltlhauser, R Pfau, T
Technical University of Munich, GERMANY
We1P40
Speech, Silence, Music and Noise Classification of TV Broadcast Material
Samouelian, A Robert-Ribes, J Plumpe, M
University of Wollongong, AUSTRALIA
We1P41
The relation between vocal tract shape and formant frequencies can be
described by means of a system of coupled differential equations
Schoentgen, J Soquet, A Lecuit, V Ciocea, S
Universite Libre De Bruxelles, BELGIUM
We1P42
Improving speech recognizer by broader acoustic-phonetic group
classification
Suh, Y Hwang, K Kwon, O Park, J
ETRI, KOREA
We1P43
Separation of speech source and filter by time-domain deconvolution
Thorpe, C W
National Voice Centre, Australia, AUSTRALIA
We1P44
On the Application of the AM-FM Model for the Recovery of Missing
Frequency Bands of Telephone Speech
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA
We1P45
Estimation of Voice Source and Vocal Tract Parameters Using Combined
Subspace-based and Amplitude Spectrum-based Algorithm
Yang, C-S Kasuya, H
Utsunomiya University, JAPAN
We1P46
The Distance Measure for Line Spectrum Pairs Applied to Speech
Recognition
Zheng, F Song, Z-J Li, L Yu, W-J Zheng, F-Z Wu, W-H
Tsinghua University, P.R. CHINA
We1Q - Spoken Language Generation and Translation 1
Time & Place: 0900 - 1130, Quokka Room
Chair: Yoshinori Sagisaka
We1Q1
The Modeling and Realization of Natural Speech Generation System
Chen, F Yuan, B
Northern Jiaotong University, P.R. CHINA
We1Q2
'Ko Tok Ples Ensin bilong Tok Pisin' or the TP-CLE: A first report from
a pilot speech-to-speech translation project from Swedish to Tok Pisin
Eklund, R
Telia Research AB, SWEDEN
We1Q3
An Iterative, DP-based Search Algorithm for Statistical Machine
Translation
Garcia-Varea, I Casacuberta, F Ney, H
Universidad Politecnic De Valencia, SPAIN
We1Q4
Information extraction and text generation of news reports for a
Swedish-English bilingual spoken dialogue system
Gawronska, B House, D
University of Skovde, SWEDEN
We1Q5
Utterance Generation for Transaction Dialogues
Hulstijn, J Van Hessen, A
University of Twente, THE NETHERLANDS
We1Q6
Example-Based Error Recovery Method for Speech Translation:
Repairing Sub-Trees According to the Semantic Distance
Ishikawa, K Sumita, E Iida, H
ATR Interpreting Telecommunications Research Laboratories, JAPAN
We1Q7
Context Sensitive Generation of Descriptions
Krahmer, E Theune, M
IPO Center for Research On User-System Interaction, THE NETHERLANDS
We1Q8
An Interlingua Based on Domain Actions for Machine Translation of
Task-Oriented Dialogues
Levin, L Gates, D Lavie, A Waibel, A
Carnegie Mellon University, USA
We1Q9
Generating Pitch Accents in a Concept-To-Speech System using a
Knowledge Base
Williams, S H
Microsoft Research Institute, Macquarie University, AUSTRALIA
We1Q10
Making the most of multiplicity: a multi-parser, multi-strategy
architecture for the robust processing of spoken language
Ruland, T Rupp, C J Spilker, J Weber, H Worm, K L
Siemens AG / University of the Saarland, GERMANY
We1Q11
Natural-Sounding Speech Synthesis Using Variable-Length Units
Yi, J R W Glass, J R
MIT, USA
We1R - Spoken Language Models and Dialog 5
Time & Place: 0900 - 1130, Rosella Room
Chair: Pietro Laface
We1R1
A robust dialogue model for spoken dialogue processing
Araki, M Doshita, S
Kyoto University, JAPAN
We1R2
The REWARD Service Creation Environment. An Overview
Brøndsted, T Bai, B Olsen, J
Aalborg University, DENMARK
We1R3
An Analysis of the Timing of Turn-Taking in a Corpus of Goal-Oriented
Dialogue
Bull, M C Aylett, M
University of Edinburgh, UNITED KINGDOM
We1R4
The provision of corrective feedback in a spoken dialogue CALL system
Davies, S Poesio, M
University of Edinburgh, HCRC, UNITED KINGDOM
We1R5
Evaluation of dialog strategies for a tourist information retrieval system
Devillers, L Bonneau-Maynard, H
LIMSI/CNRS, FRANCE
We1R6
Designing a Multimodal Dialogue System for Information Retrieval
Furui, S Yamaguchi, K
Tokyo Institute of Technology, JAPAN
We1R7
The Research Project of Man-Computer dialogue System in Chinese
Guan, D Chu, M Zhang, Q Liu, J Zhang, H
Chinese Academy of Sciences, P.R. CHINA
We1R8
Interfaces for Speech Recognition Systems: The Impact of Vocabulary
Constraints and Syntax on Performance
Hone, K Golightly, D
ICL Institute of Information Technology, UNITED KINGDOM
We1R9
Pacing Spoken Directions to Suit the Listener
Iwase, T Ward, N
University of Tokyo, JAPAN
We1R10
A Spoken Dialogue System Utilizing Spatial Information
Flycht-Eriksson, A Jonsson, A
Linkoping University, SWEDEN
We1R11
From Novice to Expert: The Effect of Tutorials on User Expertise with
Spoken Language Dialogue Systems - An Empirical Analysis of the Efficacy
of Tutorial Dialogues
Kamm, C A Litman, D J Walker, M A
AT&T Labs Research, USA
We1R12
Emergent Computational Dialogue Management Architecture for
Task-Oriented Spoken Dialogue Systems
Kawabata, T
NTT Basic Research Labs, JAPAN
We1R13
An Analysis of Dialogues with Our Dialogue System through a WWW
page
Kumamoto, T Ito, A
Communications Research Laboratory, JAPAN
We1R14
Modelling spoken dialogues with state transition diagrams: experiences of
the CSLU toolkit
McTear, M
University of Ulster, UNITED KINGDOM
We1R15
Situated Dialogue Coordination for Spoken Dialogue Systems
Okada, M Suzuki, N Terken, J
ATR MI & C Research Laboratories, JAPAN
We1R16
Robust spoken dialogue systems for consumer products: a concrete
application
Pouteau, X Arevalo, L
IPO, Center for Research On User-System Interaction, THE NETHERLANDS
We1R17
A German Dialogue System for Scheduling Dates and Meetings by
Naturally Spoken Continuous Speech
Willett, D Roemer, A Rottland, J Rigoll, G
Duisburg University, GERMANY
We1R18
Spoken Dialogue System Using Corpus-Based Hidden Markov Model
Wu, C-H Yan, G-L Lin, C-L
National Cheng Kung University, TAIWAN R.O.C.
We1R19
A Realistic Wizard of Oz simulation of a Multimodal Spoken Language
System
Wyard, P Churcher, G
BT Laboratories, UNITED KINGDOM
We1R20
A Syllable-Based Chinese Spoken Dialogue System for Telephone
Directory Services Primarily Trained with a Corpus
Yang, Y-J Lee, L-S
National Taiwan University, TAIWAN R.O.C.
We1R21
How disagreement expressions are used in cooperative tasks
Yano, H Ito, A
Communications Research Laboratory, JAPAN
We2A - Segmentation, Labelling and Speech Corpora 1
Time & Place: 1030 - 1130, Antechinus Room
Chair: Bill Barry
1030
We2A1
Acoustic Indicators of Topic Segmentation
Hirschberg, J Nakatani, C
AT & T Labs - Research, USA
1045
We2A2
IViE - A Comparative Transcription system for Intonational Variation in
English
Grabe, E Nolan, F Farrar, K J
University of Cambridge, UNITED KINGDOM
1100
We2A3
Automatic Segmental and Prosodic Labeling of Mandarin Speech
Database
Chou, F-C Tseng, C-Y Lee, L-S
National Taiwan University, TAIWAN R.O.C.
1115
We2A4
Automatic Labelling of German Prosody
Rapp, S
Sony International (Europe) GmbH, GERMANY
We2B - Multimodal Spoken Language Processing 2
Time & Place: 1030 - 1130, Bandicoot Room
Chair: Dominic Massaro
1030
We2B1
Speech Driven 3-D Face Point Trajectory Synthesis Algorithm
Arslan, L M Talkin, D
Entropic Research Lab, USA
1045
We2B2
Speech-to-Lip Movement Synthesis based on the EM Algorithm using
Audio-Visual HMMs
Yamamoto, E Nakamura, S Shikano, K
Nara Institute of Science and Technology, JAPAN
1100
We2B3
Learning Words from Natural Audio-Visual Input
Roy, D Pentland, A
MIT Media Laboratory, USA
1115
We2B4
Using the Multi-Stream Approach for Continuous Audio-Visual Speech
Recognition: Experiments on the M2VTS Database
Dupont, S Luettin, J
Faculte Polytechnique De Mons, BELGIUM
We2C - Prosody and Emotion 4
Time & Place: 1030 - 1130, Currawong Room
Chair: Hiroya Fujisaki
1030
We2C1
Intonative structure as a determinant of word order variation in Dutch
verbal endgroups
Swerts, M
IPO, Center for Research On User-System Interaction, THE NETHERLANDS
1045
We2C2
Experiments on the Meaning of Two Pitch Accent Types: The 'Pointed
Hat' Versus the Accent-Lending Fall in Dutch
Caspers, J
Holland Institute of Generative Linguistics, NETHERLANDS
1100
We2C3
Phonetic and Phonological Markers of Contrastive Focus in Korean
Jun, S-A Lee, H-J
UCLA, USA
1115
We2C4
Reconciling two competing views on contrastiveness
Krahmer, E Swerts, M
IPO, Center for Research On User-System Interaction, THE NETHERLANDS
We2D - Neural Networks, Fuzzy and Evolutionary
Methods 2
Time & Place: 1030 - 1130, Dingo Room
Chair: Tuan Pham
1030
We2D1
Modular Neural Networks for Low-Complex Phoneme Recognition
Glaeser, A
Ascom Systec Ltd, SWITZERLAND
1045
We2D2
Global Optimisation of Neural Network Models Via Sequential
Sampling-Importance Resampling
de Freitas, N Johnson, S Niranjan, M Gee, A
Cambridge University, UNITED KINGDOM
1100
We2D3
Efficient computation of MMI neural networks for large vocabulary
speech recognition systems
Rottland, J Luedecke, A Rigoll, G
Duisburg University, GERMANY
1115
We2D4
Modular Connectionist Systems for Identifying Complex Arabic Phonetic
Features
Selouani, S A Caelen, J
HB University, ALGERIA
Plenary Session
We3A - Plenary Panel Discussion
Title: Automatic Speech Recognition Is A Solved Problem:
A Fact Or Convenient Fiction?
Time & Place: 1135 - 1235, Antechinus Room
Chair: Roberto Togneri
Moderator: Roger Moore
Panelists:
Hynek Hermansky (Oregon Graduate Institute)
Sadaoki Furui (Tokyo Institute of Technology)
Chin-Hui Lee (Bell Laboratories)
Clive Summerfield (Syrinx Speech Systems)
We4A - Large Vocabulary Continuous Speech
Recognition 1
Time & Place: 1330 - 1500, Antechinus Room
Chair: Sadaoki Furui
1330
We4A1
Real-Time Recognition of Broadcast News
Cook, G Robinson, T Christie, J
Cambridge University, UNITED KINGDOM
1345
We4A2
Automatic Recognition of Korean Broadcast News Speech
Yu, H-J Kim, H Choi, J-S Hong, J-M Park, K-S Lee, J-S Lee, Y-G Lee, H-Y
LG Corporate Institute of Technology, KOREA
1400
We4A3
Telephone-Based Conversational Speech Recognition in the JUPITER
Domain
Glass, J R Hazen, T J
MIT, USA
1415
We4A4
Japanese Large-Vocabulary Continuous Speech Recognition System
Based on Microsoft Whisper
Hon, H Ju, Y Otani, K
Microsoft Research, USA
1430
We4A5
Partitioning and Transcription of Broadcast News Data
Gauvain, J-L Lamel, L Adda, G
LIMSI-CNRS, FRANCE
We4B - Speaker and Language Recognition 2
Time & Place: 1330 - 1500, Bandicoot Room
Chair: Marc Zissman
1330
We4B1
Speaker Detection in Broadcast Speech Databases
Rosenberg, A E Magrin-Chagnolleau, I Parthsarathy, S Huang, Q
AT&T Labs-Research, USA
1345
We4B2
Multilateral Techniques for Speaker Recognition
Parris, E S Carey, M J
Ensigma, UNITED KINGDOM
1400
We4B3
Real Time Speaker Indexing based on subspace method - Application to
TV News Articles and Debate -
Nishida, M Ariki, Y
Ryukoku University, JAPAN
1415
We4B4
SHEEP, GOATS, LAMBS and WOLVES: A Statistical Analysis of
Speaker Performance in the NIST 1998 Speaker Recognition Evaluation
Doddington, G Liggett, W Martin, A Przybocki, M Reynolds, D
MIT Lincoln Laboratory, USA
1430
We4B5
Progress in Speaker Recognition at Dragon Systems
Corrada-Emmanuel, A Newman, M Peskin, B Gillick, L Roth, R
Dragon Systems, USA
1445
We4B6
A Comparative Study of Speaker Verification Systems using the Polycost
Database
Nordström, T Melin, H Lindberg, J
KTH, SWEDEN
We4C - Signal Processing and Speech Analysis 2
Time & Place: 1330 - 1500, Currawong Room
Chair: Kazuyo Tanaka
1300
We4C1
Improving Pitch Estimation with Short Duration Speech Samples
Ainsworth, W A Day, C R Meyer, G F
Keele University, UNITED KINGDOM
1345
We4C2
An instantaneous-frequency-based pitch extraction method for
high-quality speech transformation: revised TEMPO in the STRAIGHT-suite
Kawahara, H de Cheveigne, A Patterson, R
Wakayama University, JAPAN
1400
We4C3
Speaker-Independent Speech Recognition using Micro Segment Spectrum
Integration
Aikawa, K
NTT Human Interface Laboratories, JAPAN
1415
We4C4
On Robust Speech Analysis Based on Time-Varying Complex AR Model
Funaki, K Miyanaga, Y Tochinai, K
Hokkaido University, JAPAN
1430
We4C5
Spectral Basis Functions from Discriminant Analysis
Hermansky, H Malayath, N
Oregon Graduate Institute, USA
We4D - Prosody and Emotion 5
Time & Place: 1330 - 1500, Dingo Room
Chair: Daniel Hirst
1330
We4D1
The Tilt Intonation Model
Taylor, P
University of Edinburgh, UNITED KINGDOM
1345
We4D2
Analysis of Occurrences of Pauses and Their Durations in Japanese Text
Reading
Fujisaki, H Ohno, S Yamada, S
Science University of Tokyo, JAPAN
1400
We4D3
A statistical study of pitch target points in five languages
Campione, E Véronis, J
Université De Provence, FRANCE
1415
We4D4
Fully Automatic Prosody Generator for Text-to-Speech
Malfrere, F Dutoit, T Mertens, P
Faculte Polytechnique De Mons, BELGIUM
1430
We4D5
Automatic Prosodic Labeling of 6 Languages
Vereecken, H Martens, J-P Grover, C Fackrell, J Van Coile, B
Elis-University of Ghent, BELGIUM
1445
We4D6
Automatic Utterance Type Detection Using Suprasegmental Features
Wright, H
The University of Edinburgh, UNITED KINGDOM
We4P- Robust Speech Processing in Adverse
Environments 4
Time & Place: 1330 - 1700, Platypus Room
Chair: Keikichi Hirose
We4P1
Spectral Sequence Compensation based on Continuity of Spectral
Sequence
Akagi, M Iwaki, M Sakaguchi, N
Japan Advanced Institute of Science and Technology, JAPAN
We4P2
Robust Features for Speech Recognition Systems
Bayya, A Yegnanarayana, B
Rockwell Semiconductor Systems, USA
We4P3
Interfacing of CASA and Partial Recognition Based on a Multistream
Technique
Berthommier, F Glotin, H Tessier, E Bourlard, H
Institut De La Communication Parlee, FRANCE
We4P4
An RNN-based compensation method for Mandarin telephone speech
recognition
Chang, S-C Chien, S-C Kuo, C-C
Industrial Technology Research Institute, TAIWAN R.O.C.
We4P5
Robust Speech Recognition Using Discriminative Stream Weighting and
Parameter Interpolation
Chu, S Zhao, Y
University of Illinois, USA
We4P6
Acoustic backing-off in the local distance computation for robust
automatic speech recognition
de Veth, J Cranen, B Boves, L
University of Nijmegen, THE NETHERLANDS
We4P7
Noise Model Selection for Robust Speech Recognition
Docio-Fernandez,L Garcia-Mateo,C
University of Vigo, SPAIN
We4P8
A novel iterative signal enhancement algorithm for noise reduction in
speech
Doclo, S Dologlou, I Moonen, M
KU Leuven - ESAT/SISTA, BELGIUM
We4P9
Missing Data Reconstruction for Robust Automatic Speech Recognition
in the Framework of Hybrid HMM/ANN Systems
Dupont, S
Faculte Polytechnique De Mons, BELGIUM
We4P10
Recognition from GSM Digital Speech
Gallardo-Antolin, A Diaz-de-Maria, F Valverde-Albacete, F
Universidad Carlos III De Madrid, SPAIN
We4P11
Conversational Speech Systems for On-Board Car Navigation and
Assistance
Geutner, P Denecke, M Meier, U Westphal, M Waibel, A
Universitaet Karlsruhe, GERMANY
We4P12
A signal processing system for having the sound "pop-out"
in noise thanks to the image of the speaker’s lips: New advances using Multi-Layer Perceptrons
Girin, L Varin, L Feng, G Schwartz, J L
Institut De La Communication Parlée, FRANCE
We4P13
Robust Speech Activity Detection in the Presence of Noise
Sarikaya, R Hansen, J H L
Robust Speech Processing Laboratory, USA
We4P14
Robust Automatic Speech Recognition by the Application of a
Temporal-Correlation-Based Recurrent Multilayer Neural Network to
the Mel-based Cepstral Coefficients
Heon, M Tolba, H O'Shaughnessy, D
INRS-Telecommunication, CANADA
We4P15
Speech Recognition from GSM Codec Parameters
Huerta, J M Stern, R M
Carnegie Mellon University, USA
We4P16
Improved Parallel Model Combination Based on Better Domain
Transformation for Speech Recognition Under Noisy Environments
Hung, J W Shen, J L Lee, L S
Academia Sinica, TAIWAN R.O.C.
We4P17
Robust Speech/Non-Speech Detection in Adverse Conditions Based on
Noise and Speech Statistics
Karray, L Monne, J
France Telecom - CNET, FRANCE
We4P18
Speech Recognition in Car Noise Environments using Multiple Models
According to Noise Masking Levels
Song, M G Jung, H I Shim, K-J Kim, H S
Pusan National University, KOREA
We4P19
Spectral Noise Subtraction with Recursive Gain Curves
Linhard, K Haulick, T
Daimler Benz AG, GERMANY
We4P20
A Novel Robust Speech Recognition Algorithm Based on Multi-models
and Integrated Decision Method
Pan, S Liu, J Jiang, J-T Wang, Z-Y Lu, D-J
Tsinghua University, P.R. CHINA
We4P21
On the Interaction Between Time and Frequency Filtering of Speech
Parameters for Robust Speech Recognition
Macho, D Nadeu, C
Polytechnical University of Catalonia, SPAIN
We4P22
Inference of Missing Spectrographic Features for Robust Speech Recognition
Raj, B Singh, R Stern, R M
Carnegie Mellon University, USA
We4P23
SNR-dependent flooring and noise overestimation for joint application of
spectral subtraction and model combination
Schless, V Class, F
Daimler-Benz AG, GERMANY
We4P24
Improved Robust Speech Recognition Considering Signal Correlation
Approximated by Taylor Series
Shen, J-L Hung, J-W Lee, L-S
Academia Sinica, TAIWAN R.O.C.
We4P25
Speech Recognition in Noisy Environment Using Weighted
Projection-Based Likelihood Measure
Shin, W H Kim, W G Lee, C Cha, I W
Yonsei University, KOREA
We4P26
Evaluation of Model Adaptation by HMM Decomposition on Telephone
Speech Recognition
Takiguchi, T Nakamura, S Shikano, K Morishima, M Isobe, T
Nara Institute of Science and Technology, JAPAN
We4P27
Comparative Experiments to Evaluate a Voiced-Unvoiced-Based
Pre-Processing Approach to Robust Automatic Speech Recognition in
Low-SNR Environments
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA
We4P28
Signal Extraction from Noisy Signal based on Auditory Scene Analysis
Unoki, M Akagi, M
Japan Advanced Institute of Science and Technology, JAPAN
We4P29
Frequency domain binaural model as the front end of speech recognition
system
Usagawa, T Sakai, K Ebata, M
Kumamoto University, JAPAN
We4P30
A Study on the Recognition of Low Bit-Rate Encoded Speech
Yu, A-T Wang, H-C
National Tsing Hua University, TAIWAN R.O.C.
We4P31
Weighted Parallel Model Combination for Noisy Speech Recognition
Hwang, T-H Wang, H-C
National Tsing-Hua University, TAIWAN R.O.C.
We4P32
Favourable and Unfavourable Short Duration Segments of Speech in
Noise
Woo, D
The University of New South Wales, AUSTRALIA
We4Q - Segmentation, Labelling and Speech Corpora 2
Time & Place: 1330 - 1700, Quokka Room
Chair: Paul Dalsgaard
We4Q1
An Efficient Labeling Tool for the QuickSig Speech Database
Karjalainen, M Altosaar, T Huttunen, M
Helsinki University of Technology, FINLAND
We4Q2
Collection and Detailed Transcription of a Speech Database for
Development of Language Learning Technologies
Bratt, H Neumeyer, L Shriberg, E Franco, H
SRI International, USA
We4Q3
Resegmentation of SWITCHBOARD
Deshmukh, N Ganapathiraju, A Gleeson, A Hamaker, J Picone, J
Mississippi State University, USA
We4Q4
Automatic Generation of Visual Scenarios for Spoken Corpora
Acquisition
Aiello, D Delogu, C De Mori, R Di Carlo, A Nisi, M Tummeacciu, S
Fondazione Ugo Bordoni, ITALY
We4Q5
Automatic Detection of Semantic Boundaries based on Acoustic and
Lexical Knowledge
Cettolo, M Falavigna, D
ITC-IRST, ITALY
We4Q6
A New Fast Algorithm for Automatic Segmentation of Continuous Speech
Gholampour, I Nayebi, K
Sharif University of Technology, IRAN
We4Q7
Acoustic Nature and Perceptual Testing of Corpora of Emotional Speech
Iida, A Campbell, W N Iga, S Higuchi, F Yasumura, M
Keio University, JAPAN
We4Q8
Korean Prosodic Break Index Labelling by a new Mixed Method of LDA
and VQ
Kang, P S Kang, J Y Kim, J Y
Chonnam National University, KOREA
We4Q9
MOOSE: Management of Otago Speech Environment
Laws, M R Kilgour, R I
University of Otago, NEW ZEALAND
We4Q10
Phonetic Alignment: Speech Synthesis Based vs. Hybrid HMM/ANN
Malfrere, F Deroo, O Dutoit, T
Faculte Polytechnique De Mons, BELGIUM
We4Q11
Customisation and Quality Assessment of Spoken Language Description
Millar, J B
Australian National University, AUSTRALIA
We4Q12
A Silence/Noise/Music/Speech Splitting Algorithm
Montacie, C Caraty, M-J
Universite Pierre Et Marie Curie, FRANCE
We4Q13
Audio-Visual Segmentation for Content-based Retrieval
Pye, D Hollinghurst, N Mills, T Wood, K
Oracle & Olivetti Research Laboratory, UNITED KINGDOM
We4Q14
Same News is Good News: Automatically Collecting Reoccuring Radio
News Stories
Rapp, S Dogil, G
Sony International (Europe) GmbH, GERMANY
We4Q15
An annotation system for melodic aspects of German spontaneous speech
Brindoepke, C Schaffranietz, B
University Bielefeld, GERMANY
We4Q16
Additional use of phoneme duration hypotheses in automatic speech
segmentation
Stöber, K Hess, W
Bonn University, GERMANY
We4Q17
Towards a minimal standard for dialogue transcripts:A new SGML
architecture for the HCRC Map Task Corpus
Isard, A McKelvie, D Thompson, H S
University of Edinburgh, UNITED KINGDOM
We4R - Speech Technology Applications and Human-
Machine Interface 1
Time & Place: 1330 - 1700, Rosella Room
Chair: Jordi Robert-Ribes
We4R1
Steps toward the integration of speaker recognition in real-world telecom
applications
Glaeser, A Bimbot, F
Ascom Systec Ltd, SWITZERLAND
We4R2
A Bimodal Korean Address Entry/Retrieval System
Chung, H Y Hwang, C J Lee, S W
Yeungnam University, KOREA
We4R3
Usability Evaluation of IVR systems with DTMF and ASR
Delogu, C Di Carlo, A Rotundi, P Sartori, D
Fondazione Ugo Bordoni, ITALY
We4R4
SALSA Version 1.0: A Speech-based Web Browser for Hong Kong
English
Fung, P Cheung, C S Lam, K L Liu, W K Lo, Y Y
University of Science & Technology, HONG KONG
We4R5
A Language for Creating Speech Applications
Pargellis, A Zhou, Q Saad, A Lee, C-H
Bell Labs, Lucent Technologies
We4R6
The use of automatic speech recognition to reduce the interference
between concurrent tasks of driving and phoning
Graham, R Carter, C Mellor, B
Loughborough University, UNITED KINGDOM
We4R7
Interactive Listening to Structured Speech Content on the Internet
Hirayama, M J Sugahara, T Peng, Z Yamazaki, J
Hewlett Packard Laboratories Japan, JAPAN
We4R8
MSF Format for the Representation of Speech Synchronized Moving
Image
Jo, C W
Changwon National University, KOREA
We4R9
Effects of Using Speech in Timetable Information Systems for WWW
Qvarfordt, P Jönsson, A
Linkoping University, SWEDEN
We4R10
The Interactive Systems Labs View4You video indexing system
Kemp, T Geutner, P Schmidt, M Tomaz, B Weber, M Westphal, M
Waibel, A
University of Karlsruhe, GERMANY
We4R11
SEMOLE: A Robust Framework for Gathering Information from the
World Wide Web
Kim, H J Hetherington, L
MIT Spoken Language Systems, USA
We4R12
Enhancing a WIMP Based Interface with Speech, Gaze Tracking and
Agents
Bakman, L Blidegn, M Wittrup, M Larsen, L B Moeslund, T B
Aalborg University, DENMARK
We4R13
Now You Hear It, Now You Don't: Empirical Studies of Audio Browsing
Behavior
Nakatani, C Whittaker, S Hirschberg, J
AT & T Labs - Research, USA
We4R14
A Voice Verifier for Face/Voice Based Person Verification System
Qiao, R-Y Choi, Y Agbinya, J I
CSIRO, Australia, AUSTRALIA
We4R15
On the use of automatic speech recognition for TV captioning
Robert-Ribes, J
CSIRO-MIS, AUSTRALIA
We4R16
An Undergraduate Course on Speech Recognition Based on the CSLU
Toolkit
Serridge, B
Universidad De Las Americas, MEXICO
We4R17
Real time voice alteration based on Linear Prediction
Yang, P-F Stylianou, Y
AT&T Labs - Research, USA
We4R18
Evaluation and Implementation of a Voice-Activated Dialing System with
Utterance Verification
Tan, B T Gu, Y Thomas, T
Vocalis Ltd, UNITED KINGDOM
We4R19
Towards A Mandarin Voice Memo System
Wang, H-M Lin, B-S Chen, B Bai, B-R
Academia Sinica, TAIWAN R.O.C.
We5A - Large Vocabulary Continuous Speech
Recognition 2
Time & Place: 1530 - 1700, Antechinus Room
Chair: Gary Cook
1530
We5A1
Grammatical word graph re-generation for spontaneous speech
recognition
Tsukada, H Yamamoto, H Takezawa, T Sagisaka, Y
ATR Interpreting Telecommunications Research Laboratories, JAPAN
1545
We5A2
Compression Algorithm of Trigram Language Models based on
Maximum Likelihood Estimation
Yodo, N Shikano, K Nakamura, S
Nara Institute of Science and Technology, JAPAN
1600
We5A3
Morphological Modeling of Word Classes for Langauge Models
Uebler, U Niemann, H
Bavarian Research Center for Knowledge Based Systems (FORWISS),
GERMANY
1615
We5A4
A Comparative Study Between Polyclass and Multiclass Language
Models
Zitouni, I Smaili, K Haton, J-P Deligne, S Bimbot, F
LORIA/INRIA Lorraine, FRANCE
1630
We5A5
Log-Linear Interpolation of Language Models
Klakow, D
Philips Research Laboratories, GERMANY
1645
We5A6
The Applicability of Adaptive Language Modelling for the Broadcast
News Task
Clarkson, P R Robinson, A J
Cambridge University, UNITED KINGDOM
We5B - Text-To-Speech Synthesis 3
Time & Place: 1530 - 1700, Bandicoot Room
Chair: Nick Campbell
1530
We5B1
The IBM Trainable Speech Synthesis System
Donovan, R E Eide, E M
IBM, USA
1545
We5B2
ProSynth: An Integrated Prosodic Approach to Device-Independent,
Natural-Sounding Speech Synthesis
Hawkins, S House, J Huckvale, M Local, J K Ogden, R
University of Cambridge, UNITED KINGDOM
1600
We5B3
Total Quality Evaluation of Speech Synthesis Systems
Zhang, J Dong, S Yu, G
Institute of Acoustics, P.R. CHINA
1615
We5B4
Comparative Evaluation of Synthetic Prosody with the PURR Method
Sonntag, G P Portele, T
IKP, University of Bonn, GERMANY
1630
We5B5
SABLE: A Standard for TTS Markup
Sproat, R Hunt, A Ostendorf, M Taylor, P Black, A Lenzo, K
Bell Laboratories / Lucent Technologies, USA
1645
We5B6
Prosodic vs. segmental contributions to naturalness in a diphone
synthesizer
Bunnell, H T Hoskins, S R Yarrington, D M
DuPont Hospital for Children / University of Delaware, USA
We5C - Language Acquisition 1
Time & Place: 1530 - 1700, Currawong Room
Chair: Jean Vroomen
1530
We5C1
Non-native productions of Japanese single stops that are too long for one
mora unit
Minagawa-Kawai, Y Kiritani, S
University of Tokyo, JAPAN
1545
We5C2
The Process of Generation and Development of Second Language
Japanese Accentuation
Yamada, N
Ibaraki University, JAPAN
1600
We5C3
Perceptual properties of Russians with Japanese fricatives
Funatsu, S Kiritani, S
Hiroshima Women's University, JAPAN
1615
We5C4
Assessment of Dutch pronunciation by means of automatic speech
recognition technology
Cucchiarini, C de Wet, F Strik, H Boves, L
University of Nijmegen, THE NETHERLANDS
1630
We5C5
Phonetic-level Mispronunciation Detection in non-native Swedish Speech
Langlais, P Öster, A-M Granström, B
Center for Speech Technology, TMH-KTH, SWEDEN
1645
We5C6
Computer-based second language production training by using
spectrographic representation and HMM-based speech recognition scores
Akahane-Yamada, R McDermott, E Adachi, T Kawahara, H Pruitt, J S
ATR Human Information Processing Research Laboratories, JAPAN
We5D - Acoustic Phonetics 1
Time & Place: 1530 - 1700, Dingo Room
Chair: Louis Pols
1530
We5D1
Assimilation of place in Japanese and Dutch
Cutler, A Otake, T
Max-Planck-Institute for Psycholinguistics, NETHERLANDS
1545
We5D2
Prosodic Constraint on V-to-V Coarticulation in Japanese
Kondo, Y Arai, Y
Musashino Art University, JAPAN
1600
We5D3
Postvocalic /r/-deletion in standard Dutch: how experimental phonology
can profit from ASR technology
Cucchiarini, C van den Heuvel, H
University of Nijmegen, THE NETHERLANDS
1615
We5D4
More evidence for the perceptual basis of sound change? Suprasegmental
effects in the development of distinctive nasalisation
Hajek, J Watson, I
University of Melbourne, AUSTRALIA
1630
We5D5
Speech production of vowel sequences using a physiological articulatory
model
Dang, J Honda, K
ATR Human Information Processing Research Labs,, JAPAN
We6A - Speaker Adaptation 2
Time & Place: 1700 - 1800, Antechinus Room
Chair: Frank Soong
1700
We6A1
Eigenvoices for Speaker Adaptation
Kuhn, R Nguyen, P Junqua, J-C Goldwasser, L Niedzielski, N Fincke, S
Field, K Contolini, M
Panasonic Technologies, Inc., USA
1715
We6A2
Speaker Clustering Using Direct Maximisation of the MLLR-Adapted
Likelihood
Johnson, S E Woodland, P C
Cambridge University, UNITED KINGDOM
1730
We6A3
Incremental On-line Speaker Adaptation in Adverse Conditions
Viikki, O Laurila, K
Nokia Research Center, FINLAND
1745
We6A4
Cluster Adaptive Training for Speech Recognition
Gales, M
IBM Almaden Research Center, USA
We6B - Speech Coding 2
Time & Place: 1700 - 1800, Bandicoot Room
Chair: Barry Cheetham
1700
We6B1
Towards a unified model for low bit-rate speech coding using a
recognition-synthesis approach
Holmes, W J
Defence Evaluation and Research Agency, UNITED KINGDOM
1715
We6B2
On the Significance of Temporal Masking in Speech Coding
Skoglund, J Kleijn, W B
Royal Insitute of Technology, SWEDEN
1730
We6B3
Waveform Interpolation Coding with Pitch-Spaced Subbands
Kleijn, W B Yang, H Deprettere, E F
Royal Insitute of Technology, SWEDEN
1745
We6B4
An improved Decomposition Method for WI using IIR Wavelet Filter
Banks
Chong, N R Burnett, I S Chicharo, J F
University of Wollongong, AUSTRALIA
We6C - Hidden Markov Model Techniques 2
Time & Place: 1700 - 1800, Currawong Room
Chair: Jeff Bilmes
1700
We6C1
Real -Time Probabilistic Segmentation for Segment-Based Speech
Recognition
Lee, S Glass, J
MIT, USA
1715
We6C2
Toward Markov random field modeling of speech
Gravier, G Sigelle, M Chollet, G
ENST/TSI and CNRS-URA 820, FRANCE
1730
We6C3
Hidden Markov Models for Trajectory Modeling
Iyer, R Gish, H Siu, M Zavaliagkos, G Matsoukas, S
GTE/BBN Technologies, USA
We6D - Multilingual Perception and Recognition 1
Time & Place: 1700 - 1800, Dingo Room
Chair: Valerie Hazan
1700
We6D1
Bilingual and Dialectal Adaptation and Retraining
Uebler, U Schuessler, M Niemann, H
Bavarian Research Center for Knowledge Based Systems (FORWISS),
GERMANY
1715
We6D2
Language Independent and Language Adaptive Large Vocabulary
Speech Recognition
Schultz, T Waibel, A
Interactive Systems Laboratories, GERMANY
1730
We6D3
A method for measuring the intelligibility and nonnativeness of phone
quality in foreign language pronunciation training
Kawai, G Hirose, K
University of Tokyo, JAPAN
Thursday 3 December, 1998
Th1A - Large Vocabulary Continuous Speech
Recognition 3
Time & Place: 0900 - 1030, Antechinus Room
Chair: Michael Wagner
0900
Th1A1
The BBN Single-Phonetic-Tree Fast-Match Algorithm
Nguyen, L Schwartz, R
BBN Technologies, USA
0915
Th1A2
An Efficient Two-pass Search Algorithm using Word Trellis Index
Lee, A Kawahara, T Shuji, D
Kyoto University, JAPAN
0930
Th1A3
Nozomi - a fast, memory-efficient stack decoder for LVCSR
Schuster, M
ATR Interpreting Telecommunications Research Laboratories, JAPAN
0945
Th1A4
Reducing the OOV Rate in Broadcast News Speech Recognition
Kemp, T Waibel, A
University of Karlsruhe, GERMANY
1000
Th1A5
Using Automatic-Derived Acoustic Sub-Word Units in Large Vocabulary
Speech Recognition
Bacchiani, M Ostendorf, M
Boston University, USA
1015
Th1A6
Fabricating conversational speech data with acoustic models: a program
to examine model-data mismatch
McAllaster, D Gillick, L Scattone, F Newman, M
Dragon Systems Inc, USA
Th1B - Articulatory Modelling 3
Time & Place: 0900 - 1030, Bandicoot Room
Chair: Philip Hoole
0900
Th1B1
An electropalatographic, kinematic, and acoustic analysis of
supralaryngeal correlates of word-level prominence contrasts in English
Harrington, J Beckman, M E Fletcher, J Palethorpe, S
Macquarie University, AUSTRALIA
0915
Th1B2
Consistencies and inconsistencies between EPG and locus equation data
on coarticulation
Tabain, M
Macquarie University, AUSTRALIA
0930
Th1B3
Synergy Between Jaw and Lips/ Tongue Movements : Consequences in
Articulatory Modelling
Bailly, G Badin, P Vilain, A
Institut Communication Parlee, FRANCE
0945
Th1B4
Modelling tongue configuration in German vowel production
Hoole, P
Munich University, GERMANY
1000
Th1B5
Optopalatograph: Real-time feedback of tongue movement in 3D
Wrench, A McIntosh, A D Watson, C Hardcastle, W J
Queen Margaret College, UNITED KINGDOM
1015
Th1B6
Effects of contrastive focal accent on linguopalatal articulation and
coarticulation in the French [kskl] Cluster
Meynadier, Y Pitermann, M Marchal, A
Institut De Phonetique - Laboratoire Parole Et Langage, CNRS, FRANCE
Th1C - Language Acquisition 2
Time & Place: 0900 - 1030, Currawong Room
Chair: Shigeru Kiritani
0900
Th1C1
Spoken Word Identification by Native and Nonnative Speakers of
English: Effects of training, modality, context and phonetic environment
Hardison, D M
University of California, Davis, USA
0915
Th1C2
The Effect of Background Knowledge on First and Second Language
Comprehension Difficulty
Tyler, M D
University of New South Wales, AUSTRALIA
0930
Th1C3
Comparsion of Cross-language Coarticulation: English, Japanese and
Japanese-accented English
Tsukada, K
Macquarie University, AUSTRALIA
0945
Th1C4
Plasticity of non-native phonetic perception and production: A training
study
Imaizumi, S Itoh, H Tamekawa, Y Deguchi, T Mori, K
University of Tokyo, JAPAN
1000
Th1C5
The Relation Between Perceptual and Production Categories in
Acquisition
Watson, I
University of Oxford, UNITED KINGDOM
1015
Th1C6
The development of perceptual cue-weighting in children aged 6 to 12
Hazan, V Barrett, S
Department of Phonetics and Linguistics, UCL, UNITED KINGDOM
Th1D - Speaker and Language Recognition 3
Time & Place: 0900 - 1030, Dingo Room
Chair: Ara Samoulian
0900
Th1D1
Robust speaker verification insensitive to session-dependent utterance
variation and handset-dependent distortion
Matsui, T Aikawa, K
NTT Human Interface Labs, JAPAN
0915
Th1D2
A Comparative Evaluation of Variance Flooring Techniques in
HMM-based Speaker Verification
Melin, H Koolwaaij, J Lindberg, J Bimbot, F
KTH, SWEDEN
0930
Th1D3
Text-Independent Speaker Verification Using Automatically Labelled
Acoustic Segments
Petrovska Delacretaz, D Cernocky, J Henneber, J Chollet, G
Institute of Radioelectronics, CZECH REPUBLIC
0945
Th1D4
A Fast Decoding Algorithm Based on Sequential Detection of the Changes
in Distribution
Li, Q
Bell Labs, Lucent Technologies, USA
1000
Th1D5
Speaker verification with ensemble classifiers based on linear speech
transforms
Olsen, J
Aalborg University, DENMARK
1015
Th1D6
Speaker recognition based on discriminative projection models
Olsen, J
Aalborg University, DENMARK
Th1P - Text-To-Speech Synthesis 4
Time & Place: 0900 - 1230, Platypus Room
Chair: Robert Mannell
Th1P1
A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale
Modification of Speech
Acero, A
Microsoft Corporation, USA
Th1P2
Analytic Generation of Synthesis Units by Closed Loop Training for
Totally Speaker Driven Text to Speech System (TOS Drive TTS)
Akamine, M Kagoshima, T
Toshiba Kansai Laboratory, JAPAN
Th1P3
Modeling the Microprosody of Pitch and Loudness for Speech Synthesis
with Neural Networks
Vainio, M Altosaar, T
Helsinki University of Technology, FINLAND
Th1P4
Spectral Smoothing for Concatenative Speech Synthesis
Chappell, D T Hansen, J H L H
Robust Speech Processing Laboratory, USA
Th1P5
Mimic: A Voice-Adaptive Phonetic-Tree Speech Synthesiser
Chen, A Vaseghi, S Ho, C
Queen's University of Belfast, UNITED KINGDOM
Th1P6
Automatic Generation of Korean Pronunciation Variants by Multistage
Applications of Phonological Rules
Jeon, J Cha, S Chung, M Park, J Hwang, K
Sogang University, KOREA
Th1P7
Techniques for accurate automatic annotation of speech waveforms
Cox, S Brady, R Jackson, P
University of East Anglia, UNITED KINGDOM
Th1P8
Optimized Stopping Criteria for Tree-Based Unit Selection in
Concatenative Synthesis
Cronk, A Macon, M
Oregon Graduate Institute, USA
Th1P9
Automatic transcription of intonation using an identified prosodic
alphabet
de Tournemire, S
France Telecom, CNET, FRANCE
Th1P10
Frequency Analysis of Phonetic Units for Concatenative Synthesis in
Catalan
Esquerra, I Febrer, A Nadeu, C
Polytechnical University of Catalonia, SPAIN
Th1P11
Investigating the Syntactic Characteristics of English Tone Units
Fang, A C House, J Huckvale, M
University College London, UNITED KINGDOM
Th1P12
The UPC Text-to-Speech System for Spanish and Catalan
Bonafonte, A Esquerra, I Febrer, A Fonollosa, J A R Vallverdu, F
Polytechnical University of Catalonia, SPAIN
Th1P13
The New Version of the ROMVOX Text-To-Speech Synthesis System
Based on a Hybrid Time Domain-LPC Synthesis Technique
Ferencz, A Nagy, I Kovacs, T C S Ferencz, M Ratiu, T
Software ITC, ROMANIA
Th1P14
An F0 Contour Control Model for Totally Speaker Driven Text to Speech
System
Kagoshima, T Morita, M Seto, S Akamine, M
Toshiba Corporation, JAPAN
Th1P15
On the Relationship of Speech Rates with Prosodic Units in Dialogue
Speech
Hirose, K Kawanami, H
University of Tokyo, JAPAN
Th1P16
On the Reduction of Concatenation Artefacts in Diphone Synthesis
Klabbers, E Veldhuis, R
IPO, THE NETHERLANDS
Th1P17
Error Analysis and Confidence Measure of Chinese Word Segmentation
Kuo, C-C Ma, K-Y
Industrial Technology Research Institute, TAIWAN R.O.C.
Th1P18
Energy Contour Generation for a Sentence Using a Neural Network
Learning Method
Lee, J Kang, D Kim, S Sung, K-M
ETRI, KOREA
Th1P19
A computational algorithm for F0 contour generation in Korean
developed with prosodically labeled databases using K-ToBI system
Lee, Y-J Lee, S-H Kim, J-J Ho, H-J Kim, Y-I Kim, S-H Lee, J
Wonkwang University, KOREA
Th1P20
Rapid-Deployment Text-to-Speech in the DIPLOMAT System
Lenzo, K Hogan, C Allen, J
Carnegie Mellon University, USA
Th1P21
Formant diphone parameter extraction utilising a labelled single-speaker
database
Mannell, R
Macquarie University, AUSTRALIA
Th1P22
A New Synthetic Speech/Sound Control Language
Mizuno, O Nakajima, S
NTT Human Interface Labs, JAPAN
Th1P23
A study on the natural-sounding Japanese phonetic word synthesis by
using the VCV-balanced word database that consists of the words uttered
forcibly in two types of pitch accent
Mochizuki, R Arai, Y Honda, T
AVC Research Lab., CED, Matsushita Communication Industrial, JAPAN
Th1P24
Letter to Sound Rules for Accented Lexicon Compression
Pagel, V Lenzo, K Black, A W
Facultie Polytechnique of Mons, BELGIUM
Th1P25
A Name Announcement Algorithm With Memory Size and
Computational Power Constraints
Roth, Z Rosenhouse, J
DSP Semiconductors Ltd., ISRAEL
Th1P26
How a French TTS System Can Describe Loanwords
Sannier, F Belrhali, R Auberge, V
ICP - ESA 5009 CNRS, FRANCE
Th1P27
Improvements in Slovene Text-to-Speech Synthesis
Sef, T Dobnikar, A Gams, M
Jozef Stefan Institute, SLOVENIA
Th1P28
Automatic Rule Generation for Linguistic Features Analysis Using
Inductive Learning Technique - Linguistic Features Analysis in TOS
Drive TTS System
Seto, S Morita, M Kagoshima, T Akamine, M
Toshiba Corporation, JAPAN
Th1P29
Segmental Duration Control Based on an Articulatory Model
Shiga, Y Matsuura, H Nitta, T
Toshiba Corporation, JAPAN
Th1P30
Text Analysis for the Bell Labs French Text-to-Speech System
Tzoukermann, E
Bell Labs - Lucent Technologies, USA
Th1P31
Modeling vowel duration for Japanese text-to-speech synthesis
Venditti, J J van Santen, J P H
Ohio State University / Bell Labs, USA
Th1P32
Towards a Chinese Text-to-Speech System with Higher Naturalness
Wang, R-H Liu, Q Teng, Y Xia, D
University of Science & Technology of China, P.R. CHINA
Th1Q - Spoken Language Understanding Systems 4
Time & Place: 0900 - 1230, Quokka Room
Chair: Robert Dale
Th1Q1
Grammar Fragment Acquisition using Syntactic and Semantic Clustering
Arai, K Wright, J Riccardi, G Gorin, A
NTT Human Interface Laboratories, JAPAN
Th1Q2
Non-Expert Access to Unification-based Speech Understanding
Brøndsted, T
Aalborg University, DENMARK
Th1Q3
Natural Language Call Routing: A Robust, Self-Organizing Approach
Carpenter, B Chu-Carroll, J
Lucent Technologies - Bell Labs, USA
Th1Q4
Automatic Grammar Induction from Semantic Parsing
Ghosh, D Goddeau, D
Nuance Communications, USA
Th1Q5
BTH: An efficient parsing algorithm for word-spotting
Kono, Y Yano, T Sasajima, M
Toshiba Corporation, JAPAN
Th1Q6
Syntax Coordination: Interaction of Discourse and Extrapositions
Kronenberg, S C Kummert, F
University of Bielefeld, GERMANY
Th1Q7
Hierarchical Tag-Graph Search for Spontaneous Speech Understanding
in Spoken Dialog Systems
Lin, B-S Chen, B Wang, H-M Lee, L-S
National Taiwan University, TAIWAN R.O.C.
Th1Q8
Extraction of the dialog act and the topic from utterances in a spoken
dialog system
Niimi, Y Takinaga, N Nishimoto, T
Kyoto Institute of Technology, JAPAN
Th1Q9
Fast Computation of Maximum Entropy / Minimum Divergence Feature
Gain
Printz, H
Carnegie Mellon University, USA
Th1Q10
Stochastic Language Models for Speech Recognition and Understanding
Riccardi, G Gorin, A L
AT&T Labs-Research, USA
Th1Q11
Linguistically Engineered Tools for Speech Recognition Error Analysis
Van Ess-Dykema, C Ries, K
U.S. Department of Defense, USA
Th1Q12
Estimating Entropy of a Language from Optimal Word Insertion Penalty
Takeda, K Ogawa, A Itakura, F
Nagoya University, JAPAN
Th1Q13
A Linguistic Analysis of Repair Signals in Co-operative Spoken Dialogues
Tseng, S-C
University of Bielefeld, GERMANY
Th1Q14
A Hierarchical Language Model for CSR
Valverde-Albacete, F J Pardo, J M
Universidad Carlos III De Madrid, SPAIN
Th1Q15
Spoken Language Understanding Within Dialogues Using a Graphical
Model of Task Structure
Wright, J H Gorin, A L Abella, A
AT&T Labs - Research, USA
Th1Q16
Keyword Extraction of Radio News using Domain Identification based on
Categories of an Encyclopedia
Suzuki, Y Fukumoto, F Sekiguchi, Y
Yamanashi University, JAPAN
Th1R - Human Speech Perception 2
Time & Place: 0900 - 1230, Rosella Room
Chair: Kerrie Lee
Th1R1
Fundamental frequency flunctuation in continuous vowel utterance and
its perception
Akagi, M Iwaki, M Minakawa, T
Japan Advanced Institute of Science and Technology, JAPAN
Th1R2
Estimation of Mental Lexicon Size with Word Familiarity Database
Amano, S Kondo, T
NTT Basic Research Laboratories, JAPAN
Th1R3
Vowel Quality in Spontaneous Speech: What Makes a Good Vowel?
Aylett, M P Turk, A
University of Edinburgh, UNITED KINGDOM
Th1R4
Cooperation and Competition of Burst and Formant Transitions for the
Perception and the Identification of French Stops
Neagu, A Bailly, G
Institut Communication Parlee, FRANCE
Th1R5
The effect of modifying formant amplitudes on the perception of French
vowels generated by copy synthesis
Bonneau, A Laprie, Y
Loria & CNRS, FRANCE
Th1R6
Segmental and tonal processing in Cantonese
Chen, H-C Yip, M Wong, S-Y
Chinese University of Hong Kong, HONG KONG
Th1R7
Phonological similarity effects in Cantonese spoken-word processing
Yip, M Leung, P-Y Chen, H-C
Chinese University of Hong Kong, HONG KONG
Th1R8
On the learnability of the voicing contrast for initial stops
Damper, R I Gunn, S R
University of Southampton, UNITED KINGDOM
Th1R9
Acoustic and Perceptual Characteristics of Italian Stop Consonants
Cerrato, L Falcone, M
Fondazione Ugo Bordoni, ITALY
Th1R10
Acoustic cues for the auditory identification of the Spanish fricative /f/
Fernandez, S Feijoo, S Balsa, R Barros, N
Universidad de Santiago, SPAIN
Th1R11
Recognition of vowels in fricative context
Fernandez, S Feijoo, S Balsa, R Barros, N
Universidad De Santiago, SPAIN
Th1R12
Voicing affects perceived manner of articulation
Fernandez, S Feijoo, S Almeida, P
Universidad De Santiago, SPAIN
Th1R13
Enhancement techniques to improve the intelligibility of consonants in
noise: Speaker and listener effects
Hazan, V Simpson, A Huckvale, M
Department of Phonetics and Linguistics, UCL, UNITED KINGDOM
Th1R14
Boundaries of Perception of Long Tones in Taiwanese Speech
Jian F
University of Reading, UNITED KINGDOM
Th1R15
Effects of phonetic quality and duration on perceptual acceptability of
temporal changes in speech
Kato, H Tsuzaki, M Sagisaka, Y
ATR Human Information Processing Research Labs, JAPAN
Th1R16
Dynamic vs. Static Spectral Detail in the Perception of Gated Stops
Kiefte, M Neary, T M
University of Alberta, CANADA
Th1R17
Phonological Units in Speech Segmentation and Phonological Awareness
Otake, T Yoneyama, K
Dokkyo University, JAPAN
Th1R18
How Far Do Speakers Back Up in Repairs? A Quantitative Model
Shriberg, E Stolcke, A
SRI International, USA
Th1R19
Don’t blame it (all) on the pause: Further ERP evidence for a
prosody-induced garden-path in running speech
Steinhauer, K Alter, K Friederici, A D
Max-Planck-Institute of Cognitive Neuroscience, GERMANY
Th1R20
The role of stress for lexical selection in Dutch
Vroomen, J de Gelder, B
University of Tilburg, THE NETHERLANDS
Th1R21
The perception of stressed syllables in Finnish
Tuomainen, J Vroomen, J de Gelder, B
University of Tilburg, THE NETHERLANDS
Th1R22
The perception of the morae with devocalized vowels in Japanese
language
Yamakawa, K Baba, R
Prefectural University of Kumamoto, JAPAN
Th2A - Large Vocabulary Continuous Speech
Recognition 4
Time & Place: 1100 - 1230, Antechinus Room
Chair: Richard Stern
1100
Th2A1
High Resolution Decision Tree based Acoustic Modeling beyond CART
Chou, W Reichl, W
Bell Labs, Lucent Technologies, USA
1115
Th2A2
Unsupervised Training of a Speech Recognizer Using TV Broadcasts
Kemp, T Waibel, A
University of Karlsruhe, GERMANY
1130
Th2A3
A New Method to Achieve Fast Acoustic Matching for Speech Recognition
Lee, C Z O'Shaughnessy, D
INRS-Telecommunications, CANADA
1145
Th2A4
Improved Parameter Tying for Efficient Acoustic Model Evaluation in
Large Vocabulary Continuous Speech Recognition
Duchateau, J Demuynck, K Van Compernolle, D Wambacq, P
Kath. Univ. Leuven - ESAT/PSI, BELGIUM
1200
Th2A5
A New Look at HMM Parameter Tying for Large Vocabulary Speech
Recognition
Sankar, A
SRI International, USA
1215
Th2A6
Factor Analysis Invariant to Linear Transformations of Data
Gopinath, R A Ramabhadran, B Dharanipragada, S
IBM T. J. Watson Research Center, USA
Th2B - Spoken Language Understanding Systems 2
Time & Place: 1100 - 1230, Bandicoot Room
Chair: Sandra Williams
1100
Th2B1
Automatic Ambiguity Detection
Sproat, R van Santen, J
Bell Laboratories / Lucent Technologies, USA
1115
Th2B2
Empowering Knowledge Based Speech Understanding through Statistics
Fischer, J Haas, J Noth, E Niemann, H Deinzer, P
University of Erlangen, GERMANY
1130
Th2B3
Concept-driven speech understanding incorporated with a statistic
language model
Nagai, A Ishikawa, Y
Mitsubishi Electric Corporation, JAPAN
1145
Th2B4
On the Limitations of Stochastic Conceptual Finite-State Language
Models for Speech Understanding
Colas, J Ferreiros, J Montero, J M Pastor, J Gallardo, A Pardo, J M
GTH-IEL-UPM, SPAIN
1200
Th2B5
Towards Speech Understanding across multiple languages
Ward, T Roukos, S Neti, C Gros, J Epstein, M Dharanipragada, S
IBM T.J. Watson Research Center, USA
1215
Th2B6
Automatic Detection of Sentence Boundaries and Disfluencies Based on
Recognized Words
Stolcke, A Shriberg, E Bates, R Ostendorf, M Hakkani, D Plauche, M Tur,
G Lu, Y
SRI International, USA
Th2C - Signal Processing and Speech Analysis 3
Time & Place: 1100 - 1230, Currawong Room
Chair: Wolfgang Hess
1100
Th2C1
Determination of articulator positions from speech acoustics by applying
dynamic articulatory constraints
Suzuki, S Okadome, T Honda, M
NTT Basic Research Laboratories, JAPAN
1115
Th2C2
Recognizing Emotions in Speech Using Short-term and Long-term
Features
Li, Y Zhao, Y
University of Illinois, USA
1130
Th2C3
PeriphEar : A Nonlinear Active Model of the Auditory Periphery
Robert, A R Eriksson, J L E
CIRC, EPFL, SWITZERLAND
1145
Th2C4
The Voicing Feature for Stop Consonants: Acoustic Phonetic Analyses
and Automatic Speech Recognition Experiments
Ramesh, P Niyogi, P
Lucent Technologies, USA
1200
Th2C5
Wavelet-based Energy Binning Cepstral Features for Automatic Speech
Recognition
Basu, S Maes, S
IBM T.J. Watson Research Center, USA
1215
Th2C6
Articulatory Analysis using a Codebook for Articulatory based Low
Bit-Rate Speech Coding
Silva, C Chennoukh, S
Universidade Do Minho, PORTUGAL
Th2D - Human Speech Perception 3
Time & Place: 1100 - 1215, Dingo Room
Chair: Philip Rubin
1100
Th2D1
Categorical perception: Important phenomenon or lasting myth?
Massaro, D W
University of California, Santa Cruz, USA
1115
Th2D2
Categorical Perception of Vowels
Gerrits, E Schouten, B
Utrecht University, THE NETHERLANDS
1130
Th2D3
Suprasegmental Cues for the Segmentation of Identical Vowel Sequences
in Japanese
Kakehi, K Hirose, Y
Nagoya University, JAPAN
1145
Th2D4
Perception of concurrent approximant-vowel syllables
Ainsworth, W A
Keele University, UNITED KINGDOM
1200
Th2D5
Perceived Swedish Vowel Quantity: Effects of Postvocalic Consonant
Duration
Behne, D M Czigler, P E Sullivan, K P H
Norwegian University of Science & Technology, SWEDEN
Th4A - Speaker Adaptation 3
Time & Place: 1330 - 1500, Antechinus Room
Chair: Jean-Claude Junqua
1330
Th4A1
On-line hierarchical transformation of hidden Markov models for
speaker adaptation
Chien, J-T
National Cheng Kung University, TAIWAN R.O.C.
1345
Th4A2
High-Speed Speaker Adaptation Using Phoneme-Dependent
Tree-Structured Speaker Clustering
Suzuki, M Abe, T Mori, H Makino, S Aso, H
Tohoku University, JAPAN
1400
Th4A3
The Use of Confidence Measures in Unsupervised Adaptation of Speech
Recognizers
Anastasakos, T Balakrishnan, S
Motorola, Lexicus Division, USA
1415
Th4A4
Speaker Normalization with All-Pass Transforms
McDonough, J Byrne, W Luo, X
Johns Hopkins University, USA
1430
Th4A5
The CHAM model of hyperarticulate adaptation during human-computer
error resolution
Oviatt, S L
Oregon Graduate Institute of Science & Technology, USA
Th4B - Spoken Language Understanding Systems 3
Time & Place: 1330 - 1500, Bandicoot Room
Chair: Richard Sproat
1330
Th4B1
Language Modeling for Content Extraction in Human-Computer
Dialogues
Reichl, W Carpenter, R Chu-Carroll, J Chou, W
Bell Labs - Lucent Technologies, USA
1345
Th4B2
A Language Model Combining Trigrams and Stochastic Context-Free
Grammars
Gillett, J Ward, W
Carnegie Mellon University, USA
1400
Th4B3
Online Adaptation of Language Models in Spoken Dialogue Systems
Souvignier, B Kellner, A
Philips Research Laboratories, GERMANY
1415
Th4B4
Language Model Adaptation for Spoken Language Systems
Riccardi, G Potamianos, A Narayanan, S
AT&T Labs-Research, USA
1430
Th4B5
Detecting Topic Shifts Using a Cache Memory
Bigi, B De Mori, R El-Beze, M Spriet, T
University of Avignon, FRANCE
1445
Th4B6
A Discourse Coding Scheme for Conversational Spanish
Levin, L Thyme-Gobbel, A Lavie, A Ries, K Zechner, K
Carnegie Mellon University, USA
Th4C - Multimodal Spoken Language Processing 3
Time & Place: 1330 - 1500, Currawong Room
Chair: Eric Vatikiotis-Bateson
1330
Th4C1
Referential Features and Linguistic Indirection in Multimodal Language
Oviatt, S L Kuhn, K
Oregon Graduate Institute of Science & Technology, USA
1345
Th4C2
Multimodal Language Processing
Johnston, M
Oregon Graduate Institute, USA
1400
Th4C3
Implementation of Coordinative Nodding Behavior on Spoken Dialogue
Systems
Hirasawa, J-I Miyazaki, N Nakano, M Kawabata, T
Nippon Telegraph & Telephone, JAPAN
1415
Th4C4
Use of Non-verbal Information in Communication between Human and
Robot
Yokoyama, M Aoyama, K Kikuchi, H Shirai, K
Waseda University, JAPAN
1430
Th4C5
What you see is (almost) what you hear: Design principles for user
interfaces for accessing speech archives
Whittaker, S Choi, J Hirschberg, J Nakatani, C
AT & T Labs - Research, USA
Th4D - Acoustic Phonetics 2
Time & Place: 1330 - 1500, Dingo Room
Chair: John Hajek
1330
Th4D1
Regional variations in the vowels of female adolescents from Sydney
Cox, F Palethorpe, S
Macquarie University, AUSTRALIA
1345
Th4D2
A kinematic analysis of New Zealand and Australian English vowel
spaces
Watson, C Harrington, J Palethorpe, S
Macquarie University, AUSTRALIA
1400
Th4D3
Syllable-onset acoustic properties associated with syllable-coda voicing
Nguyen, N Hawkins, S
University of Geneva, SWITZERLAND
1415
Th4D4
Articulatory, acoustic and perceptual aspects of fricative/stop
coarticulation
Nguyen, N Wrench, A Gibbon, F Hardcastle, W
University of Geneva, SWITZERLAND
1430
Th4D5
Efficiency as an organizing principle of natural speech
Van Son, R J J H Koopmans-van Beinum, F J Pols, L C W
University of Amsterdam, THE NETHERLANDS
1445
Th4D6
Within-speaker variability due to speaking manners
Karlsson, I Banziger, T Dankovicová, J Johnstone, T
Lindberg, J Melin, H Nolan, F Scherer, K
KTH, SWEDEN
Th4P - Large Vocabulary Continuous Speech
Recognition 5
Time & Place: 1330 - 1700, Platypus Room
Chair: Herve Bourlard
Th4P1
A Thesaurus-based Statistical Language Model for Broadcast New
Transcription
Ando, A Kobayashi, A Imai, T
NHK Science & Technical Research Laboratories, JAPAN
Th4P2
Effect of Task Complexity on Search Strategies for the Motorola Lexicus
Continuous Speech Recognition System
Balakrishnan, S V
Motorola Lexicus Division, USA
Th4P3
New Features for Confidence Annotation
Bansal, D Ravishankar, M
Carnegie Mellon University, USA
Th4P4
Multi-Span Statistical Language Modeling for Large Vocabulary Speech
Recognition
Bellegarda, J R
Apple Computer, USA
Th4P5
Maximum-Likelihood Updates of HMM Duration Parameters for
Discriminative Continuous Speech Recognition
Chengalvarayan, R
Lucent Technologies, USA
Th4P6
Towards Better Integration of Semantic Predictors in Statistical
Language Modeling
Coccaro, N Jurafsky, D
University of Colorado, USA
Th4P7
An Asymmetric Stochastic Language Model Based on Multi-Tagged
Words
Pastor, J Colas, J San-Seguno, R Pardo, J
GTH-IEL-UPM, SPAIN
Th4P8
Product-Code Vector Quantization of Cepstral Parameters for Speech
Recognition over the WWW
Digalakis, V Neumeyer, L Perakakis, M
Technical University of Crete, GREECE
Th4P9
Context Dependent Tree Based Transforms for Phonetic Speech
Recognition
Doherty, B Vaseghi, S McCourt, P
Queen's University of Belfast, UNITED KINGDOM
Th4P10
Interfacing Acoustic Models with Natural Language Processing Systems
Johnson, M T Harper, M P Jamieson, L H
Purdue University, USA
Th4P11
Hierachical Cluster Language Modelling with Statistical Rule Extraction
for Rescoring N-Best Hypotheses During Speech Decoding
Jang, P J Hauptmann, A
Carnegie Mellon University, USA
Th4P12
Dealing with out-of-vocabulary words and speech disfluencies in an
N-gram based speech understanding system
Kai, A Hirose, Y Nakagawa, S
Toyohashi University of Technology, JAPAN
Th4P13
Source-Extended Language Model for Large Vocabulary Continuous
Speech Recognition
Kobayashi, T Wada, Y Kobayashi, N
Waseda University, JAPAN
Th4P14
Time Dependent Language Model for Broadcast News Transcription and
Its Post-Correction
Kobayashi, A Onoe, K Imai, T Ando, A
NHK Science & Technical Research Laboratories, JAPAN
Th4P15
Exploiting Transitions and Focussing on Linguistic Properties for ASR
Koreman, J Barry, W J Andreeva, B
University of the Saarland, GERMANY
Th4P16
A Unified Framework for Sublexical and Linguistic Modelling Supporting
Flexible Vocabulary Speech Understanding
Lau, R Seneff, S
MIT, USA
Th4P17
A method for modeling liaison in a speech recognition system for French
Bahl,L De Gennaro, S De Souza, P Epstein, E Le Roux, J M Lewis, B
Waast, C
IBM, USA
Th4P18
On Variable Sampling Frequencies in Speech Recognition
Liu, F-H Picheny, M
IBM Watson Research Center, USA
Th4P19
Pronunciation Modeling for Large Vocabulary Conversational Speech
Recognition
Ma, K Zavaliagkos, G Iyer, R
GTE/BBN Technologies, USA
Th4P20
Time Shift Invariant Speech Recognition
Basu, S Ittycheriah, A Maes, S H
IBM T.J. Watson Research Center, USA
Th4P21
The Demiphone versus the Triphone in a Decision-tree State-tying
Framework
Mariño, J B Pachès-Leal, P Nogueiras, A
Polytechnical University of Catalonia, SPAIN
Th4P22
Word Clustering for a Word Bi-gram Model
Mori, S Nishimura, M Itoh, N
Tokyo Research Laboratory, JAPAN
Th4P23
A Large Vocabulary Continuous Speech Recognition Hybrid System for
the Portuguese Language
Neto, J Martins, C Almeida, L
INESC, PORTUGAL
Th4P24
Speech Recognition Performance on a New Voicemail Transcription Task
Padmanabhan, M Ramabhadran, B Basu, S
IBM T J Watson Research Center, USA
Th4P25
Grammatical and statistical word prediction system for Spanish
integrated in an aid for people with disabilities
Palazuelos, S Aguilera, S Rodrigo, J Godino, J
ETSI Telecomunicación, SPAIN
Th4P26
Segmentation using a maximum entropy approach
Papineni, K Dharanipragada, S
IBM T.J. Watson Research Center, USA
Th4P27
Recognition Performance of a Large-Scale Dependency Grammar
Language Model
Berger, A Printz, H
Carnegie Mellon University, USA
Th4P28
A Bootstrap Technique for Building Domain-Dependent Language
Models
Ramaswamy, G N Printz, H Gopalakrishnan, P S
I.B.M. Research Center, USA
Th4P29
Estimation of the Probability Distributions of Stochastic Context-Free
Grammars from the K-Best Derivations
Sanchez, J-A Benedi, J-M
Universidad Politecnica De Valencia, SPAIN
Th4P30
Robust HMM Estimation with Gaussian Merging-Splitting and
Tied-Transform HMMs
Sankar, A
SRI International, USA
Th4P31
Nonlinear Interpolation of Topic Models for Language Model Adaptation
Seymore, K Chen, S Rosenfeld, R
Carnegie Mellon University, USA
Th4P32
Performance evaluation of word phrase and noun category language
models for broadcast news speech recognition
Takagi, K Oguro, R Hashimoto, K Ozeki, K
The University of Electro-Communications, JAPAN
Th4P33
Robust Automatic Continuous-Speech Recognition Based on a
Voiced-Unvoiced Decision
Tolba, H O'Shaughnessy, D
INRS-Telecommunications, CANADA
Th4P34
Double Tree Beam Search Using Hierarchical Subword Units
Torrecilla, J C Cortazar, I Hernandez, L
Telefonica I+D, SPAIN
Th4P35
Text Segmentation and Topic Tracking on Broadcast News Via a Hidden
Markov Model Approach
van Mulbregt, P Carp, I Gillick, L Lowe, S Yamron, J
Dragon Systems, Inc, USA
Th4P36
Multi-Phone Strings as SubWord Units for Speech Recognition
O’Neill, P Vaseghi, S Doherty, B Tan, W McCourt, P
The Queen's University of Belfast
Th4P37
Phonetic modification of the Syllable / tu / in two spontaneous American
English dialogues
Veilleux, N Shattuck-Hufnagel, S
Boston University Metropolitan College, USA
Th4P38
Efficient Lattice Representation and Generation
Weng, F Stolcke, A Sankar, A
SRI International, USA
Th4P39
Modeling pronunciation variation for a Dutch CSR: Testing three
methods
Wester, M Kessens, J M Strik, H
University of Nijmegen, THE NETHERLANDS
Th4P40
Comparison of Language Modelling Techniques for Russian and English
Whittaker, E W D Woodland, P C
Cambridge University, UNITED KINGDOM
Th4P41
Optimized POS-Based Language Models for Large Vocabulary Speech
Recognition
Witschel, P
Siemens AG ZT IK 5, GERMANY
Th4P42
Reducing Peak Search Effort Using Two-Tier Pruning
Wright, M Hovell, S Ringland, S
BT Laboratories, UNITED KINGDOM
Th4P43
Using Untranscribed Training Data to Improve Performance
Zavaliagkos, G Siu, M Colthurst, T Billa, J
BBN Technologies, USA
Th4P44
Telephone Band LVCSR for Hearing-Impaired Users
Jan, E Bakis, R Liu, F Picheny, M
IBM T.J. Watson Research Center, USA
Th4P45
Using X-gram for efficient speech recognition
Bonafonte, A Mariño, J B
Polytechnical University of Catalonia, SPAIN
Th4Q - Speech Coding 3
Time & Place: 1330 - 1700, Quokka Room
Chair: Alan Bradley
Th4Q1
A New Linear Predictive Method for Compression of Speech Signals
Alku, P Varho, S
University of Turku, FINLAND
Th4Q2
Hierarchical Temporal Decomposition: A Novel Approach to Efficient
Compression of Spectral Characteristics of Speech
Ghaemmaghami, S Deriche, M Sridharan, S
Queensland University of Technology, AUSTRALIA
Th4Q3
Speech intelligibility testing for new technologies
Hura, S
Lucent Technologies, USA
Th4Q4
Efficient Quantization of LSF Parameters Based on Temporal
Decomposition
Kim, S-J Lee, S-H Han, W-J Oh, Y-H
KAIST, KOREA
Th4Q5
A Sinusoidal Harmonic Vocoder at 1.2 kbps Using Auditory Perceptual
Characteristics
Kohata, M
Chiba Institute of Technology, JAPAN
Th4Q6
A 16 kbit/s Wideband CELP Coder Using Mel-Generalized Cepstral
Analysis and Its Subjective Evaluation
Koishida, K Hirabayashi, G Tokuda, K Kobayashi, T
University of California, Santa Barbara, USA
Th4Q7
Comparison of spectral estimation techniques for low bit-rate speech
coding
Molyneux, D J Parris, C I Sun, X Q Cheetham, B M G
University of Manchester, UNITED KINGDOM
Th4Q8
Low Bit Rate Coding for Speech and Audio Using Mel Linear Predictive
Coding (MLPC) analysis
Nakatoh, Y Norimatsu, T Low, A H Matsumoto, H
Matsushita Electric Industrial Co Ltd, JAPAN
Th4Q9
Comparison Study on VQ Codevector Index Assignment
Pan, J S Shieh, C S Chu, S C
National Kaohsiung Institute of Technology, TAIWAN R.O.C.
Th4Q10
Using linguistic knowledge to improve the design of low-bit rate LSF
quantisation
Parry, J J Burnett, I S Chicharo, J F
University of Wollongong, AUSTRALIA
Th4Q11
Transform Coding of LSF Parameters Using Wavelets
Petrinovic, D
University of Zagreb, CROATIA
Th4Q12
Source Controlled Variable Bit-Rate Speech Coder Based on Waveform
Interpolation
Plante, F Cheetham, B M G Marston, D Barrett, P A
Liverpool University, UNITED KINGDOM
Th4Q13
Improving Speaker Recognizability in Phonetic Vocoders
Ribeiro, C E M Trancoso, I M
INESC, PORTUGAL
Th4R - Language Acquisition 3 / Multilingual
Perception and Recognition 2
Time & Place: 1330 - 1700, Rosella Room
Chair: Sudaporn Luksaneeyyanawin
Th4R1
Speech Perception and Spoken Language in Children with Impaired
Hearing
Blamey, P Sarant, J Serry, T Wales, R James, C Barry, J Clark, G Wright,
M Tooher, R Psarros, C Godwin, G Rennie, M Meskin, T
University of Melbourne, AUSTRALIA
Th4R2
Quantitative assessment of second language learners' fluency: An
automatic approach
Cucchiarinim, C Strik, H Boves, L
University of Nijmegen, THE NETHERLANDS
Th4R3
Cross-Language Merged Speech Units and their Descriptive Phonetic Correlates
Dalsgaard, P Andersen, O Barry, W
Aalborg University, DENMARK
Th4R4
Crosslinguistic Disfluency Modelling: A Comparative Analysis of Swedish
and American English Human-Human and Human-Machine Dialogues
Eklund, R Shriberg, E
Telia Research AB, SWEDEN
Th4R5
Calibration of machine scores for pronunciation grading
Franco, H Neumeyer, L
SRI International, USA
Th4R6
Phonetic Distance-Based Hypothesis Driven Lexical Adaptation for
Transcribing Multilingual Broadcast News
Geutner, P Finke, M Waibel, A
Carnegie Mellon University, USA
Th4R7
Automatic Pronunciation Error Detection and Guidance for Foreign
Language Learning
Jo, C-H Kawahara, T Doshita, S Dantsuji, M
Kyoto University, JAPAN
Th4R8
Lexical Access for Large-Vocabulary Speech Recognition
Leung, H Y Leung, H C
Chinese University of Hong Kong, HONG KONG
Th4R9
The effect of fundamental frequency on Mandarin speech recognition
Liu, S Doyle, S Morris, A Ehsani, F
Nuance Communications, USA
Th4R10
The perception of nativeness: Variable speakers and flexible listeners
Markham, D
Deakin University, AUSTRALIA
Th4R11
Voice Dictation in the Secondary School Classroom
McTear, M O'Hare, E
University of Ulster, UNITED KINGDOM
Th4R12
The importance of first syllable in English spoken word recognition by
adult Japanese speakers
Nakayama, K Tomita-Nakayama, K
Yamagata University, JAPAN
Th4R13
Spoken L2 Teaching with Contrastive Visual and Auditory Feedback
Öster, A-M
KTH, SWEDEN
Th4R14
The role of phonological, morphological and orthographic knowledge in
the intuitive syllabification of Dutch words: A longitudinal approach
Sandra, D Gillis, S
University of Antwerp, BELGIUM
Th4R15
The Acquisition of Japanese Compound Accent Rule
Shirose, A Kubozono, H Kiritani, S
University of Tokyo, JAPAN
Th4R16
The Acquisition of Putonghua Phonology
So, L K H Zhou, J
The University of Hong Kong, HONG
Th4R17
Enhancing Speech Processing of Japanese Learners of English Utilizing
Time-Scale Expansion with Constant Pitch
Tomita-Nakayama, K Nakayama, K Misaki, M
Yamagata University, JAPAN
Th4R18
A Bootstrap Training Approach for Language Model Classifiers
Warnke, V Noeth, E Buckow, J Harbeck, S Niemann, H
University of Erlangen, GERMANY
Th4R19
Voice onset time patterns in 7-, 9- and 11-year old children
Whiteside, S P Marshall, J
University of Sheffield, UNITED KINGDOM
Th4R20
Some developmental patterns in the speech of 6-, 8- and 10-year old
children: An acoustic phonetic study
Whiteside, S P Hodgson, C
University of Sheffield, UNITED KINGDOM
Th4R21
Language Development After Extreme Childhood Deprivation: A Case
Study
Brown, L Locke, J L Jones, P Whiteside, S
University of Sheffield, UNITED KINGDOM
Th4R22
Phonological elements as a basis for language-independent ASR
Williams, G Terry, A M P Kaye, J D
SOAS, University of London, UNITED KINGDOM
Th4R23
A Phonetic and Acoustic Study of Babbling in an Italian Child
Zmarich, C Lanni, R
Instituto Di Fonetica E Dialettologia, CNR, ITALY
Th4R24
Rescoring Multiple Pronunciations Generated from Spelled Words
Kuhn, R Junqua, J-C Martzen, P
Panasonic Technologies, Inc., USA
Th5A - Segmentation, Labelling and Speech Corpora 3
Time & Place: 1530 - 1700, Antechinus Room
Chair: Toomas Altosaar
1530
Th5A1
A recursive algorithm for the forced alignment of very long audio
segments
Moreno, P J Joerg, C Van Thong, J-M Glickman, O
Compaq (Cambridge Research Lab), USA
1545
Th5A2
The selection of pronunciation variants: comparing the performance of
man and machine
Kessens, J M Wester, M Cucchiarini, C Strik, H
University of Nijmegen, THE NETHERLANDS
1600
Th5A3
Acoustic Confidence Measures for Segmenting Broadcast News
Barker, J Williams, G Renals, S
Sheffield University, UNITED KINGDOM
1615
Th5A4
A Duration-Based Confidence Measure for Automatic Segmentation of
Noise Corrupted Speech
Pellom, B L Hansen, J H L
Duke University, USA
1630
Th5A5
Segmentation and Classification of Broadcast News Audio
Hain, T Woodland, P C
Cambridge University, UNITED KINGDOM
1645
Th5A6
Speaker Recruitment Methods and Speaker Coverage - Experiences
From a Large Multilingual Speech Database Collection
Lindberg, B Comeyne, R Draxler, C Senia, F
Aalborg University, DENMARK
Th5B - Text-To-Speech Synthesis 5
Time & Place: 1530 - 1700, Bandicoot Room
Chair: Ron van Son
1530
Th5B1
A phonologically motivated method of selecting non-uniform units
Breen, A P Jackson, P
BT Labs, UNITED KINGDOM
1545
Th5B2
A Synthesis Method Based on Concatenation of Demisyllables and a
Residual Excited Vocal Tract Model
Pearson, S Kibre, N Niedzielski, N
Speech Technology Lab/Panasonic Technologies Inc., USA
1600
Th5B3
Exploration of Acoustic Correlates in Speaker Selection for
Concatenative Synthesis
Syrdal, A K Conkie, A Stylianou, Y
AT&T Labs - Research, USA
1615
Th5B4
A Perceptual Evaluation of Distance Measures for Concatenative Speech
Synthesis
Wouters, J Macon, M W
Center for Spoken Language Understanding, USA
1630
Th5B5
HMM-Based Smoothing for Concatenative Speech Synthesis
Plumpe, M Acero, A Hon, H Huang, X
Microsoft Research, USA
1645
Th5B6
A nonlinear unit selection strategy for concatenative speech synthesis
based on syllable level features
Holzapfel, M Cambpell, N
ATR Interpreting Telecommunications Research Laboratories, JAPAN
Th5C - Spoken Language Generation and Translation 2
Time & Place: 1530 - 1700, Currawong Room
Chair: Julia Hirschberg
1530
Th5C1
A Generic Algorithm for Generating Spoken Monologues
Klabbers, E Krahmer, E Theune, M
IPO Center for Research On User-System Interaction, THE NETHERLANDS
1545
Th5C2
On the Use of Automatically Generated Discourse-level Information in a
Concept-to-Speech Synthesis System
Hitzeman, J Black, A W Taylor, P Mellish, C Oberlander, J
University of Edinburgh, UNITED KINGDOM
1600
Th5C3
Learning Phrase-based Head Transduction Models for Translation of
Spoken Utterances
Alshawi, H Bangalore, S Douglas, S
AT&T Labs, USA
1615
Th5C4
Probabilistic dialogue act extraction for concept based multilingual
translation systems
Fukada, T Koll, D Waibel, A Tanigaki, K
ATR Interpreting Telecommunications Research Laboratories, JAPAN
1630
Th5C5
Fast Decoding for Statistical Machine Translation
Wang, Y Waibel, A
Carnegie Mellon University, USA
1645
Th5C6
A Japanese-to-English Speech Translation System: ATR-MATRIX
Takezawa, T Morimoto, T Sagisaka, Y Campbell, N Iida, H Sugaya, F
Yokoo, A Yamamoto, S
ATR Interpreting Telecommunications Research Laboratories, JAPAN
Th5D - Human Speech Perception 4
Time & Place: 1530 - 1700, Dingo Room
Chair: Sieb Nooteboom
1530
Th5D1
Orthografik inkoncistensy ephekts in foneme detektion?
Cutler, A Treiman, R van Ooijen, B
Max Plank Institute for Psycholinguistics, THE NETHERLANDS
1545
Th5D2
The effect of orthographic knowledge on the segmentation of speech
Derwing, B L Nearey, T M Yoon, Y B
University of Alberta, CANADA
1600
Th5D3
Spotting (different types of) Words in (different types of) Context
McQueen, J M Cutler, A
Max-Planck-Institute for Psycholinguistics, NETHERLANDS
1615
Th5D4
Correlation Between Consonantal VC Transitions and degree of
Perceptual Confusion of Place Contrasts in Hindi
Ohala, M Ohala, J J
San Jose State University, USA
1630
Th5D5
Perception of Tonal Rises and Falls for Accentuation and Phrasing in
Swedish
House, D Hermes, D Beaugendre, F
University of Skovde, SWEDEN
1645
Th5D6
Speech Intelligibility Derived from Exceedingly Sparse Spectral
Information
Greenberg, S Arai, T Silipo, R
International Computer Science Institute, USA
Friday 4 December, 1998
Fr1A - Robust Speech Processing in Adverse
Environments 5
Time & Place: 0900 - 1030, Antechinus Room
Chair: John Hansen
0900
Fr1A1
Auditory Modelling Techniques for Robust Pitch Extraction and Noise
Reduction
Cosi, P Pasquin, S Zovato, E
IIFD - Institute of Phonetics and Dialectology, ITALY
0915
Fr1A2
Wavelet Transform-based Speech Enhancement
Ambikairajah, E Tattersall, G Davis, A
Athlone Institute of Technology, IRELAND
0930
Fr1A3
A Practical Perceptual Frequency Autoregressive HMM Enhancement
System
Logan, B Robinson, A
University of Cambridge, UNITED KINGDOM
0945
Fr1A4
An Effective Quality Evaluation Protocol for Speech Enhancement
Algorithms
Hansen, J H L Pellom, B L
Robust Speech Processing Laboratory, Duke University, USA
1000
Fr1A5
An adaptive beamforming microphone array system using a blind
deconvolution
Park, J-N Usagawa, T Ebata, M
Kumamoto University, JAPAN
1015
Fr1A6
Speech Enhancement using Critical Band Spectral Subtraction
Singh, L Sridharan, S
Queensland University of Technology, AUSTRALIA
Fr1B - Text-To-Speech Synthesis 6
Time & Place: 0900 - 1030, Bandicoot Room
Chair: Catherine Watson
0900
Fr1B1
How to Handle 'Foreign' Sounds in Swedish Text-to-Speech Conversion:
Approaching the 'Xenophone' Problem
Eklund, R Lindström, A
Telia Research AB, SWEDEN
0915
Fr1B2
Multi-Lingual Concatenative Speech Synthesis
Campbell, N
ATR Interpreting Telecommunications Research Laboratories, JAPAN
0930
Fr1B3
On the Use of F0 features in Automatic Segmentation for Speech
Synthesis
Saito, T
IBM Japan Ltd, JAPAN
0945
Fr1B4
A linguistic and prosodic database for data-driven Japanese TTS synthesis
Sakurai, A Natsume, T Hirose, K
University of Tokyo, JAPAN
1000
Fr1B5
Text-to-Speech Voice Adaptation from Sparse Training Data
Kani, A Macon, M
Oregon Graduate Institute, USA
1015
Fr1B6
Describing Intonation with a Parametric Model
Möhler, G
University of Stuttgart, GERMANY
Fr1C - Speech Technology Applications and Human-
Machine Interface 2
Time & Place: 0900 - 1030, Currawong Room
Chair: Ben Serridge
0900
Fr1C1
Development of CAI system employing synthesized speech responses
Shinozaki, T Abe, M
NTT Human Interface Labs, JAPAN
0915
Fr1C2
Using Combined Decisions and Confidence Measures for Name
Recognition in Automatic Directory Assistance Systems
Kellner, A Rueber, B Schramm, H
Philips Research Laboratories, GERMANY
0930
Fr1C3
VPQ: A Spoken Language Interface to Large Scale Directory Information
Buntschuh, B Kamm, C DiFabbrizio, G Abella, A Mohri, M Narayanan, S
Zeljkovic, I Sharp, R Wright, R Marcus, S Shaffer J Duncan, R Wilpon, J
AT&T Labs - Research, USA
0945
Fr1C4
SCAN - Speech Content Based Audio Navigator: A System Overview
Choi, J Hindle, D Hirschberg, J Magrin-Chagnolleau, I Nakatani, C
Pereira, F Singhal, A Whittaker, S
AT&T Labs - Research, USA
1000
Fr1C5
Controlling a HIFI with a continuous speech understanding system
Ferreiros, J Colas, J Macias-Guarasa, J Ruiz, A Pardo, J M
GTH-IEL-UPM, SPAIN
1015
Fr1C6
User Evaluation of the Mask Kiosk
Lamel, L Bennacef, S Gauvain, J L Dartigues, H Temem, J-N
LIMSI-CNRS, FRANCE
Fr1D - Prosody and Emotion 6
Time & Place: 0900 - 1030, Dingo Room
Chair: Mary Beckman
0900
Fr1D1
A Contrastive Study of Lexical Stress Placement in Singapore English
and British English
Low, E L Grabe, E
Nayang Technological University, SINGAPORE
0915
Fr1D2
Integrated Recognition of Words and Phrase Boundaries
Gallwitz, F Batliner, A Buckow, J Huber, R Niemann, H Noeth, E
University of Erlangen, GERMANY
0930
Fr1D3
Phrase accents revisited: comparative evidence from Standard and
Cypriot Greek
Arvaniti, A
University of Cyprus, CYPRUS
0945
Fr1D4
Phonetic invariance and phonological stability: Lithuanian pitch accents
Dogil, G Möhler, G
University of Stuttgart, GERMANY
1000
Fr1D5
A HMM-based recognition system for perceptive relevant pitch
movements of spontaneous German speech
Brindoepke, C Fink, G Kummert, F Sagerer, G
University Bielefeld, GERMANY
1015
Fr1D6
Towards a reversible symbolic coding of intonation
Véronis, J Campione, E
Université de Provence, FRANCE
Fr1P - Hidden Markov Model Techniques 3
Time & Place: 0900 - 1230, Platypus Room
Chair: Joe Picone
Fr1P1
A Statistical Phonemic Segment Model for Speech Recognition Based on
Automatic Phonemic Segmentation
Aizawa, K Furuichi, C
TOIN University of Yokohama, JAPAN
Fr1P2
Improved Feature Decorrelation for HMM-based Speech Recognition
Demuynck, K Duchateau, J Van Compernolle, D Wambacq, P
K.U. Leuven - ESAT/PSI, BELGIUM
Fr1P3
Efficient high-order Hidden Markov Modelling
du Preez, J A Weber, D M
University of Stellenbosch, SOUTH AFRICA
Fr1P4
A Time-synchronous, Tree-based Search Strategy in the Acoustic Fast
Match of an Asynchronous Speech Recognition System
Eide, E Bahl, L
IBM, USA
Fr1P5
Effective Structural Adaptation of LVCSR Systems to Unseen Domains
Using Hierarchical Connectionist Acoustic Models
Fritsch, J Finke, M Waibel, A
University of Karlsruhe, GERMANY
Fr1P6
Support Vector Machines for Speech Recognition
Ganapathiraju, A Hamaker, J Picone, J
Mississippi State University, USA
Fr1P7
Natural Number Recognition Using Discriminatively Trained Inter-Word
Context Dependent Hidden Markov Models
Gandhi, M B
Lucent Technologies, USA
Fr1P8
Information Theoretic Approaches to Model Selection
Hamaker, J Ganapathiraju, A Picone, J
Mississippi State University, USA
Fr1P9
Continuous Speech Recognition Using Segmental Unit Input HMMs with
a Mixture of Probability Density Functions and Context Dependency
Hanai, K Yamamoto, K Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN
Fr1P10
Gaussian density tree structure in a multi-Gaussian HMM-based speech
recognition system
Simonin, J Delphin-Poulat, L Damnati, G
France Telecom - CNET, FRANCE
Fr1P11
Generalized Phone Modeling Based on Piecewise Linear Segment Lattice
Kojima, H Tanaka, K
Electrotechnical Laboratory, AIST, MITI, JAPAN
Fr1P12
A Flexible Method of Creating HMM Using Block-diagonalization of
Covariance Matrices
Koshiba, R Tachimori, M Kanazawa, H
Toshiba, JAPAN
Fr1P13
HMM Topology Selection for Accurate Acoustic and Duration Modeling
Chesta, C Laface, P Ravera, F
Politecnico Di Torino, ITALY
Fr1P14
Context-dependent duration modelling for continuous speech recognition
Lee, T Carlson, R Grandstrom, B
The Chinese University of Hong Kong, HONG KONG
Fr1P15
Training of Context-Dependent Subspace Distribution Clustering Hidden
Markov Model
Mak, B Bocchieri, E
Hong Kong University of Science & Technology, HONG KONG
Fr1P16
Unsupervised Training of HMMs With Variable Number of Mixture
Components per State
Martin, C Villarrubia, L Gonzalez, F J Hernandez, L
Telefonica I+D, SPAIN
Fr1P17
Acoustic Observation Context Modeling in Segment Based Speech
Recognition
Szarvas, M Matsunaga, S
Technical University of Budapest, HUNGARY
Fr1P18
Capturing Discriminative Information Using Multiple Modeling
Techniques
Ming, J Hanna, P Stewart, D Vaseghi, S Smith, F J
Queen's University of Belfast, UNITED KINGDOM
Fr1P19
Suprasegmental Duration Modelling with Elastic Constraints in
Automatic Speech Recognition
Molloy, L Isard, S
Edinburgh University, UNITED KINGDOM
Fr1P20
An Adaptive Gradient-Search-Based Algorithm for Discriminative
Training of HMMs
Nogueiras-Rodriguez, A Marinho, J B Monte, E
Polytechnical University of Catalonia, SPAIN
Fr1P21
Task Adaptation of Sub-Lexical Unit Models using the Minimum
Confusibility Criterion on Task Independent Databases
Nogueiras-Rodriguez, A Marinho, J B
Polytechnical University of Catalonia, SPAIN
Fr1P22
Stochastic Calculus, Non-Linear Filtering, and the Internal Model
Principle: Implications for Articulatory Speech Recognition
Ramsay, G
Institut De La Communication Parlee, FRANCE
Fr1P23
The Use of Meta-HMM in Multistream HMM Training for Automatic
Speech Recognition
Wellekens, C J Kangasharju, J Milesi, C
Institut Eurecom, FRANCE
Fr1P24
Enhanced ASR by acoustic feature filtering
Wellekens, C J
Institut Eurecom, FRANCE
Fr1P25
Soft State-Tying for HMM-Based Speech Recognition
Neukirchen, C Willett, D Rigoll, G
Duisburg University, GERMANY
Fr1P26
Estimation of models for non-native speech in computer-assisted language
learning based on linear model combination
Witt, S M Young, S J
Cambridge University, UNITED KINGDOM
Fr1P27
Duration Modeling using Cumulative Duration Probability and Speaking
Rate Compensation
Yang, T-Y Kim, J-S Lee, C Youn, D H Cha, I-W
Yonsei University, KOREA
Fr1P28
Probabilistic Modeling with Bayesian Networks for Automatic Speech
Recognition
Zweig, G Russell, S
IBM T.J. Watson Research Center, USA
Fr1Q - Speech & Hearing Disorders 2 /
Speech Processing for the Speech Impaired and Hearing
Impaired 1
Time & Place: 0900 - 1230, Quokka Room
Chair: Chris James
Fr1Q1
SIVHA, Visual Speech Synthesis System
Blanco, Y Cuellar, M Villanueva, A Lacunza, F Cabeza, R Marcotegui, B
Universidad Publica De Navarraica, UPNA, SPAIN
Fr1Q2
Using automatic speech recognition and its possible effects on the voice
de Bruijn, C G Whiteside, S P Cudd, P A Syder, D Rosen, M Nord, L
University of Sheffield, UNITED KINGDOM
Fr1Q3
The Importance of F0 or Voice Pitch for Perception of Tonal Language:
Simulations with Cochlear Implant Speech Processing Strategies
Fearn, R
University of New South Wales, AUSTRALIA
Fr1Q4
Assessing high level language in individuals with multiple sclerosis - a
pilot study
Brunnegård, K Laakso, K Hartelius, L Ahlsén, E
Goteborg University, SWEDEN
Fr1Q5
Design of Cochlear Implant Device for Transmitting Voice Pitch
Information in a Speech Sound of Asian Languages
Hiki, S Imaizumi, K Fukuda, Y
Waseda University, JAPAN
Fr1Q6
Abnormal Volume-Duration Relationship in Parkinsonian Speech
Ho, A K Bradshaw, J L Iansek, R Alfredson, R J
Monash University, AUSTRALIA
Fr1Q7
Analysis of Disordered Speech Signal Using Wavelet Transform
Jo, C W Kim, D H
Changwon National University, KOREA
Fr1Q8
Multi-channel Pulsation Strategy for Electric Stimulation of Cochlea
Kitazawa, S Kirihata, H Kitamura, T
Shizuoka University, JAPAN
Fr1Q9
Synthetic faces as a lipreading support
Agelfors, E Beskow, J Dahlquist, M Granström, B Lundeberg, M Spens,
K-E Öhman, T
KTH, SWEDEN
Fr1Q10
Predicting language scores from speech perception scores of hearing
impaired children
Martin, L Bench, J
La Trobe University, AUSTRALIA
Fr1Q11
Context-Independent Duration Model on Categories of Voice and Unvoice
Segments
Skljarov, O P
Research Institute of Ear Throat Nose & Speech, RUSSIA
Fr1Q12
Dynamical spectrogram, an aid for the deaf
Soltani Farani, A A Chilton, E H S Shirley, R
University of Surrey, UNITED KINGDOM
Fr1Q13
Evidence of dual-route phonetic encoding from apraxia of speech:
Implications for phonetic encoding models
Varley, R A Whiteside, S P
University of Sheffield, UNITED KINGDOM
Fr1Q14
Speech Communication Profiles Across the Adult Lifespan: Persons
Without Self-Identified Hearing Impairment
Cheesman, M F Smilsky, K L Major, T M Lewis, F Boorman, L M
University of Western Ontario, CANADA
Fr1R - Human Speech Production
Time & Place: 0900 - 1230, Rosella Room
Chair: Jan van Doorn
Fr1R1
Time as a factor in the acoustic variation of schwa
Barry, W J
University of the Saarland, GERMANY
Fr1R2
On the Structure of Vowel Space: A Genealogy of General Phonetic
Concepts
Boshoff, H F V Botha, E C
University of Pretoria, SOUTH AFRICA
Fr1R3
The relationship between intensity and subglottal pressure with
controlled pitch
Lecuit, V Demolin, D
Universite Libre De Bruxelles, BELGIUM
Fr1R4
Segmentation of the airway from the surrounding tissues on Magnetic
Resonance Images: A comparative study
Soquet, A Lecuit, V Metens, T Nazarian, B Demolin, D
Universite Libre De Bruxelles, BELGIUM
Fr1R5
Recovering Vocal Tract Shapes from MFCC Parameters
Dusan, S Deng, L
University of Waterloo, CANADA
Fr1R6
Quantification of Pharyngeal Articulations Using Measurements from
Laryngoscopic Images
Esling, J Clayards, J Edmondson, J Fuyuan, Q Harris, J
University of Victoria, CANADA
Fr1R7
Variance and Invariance in Speech Rate as a Reflection of Conceptual
Planning
Fon, Y-J
The Ohio State University, USA
Fr1R8
Correspondence between the Glottal Gesture Overlap Pattern and Vowel
Devoicing in Japanese
Fujimoto, M Murano, E Niimi, S Kiritani, S
University of Tokyo, JAPAN
Fr1R9
Evaluation of Japanese manners of generating word accent of English
based on a stressed syllable detection technique
Fujisawa, Y Minematsu, N Nakagawa, S
Toyohashi University of Technology, JAPAN
Fr1R10
Independence of consonantal voicing and vocoid F0 perturbation in
English and Japanese
Ishihara, S
The Australian National University, AUSTRALIA
Fr1R11
Reduction of English Function Words in Switchboard
Jurafsky, D Bell, A Fosler-Lussier, E Girand, C Raymond, W
University of Colorado, USA
Fr1R12
Duration Compensation in Non-Adjacent Consonant and Temporal
Regularity
Kim, H-S
Stanford University, USA
Fr1R13
Relationship Between Lip Shapes and Acoustical Characteristics During
Speech
Mori, K Sonoda, Y
Kyushu Kyoritsu University, JAPAN
Fr1R14
A model to represent propagation and radiation of higher-order modes
for 3-D vocal-tract configuration
Motoki, K Matsuzaki, H
Hokkai-Gakuen University, JAPAN
Fr1R15
FEM Analysis of aspirated air flow in three-dimensional vocal tract
during fricative consonant phonation
Niikawa,T Matsumura,M Tachimura,T Wada,T
Osaka Electro-Communication University, JAPAN
Fr1R16
Trajectory formation of articulatory movements for a given sequence of
phonemes
Okadome, T Kaburagi, T Honda, M
NTT Basic Research Laboratories, JAPAN
Fr1R17
Contextual Effects on Voicing Profiles of German and Mandarin
Consonants
Shih, C Möbius, B
Bell Labs - Lucent Technologies, USA
Fr1R18
Reconstructing the tongue surface from six cross-sectional contours:
Ultrasound data
Lundberg, A Stone, M
Johns Hopkins University, USA
Fr1R19
Articulability of two consecutive morae in Japanese speech production:
Evidence from sound exchange errors in spontaneous speech
Terao, Y Murata, T
Tokoha Gakuen College, JAPAN
Fr1R20
Coarticulation and degrees of freedom in the elaboration of a new
articulatory plant: Gentiane
Vilain, A Abry, C Badin, P
ICP - Stendhal, FRANCE
Fr1R21
A pressure sensitive palatography: Application of new pressure sensitive
sheet for measuring tongue-palatal contact pressure
Wakumoto, M Masaki, S Honda, K Ohue, T
ATR, JAPAN
Fr1R22
Dual-route phonetic encoding: Some acoustic evidence
Whiteside, S P Varley, R A
University of Sheffield, UNITED KINGDOM
Fr1R23
Fast and Slow Speech Rate: A Characterisation for French
Zellner, B
University of Lausanne, SWITZERLAND
Fr2A - Segmentation, Labelling and Speech Corpora 4
Time & Place: 1100 - 1230, Antechinus Room
Chair: Khalid Choukri
1100
Fr2A1
A multilingual prosodic database
Campione, E Véronis, J
Université De Provence, FRANCE
1115
Fr2A2
The CSLU Speaker Recognition Corpus
Cole, R Noel, M Noel, V
Oregon Graduate Institute, USA
1130
Fr2A3
How effective is unsupervised data collection for children's speech
recognition?
Aist, G Chan, P Huang, X D Jiang, L Kennedy, R Latimer, D Mostow, J
Yueng, C
Carnegie Mellon University, USA
1145
Fr2A4
An Algorithm for Automatic Generation of Mandarin Phonetic Balanced
Corpus
Shyuu, J-S Wang, J-F
Department of Computer Science and Information Engineering, TAIWAN
R.O.C.
1200
Fr2A5
Towards a formal framework for linguistic annotations
Bird, S Liberman, M
University of Pennsylvania, USA
1215
Fr2A6
Forming Generic Models of Speech for Uniform Database Access
Altosaar, T Vainio, M
Helsinki University of Technology, FINLAND
Fr2B - Speaker and Language Recognition 4
Time & Place: 1100 - 1230, Bandicoot Room
Chair: Jesper Olsen
1100
Fr2B1
On the Convergence of Gaussian Mixture Models: Improvements through
Vector Quantization
Moody, J Slomka, S Pelecanos, J Sridharan, S
Queensland University of Technology, AUSTRALIA
1115
Fr2B2
Modeling Dynamic Prosodic Variation for Speaker Verification
Sonmez, M K Shriberg, E Heck, L Weintraub, M
SRI International, USA
1130
Fr2B3
Blind Clustering of Speech Utterances Based on Speaker and Language
Characteristics
Reynolds, D Singer, E Carlson, B O'Leary, G McLaughlin, J Zissman, M
MIT Lincoln Laboratory, USA
1145
Fr2B4
Spoken Language Identification Using the Speechdat Corpus
Antonio Caseiro, D Martins Trancoso, I
INESC, PORTUGAL
1200
Fr2B5
Automatic Language Identification with Perceptually Guided Training
and Recurrent Neural Networks
Braun, J Levkowitz, H
University of Massachusetts Lowell, USA
1215
Fr2B6
On the Importance of Components of the Modulation Spectrum for
Speaker Verification
Van Vuuren, S Hermansky, H
Oregon Graduate Institute, USA
Fr2C - Speech Technology Applications and Human-
Machine Interface 3
Time & Place: 1100 - 1230, Currawong Room
Chair: George Kokkinakis
1100
Fr2C1
Is Speech the Right Thing for your Application?
Bernsen, N O Dybkjaer, L
Centre for Cognitive Science, Odense University, DENMARK
1115
Fr2C2
A PC-Based Tool for Helping in Diagnosis of Pathologic Voice
Godino Llorente, J I Aguilera Navarro, S Palazeulos, S Nieto Altuzarra, A
Gómez Vilda, P
ETSI Telecomunicación, SPAIN
1130
Fr2C3
Web-based educational tools for speech technology
Sjölander, K Beskow, J Gustafson, J Levin, E Carlson, R Granström, B
KTH, SWEDEN
1145
Fr2C4
Universal Speech Tools: The CSLU Toolkit
Sutton, S Cole, R de Villiers, J Schalkwyk, J Vermeulen, P Macon, M
Yan, Y-H Kaiser, E Rundle, B Shobaki, K Hosom, P Kain, A Wouters, J
Massaro, D Cohen, M
Oregon Graduate Institute, USA
1200
Fr2C5
Creating a Mexican Spanish Version of the CSLU Toolkit
Serridge, B Barbosa, A Cole, R Munive, N Vargas, A
Universidad De Las Americas, MEXICO
1215
Fr2C6
A Voice User Interface Demonstration System for Mexican Spanish
Garcia-Mateo, C Zhou, Q Lee, C H Pargellis, A
Universidad De Vigo, SPAIN
Fr2D - Utterance Verification and Word Spotting 2
Time & Place: 1100 - 1230, Dingo Room
Chair: Jean-Luc Gauvain
1100
Fr2D1
Context Dependent Anti Subword Modeling for Utterance Verification
Ramesh, P Lee, C-H Juang, B-H
Lucent Technologies, USA
1115
Fr2D2
Combination of confidence measures in isolated word recognition
Dolfing, J G A Wendemuth, A
Philips Research Laboratories Aachen, GERMANY
1130
Fr2D3
Confidence Measures for HMM-based Speech Recognition
Willett, D Worm, A Neukirchen, C Rigoll, G
Duisburg University, GERMANY
1145
Fr2D4
Vocabulary-Independent Word Confidence Measure using Subword
Features
Jiang, L Huang, X D
Microsoft Research, USA
1200
Fr2D5
A New Confidence Measure Based on Rank-Ordering Subphone Scores
Lin, Q Das, S Lubensky, D Picheny, M
IBM T.J. Watson Research Center, USA
1215
Fr2D6
Speaking-Style Dependent Lexicalized Filler Model for Key-Phrase
Detection and Verification
Kawahara, T Ishizuka, K Doshita, S Lee, C-H
Kyoto University, JAPAN
Fr4A - Large Vocabulary Continuous Speech
Recognition 6
Time & Place: 1330 - 1430, Antechinus Room
Chair: Lori Lamel
1330
Fr4A1
Sharable Software Repository for Japanese Large Vocabulary
Continuous Speech Recognition
Kawahara, T Kobayashi, T Takeda, K Minematsu, N Itou, K
Yamamoto, M Yamada, A Utsuro, T Shikano, K
Kyoto University, JAPAN
1345
Fr4A2
The Design of the Newspaper-Based Japanese Large Vocabulary
Continuous Speech Recognition corpus
Itou, K Yamamoto, M Takeda, K Takezawa, T Matsuoka, T Shikano, K
Kobayashi, T Itahashi, S
Electrotechnical Laboratory, JAPAN
1400
Fr4A3
Indexing and Classification of TV News Articles Based on Speech
Dictation Using Word Bigram
Ogata, J Ariki, Y
Ryukoku University, JAPAN
1415
Fr4A4
Parametric Trajectory Mixtures for LVCSR
Siu, M Iyer, R Gish, H Quillen, C
GTE/BBN Technologies, USA
Fr4B - Neural Networks, Fuzzy and Evolutionary
Methods 3
Time & Place: 1330 - 1430, Bandicoot Room
Chair: Joerg Rottland
1330
Fr4B1
Fuzzy-Integration Based Normalisation for Speaker Verification
Pham, T D Wagner, M
University of Canberra, AUSTRALIA
1345
Fr4B2
Improving the generalization performance of the MCE/GPD learning
Shimodaira, H Rokui, J Nakai, M
Japan Advanced Institute of Science and Technology, JAPAN
1400
Fr4B3
Acoustic Speech Recognition Model by Neural Net Equation with
Competition and Cooperation
Kitazoe, T Ichiki, T Kim, S-I
Miyazaki University, JAPAN
1415
Fr4B4
Improved Surname Pronunciations Using Decision Trees
Ngan, J Ganapathiraju, A Picone, J
Mississippi State University, USA
Fr4C - Speech Processing for the Speech-Impaired and
Hearing-Impaired 2
Time & Place: 1330 - 1430, Currawong Room
Chair: Björn Granström
1330
Fr4C1
A Speechreading Aid Based on Phonetic ASR
Duchnowski, P Braida, L Bratakos, M Lum, D Sexton, M Krause, J
Massachusetts Institute of Technology, USA
1345
Fr4C2
Training Speech through Visual Feedback Patterns
Nouza, J
Technical University of Liberec, CZECH REPUBLIC
1400
Fr4C3
Word sequence pair spotting for synchronization of speech and text in
production of closed-caption TV programs for the hearing impaired
Maruyama, I Abe, Y Sawamura, E Wakao, T Ehara, T Shirai, K
Telecommunications Advancement Organization of Japan, JAPAN
1415
Fr4C4
Volume Regulation in Parkinsonian Speech
Ho, A K Bradshaw, J L Iansek, R Alfredson, R J
Monash University, AUSTRALIA
Fr4D - Prosody and Emotion 7
Time & Place: 1330 - 1430, Dingo Room
Chair: Gosta Bruce
1330
Fr4D1
On the amount and domain of focal lengthening in Swedish
Strangert, E Heldner, M
Department of Phonetics, Umeå University, SWEDEN
1345
Fr4D2
Differential lengthening of syllabic constituents in French: the effect of
accent type and speaking style
Hirst, D J Astesano, C Di Cristo, A
CNRS, Université De Provence, FRANCE
1400
Fr4D3
Prosodic Analysis of Fillers and Self-Repair in Japanese Speech
Quimbo, F Kawahara, T Doshita, S
Kyoto University, JAPAN
1415
Fr4D4
A synthesis-oriented model of phrasal pitch movements in standard
Chinese
Ni, J Kawai, G Hirose, K
University of Tokyo, JAPAN
Plenary Session
Fr5A - Keynote Speech 2
Title: The Use of Linguistic Hierarchies in Speech Understanding
Speaker: Stephanie Seneff
MIT, USA
Time & Place: 1440 - 1540, Antechinus Room
Chair: Michael Wagner
Closing Ceremony
Time & Place: 1545 - 1615, Antechinus Room
Chair: Peter Blamey