PROCEEDINGS - · PDF fileT1A3 Providing Sublexical Constraints for Word Spotting Within the...

EUROPEAN SPEECH COMMUNICATION ASSOCIATION (ESCA)

5th EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY

S2I2 SIse p t e m b e n&l 99,7i

UNDER THE AUSPICES OF

THE MINISTRY OF CULTURETHE MINISTRY OF THE AEGEAN

THE GENERAL SECRETARIAT OF SCIENCE AND RESEARCH

PROCEEDINGSVOLUME 1

ORGANIZER:UNIVERSITY OF PATRASWIRE COMMUNICATIONS LABORATORY261 10 Rion - Patras - Greece

KE¥NOYE SPEECHES !

Keynote Speech 1: Is Syntactic Structure ProsodicallyRecoverable? KN-1Speaker: Mario Rossi, Institutde Phonetique d'Aix-en-Provence, Laboratoire Parole et Langage, FranceChair: Louis Pols, ESCA, Univ. of Amsterdam, TheNetherlands

Keynote Speech 2: Conversational Interfaces: Advancesand Challenges KN-9Speaker: Victor Zue, MIT, USAChair: Paul Daalsgard, Aalborg University, Denmark

Keynote Speech 3: Prosodic Modelling in Text-to-SpeechSynthesis KN-19Speaker: Jan P. H. van Sanlen, Bell Labs-LucentTechnologies, USAChair: Hiroya Fujisaki, Science Univ. of Tokyo, Japan

Keynote Speech 4: Robust Speech Recognition: Review andPerspectivesChair: Joseph Mariani, LIMSI-CNRS, France

Speech 4A (8:35-9:00): Impact of the UnknownCommunication Channel on Automatic SpeechRecognition: A Review KN-29Speaker: Jean-Claude Junqua, Panasonic Technologies Inc.,USA

Speech 4B (9:00-9:25): Statistical Techniques for RobustASR: Review and Perspectives KN-33Speaker: Jerome BeUegarda, Apple Computer, USA

Speech 4C (9:25-9:50): Using Missing Feature Theory toActively Select Features for Robust Speech Recognitionwith Interruptions, Filtering and Noise KN-37Speaker: Richard Uppmann, Lincoln Laboratory MIT, USA

Keynote Speech 5: Perspectives of Speech TechnologyResearch Highlighted in Eurospcech '97Speaker: Wolfgang Hess, Univ. of Bonn, GermanyChair: George Kokkinakis, WCL, Univ. ofPatras, Greece

SESSION: MIA.Acoustic Modelling ICliair: Roger Moore, DRA. UK

M4A.1 Using Multiple Time Scales in a Multi-StreamSpeech Recognition System 3Stephane Dupont, *Herve BourlardFPMs-TCTS, Belgium*IDIAP, Switzerland

M4A.2 Speech Recognition Using HMM-State ConfusionCharacteristics 7Wakita Yurai, Harald Singer, Yoshinori SagisakaATR Interpreting Telecommunications Res. Labs., Japan

M4A3 Bottom-up and Top-down State Clustering forRobust Acoustic Modeling 11Cristina Chesta, Pietro Laface, *Franco RaveraPolitecnico di Torino, Italy*Centro Studi e Laboratori Telecomunicazioni (CSELT), Italy

M4A.4 Comparison of Optimization Methods forDiscriminative Trainining Criteria 15Ralf Schluter, W. Macherey, S. Kanthak, Hermann Ney, LutzWellingRWTH Aachen, Germany

M4A.5 Clustering Beyond Phoneme Contexts for SpeechRecognition 19Clark Z. Lee, Douglas O'ShaughnessyINRS Telecommunications, Canada

M4A.6 Influence of Outliers in Training the ParametricTrajectory Models for Speech Recognition 23Rathinavelu ChengalvarayanBell Labs-Lucent Technologies, USA

SESSION: M4BDynamic Ar^cuiatory MeasurementsCltair; Rene Cam* £NST, France

M4B.1 Adaptation of Natural Articulatory Movementsto the Control of the Command Parameters of a ProductionModel 27Laurence Candille, Henri MeloniCERI, France

M4B.2 Three-Dimensional Coarticulatory Strategies ofTongue Movement 31Maureen Stone, *Andrew Lundberg, *Edward Davis, RaoGullapalli, Moriel NessAiverUniv. of Maryland, USA*John Hopkins Univ., USA

M4B.3 From Laryngographic and Acoustic Signals toVoicing Gestures 35Nathalie Parlangeau, Regine Andre-ObrechtIRTT - Equipe IHMPT, France

M4B.4 Ultrasonographic Measurement of CricothyroidSpace in Speech 39Erkki Vilkman, Raija Takalo, Maatta Taisto, *Anne-MariaLaukkanen, Jaana Nummenranta, Lipponen TeroUniv. ofOulu, Finland*Univ. of Tampere, Finland

M4B.5 Coarticulation and Articulatory CompensationsStudied by Dynamic MRI 43Didier Demolin, M. George, V. Lecuit, *T. Metens, /JA.Soquet, tH. RaeymaekersUniv. Libre de Bruxelles, Belgium^Magnetic Resonance Unit.Hopital ErasmetPhilips Medical Systems, BelgiumtCollaborateur Scientifique-FNRS

SESSION! U4CLanguage IdentificationChair: MarcZisstmtn, MIT Lincoln Laboratory, USA

M4C.1 Predicting, Diagnosing and Improving AutomaticLanguage Identification Performance 51Marc A. ZissmanMIT Lincoln Laboratory, USA

M4C.2 Language Identification with Language-Independent Acoustic Models 55Cristobal Corredor-Ardoy, Jean Luc Gauvain, Marline Adda-Decker, Lori LamelLIMSI, France

M4C.3 Bayesian Methods for Language Verification 59Eluned S. Parris, Harvey Lloyd-Thomas, Michael Carey, *JerryH. WrightEnsigma Ltd, UK*Bristol Univ., UK.

M4C.4 Use of Recurrent Network for Unknown LanguageRejection in Language Identification System 63Hingkeung Kwan, Keikichi HiroseUniv. of Tokyo, Japan

M4C.5 Language-Identification Based on Cross-LanguageAcoustic Models and Optimized Information CombinationOve Andersen, Paul Dalsgaard 67Aalborg Univ., Denmark

M4C.6 Phonetic-Context Mapping in LanguageIdentification 71Jiri Navratil, Werner ZuehlkeTechnische Univ. Ilmenau, Germany

SI'S'Sinv M-H)Neural Network*, for Speech and LanguageProcessingChair Wiil/yang He\s, Univ. of Hewn, Germany

M4D.1 Discriminative Feature and Model Design forAutomatic Speech Recognition 75Mazin Rahim, Yoshua Bengio, Yann LeCunAT&T Labs-Research, USA

M4D.2 Large Vocabulary Speech Recognition with ContextDependent MMI-Connectionist/HMM Systems Using theWSJ Database 79Jorg Rottland, Christoph Neukirchen, Daniel Willett, GerhardRigollGerhard-Mercator-Univ. Duisburg, Germany

M4D J Automatic Selection of Segmental AcousticParameters by Means of Neural-Fuzzy Networks forReordering in N-best HMM Hypotheses 83Thierry Moudenc, Guy MercierFrance Telecom, France

M4D.5 A Connectionist Approach to Machine TranslationAsuncion Castano, *Francisco Casacuberta 91Universitat Jaume I, Spain*Univ. Politecnica de Valencia, Spain

M4D.6 Continuous Speech Recognition Using a ContextSensitive ANN and HMM2s 95Nicolas Pican, Jean-Francois Mari, Dominique FohrCRIN-CNRS & INRIA Lorraine, France

SESSION: MAATraining Techniques, Efficient Decoding in ASRCJiair: Jerome Belkgarda, Apple Computer, USA

MAA.1 Acoustic Modeling Based on the MDL Principle forSpeech Recognition 99Koichi Shinoda, Takao WatanabeNEC Corporation, Japan

MAA.2 Discriminative Utterance Verification UsingMultiple Confidence Measures 103Piyush Modi, Mazin RahimAT&T, USA

MAA.3 Subspace Distribution Clustering for ContinuousObservation Density Hidden Markov Models 107Enrico Bocchieri, Brian MakAT&T Labs-Research, USA

MAA.4 A Comparative Study of Methods for PhoneticDecision-Tree State Clustering I l lHJ. Nock, MJ.F. Gales, Steve YoungCambridge Univ., UK

MAA.5 Comparing Gaussian and Polynomial Classifica-tion in SCHMM-Based Recognition Systems 115Alfred Kaltenmeier, Jurgen FrankeDaimler BenzAG, Germany

MAA.6 Maximum Likelihood Successive State SplittingAlgorithm for Tied-Mixture HMNET 119AlexAndre Girardi, Harald Singer, Kiyohiro Shikano, SatoshiNakamuraNara Institute of Science and Technology, Japan

MAA.7 String-Level MCE for Continuous PhonemeRecognition 123Erik McDermott, Shigeni KatagiriATR Interpreting Telecommunications Res. Labs., Japan

MAA.8 HMM State Clustering Across Allophone ClassBoundaries 127Ze'ev Rivlin, Ananth Sankar, Harry BrattSRI International, USA

MAA.9 Weighted Determinization and Mini miration forLarge Vocabulary Speech Recognition 131Mehryar Mohri, Michael RileyAT&T Labs-Research, USA

MAA.10 Parallel Speech Recognition 135Steven Phillips, Anne RogersAT&T Labs-Research, USA

MAA.ll Fast Likelihood Computation Methods forContinuous Mixture Densities in Large Vocabulary SpeechRecognition 139Stefan Ortmanns, Thorsten Firzlaff, Hermann NeyRWTH Aachen Univ. of Technology, Germany

MAA.12 A Static Lexicon Network Representation forCross-Word Context Dependent Phones 143Kris Derouynck, Jacques Duchateau, Dirk Van CompernolleKU Leuven-ESAT, Belgium

MAA.13 Decision-Tree Based Quantization of the FeatureSpace of a Speech Recognizer 147Mukund Padmanabhan, L.R. Bahl, D. Nahamoo, De Sou2aPeterIBM, USA

MAA.14 Sub-Vector Clustering to Improve the Memoryand Speed Performance of the Acoustic LikelihoodComputation 151Mosur Ravishankar, *R. Bisiani, E. ThayerCarnegie Mellon Univ., USA*Univ. of Milan, Italy

MAA.15 The Incorporation of Path Merging in a DynamicNetwork Recogniser 155Simon HovellBT Laboratories, UK

MAA.16 Improvement on Connected Digits RecognitionUsing Duration Constraints in the Asynchronous DecodingScheme 159Miroslav NovakIBM, USA

MAA.17 Explicit Word Error Minimization in N-Best ListRescoring 163Andreas Stolcke, Yochai Konig, Mitchel WeintraubSRI International, USA

MAA.18 Efficient 2-Pass N-Best Decoder 167Long Nguyen, Richard SchwartzBBN Systems and Technologies, USA

MAA.19 A Memory Management Method for a LargeWord Network 171Tomohiro Iwasaki, Yoshiharu AbeMitsubishi Electric Corporation, Japan

SESSION: MAB -ProsodyCiwir: Nick Campbell, ATRt Japan

MAB.l Persistence of Prosodic Features Between Dialectaland Standard Italian Utterances in Six Sub-Varieties of aRegion of Southern Italy (Salento): First Assessments ofthe Results of a Recognition Test and an InstrumentalAnalysis 175Antonio RomanoUniv. Stendhal, France

MAB.2 Improving the Phonetic Annotation by Means ofProsodic Phrasing 179Halewijn Vereecken, *Annemie Vorstermans, Jean-PierreMartens, *Bert Van CoileUniv. of Gent, Belgium*L&H, Belgium

MAB.3 A Descriptive Study of Prosodic Phenomena inMPUR (West Papuan Phylum) 183Cecilia OdeLeiden Univ., The Netherlands

MAB.4 Automated Quantitative Analysis of Fo Contours ofUtterances From a German ToBI-Labeled SpeechDatabase 187HansJorg Mixdorff, *Hiroya FujisakiTU Dresden, Germany*Science Univ. of Tokyo, Japan

MAB.5 Identification and Automatic Generation ofProsodic Contours for Text-to-Speech Synthesis System inFrench 191De Tournemire StephanieFrance Telecom, France

MAB.6 Quantitative Analysis and Formulation of ToneConcatenation in Chinese FO Contours 195Jin-Fu Ni, Ren-Hua Wang, *Keikichi HiroseUniv. of Science and Technology of China, ROChina*Univ. of Tokyo, Japan

MAB.7 An Environment for the Labelling and Testing ofMelodic Aspects of Speech 199Christel Brindopke, Arno Pahde, Franz Kummert, GerhardSagererUniv. of Bielefeld, Germany

MAB.8 PROPAUSE: A Syntactico-Prosodic SystemDesigned to Assign Pauses 203David Casacuberta, Lourdes Aguilar, Rafael MarinUniversitat Autonoma de Barcelona, Spain

MAB.9 Integrated Dialog Act Segmentation andClassification Using Prosodic Features and LanguageModels 207Volker Warnke, *Ralf Kompe, Heinrich Niemann, Elmar NoethUniv. ofErlangen-NUremburg, Germany*Sony International (Europe), Germany

MAB.10 Evaluation of Prosodic Characteristics hi RetoldStories hi Dutch by Means of Semantic Scales 211Monique E. van Donzel, Florien J. Koopmans-van BeinumUniv. of Amsterdam, The Netherlands

MAB.ll Text-to-intonation in Spontaneous Swedish 215Gosta Bruce, Marcus Filipsson, Johan Frid, *Bjorn Granstr5m,*Kjell Gustafson, Merle Home, David HouseLund Univ., Sweden*KTH, Sweden

MAB.12 Synthesing Attitudes with Global Rhythmic andIntonation Contours 219Yann Morlec, Gerard Bailly, Veronique AubergeInstitut de la Communication Parlee, France

MAB.I3 Prosody-Particle Pairs as Discourse ControlSigns 223Dafydd Gibbon, Claudia SassenUniv. of Bielefeld, Germany

MAB.14 Focus Detection with Additional Information ofPhrase Boundaries and Sentence Mode 227Anja EisnerUniv. of Bonn,Germany

MAB.15 The Role of Prosody in Infants' Native-LanguageDiscrimination Abilities: The Case of Two PhonologicallyClose Languages 231Laura Bosch, Nuria Sebastian-GallesUniversitat de Barcelona, Spain

MAB.16 Prosodic Cycles and Interpersonal Synchrony inAmerican English and Swedish 235Eugene H. Buder, *Anders ErikssonUniv. of Memphis, USA*Ume& Univ., Sweden

MAB.17 Relating Prosody to Syntax: Boundary Signallingin Swedish 239Eva StrangertUmea Univ., Sweden

MAB.18 On Representation of Fundamental Frequency ofSpeech for Prosody Analysis Using Reliability FunctionMitsuru Nakai, Hiroshi Shimodaira 243Japan Advanced Institute of Science and Technology, Japan

MAB.19 Efficient Method of Establishing Words ToneDictionary for Korean TTS System 247Seong-hwan Kim, Jin-young KimChonnam National Univ., South Korea

MAB.20 Perception of Questions and Statements inNeapolitan Italian 251D'Imperio Mariapaola *David HouseOhio State University, USA*Lund Univ., Sweden

SESSION: Tl AKeyword and Topic SpottingChair: Joseph Mariani, LtMSl-CNRS, France

T1A.1 Key-Phrase Spotting Using An Integrated Lan-guage Model of N-Grams and Finite-State Grammar.... 255Lin Qiguang, Dave Lubensky, Michael Picheny, P. SrinivasaRaoIBM, USA

T1A.2 Efficient Methods for Detecting Keywords inContinuous Speech 259Jochen Junkawitsch, *Gunther Ruske, Harald HogeSiemens AG, Germany*Munich Univ.of Technology, Germany

T1A3 Providing Sublexical Constraints for Word SpottingWithin the ANGIE Framework 263Raymond Lau, Stephanie SeneffMIT Laboratory for Computer Science, USA

T1A.4 Usefulness of Phonetic Parameters in a RejectionProcedure of an IIMM Based Speech Recognition SystemKatarina Bartkova, Denis Jouvet 267France Telecom, France

T1A.5 Keyword Spotting Using FO Contour MatchingYoichi Yamashita, *Riichiro Mizoguchi 271Ritsumeikan Univ., Japan*Osaka Univ., Japan

T1A.6 A Frame and Segment Based Approach for TopicSpotting 275Elmar Noeth, Stefan Harbeck, Heinrich Niemann, VolkerWamkeUniv. of Erlangen-Nilremburg, Germany

SESSION; TIBRobustness in Recognition and Signal Processing ICtuiir: Helmut Mangold, Daimler Benz, Germany

Tlli.l Cyclic Autocorrelation-Based Linear PredictionAnalysis of Speech 279K.K. Paliwal, Yoshinori SagisakaATR Interpreting Telecommunications Res. Labs., Japan

TVBJ2 Novel FUler Acoustic Models for Connected DigitRecognition 283Ilija Zeljkovic, Shrikanth NarayananAT&T Labs-Research, USA

T1B3 A Non-Iterative Model-Adaptive E-CMN/PMCApproach for Speech Recognition in Car EnvironmentsMakoto Shozakai, Satoshi Nakamura, Kiyohiro ShikanoNara Institute of Science and Technology, Japan 287

T1B.4 Discriminative Feature Extraction for SpeechRecognition in Noise 291la Torre Angel de, Antonio M. Peinado, Antonio J. Rubio,Pedro GarciaUniversidad de Granada, Spain

T1B.5 Noise Robust Recognition Using Feature SelectiveModeling 295Michael K. Brendborg, Borge LindbergAalborg Univ., Denmark

T1B.6 Mixture Input Transformations for Adaptation ofHybrid Connectionist Speech Recognizers 299Victor AbrashSRI International, USA

SESSION: TICModelling of ProsodyOtair: Hiroya Fujisaki, Science Univ. of Tokyo, Japan

T1C.1 Metrical Representation of Demarcation andConstituency in Long Noun Phrases 303Christos Malliopoulos, *George MikrosNational Technical Univ. of Athens, GreeceVLSP, Greece

T1C.2 A System of Stylized Intonation Contours forGerman 307Hannes Pirker, *Kai Alter, Erhard Rank, John Matiasek, HaraldTrost, tGermnot KubinAustrian Research Institute for Artificial Intelligence (OFAI),Austria*Max-Planck-Institute of Cognitive Neuroscience, GermanyfVienna Univ. of Technology, Austria

T1C.3 A Method of Representing Fundamental FrequencyContours of Japanese Using Statistical Models of MoraicTransition 311Keikichi Hirose, Kouji IwanoUniv. of Tokyo, Japan

T1C.4 Modeling Arbitrarily Long Sentence-Spanning F0Contours by Parametric Concatenation of Word-SpanningPatterns 315Stavroula-Evita Fotinea, *Michael Vlahakis, *GeorgeCarayannisNational Technical Univ. of Athens, Greece*ILSP, Greece

T1C.5 Strong Interaction Between Factors InfluencingConsonant Duration 319R. J. J. H. van Son, *Jan P.H. van SantenUniv. of Amsterdam, The Netherlands*Bell Labs-Lucent Technologies, USA

T1C.6 Speech Tuning in Slovenian TTS 323Jerneja Gros, Nikola Pavesic, France MihelicUniv. of Ljubljana, Slovenia

SESSION: T1X> 'Microphone Arrays for Speech IMianecraetttCJtair: Alan Bradley, RMTT, Australia

T1D.1 Small Microphone Arrays with OptimizedDirectivity for Speech Enhancement 327Matthias DorbeckerAachen Univ. of Technology, Germany

T1D.2 Microphone Array Design Measures for Hands-FreeSpeech Recognition 331Masaaki Inoue, Satoshi Nakamura, Takeshi Yamada, KiyohiroShikanoNam Institute of Science and Technology, Japan

T1D.3 Noise Reduction by Paired Microphones 335Masato Akagi, Mitsunori MizumachiJapan Advanced Institute of Science and Technology, Japan

T1D.4 A Microphone Array for Speech EnhancementUsing Multiresolution Wavelet Transform 339Djamila MahmoudiLTS-DE, EPFL, Switzerland

T1D.5 A Two-Channel Adaptive Microphone Array withTarget Tracking 343Yoshifumi Nagata, Hiroyuki TsuboiToshiba Corporation, Japan

T1D.6 Use of Different Microphone Array Configurationsfor Hands-Free Speech Recognition in Noisy andReverberant Environment 347Diego Giuliani, Marco Matassoni, Maurizio Omologo,Piergiorgio SvaizerIstituto per la Ricerca Scientifica e Tecnologica (IRST), Italy

SBSSION: T2AMultilingual RecognitionChair: Ridtard Uppmatt, MIT Lincoln Lah, USA

T2A.1 YINHE: A Mandarin Chinese Version of the GalaxySystem 351Chao Wang, James R. Glass, Helen Meng, Joe Polifroni,Stephanie Seneff, Victor ZueMIT Laboratory for Computer Science, USA

T2A.2 Multilingual Speech Recognition for FlexibleVocabularies 355Patrizia Bonaventura, *Filippo Gallocchio, tGiorgio MiccaCentro Studi e Laboratori Telecomunicazioni (CSELT),Consultant, Italyf Centro Studi e Laboratori Telecomunicazioni (CSELT),Turin, Italy*Univ.di Padova, Italy

T2A3 A Study of Multilingual Speech Recognition 359Fuliang Weng, Harry Bratt, Leonardo Neumeyer, AndreasStolckeSRI International, USA

T2A.4 Multilingual Speech Recognition: The 1996 ByblosCallhomc System 363Jayadev Billa, Kristine Ma, John W. McDonough, GeorgeZavaliagkos, David R. Miller, Kenneth N. Ross, Amro El-JaroudiBBN Systems and Technologies, USA

T2A.5 Japanese LVCSR on the Spontaneous SchedulingTask with JANUS-3 367Tanja Schultz, Detlef Koll, Alex WaibelUniv. Karlsruhe, Germany

T2A.6 Fast Bootstrapping of LVCSR Systems withMultilingual Phoneme Sets 371Tanja Schultz, Alex WaibelUniv. Karlsruhe, Germany

SESSION: T2BLanguage Specific Speech AnalysisChair: John J. Ohala, Umv uf California, USA

T2B.1 Factors of Variation in the Production of theGerman Dorsal Fricative 375Bemd Pompino-Marschall, Christine MooshammerCenter for General Linguistics, Berlin, Germany

T2B.2 EPG and Aerodynamic Evidence for the Copro-duction and Coarticulation of Clicks in ISIZULU 379Kimberly ThomasUniv. of California, USA

T2B3 Formant Trajectory Dynamics in SwabianDiphthongs 383Anja GeumannMuenchen Univ., Germany

T2B.4 The Gestural Organization of Vowels andConsonants: A Cinefluorographic Study of ArticulatorGestures in Greenlandic 387Sidney A.J. WoodUniv. of Lund, Sweden

T2B.5 The Perception of Coronals in Western ArrernteVictoria B. Anderson 389Univ. of California, USA

T2B.6 Acoustic Modelling of American English M 393Carol Y. Espy-Wilson, *Shrikanth Narayanan, Suzanne E.Boyce, fAbeer AlwanBoston Univ., USA*AT&T Labs-Research, USAfUniv. of California, USA

SESSION! T2CFeature Estimation 1Chair: Paul Daalsgard, Aalborg University, Denmark

T2C.1 Acoustic Parameters Optimised for Recognition ofPhonetic Features 397Anya Vamich HansenAalborg Univ., Denmark

T2C.2 Heterogeneous Acoustic Measurements for PhoneticClassification 401Andrew K. Halberstadt, James R. GlassMIT, USA

T2C3 Cepstral-Time Matrices and LDA for ImprovedConnected Digit and Sub-Word Recognition AccuracyBenMilner 405BT Laboratories, UK

T2C.4 Data-Driven Design of Rasta-Like Filters 409Van Vuuren Sarel, / *Hynek HermanskyOregon Graduate Institute of Science and Technology, USAinternational Computer Science Institute, USA

T2C.5 Evaluating Feature Set Performance Using the F-Ratio and J-Measures 413Simon Nicholson, *Ben Milner, Stephen CoxUEA, UK*BT Laboratories, UK

T2C6 Robust Speech Parameters Located in theFrequency Domain 417Javier Hernando, Climent NadeuUniversitdt Politecnica de Catalunya, Spain

SESSION: T2DSpeech Coding IChair: fcabel Trancoso, tNESC, Portugal

T2D.1 A Simple and Efficient Algorithm for theCompression of MBROLA Segment Databases 421van der Vrecken Olivier, Nicolas Pierret, Thierry Dutoit,Vincent Pagel, Fabrice MalfrereTCTSLab, Belgium

T2D.2 A Segmental Formant Vocoder Based On LinearlyVarying Mixtures of Gaussians 425Parham Zolfaghari, Tony A. RobinsonCambridge Univ., UK

T2D.3 Voice Mimic System Using Articulatory Codebookfor Estimation of Vocal Tract Shape 429Samir Chennoukh, Daniel Sinder, Gael Richard, JamesFlanaganCAIP Center, Rutgers Univ., USA

T2D.4 Adaptive Transform Coding for Linear PredictiveResidual 433Damith J. Mudugamuwa, Alan B. BradleyRMTT, Australia

T2D.5 Performance Evaluation of Objective QualityMeasures for Coded Speech 437AkiraTakahashi, Nobuhiko Kitawaki, *Paolino Usai, t DavidAtkinsonNTT, Human Interface Labs, Japan*Centro Studi e Laboratori Telecomunicazioni (CSELT), ItalytNTIA, USA

T2D.6 Between Recognition and Synthesis - 300 Bits/Second Speech Coding 441Mohamed Ismail, Keith PontingSpeech Research Unit, UK

SESSION: TMAFeature Estimation II, Pitch and ProsodyCtutir: Egidia Giachin, Centro Studi e IxtboraioriTelecomunicazioni (CSELT), Italy

TMA.1 A Modified Zero-Crossing Method for PitchDetection in Presence of Interfering Sources 445Francois Gaillard, Frederic Berthommier, *Gang Feng, •Jean-Luc Schwartz1CP/1NPG, France*Institut de la Communication Parlee, France

TMA.2 Using Simulated Annealing ExpectationMaximization Algorithm for Hidden Markov ModelParameters Estimation 449Jacques Simonin, Chafic MokbelFrance Telecom, France

TMA3 Covariation of Subglottal Pressure, F0 and GlottalParameters 453Gunnar Fant, "Stellan Hertegard, Anita Kruckenberg, JohanIiljencrantsKTH, Sweden*Karolinska Hospital and Huddinge Univ. Hospital, Sweden

TMA.4 The Fractal Behaviour of Unvoiced Plosives: AMeans for Classification 457Anastasios Delopoulos, Maria RangoussiNTUA, Greece

TMA.5 A Method for Analysis of the Local Speech RateUsing an Inventory of Reference Units 461Sumio Ohno, Hiroya Fujisaki, Hideyuki TaguchiScience Univ. of Tokyo, Japan

TMA.6 Analysis and Modeling of Fundamental FrequencyContours of Greek Utterances 465Hiroya Fujisaki, Sumio Ohno, Takashi YagiScience Univ. of Tokyo, Japan

TMA.7 Characteristics of Slow, Average and Fast Speechand their Effects in Large Vocabulary Continuous SpeechRecognition 469Fernando Martinez, Daniel Tapias, Jorge Alvarez, Paloma LeonTelefonica I+D, Spain

TMA.8 Analysis of Children's Speech: Duration, Pitch, andFormants 473Sungbok Lee, Alexandra Potamianos, Shrikanth NarayananAT&T Labs-Research, USA

TMA.9 A Method of Measuring Formant Frequencies atHigh Fundamental Frequencies 477HarimutTraunmuller, *Anders ErikssonStockholm Univ., Sweden*Univ. ofUmea, Sweden

TMA.10 Analysis of Speaking Rate Variation in Stress-Timed Languages 481Tom Brondsted, Jens Printz MadsenAalborg Univ., Denmark

TMA.ll Automatic Identification of the PhonemeBoundaries Using a Mixed Parameter Model 485Paul Micallef, T e d ChiltonUniv. of Malta, Malta*Univ. of Surrey, UK

TMA.12 Pitch Detection Reliability Assessment forForensic Applications 489Sergey Koval, Veronika Bekasova, Michael Khitrov, AndreyRaevSt Petersburg State Univ., Russia

TMA.13 Efficient Estimation of Perceptual Features forSpeech Recognition 493Zhihong Hu, Etienne BarnardOregon Graduate Institute, USA

TMA.14 Towards Decomposing the Sources of Variabilityin Speech 497Narendranath Malayath, Hynek Hermansky, Alexander KainOregon Graduate Institute of Science and Technology, USA

TMA.15 Use of Vector-Valued Dynamic WeightingCoefficients for Speech Recognition: Maximum LikelihoodApproach 501Rathinavelu ChengalvarayanBell Labs-Lucent Technologies, USA

TMA.16 Automatic Segmentation: Data-Driven Units ofSpeech 505S.W. Beet, L. Baghai-RavaryAculab pic, UK

TMA.17 On Robust Time-Varying AR Speech AnalysisBased on T-Distribution 509Dejan BajicInstitute of Applied Mathematics and Electronics, Yugoslavia

TMA.18 A Simple Phoneme Energy Model for the GreekLanguage and its Application to Speech Recognition 513Dimhris Tambakas, Diana Tzima, Nikos Fakotakis, GeorgeKokkinakisUniv. ofPatras, Greece

TMA.19 A Macroscopic Analysis of an Emotional SpeechCorpus 517James E.H. Noad, Sandra P. Whiteside, Phil GreenUniv. of Sheffield, UK

TMA.20 Restoration of Pitch Pattern of Speech Based on aPitch Generation Model 521Hiroshi Shimodaira, Mitsuru Nakai, Akihiro KumataJapan Advanced Institute of Science and Technology, Japan

TMA.21 The Research of Correlation Between Pitch andSkin Galvanic Reaction at Change of Human EmotionalState 525A.V. Agranovski, O.Y. Berg, D.A. LednovSpetsvuzavtomatika Design Bureau, Russia

TMA.22 K-NN Versus Gaussian in HMM-BasedRecognition System 529Claude Montacie, Marie-Jose Caraty, Fabrice LefevreUniv. Pierre et Marie Curie - CNRS, France

TMA.23 Spectral Methods for Voice Source ParametersEstimation 533Boris Doval, Christophe D'Alessandro, Benoit DiardLIMSI-CNRS, France

PROCEEDINGS - · PDF fileT1A3 Providing Sublexical Constraints for Word Spotting Within the...

Documents

Transcript of PROCEEDINGS - · PDF fileT1A3 Providing Sublexical Constraints for Word Spotting Within the...