SP-P14: Acoustic Modeling: Tone, Prosody, and Features

Session Type: Poster
Time: Thursday, May 20, 15:30 - 17:30
Location: Poster Area 3
Chair: Michael Picheny, IBM T. J. Watson Research Center
 
SP-P14.1: INTEGRATING THUMBNAIL FEATURES FOR SPEECH RECOGNITION USING CONDITIONAL EXPONENTIAL MODELS
         Hua Yu; Carnegie Mellon University
         Alex Waibel; Carnegie Mellon University
 
SP-P14.2: DISCRIMINATIVE FEATURE TRANSFORMATION BY GUIDED DISCRIMINATIVE TRAINING
         Roger Hsiao; Hong Kong University of Science and Technology
         Brian Mak; Hong Kong University of Science and Technology
 
SP-P14.3: SEGMENTAL TONAL MODELING FOR PHONE SET DESIGN IN MANDARIN LVCSR
         Chao Huang; Microsoft Research Asia
         Yu Shi; Microsoft Research Asia
         Jian-Lai Zhou; Microsoft Research Asia
         Min Chu; Microsoft Research Asia
         Terry Wang; Microsoft Research Asia
         Eric Chang; Microsoft Research Asia
 
SP-P14.4: DECISION TREE BASED TONE MODELING FOR CHINESE SPEECH RECOGNITION
         Pui-Fung Wong; Hong Kong University of Science and Technology
         Man-Hung Siu; Hong Kong University of Science and Technology
 
SP-P14.5: HIDDEN SPECTRAL PEAK TRAJECTORY MODEL FOR PHONE CLASSIFICATION
         Yiu-Pong Lai; Hong Kong University of Science and Technology
         Man-Hung Siu; Hong Kong University of Science and Technology
 
SP-P14.6: A STUDY ON ROBUST SEGMENTATION AND LOCATION OF TONE NUCLEI IN CHINESE CONTINUOUS SPEECH
         Jinsong Zhang; ATR, Spoken Language Translation Laboratories
         Keikichi Hirose; University of Tokyo
 
SP-P14.7: CHINESE-ENGLISH BILINGUAL PHONE MODELING FOR CROSS-LANGUAGE SPEECH RECOGNITION
         Shengmin Yu; Chinese Academy of Sciences
         Shuwu Zhang; Chinese Academy of Sciences
         Bo Xu; Chinese Academy of Sciences
 
SP-P14.8: VOICING FEATURE INTEGRATION IN SRI'S DECIPHER LVCSR SYSTEM
         Martin Graciarena; SRI International
         Horacio Franco; SRI International
         Jing Zheng; SRI International
         Dimitra Vergyri; SRI International
         Andreas Stolcke; SRI International
 
SP-P14.9: PARSING SPEECH INTO ARTICULATORY EVENTS
         Kadri Hacioglu; University of Colorado, Boulder
         Bryan Pellom; University of Colorado, Boulder
         Wayne Ward; University of Colorado, Boulder
 
SP-P14.10: PROSODY-BASED RECOGNITION OF SPOKEN GERMAN VARIETIES
         Vedran Dizdarevic; Graz University of Technology
         Martin Hagmüller; Graz University of Technology
         Gernot Kubin; Graz University of Technology
         Franz Pernkopf; Graz University of Technology
         Micha Baum; SPEX
 
SP-P14.11: TONE VARIATION MODELING FOR FLUENT MANDARIN TONE RECOGNITION BASED ON CLUSTERING
         Wan-Yi Lin; National Taiwan University
 
SP-P14.12: MINIMUM CLASSIFICATION ERROR TRAINING OF LANDMARK MODELS FOR REAL-TIME CONTINUOUS SPEECH RECOGNITION
         Erik McDermott; NTT Corporation
         Timothy Hazen; Massachusetts Institute of Technology
 
ICASSP 2003 Paper
SP-P14.13: A PHONE RECOGNIZER HELPS TO RECOGNIZE WORDS BETTER
         Georg Stemmer; Universität Erlangen-Nürnberg
         Viktor Zeissler; Universität Erlangen-Nürnberg
         Christian Hacker; Universität Erlangen-Nürnberg
         Elmar Nöth; Universität Erlangen-Nürnberg
         Heinrich Niemann; Universität Erlangen-Nürnberg
 

Home -||- Organizing Committee -||- Technical Committee -||- Technical Program -||- Plenaries
Paper Submission -||- Special Sessions -||- ITT -||- Paper Review -||- Exhibits -||- Tutorials
Information -||- Registration -||- Travel Insurance -||- Housing -||- Workshops

©2015 Conference Management Services, Inc. -||- email: webmaster@icassp2004.org -||- Last updated Monday, May 17, 2004