参考文献_语音信号处理(第3版)-QQ阅读男生科幻网

上QQ阅读APP看书，第一时间看更新

参考文献

［1］Flanagan J L. Speech analysis, synthesis, and perception[M]. 2nd ed. New York: Springer-Verlag, 1972.

［2］Ramakrishnan B R. Reconstruction of incomplete spectrograms for robust speech recognition[D]. Ph. D. dissertation. CMU, 2000.

［3］Kandel E R, Schwartz J H, Jessell T M, et al. Principles of neural science[M], 3rd ed. Amsterdam: Elsevier Science Publishing, 1991.

［4］Morgan D P, Scofield C L. Neural networks and speech processing[M]. Amsterdam: Kluwer Academic Publishers, 1991.

［5］Painter T, Spanias A. Perceptual Coding of Digital Audio[J]. Proceedings of the IEEE, 2000, 88(4):451-513.

［6］鲁瑞华．听觉特性在数字音频压缩编码中的应用[J]．电声技术，1998，（5）：6-11．

［7］何冬梅．低码率高质量音频压缩算法研究[D]．哈尔滨：哈尔滨工业大学，2000．

［8］Teager H M, Teager S M. Some observation on oral airflow during phonation[J]. IEEE Trans on ASSP, 1980, 28(5):599-601.

［9］Teager H M, Teager S M. Evidence for nonlinear production mechanisms in vocal tract[J]. In: Speech Production and Speech Modeling, vol 55. Boston: Kluwer Academic Publishers, 1990:241-261.

［10］Thomas T J. A finite element model of fluid flow in the vocal tract[J]. Computer Speech and Language, 1986, 1:131-151.

［11］McGowan R S. An aero acoustic approach to phonation[J]. The Journal of the Acoustical Society of America, 1988, 83(2):696-704.

［12］Maragos P, Kaiser J F, Quatieri T F. Energy separation in signal modulation with application to s peech analysis[J]. IEEE Trans. Signal Processing, 1993, 41(10):3024-3051.

［13］Kaiser J F. Some useful properties of Teager energy operators[J]. In Sullivan B J. ICASSP 93, vol 3. Minnesota, USA: IEEE Press, 1993:149-152.

［14］Kaiser J F. On a simple algorithm to calculate the“energy”of a signal[J]. In Ludeman L. ICASSP, vol 1. Albuquerque, New Mexico: IEEE Press, 1990:381-384.

［15］Hanson H M, Maragos P, Potamianos A. Finding speech formants and modulations via energy separation: with application to a vocoder[J]. In Sullivan B J. ICASSP 93, vol 2. Minnesota, USA: IEEE Press, 1993:716-719.

［16］Potamianos A, Maragos P. Speech formant frequency and bandwidth tracking using multiband energy demodulation[J]. In Drago D. ICASSP 95, vol 1. Michigan, USA: IEEE Press, 1995, 1:784-787.

［17］Potamianos A, Maragos P. Speech analysis and synthesis using an AM-FM modulation model[J]. Speech Communication, 1999, 28(3):195-209.

［18］Foote J T, Mashao D J, Silverman H F. Stop classification using DESA-1 high-resolution formant Tracking[J]. In Sullivan B J. ICASSP 93, vol 2. Minnesota, USA: IEEE Press, 1993, 720-723.

［19］Guojun Z, Hansen J H L, Kaiser J F. Classification of speech under stress based on features derived from the nonlinear Teager energy operator[J]. In Acero A. ICASSP 98, vol 1. Seattle, Washington, USA: IEEE Press, 1998, 549-552.

［20］Ying G S, Mitchell C D, Jamieson L H. Endpoint detection of isolated utterances based on a modified Teager energy measurement[J]. In Sullivan B J. ICASSP 93, vol 2. Minnesota, USA: IEEE Press, 1993, 732-735.

［21］Cairns D A, Hansen J H L. Nonlinear analysis and classification of speech under stressed conditions[J]. The Journal of the Acoustical Society of America, 1994, 96(6):3392-3400.

［22］Guojun Z, Hansen H J L, Kaiser J F. Methods for stress classification: nonlinear TEO and linear s peech based features[J]. In Rodriguez J. ICASSP 99, vol 4. Phoenix, Arizona, USA: IEEE Press, 1999, 2087-2090.

［23］马永林，韩纪庆，张磊，等．应力影响下的变异语音分类[M]//怀进鹏，等．智能计算机研究进展．北京：清华大学出版社，2001．

［24］张磊，韩纪庆，等．声道的调频-调幅模型及其在语音分析中的应用[J]．计算机研究与发展，2002，39（6）：684-688．

［25］陈永彬，王仁华．语言信号处理[M]．合肥：中国科技大学出版社，1990．

［26］易克初，田斌，付强．语音信号处理[M]．北京：国防工业出版社，2000．

［27］罗宾纳．语音识别基本原理[M]．北京：清华大学出版社，1999．

［28］Michael Konerner．最新语音识别技术[M]．李逸波，郭天杰，王华驹，等译．北京：电子工业出版社，1998．

［29］杨行峻，迟惠生，等．语音信号数字处理[M]．北京：电子工业出版社，1995．

(1)　本书中的对数函数，除明确标注了底数的部分外，其他形如log表述的部分底数均可取任意值。因为语音信号处理中，取对数运算主要有两个用途：一是压缩数据的动态范围；二是将诸如x，y两变量的乘积部分通过取对数运算转化为两变量的相加，即logxy＝logx+logy。

本周热推：

Arm Helium技术指南：Cortex-M系列处理器的矢量运算扩展被动雷达宽带数字接收机技术 Photoshop智能手机APP界面设计之道电视机原理与实训实战无线通信应知应会：新手入门，老手温故（第二版）