Search for: [Authors = "Zhao, He\-ming"]

Search results

Search for: [Authors = "Zhao, He\-ming"]

Filters

Journals
- Archives of Acoustics (2)

Search results

Number of results: 2

items per page: 25 50 75

Sort by:

of 1

Speech Enhancement Based on the Multi-Scales and Multi-Thresholds of the Auditory Perception Wavelet Transform

Zhi Tao He-Ming Zhao Xiao-Jun Zhang Di Wu

Archives of Acoustics | 2011 | vol. 36 | No 3 | 519-532 | DOI: 10.2478/v10168-011-0037-5

Keywords speech enhancement low SNR auditory perception wavelet transform unvoiced enhancement masking effect

Download PDF Download RIS Download Bibtex

Abstract

This paper proposes a speech enhancement method using the multi-scales and multi-thresholds of the auditory perception wavelet transform, which is suitable for a low SNR (signal to noise ratio) environment. This method achieves the goal of noise reduction according to the threshold processing of the human ear's auditory masking effect on the auditory perception wavelet transform parameters of a speech signal. At the same time, in order to prevent high frequency loss during the process of noise suppression, we first make a voicing decision based on the speech signals. Afterwards, we process the unvoiced sound segment and the voiced sound segment according to the different thresholds and different judgments. Lastly, we perform objective and subjective tests on the enhanced speech. The results show that, compared to other spectral subtractions, our method keeps the components of unvoiced sound intact, while it suppresses the residual noise and the background noise. Thus, the enhanced speech has better clarity and intelligibility.

Go to article

Authors and Affiliations

Zhi Tao

He-Ming Zhao

Xiao-Jun Zhang

Di Wu

A Lifting Wavelet Domain Audio Watermarking Algorithm Based on the Statistical Characteristics of Sub-Band Coefficients

Zhi Tao He-ming Zhao Jun Wu Ji-hua Gu Yi-shen Xu Di Wu

Archives of Acoustics | 2010 | vol. 35 | No 4 | 481-491 | DOI: 10.2478/v10168-010-0037-x

Keywords audio watermarking lifting wavelet transform statistical characteristics sub-band coefficients

Download PDF Download RIS Download Bibtex

Abstract

In this paper, a new lifting wavelet domain audio watermarking algorithm based on the statistical characteristics of sub-band coefficients is proposed. First of all, an original audio signal was segmented and each segment was divided into two sections. Then, the Barker code was used for synchronization, the LWT (lifting wavelet transform) was performed on each section, a synchronization code and a watermark were embedded into the first section and the second section, respectively, by modifying the statistical average value of the sub-band coefficients. The embed strength was determined adaptively according to the auditory masking property. Experiments show that the embedded watermark has better robustness against common signal processing attacks than present algorithms based on LWT and can resist random cropping in particular.

Go to article

Authors and Affiliations

Zhi Tao

He-ming Zhao

Jun Wu

Ji-hua Gu

Yi-shen Xu

Di Wu