The purpose of the study was to compare auditory judgments of sound clarity of music examples recorded in a concert hall with predictions of clarity made from the impulse response signal recorded in the same hall. Auditory judgments were made with the use of two methods: by rating sound clarity on a numerical scale with two endpoints, and by absolute magnitude estimation. Results obtained by both methods were then compared against the values of clarity indices, C80 and C50, determined from the impulse response of the concert hall, measured in places in which the microphone was located during recording of music examples. Results show that auditory judgments of sound clarity and predictions made from the C80 index yield a similar rank order of data, but the relation between the C80 scale and perceived sound clarity is nonlinear. The data also show that the values of C80 and C50 indices are in very close agreement.
Psychoacoustics is traditionally based on a world model that assumes a physical world existing independently of any observer - the so-called objective world. Being exposed to this world, an observer is impinged upon by a variety of stimuli reaching his/her sensory organs. These stimuli, if physiologically adequate, may cause biological transduction and signal processing in the sensory organs and its afferent pathways in such a way that finally a specific excitation of the cortex takes place, which results in sen-sations to appear in the observer’s perceptual world. The sensations are understood as being subjective, since they require an observer to exist. This world model - also known as (objectivistic) realism - reaches its limits when it comes to explaining more complex phenomena of perception. Thereupon, in this paper, an alternative world model is emphasized and applied to psychoacoustics, namely the perceptionist’s model. Like realism, perceptionism has a long tradition in epistemology. It appears to be suitable to improve our understanding of perceptual organization.
In this paper, a modified sound quality evaluation (SQE) model is developed based on combination of an optimized artificial neural network (ANN) and the wavelet packet transform (WPT). The presented SQE model is a signal processing technique, which can be implemented in current microphones for predicting the sound quality. The proposed method extracts objective psychoacoustic metrics including loudness, sharpness, roughness, and tonality from sound samples, by using a special selection of multi-level nodes of the WPT combined with a trained ANN. The model is optimized using the particle swarm optimization (PSO) and the back propagation (BP) algorithms. The obtained results reveal that the proposed model shows the lowest mean square error and the highest correlation with human perception while it has the lowest computational cost compared to those of the other models and software.
This paper reviews parametric audio coders and discusses novel technologies introduced in a low-complexity, low-power consumption audio decoder and music synthesizer platform developed by the authors. The decoder uses parametric coding scheme based on the MPEG-4 Parametric Audio standard. In order to keep the complexity low, most of the processing is performed in the parametric domain. This parametric processing includes pitch and tempo shifting, volume adjustment, selection of psychoacoustically relevant components for synthesis and stereo image creation. The decoder allows for good quality 44.1 kHz stereo audio streaming at 24 kbps. The synthesizer matches the audio quality of industry-standard sample-based synthesizers while using a twenty times smaller memory footprint soundbank. The presented decoder/synthesizer is designed for low-power mobile platforms and supports music streaming, ringtone synthesis, gaming and remixing applications.