Jump to content

Hearing-Aid Speech Quality Index

From Wikipedia, the free encyclopedia

Hearing-Aid Speech Quality Index (HASQI) is a measure of audio quality originally designed for the evaluation of speech quality for those with a hearing aid,.[1][2] It has also been shown to be able to gauge audio quality for non-speech sounds and for listeners without a hearing loss.[3]

Background

[edit]

While the perception of audio quality can be gauged through perceptual measurements, the testing is time-consuming to undertake. Consequently, a number of metrics have been developed to allow audio quality to be evaluated without the need for human listening. Standardized examples from telephony include PESQ, POLQA, PEVQ and PEAQ. HASQI was originally developed by Kates and Arehart to evaluate how the distortions introduced by hearing aids degrade quality.[1] They also produced a new version in 2014.[2]

Kressner et al.[3] tested a speech corpus different from the dataset used to develop HASQI and showed that the index generalizes well for listeners without a hearing loss with a performance comparable to PESQ. Kendrick et al.[4] showed that HASQI can grade the audio quality of music and geophonic, biophonic, and anthrophonic quotidian sounds, although their study used a more limited set of degradations.

Method

[edit]

HASQI and its 2014 revision are double-ended methods requiring both a clean reference and the degraded signal to allow evaluation. The index attempts to capture the effects of noise, nonlinear distortion, linear filtering and spectral changes, by computing the difference or correlation between key audio features. This is done by examining short-time signal envelopes to quantify the degradation caused by noise and nonlinear filtering, and long-time signal envelopes to quantify the effects of linear filtering. Version 2 of HASQI includes a model to capture some aspects of the peripheral auditory system for both normal and hearing impaired listeners.

Kendrick et al. developed a blind (single-ended) method, bHASQI, using machine learning. This enables the audio quality to be evaluated from just the degraded signal without needing the clean reference.[4]

See also

[edit]

References

[edit]
  1. ^ a b Kates, James; Arehart, Kathryn (2010). "The hearing-aid speech quality index (HASQI)". Journal of the Audio Engineering Society. 58 (5): 363–381.
  2. ^ a b Kates, James; Arehart, Kathryn (2014). "The hearing-aid speech quality index (HASQI) version 2". Journal of the Audio Engineering Society. 62 (3): 99–117. doi:10.17743/jaes.2014.0006.
  3. ^ a b Kressner, Abigail A.; Anderson, David V.; Rozell, Christopher J. (2013). "Evaluating the Generalization of the Hearing Aid Speech Quality Index (HASQI)". IEEE Transactions on Audio, Speech, and Language Processing. 21 (2): 407. doi:10.1109/TASL.2012.2217132. S2CID 2722337.
  4. ^ a b Kendrick, Paul; Li, Francis; Fazenda, Bruno; Jackson, Iain; Cox, Trevor (2015). "Perceived Audio Quality of Sounds Degraded by Nonlinear Distortions and Single-Ended Assessment Using HASQI". Journal of the Audio Engineering Society. 63 (9): 698–712. doi:10.17743/jaes.2015.0068.
[edit]