Supplementary material for the article


Determining local transientness of audio signals
by
S. Molla and B. Torrésani





site under construction
(site under construction)


Abstract

Using information theoretic criteria and MDCT and wavelet transforms, a time dependent transientness index and a time-dependent tonality index may be associated with audio signals. This finds applications in several contexts, in particular hybrid audio coders, where dynamic bit budget allocation to tonal and transient layers is necessary. Extensions to similar ideas in image modeling and coding are also possible.


Transientness and tonality indices

Given a signal model of the form
signal = tonal + transient + residual
in mathematical terms, a (sparse) sum of wavelets and windowed cosine functions, with independent normally distributed coefficients, it may be shown that the sum of logarithms of squares of the wavelet coefficients of such a compount signal provides estimates on the "size" of the tonal (i.e. local sine) layer. Similarly, the sum of logarithms of squares of the local cosine coefficients of the signal yields estimates on the "size" of the transient (i.e. wavelet) layer. From this, the relative importances of tonal and transient layers may be computed.

Real sound examples from the paper:

The castagnette signal:


The castagnette signal, and the corresponding time-varying transientness index

glockenspiel signal



The jazz signal


The jazz signal, and the corresponding time-varying transientness index

funky



Corresponding time-frequency representation

time-frequency representation




Transient detection by envelope of high-pass filtered version

envelope