Informed Multiple-F0 Estimation Applied to Monaural Audio Source Separation



Experimental results are obtained from a single-channel 44.1kHz-sampled mixture. The score of each instrument is estimated at the coder and used for the separation at the decoder as described in Fig.1 and Fig.2.
Fig. 1 Coder
Fig. 2 Decoder


Musical piece 1 (3 instruments): flute, piano and bass
Amout of information used: 2415 bits (0.4 kbs), original MIDI file size: 5424 bits (0.9 kbs)
Original mixtureOriginal sources Separated sources (NMF)Separated sources (filtering)
mix.wav s1.wav s2.wav s3.wav mix.wav s1.wav s2.wav s3.wav mix.wav s1.wav s2.wav s3.wav


Musical piece 2 (4 instruments): B3, piano, bass, drum
Amout of information used: 2353 bits (0.39 kbs), original MIDI file size: 6120 bits (1.02 kbs)
Original mixtureOriginal sources Separated sources (NMF)Separated sources (filtering)
mix.wav s1.wav s2.wav s3.wav mix.wav s1.wav s2.wav s3.wav mix.wav s1.wav s2.wav s3.wav