MP3 to WAV for processing with Vogone [Archive]

Clawfoot

November 14th, 2000, 10:14 PM

I have tried twice to process a WAV file I converted from MP3 and both times resulted in no sound at all when played back. The MTU information indicates that WAV files from MP3 will yield unpredictable results. Has anyone had luck processing WAV files converted from MP3?

Starlips

November 15th, 2000, 12:25 AM

I am having the same problem. I am using the convert feature in MusicMatch beta version 6.

Cid_cat

November 15th, 2000, 09:06 PM

I've had varied luck removing vocals from files I've converted from MP3 to WAV. Your luck might depend on how you are converting your files.

Of the few dozen I've tried, only 4 or 5 have worked...only 1 with exceptional results.

admin

November 16th, 2000, 06:32 PM

In case you are interested...

When a PCM (Pulse Code Modulated) linear audio file that was mastered in true stereo is converted from PCM to an MP3 compressed file, to my understanding, there are changes made in both channels that are not identical. Each channel is processed with the compression algorithm separately.

Like all psychoacoustic compression algorithms, MP3 removes audio frequencies and other components that are not "perceived" by the human ear. However, this changes the resulting bits in a 16-bit audio sample (or digital word) from what the original file contained. If it didn't change, there was no information removed. It is clear that MP3 definitely removes material to bring about the data reduction (128Kbit MP3 is 11:1 compressed).

Now, in the vocal elimination process, it is very critical that the true stereo channel data be exactly preserved. MP3 compression changes this data, and thus reduces, if not altogether prevents, the effect of vocal elimination. The fact that Cid_cat has found one MP3 file that had
"exceptional results" is very surprising. In this case, the processing of both channels must have been near identical.

For vocal elimination, the words in both channels for the frequencies of the voice (and any other sounds/music that is center panned) must be inverted, channel swapped and summed. When inverted, they become the digital opposite (or negative in mathematical terms). When two identical, digitally opposite words are summed, the result is zero. Thus, if one channel's sample word is changed slightly in the MP3 compression process, when inverted and mixed with the other channel, there will not be an exact cancelling to zero. What is left is a highly variable artifact, that is in reality, the difference between the MP3 processed audio samples from the two channels.