Might be helpful also to know the source of the file, and what, if any, particular system it was designed for(CAVS, etc).
A knee jerk reaction makes me see a strong similarity to an audio file that is not sampled at 16 bit,
44 khz.
Could be wrong, but the behavior is strikingly the same.
|