3 minutes into the audio, the mp3 is already several seconds ahead of the wav. This is a problem because I want to mux the mp3 into a video and things need to be in sync.
The original is 48 kHz, 16bit mono which Lame downsamples to 44100 Hz.
The settings I used were -V6 --vbr-new -q0, I am using Lame version 3.97.
I also tried -abr 64 instead of -V6 which didn't help.
The wav is only spoken words with a lot of silence in between, plus the first 26 seconds are silent. I wonder if the silent parts could be causing this?
Has anyone experienced this? Am I doing something wrong?