After searching around the forums for a while and such, I came up with some settings for trying to encode audiobooks. Unfortunately, when I tried them, the results where not at all what I expected, so I'm looking for help to understand them and refine my options.
The HA Wiki. says that -abr is best for the lower bitrates/voice, and that the "--preset voice" maps to "-abr 56 -mm".
Most of the forum posts about this are pretty old and focus either on older options, or options to work around bugs in older versions. There are also several suggestions about hardware limitations that require specific settings. Still, they almost all seem to suggest a -vbr where possible. There are also various --lowpass, mono, and resample suggestions.
I am using an MP3 player, but I have yet to come upon any limitation of it playing files, so that isn't a concern to me. However, it will only play MP3, no Vorbis, Speex, etc. My audiobooks are just people talking, so they are pretty simple. As it is a single person, I'm perfectly fine with mono. The time to encode is not a concern to me, and they are coming from CD so the starting sample rate is 44.1kHz. I want the audio to be as close to the audio on the CD as possible, where distinguishing the two is difficult under normal circumstances, but have the space as small as possible (within that constraint).
I downloaded and installed foobar2000_0.9.4.2.exe and the 3.97 LAME binary from Rarewares. CDs were ripped using fb2k strait into LAME. I used the ABX utility in fb2k to confirm what I heard easily with my ears.
I used a base LAME command line of:
-S --noreplaygain -V 3 --vbr-new --lowpass 8 -mm --resample 22.05 -q 0 - %d
This resulted in a somewhat muffled sound. Where the man's voice hissed, such as where the letter "S" was used, was much less 'hissy'. I noticed it right away. I also tried the -abr option listed in the wiki, but it sounded the same.
I found that both the resample option and the lowpass option could both individually create that muffled sound. If I used neither, the sound went away. I did not try varying their values though. I also found that after removing those options, I could not tell the difference between -V0 and -V6. I found that I could use this command line:
-S --noreplaygain -V6 --vbr-new -mm -q 0 - %d
And I can't tell the difference between that and the CD; and strangely it is even almost the same bitrate (57 kbps versus 55 kbps from my original command line). I'm looking for ways to improve this though. Unfortunately the permutations of various switches is to great for me to just try them all so I need suggestions of what should work best.
Here is a sample of audio from my CD (4MB). You could see what I mean in the first 15 seconds of that.
Also, is there a way to just set a quality setting? I know with Vorbis you have the option of setting a specific quality, and it will encode to that quality no matter what the resulting bitrate.