Having some previous experiences with cdparanoia I have been thinking about a 2 pass strategy using 2 plextor drives.
My first drive I've been trusting for some time is a Plextor PX-W4012A. Today I acquired a Plextor Premium to test a theory :
Basic ideas :
This is a similar assumption as EAC test & copy : if you read the data twice it should be identical. If there's a read error data it spits out random data so both reads don't compare, then retry until you have consistent results.
- even though both plextors do caching and this is an issue with cdparanoia, reading drives from two different drives to two independent output directories should make the verification quite easy as the cache is not shared across drives
I've modified the "abcde" tool which is a frontend to cdparanoia and flac. By default it bails out if there's no CD in the drive. I changed this script so that it waits until there's a cd in the drive. I created another wrapper script around the modified "abcde" bash script so I can mass rip. It also contains the correct offset values for both drives.
When a cd is finished it is ejected and when a new one is inserted it starts without having to press a key or click.
The idea is to rip a batch in drive A and another in drive B simultaniously, when both batches are finished, swap batches across drives. Then run the rsync tool on both output dirs to spot differences e.g.
Plextor PX-W4012A rips to /data/plex
Plextor PREMIUM rips to /data/prem
comparing is done using
rsync -avnrcI /data/plex/ /data/plex/prem
and will show if there's any differences between both drives. This is the same idea as manually comparing all files using the diff command (but automated
Those problematic CD's will probably need manual intervention using EAC for the reread strategy.
This is because cdparanoia will not retry as the data is probably cached in each drive, so if there's a read error, and the sector is reread it will produce the same data from the cache.
The idea is that problem cd's will occur infrequently and as every cd has to be read at least twice, 2 drives will save time and the problem cd's can be solved using EAC.
Caching sidenote :
If there's a way in Linux to disable the drive's caching then both dirs should be always identical unless you'll have an error which can only recovered by one drive (e.g. some nasty scratch which is recoverable on only one drive). Being able to disable the cache would also increase the speed as two drives can securely RIP simultaniously without having to swap batches and the EAC step is avoided even more.
Comments are really appreciated on my first post at hydrogen
