Raspberry Pi often produces smaller FLACs than PC; why?

Topic: Raspberry Pi often produces smaller FLACs than PC; why? (Read 10730 times) previous topic - next topic

0 Members and 1 Guest are viewing this topic.

Raspberry Pi often produces smaller FLACs than PC; why?

2013-10-15 08:02:46

I have couple of Raspberry Pi's and I noticed an interesting effect.

Compared encoding the same wav file on a PC vs the Raspberry Pi (arm)
95% the time the Raspberry one would be make a smaller file. Decoded both version of flac back to wav file, and compared md5sum's and they where the same.
Guessing arm version flac encodes a tighter file.

Wolf

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #1 – 2013-10-15 08:08:07

It depends which compression settings of FLAC is being used, -0 (bigger file) to -8 (smaller file).

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #2 – 2013-10-15 08:10:01

Quote from: eahm on 2013-10-15 08:08:07

It depends which compression settings in FLAC is being used -0 to -8.

My bad, both systems used the -8 option for the smallest possible file.

Wolf

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #3 – 2013-10-15 09:44:57

Displayed file size may vary because of different disk formats or format parameters, having less to do with how many bytes of data are in the file. Copy files from one device to the other and compare them side by side.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #4 – 2013-10-15 10:30:51

To get the actual size in bytes on Linux:

Code: [Select]

$ stat -c '%s' file

On OS X (and possibly UNIX in general):

Code: [Select]

$ stat -f '%z' file

There's also "du -b", but that prints the "apparent size", which "may be larger due to holes in sparse files, internal fragmentation, indirect blocks, and the like".

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #5 – 2013-10-15 13:08:04

Alternatively you can use:

Code: [Select]

$ wc -c file

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #6 – 2013-10-15 15:01:31

Quote from: Werewolf6851 on 2013-10-15 08:02:46

Decoded both version of flac back to wav file, and compared md5sum's and they where the same.

If the encoders are not seriously flawed the two wav files must obviously be bit identical. Did you also checksum the two flac files (assumed tags and everything else being equal between them)?

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #7 – 2013-10-15 16:24:04

Quote from: Nessuno on 2013-10-15 15:01:31

Did you also checksum the two flac files (assumed tags and everything else being equal between them)?

While this may no longer be true, different processors ~~could~~ would yield different flac files when using the same command line arguments.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #8 – 2013-10-15 16:44:22

https://xiph.org/flac/faq.html#tools__different_sizes

Quote

Why doesn't the same file compressed on different machines with the same options yield the same FLAC file?

It's not supposed to, and neither does it mean either encoding was bad. There are many variations between different machines or even different builds of flac on the same machine that can lead to small differences in the FLAC file, even if they have the exact same final size. This is normal.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #9 – 2013-10-15 21:05:21

Quote from: Nessuno on 2013-10-15 15:01:31

If the encoders are not seriously flawed the two wav files must obviously be bit identical. Did you also checksum the two flac files (assumed tags and everything else being equal between them)?

No, I don't think so. The compiler, the instruction set architecture (which may work with slightly different precision arithmetic) and such may lead to small differences.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #10 – 2013-10-15 23:44:58

Quote from: aztec_mystic on 2013-10-15 21:05:21

Quote from: Nessuno on 2013-10-15 15:01:31
If the encoders are not seriously flawed the two wav files must obviously be bit identical. Did you also checksum the two flac files (assumed tags and everything else being equal between them)?

No, I don't think so. The compiler, the instruction set architecture (which may work with slightly different precision arithmetic) and such may lead to small differences.

These matter for lossy, but should not for lossless unless the format is very very broken.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #11 – 2013-10-16 00:02:10

...but flac isn't broken.

http://www.hydrogenaudio.org/forums/index....st&p=847309

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #12 – 2013-10-16 07:02:51

Now, that's interesting!

I'm fine with different filesystems using or showing different space for the same raw data, but intuitively I gave for granted that a lossless algorithm is univocal, deterministic and implementation independent.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #13 – 2013-10-16 16:55:39

I guess this is the consequence of using floating point based analysis and prediction, resulting slight difference in rice parameters or something ... but I don't know for sure.
Yeah, it's not intuitive without looking at FLAC code doing fp math.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #14 – 2013-10-16 17:31:49

Quote from: nu774 on 2013-10-16 16:55:39

I guess this is the consequence of using floating point based analysis and prediction, resulting slight difference in rice parameters or something ... but I don't know for sure.
Yeah, it's not intuitive without looking at FLAC code doing fp math.

That is the issue. It was explained by Josh starting here.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #15 – 2013-10-16 17:32:39

Its also pretty common that the assembly version of an algorithm is implemented slightly or even significantly differently than the c version. If you want to compare between platforms, disabling assembly so that each device runs the same code often gets rid of some or even all of the difference. Of course it also makes it much slower.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #16 – 2013-10-16 18:08:21

Quote from: Nessuno on 2013-10-16 07:02:51

I'm fine with different filesystems using or showing different space for the same raw data, but intuitively I gave for granted that a lossless algorithm is univocal, deterministic and implementation independent.

Maybe this an easy explanation: FLAC uses 'unreliable' (but easy) floating-point math to do the modeling and approximation, but that model is stored and reconstructed with integer-math only. The residual is integer-math only too. The FP-math stuff is just to point the 'real' encoder in the right direction.

You can compile FLAC using integer-math only, but the files it creates are much larger.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #17 – 2013-10-17 08:05:38

Quote from: saratoga on 2013-10-15 23:44:58

Quote from: aztec_mystic on 2013-10-15 21:05:21
No, I don't think so. The compiler, the instruction set architecture (which may work with slightly different precision arithmetic) and such may lead to small differences.

These matter for lossy, but should not for lossless unless the format is very very broken.

I don't understand your point. Of course, the audio after decoding should be identical regardless of the platform you encoded the file on. This thread, however, is about the file size of the FLAC.

Again, I am a layman when it comes to codecs. But it is not plausible to me why binaries must yield FLACs of identical file size in every case. I don't think it's implausible that a given compression code will yield slightly different compression ratios depending on compiler flags, instruction set architecture, and similar factors.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #18 – 2013-10-17 18:41:25

For me this raises an interesting theoretical question:

Does this mean there might be an "ideal" platform which would yield consistently better compression?
(Possibly where FP math is done with more precision)

Again this is more of a curiosity. At the practical level, my guess is the answer would be irrelevant.

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #19 – 2013-10-17 18:57:26

Possibly relevant: deterministic building process

Raspberry Pi often produces smaller FLACs than PC; why?

Reply #20 – 2013-10-18 06:16:19

Quote from: Makaki on 2013-10-17 18:41:25

Does this mean there might be an "ideal" platform which would yield consistently better compression?
(Possibly where FP math is done with more precision)

Probably not. The cases where the details of the FP math (precision, order of operations, rounding direction) would make a difference in an integer result are going to be cases where the "true" result is very close to the decision threshold. The two choices are likely to be equally efficient near that threshold. If a difference in FP math did make a consistent improvement, then that would indicate a area of possible optimization.

Notice