I have been getting some anomalous results with replaygain. Mostly it works as expected: albums are perceived to be of equal loudness. However, some highly dynamic instrumental albums, and some heavy metal albums are respectively too loud and too soft.
Replaygain auto on Evelyn Glennie - Drumming (Avant-garde classical / drumming / percussion):
Level Adjustment | Peak Level (Adjst)| Filename
----------------------+-----------------------+-------------------------------
+0.00 dB =>+22.97 dB | => -13.2 (+9.74)| Evelyn Glennie - Drumming - 01 - Entrances.mpc
+0.00 dB => +4.39 dB | => -1.71 (+2.68)| Evelyn Glennie - Drumming - 02 - Halasana.mpc
+0.00 dB => +3.54 dB | => -1.22 (+2.32)| Evelyn Glennie - Drumming - 03 - Sorbet No. 1- Latin American Interlude.mpc
+0.00 dB => +6.57 dB | => -0.45 (+6.12)| Evelyn Glennie - Drumming - 04 - Bongo-O.mpc
+0.00 dB => +2.02 dB | => -0.30 (+1.72)| Evelyn Glennie - Drumming - 05 - Sorbet No. 2- Chinese Cymbals.mpc
+0.00 dB => +3.13 dB | => -0.35 (+2.78)| Evelyn Glennie - Drumming - 06 - Prim.mpc
+0.00 dB => +3.78 dB | => -4.92 (-1.14)| Evelyn Glennie - Drumming - 07 - Sorbet No. 3- UDU Trail.mpc
+0.00 dB => +4.45 dB | => -0.28 (+4.17)| Evelyn Glennie - Drumming - 08 - The Anvil Chorus.mpc
+0.00 dB => +6.77 dB | => -3.40 (+3.37)| Evelyn Glennie - Drumming - 09 - Sorbet No. 4- Woodblocks and Falling Instruments.mpc
+0.00 dB =>+10.12 dB | => -0.46 (+9.66)| Evelyn Glennie - Drumming - 10 - To the Earth.mpc
+0.00 dB => +9.35 dB | => -5.06 (+4.29)| Evelyn Glennie - Drumming - 11 - Sorbet No. 5- Wood and metal Chimes.mpc
+0.00 dB => +3.66 dB | => -0.45 (+3.21)| Evelyn Glennie - Drumming - 12 - Pezzo Da Concerto No. 1, Op. 15, for Snare Drum.mpc
+0.00 dB => +1.66 dB | => +0.01 (+1.67)| Evelyn Glennie - Drumming - 13 - Sorbet No. 6- Simtak Debut.mpc
+0.00 dB => +4.28 dB | => -0.45 (+3.83)| Evelyn Glennie - Drumming - 14 - Matre's Dance.mpc
+0.00 dB => +0.00 dB | => -13.5 (-13.5)| Evelyn Glennie - Drumming - 15 - Exits.mpc
+0.00 dB => +3.27 dB | => -0.77 (+2.50)| Evelyn Glennie - Drumming - 16 - Sorbet No. 7- Hi-Hat Playout.mpc
=> +4.37 dB | => +0.01 (+4.38)|
The +4.37 dB album gain makes this drumming album too loud. Does the RMS peak calculation fall short when it encounters this type of highly dynamic music?
Conversely:
Replaygain auto on Creed - Human clay (Post-grunge):
Title | Album |
Level- | (Peak+)| Level- | (Peak+)|
Adjustment| Peak (Adjst)|Adjustment| Peak (Adjst)| Filename
----------+--------------+----------+--------------+---------------------------
-8.18 dB | 40133 (15650)| -8.00 dB | 42725 (17009)| Creed - Human clay - 01 - Are you ready .mpc
-8.07 dB | 40149 (15855)| -8.00 dB | 42725 (17009)| Creed - Human clay - 02 - What if.mpc
-7.87 dB | 41195 (16647)| -8.00 dB | 42725 (17009)| Creed - Human clay - 03 - Beautiful.mpc
-7.14 dB | 42725 (18779)| -8.00 dB | 42725 (17009)| Creed - Human clay - 04 - Say I.mpc
-9.00 dB | 40503 (14371)| -8.00 dB | 42725 (17009)| Creed - Human clay - 05 - Wrong way.mpc
-8.89 dB | 39643 (14245)| -8.00 dB | 42725 (17009)| Creed - Human clay - 06 - Faceless man.mpc
-8.68 dB | 41197 (15166)| -8.00 dB | 42725 (17009)| Creed - Human clay - 07 - Never die.mpc
-6.83 dB | 38461 (17519)| -8.00 dB | 42725 (17009)| Creed - Human clay - 08 - With arms wide open.mpc
-7.71 dB | 38969 (16041)| -8.00 dB | 42725 (17009)| Creed - Human clay - 09 - Higher.mpc
-6.77 dB | 38803 (17798)| -8.00 dB | 42725 (17009)| Creed - Human clay - 10 - Wash away those years.mpc
-6.43 dB | 38953 (18580)| -8.00 dB | 42725 (17009)| Creed - Human clay - 11 - Inside us all.mpc
-7.98 dB | 38057 (15186)| -8.00 dB | 42725 (17009)| Creed - Human clay - 12 - Young grow old.mpc
The –8.00 dB album gain makes this a little too soft, considering that this type of music has to be listened to fairly loud.
I suspect that the RMS energy setting of 95% is what is causing this effect. With loud, continuous energy, highly [studio] compressed pop/rock etc, the replaygain implementation is “turning down” the energy level too much.
With highly staccato, dynamic, percussive solo instrumental, the RMS energy level is not representing the true perceived loudness of the music, and replaygain tends to “push” it too high.
If you were to analyse the histograms of the RMS energy levels for “Drumming” you would probably find the most common RMS value being quite soft, and even the 95% RMS value would be quite low.
What I’m suggesting is that the RMS value needs to be set on an album by album basis. in the range of ~70% up to ~99%. Perhaps the spread function of the histogram could be used to set this value. (The histograms for three types of sound can be seen here: http://privatewww.essex.ac.uk/~djmrob/repl...al_process.html.)
I’m not sure how you would do this, but some analysis of the RMS histogram might be used to calculate a value for RMS peak.