New Lossy Audio Codec, JPEG is not an ideal audio codec |
- No Warez. This includes warez links, cracks and/or requests for help in getting illegal software or copyrighted music tracks!
- No Spamming or Trolling on the boards, this includes useless posts, trying to only increase post count or trying to deliberately create a flame war.
- No Hateful or Disrespectful posts. This includes: bashing, name-calling or insults directed at a board member.
- Click here for complete Hydrogenaudio Terms of Service
![]() ![]() |
New Lossy Audio Codec, JPEG is not an ideal audio codec |
Sep 16 2005, 20:17
Post
#51
|
|
![]() Group: Members (Donating) Posts: 799 Joined: 12-September 03 Member No.: 8821 |
QUOTE (Klyith @ Sep 16 2005, 06:27 AM) I wonder what effects would give different sampling-rates and lowpass values, and turning on/off fb2k's dithering... QUOTE (XoR @ Sep 16 2005, 04:34 PM) WV - I see no diffrence from oryginal picture (best audio picture codec :] ) OFS - the are any diffrences too !!!!!!!!!!!!!!!!! Because they're lossless codecs, I wonder how their hybrid modes would perform though... This post has been edited by rutra80: Sep 16 2005, 20:19 |
|
|
|
Sep 16 2005, 20:52
Post
#52
|
|
![]() Group: Members Posts: 474 Joined: 1-December 02 Member No.: 3940 |
QUOTE (XoR @ Sep 16 2005, 06:34 AM) MPC - increase contrast and saturation of images MP3 - on the contrary decrease contrast a bit OGG - do perfectly from all above and show oryginal colors WV - I see no diffrence from oryginal picture (best audio picture codec :] ) OFS - the are any diffrences too !!!!!!!!!!!!!!!!! I wonder how these results correlate to real double blind listening tests. Can anybody ABX samples? May be we have got a new lossy audio quality measuring method -------------------- Ogg Vorbis for music and speech [q-2.0 - q6.0]
FLAC for recordings to be edited Speex for speech |
|
|
|
Sep 16 2005, 21:35
Post
#53
|
|
|
Group: Members Posts: 14 Joined: 18-August 05 Member No.: 24000 |
QUOTE ("rutra80") Because they're lossless codecs, I wonder how their hybrid modes would perform though... I tested lossy mode To me WM and OFS sound better even than same bitrate OGG so that's not surprise to me that they "look" better This post has been edited by XoR: Sep 16 2005, 21:46 |
|
|
|
Sep 17 2005, 01:04
Post
#54
|
|
|
Group: Members Posts: 289 Joined: 12-May 03 From: The Hague Member No.: 6555 |
I think its quite a surprise the relative order of encoders seems to hold on images.
-------------------- Veni Vidi Vorbis.
|
|
|
|
Sep 17 2005, 02:29
Post
#55
|
|
![]() Group: Members Posts: 1189 Joined: 19-May 05 From: Montreal, Canada Member No.: 22144 |
QUOTE (HbG @ Sep 16 2005, 06:04 PM) A greater surprise would be to see the compression gain factor they actually reach on such samples. These bitmaps are coded as 22050Hz sampling rate / 8 bit per sample, correct? what compression ratio is reached, while maintaining discernable image quality? Such an analysis would be interesting, as well as seeing what type of image compresses the most (gradients, checkered pattern, etc..) |
|
|
|
Sep 28 2005, 10:47
Post
#56
|
|
![]() Group: Members Posts: 650 Joined: 28-July 02 From: B'ham UK Member No.: 2828 |
You can get some nice visual effects by applying your own distortion instead of encoding to mp3.. flanging and multi-tap delay are quite nice
-------------------- < w o g o n e . c o m / l o l >
|
|
|
|
Apr 2 2007, 00:24
Post
#57
|
|
|
Group: Members Posts: 6 Joined: 18-April 06 Member No.: 29719 |
QUOTE While studying for a Fourier Analysis test some of my flatmates and I were discussing how well JPEG would encode music. Since both lossy audio codecs (MP3, Vorbis, etc) and JPEG operate on the same basic idea (discarding unimportant data in the frequency domain) we decided it would be an interesting thing to test. That simply won't work well - because audio coders exploit the irrelevancy according to the human psychoacoustics, adding noise in frequency regions that are masked by outer-inner ear transfer and inner-ear processing. Good audiovisual coders exploit the visual irrelevancy - so, you will end up with noise allocated in regions that do not correspond to psychovisual masking critereia. yes, but the objective is the same in either compressors. the goal is to look as close to the original picture or to sound as good as the original sound. With this technique only the methods are switched. By using a picture compressor in a sound file, you're trying to find which is the compressor that makes the file looks as good as the original, depending on the settings you use. By using a sound compressor in a picture file, you're trying to find which is the compressor that make the file sound as good as the original (which i think hasn't been done yet) When settings are specified to the compressors to work (or compress) the file is reconstructed based on those same settings. how come an audio compressor knows that an given file is an audio file? the irrelevancy model are part of a tentative of recreating the file. All the compressors do is try to reproduce the original file. To make an analogy, its like modern painters trying to reproduce a famously known piece of art, which reproduction stays closer to the original one. The compressors aren't reasonful, they don't make decisions, they use algorithms to work, regardless of its content/data. With this I think theres no reason not to think that an jpeg compressor can't be looked as an audio compressor. In the end it only shows how good it is at reproducing the original file. |
|
|
|
Apr 2 2007, 19:41
Post
#58
|
|
![]() Group: Members Posts: 1455 Joined: 22-November 05 From: Jakarta Member No.: 25929 |
Holy thread resurrection!
You know that the thread is 2005, don't you? But... I do wonder how the result would be with today's hunked up lossy compressors... And no one tested AAC yet... -------------------- Nobody is Perfect.
I am Nobody. http://pandu.poluan.info |
|
|
|
Apr 2 2007, 20:39
Post
#59
|
|
|
Group: Members Posts: 54 Joined: 29-May 05 Member No.: 22386 |
Maybe we can post the results on Head-Fi and then people there can use the quality of pictures to decide which encoder to use for audio.
|
|
|
|
Apr 3 2007, 12:04
Post
#60
|
|
![]() Group: Members Posts: 446 Joined: 13-August 06 Member No.: 34002 |
THIS IS ART.
-------------------- err... i'm not using windows any more ;)
|
|
|
|
Apr 6 2007, 18:40
Post
#61
|
|
![]() Group: Members Posts: 1355 Joined: 9-January 05 From: JJ's office. Member No.: 18957 |
Holy thread resurrection! You know that the thread is 2005, don't you? But... I do wonder how the result would be with today's hunked up lossy compressors... And no one tested AAC yet... Somebody ought to test AAC for image coding. After all, look for Kuo's paper on using SNS (Spatial Noise Shaping) in image coding -------------------- -----
J. D. (jj) Johnston |
|
|
|
Apr 10 2007, 10:04
Post
#62
|
|
![]() LAME developer Group: Developer Posts: 2950 Joined: 1-October 01 From: Nanterre, France Member No.: 138 |
|
|
|
|
Apr 10 2007, 11:08
Post
#63
|
|
![]() Group: Members Posts: 127 Joined: 11-April 06 Member No.: 29396 |
Too bad Nero discounted the use of PNS without testing its usefulness for images.
Anyway, here are some results from the current NeroAAC encoder: ![]() (crops taken from a 3mp image (original), size refers to the whole picture (original is 1322KB)) |
|
|
|
Apr 10 2007, 18:58
Post
#64
|
|
![]() Group: Members Posts: 1355 Joined: 9-January 05 From: JJ's office. Member No.: 18957 |
Too bad Nero discounted the use of PNS without testing its usefulness for images. Anyway, here are some results from the current NeroAAC encoder: ![]() (crops taken from a 3mp image (original), size refers to the whole picture (original is 1322KB)) :rollingonthefloorlaughing: There was an irony, I'm told, to the review of Kuo's paper. One of the reviewers mistook the use of a full-image FFT (for SNS purposes) as an argument to use a full image FFT for coding. Talk about missing the point completely... -------------------- -----
J. D. (jj) Johnston |
|
|
|
Apr 20 2007, 00:57
Post
#65
|
|
![]() Group: Members Posts: 484 Joined: 8-January 06 From: Earth Member No.: 26978 |
Wow, I have got to try this at home
-------------------- Vorbis-q0-lowpass99
lame3.93.1-q5-V9-k-nspsytune |
|
|
|
Apr 20 2007, 09:19
Post
#66
|
|
![]() Group: Members Posts: 127 Joined: 11-April 06 Member No.: 29396 |
|
|
|
|
Nov 22 2007, 02:44
Post
#67
|
|
|
Group: Members Posts: 17 Joined: 2-July 04 Member No.: 15023 |
Can't let this topic die, so I wrote yet another one!
http://astrange.ithinksw.net/tools/imgpcm.c This is fancier, not that it helps anything; it converts RGB to YUV, with Y in the center and U-V (the hue, sort of) as the stereo difference, then lays the image out along a Hilbert curve. Lossless: ![]() (The colors are already wrong since it pretty much stores the image by dropping the saturation) lame 32kbit: ![]() (the mp3 is 235kb) And, uh, aotuv b5: ![]() Apparently it can't handle the signal phase changing. No idea how to save the image saturation without something gross, like an NTSC signal... |
|
|
|
Nov 22 2007, 15:29
Post
#68
|
|
![]() Group: Members Posts: 1018 Joined: 27-September 03 From: Cape Town Member No.: 9042 |
Can't let this topic die, so I wrote yet another one! That's a really cool tool, thanks.http://astrange.ithinksw.net/tools/imgpcm.c This is fancier, not that it helps anything; it converts RGB to YUV, with Y in the center and U-V (the hue, sort of) as the stereo difference, then lays the image out along a Hilbert curve. I can't believe this thread is alive after nearly four years -------------------- Simulate your radar: http://www.brooker.co.za/fers/
|
|
|
|
Nov 30 2007, 16:48
Post
#69
|
|
![]() Group: Members Posts: 127 Joined: 11-April 06 Member No.: 29396 |
|
|
|
|
![]() ![]() |
|
Lo-Fi Version | Time is now: 23rd May 2013 - 06:23 |