IPB

Welcome Guest ( Log In | Register )

> Hydrogenaudio Forum Rules

- No Warez. This includes warez links, cracks and/or requests for help in getting illegal software or copyrighted music tracks!
- No Spamming or Trolling on the boards, this includes useless posts, trying to only increase post count or trying to deliberately create a flame war.
- No Hateful or Disrespectful posts. This includes: bashing, name-calling or insults directed at a board member.
- Click here for complete Hydrogenaudio Terms of Service

3 Pages V  < 1 2 3  
Reply to this topicStart new topic
New Lossy Audio Codec, JPEG is not an ideal audio codec
rutra80
post Sep 16 2005, 20:17
Post #51





Group: Members (Donating)
Posts: 799
Joined: 12-September 03
Member No.: 8821



QUOTE (Klyith @ Sep 16 2005, 06:27 AM)
3. Save as 16bit 44khz wav files
4. Convert to lossy
5. Convert back to wav (using foobar)
*

I wonder what effects would give different sampling-rates and lowpass values, and turning on/off fb2k's dithering...
QUOTE (XoR @ Sep 16 2005, 04:34 PM)
WV - I see no diffrence from oryginal picture (best audio picture codec :] )
OFS - the are any diffrences too !!!!!!!!!!!!!!!!!
*

Because they're lossless codecs, I wonder how their hybrid modes would perform though...

This post has been edited by rutra80: Sep 16 2005, 20:19
Go to the top of the page
+Quote Post
de Mon
post Sep 16 2005, 20:52
Post #52





Group: Members
Posts: 474
Joined: 1-December 02
Member No.: 3940



QUOTE (XoR @ Sep 16 2005, 06:34 AM)
MPC - increase contrast and saturation of images blink.gif
MP3 - on the contrary decrease contrast a bit
OGG - do perfectly from all above and show oryginal colors
WV - I see no diffrence from oryginal picture (best audio picture codec :] )
OFS - the are any diffrences too !!!!!!!!!!!!!!!!!
*


I wonder how these results correlate to real double blind listening tests.
Can anybody ABX samples? May be we have got a new lossy audio quality measuring method laugh.gif


--------------------
Ogg Vorbis for music and speech [q-2.0 - q6.0]
FLAC for recordings to be edited
Speex for speech
Go to the top of the page
+Quote Post
XoR
post Sep 16 2005, 21:35
Post #53





Group: Members
Posts: 14
Joined: 18-August 05
Member No.: 24000



QUOTE ("rutra80")
Because they're lossless codecs, I wonder how their hybrid modes would perform though...

I tested lossy mode smile.gif
To me WM and OFS sound better even than same bitrate OGG so that's not surprise to me that they "look" better cool.gif

This post has been edited by XoR: Sep 16 2005, 21:46
Go to the top of the page
+Quote Post
HbG
post Sep 17 2005, 01:04
Post #54





Group: Members
Posts: 289
Joined: 12-May 03
From: The Hague
Member No.: 6555



I think its quite a surprise the relative order of encoders seems to hold on images.


--------------------
Veni Vidi Vorbis.
Go to the top of the page
+Quote Post
Shade[ST]
post Sep 17 2005, 02:29
Post #55





Group: Members
Posts: 1189
Joined: 19-May 05
From: Montreal, Canada
Member No.: 22144



QUOTE (HbG @ Sep 16 2005, 06:04 PM)
I think its quite a surprise the relative order of encoders seems to hold on images.
*

A greater surprise would be to see the compression gain factor they actually reach on such samples. These bitmaps are coded as 22050Hz sampling rate / 8 bit per sample, correct? what compression ratio is reached, while maintaining discernable image quality? Such an analysis would be interesting, as well as seeing what type of image compresses the most (gradients, checkered pattern, etc..)
Go to the top of the page
+Quote Post
Mac
post Sep 28 2005, 10:47
Post #56





Group: Members
Posts: 650
Joined: 28-July 02
From: B'ham UK
Member No.: 2828



You can get some nice visual effects by applying your own distortion instead of encoding to mp3.. flanging and multi-tap delay are quite nice smile.gif


--------------------
< w o g o n e . c o m / l o l >
Go to the top of the page
+Quote Post
JB_
post Apr 2 2007, 00:24
Post #57





Group: Members
Posts: 6
Joined: 18-April 06
Member No.: 29719



QUOTE (Ivan Dimkovic @ Oct 12 2003, 10:21) *
QUOTE
While studying for a Fourier Analysis test some of my flatmates and I were discussing how well JPEG would encode music. Since both lossy audio codecs (MP3, Vorbis, etc) and JPEG operate on the same basic idea (discarding unimportant data in the frequency domain) we decided it would be an interesting thing to test.


That simply won't work well - because audio coders exploit the irrelevancy according to the human psychoacoustics, adding noise in frequency regions that are masked by outer-inner ear transfer and inner-ear processing.

Good audiovisual coders exploit the visual irrelevancy - so, you will end up with noise allocated in regions that do not correspond to psychovisual masking critereia.


yes, but the objective is the same in either compressors. the goal is to look as close to the original picture or to sound as good as the original sound.

With this technique only the methods are switched.

By using a picture compressor in a sound file, you're trying to find which is the compressor that makes the file looks as good as the original, depending on the settings you use. By using a sound compressor in a picture file, you're trying to find which is the compressor that make the file sound as good as the original (which i think hasn't been done yet)

When settings are specified to the compressors to work (or compress) the file is reconstructed based on those same settings. how come an audio compressor knows that an given file is an audio file? the irrelevancy model are part of a tentative of recreating the file. All the compressors do is try to reproduce the original file. To make an analogy, its like modern painters trying to reproduce a famously known piece of art, which reproduction stays closer to the original one.

The compressors aren't reasonful, they don't make decisions, they use algorithms to work, regardless of its content/data. With this I think theres no reason not to think that an jpeg compressor can't be looked as an audio compressor. In the end it only shows how good it is at reproducing the original file.
Go to the top of the page
+Quote Post
pepoluan
post Apr 2 2007, 19:41
Post #58





Group: Members
Posts: 1455
Joined: 22-November 05
From: Jakarta
Member No.: 25929



Holy thread resurrection! biggrin.gif

You know that the thread is 2005, don't you? wink.gif

But... I do wonder how the result would be with today's hunked up lossy compressors...

And no one tested AAC yet...


--------------------
Nobody is Perfect.
I am Nobody.

http://pandu.poluan.info
Go to the top of the page
+Quote Post
Febs
post Apr 2 2007, 20:39
Post #59





Group: Members
Posts: 54
Joined: 29-May 05
Member No.: 22386



Maybe we can post the results on Head-Fi and then people there can use the quality of pictures to decide which encoder to use for audio.

rolleyes.gif
Go to the top of the page
+Quote Post
...Just Elliott
post Apr 3 2007, 12:04
Post #60





Group: Members
Posts: 446
Joined: 13-August 06
Member No.: 34002



THIS IS ART.


--------------------
err... i'm not using windows any more ;)
Go to the top of the page
+Quote Post
Woodinville
post Apr 6 2007, 18:40
Post #61





Group: Members
Posts: 1355
Joined: 9-January 05
From: JJ's office.
Member No.: 18957



QUOTE (pepoluan @ Apr 2 2007, 11:41) *
Holy thread resurrection! biggrin.gif

You know that the thread is 2005, don't you? wink.gif

But... I do wonder how the result would be with today's hunked up lossy compressors...

And no one tested AAC yet...


Somebody ought to test AAC for image coding.

After all, look for Kuo's paper on using SNS (Spatial Noise Shaping) in image coding smile.gif


--------------------
-----
J. D. (jj) Johnston
Go to the top of the page
+Quote Post
Gabriel
post Apr 10 2007, 10:04
Post #62


LAME developer


Group: Developer
Posts: 2950
Joined: 1-October 01
From: Nanterre, France
Member No.: 138



QUOTE (Woodinville @ Apr 6 2007, 18:40) *
Somebody ought to test AAC for image coding.

After all, look for Kuo's paper on using SNS (Spatial Noise Shaping) in image coding smile.gif


Sure, and PNS would reproduce film grain laugh.gif
Go to the top of the page
+Quote Post
gaekwad2
post Apr 10 2007, 11:08
Post #63





Group: Members
Posts: 127
Joined: 11-April 06
Member No.: 29396



Too bad Nero discounted the use of PNS without testing its usefulness for images. sad.gif

Anyway, here are some results from the current NeroAAC encoder:


(crops taken from a 3mp image (original), size refers to the whole picture (original is 1322KB))
Go to the top of the page
+Quote Post
Woodinville
post Apr 10 2007, 18:58
Post #64





Group: Members
Posts: 1355
Joined: 9-January 05
From: JJ's office.
Member No.: 18957



QUOTE (gaekwad2 @ Apr 10 2007, 03:08) *
Too bad Nero discounted the use of PNS without testing its usefulness for images. sad.gif

Anyway, here are some results from the current NeroAAC encoder:


(crops taken from a 3mp image (original), size refers to the whole picture (original is 1322KB))


:rollingonthefloorlaughing:

There was an irony, I'm told, to the review of Kuo's paper. One of the reviewers mistook the use of a full-image FFT (for SNS purposes) as an argument to use a full image FFT for coding. Talk about missing the point completely...


--------------------
-----
J. D. (jj) Johnston
Go to the top of the page
+Quote Post
gameplaya15143
post Apr 20 2007, 00:57
Post #65





Group: Members
Posts: 484
Joined: 8-January 06
From: Earth
Member No.: 26978



Wow, I have got to try this at home smile.gif


--------------------
Vorbis-q0-lowpass99
lame3.93.1-q5-V9-k-nspsytune
Go to the top of the page
+Quote Post
gaekwad2
post Apr 20 2007, 09:19
Post #66





Group: Members
Posts: 127
Joined: 11-April 06
Member No.: 29396



Vorbis -q-2 looks nice as well:



At higher bitrates AAC, Vorbis and mp3 look fairly similar.
Go to the top of the page
+Quote Post
Mr VacBob
post Nov 22 2007, 02:44
Post #67





Group: Members
Posts: 17
Joined: 2-July 04
Member No.: 15023



Can't let this topic die, so I wrote yet another one!
http://astrange.ithinksw.net/tools/imgpcm.c

This is fancier, not that it helps anything; it converts RGB to YUV, with Y in the center and U-V (the hue, sort of) as the stereo difference, then lays the image out along a Hilbert curve.

Lossless:

(The colors are already wrong since it pretty much stores the image by dropping the saturation)

lame 32kbit:

(the mp3 is 235kb)

And, uh, aotuv b5:


Apparently it can't handle the signal phase changing.

No idea how to save the image saturation without something gross, like an NTSC signal...
Go to the top of the page
+Quote Post
cabbagerat
post Nov 22 2007, 15:29
Post #68





Group: Members
Posts: 1018
Joined: 27-September 03
From: Cape Town
Member No.: 9042



QUOTE (Mr VacBob @ Nov 21 2007, 17:44) *
Can't let this topic die, so I wrote yet another one!
http://astrange.ithinksw.net/tools/imgpcm.c

This is fancier, not that it helps anything; it converts RGB to YUV, with Y in the center and U-V (the hue, sort of) as the stereo difference, then lays the image out along a Hilbert curve.
That's a really cool tool, thanks.

I can't believe this thread is alive after nearly four years smile.gif


--------------------
Simulate your radar: http://www.brooker.co.za/fers/
Go to the top of the page
+Quote Post
gaekwad2
post Nov 30 2007, 16:48
Post #69





Group: Members
Posts: 127
Joined: 11-April 06
Member No.: 29396



QUOTE (Mr VacBob @ Nov 22 2007, 02:44) *
Can't let this topic die, so I wrote yet another one!

[...]

No idea how to save the image saturation without something gross, like an NTSC signal...

How about multichannel?

Instead of Lame you'd have to use fhg mp3surround, but apart from that it should work.
Go to the top of the page
+Quote Post

3 Pages V  < 1 2 3
Reply to this topicStart new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:

 



RSS Lo-Fi Version Time is now: 23rd May 2013 - 06:23