If this is implemented handbrake speed can bewonderful

You'd have to talk with the x264 guys about implementing this, as Handbrake uses their library for encoding.jeylm wrote:Hallo, AMD has released the instructions set architecture for R700 today http://developer.amd.com/gpu_assets/R70 ... ecture.pdf
If this is implemented handbrake speed can bewonderfulis it possible ?
No-one said that.I think GPU Acceleration under When Hell Freezes Over section is naive.
I think your implying that I'm saying CUDA is useless for everything. I'm not.CUDA is a bust? I think not.
I did, in the requests forum readme.s55 wrote:No-one said that.I think GPU Acceleration under When Hell Freezes Over section is naive.
I stand corrected.I did, in the requests forum readme.
Most enormous [Censored] i read in the past 2 years. A GPU is capable of doing a [Censored] of atomic (!) operations per second, MUCH more than a standard CPU will ever dream of - hence EVERY CODE YOU CAN THINK OF fits in a GPU - especially (!) with frameworks like OpenCL, Brook+, Stream, CUDA and whatnot.jbrjake wrote:s55 wrote:There is practically no code capable of being accelerated that way in our project,
And yet all they've produced are a couple mediocre encoders. You can get the same speed and quality on your CPU using settings similar to x264cli's superfast preset.tha_specializt wrote:companys have already catched on, CUDA is very well supported
I don't even know what that means. 1000 seconds per 1080p frame?tha_specializt wrote:how does "1080p / ~1000s" sound to you? Yes, thats what i thought.
I'm kind of confused why you would bring up atomic operations here. Maybe you misunderstand what an atomic operation is? They're really rather slow on a GPU, because they usually require access to global memory. This gives you a penalty of several hundred clock cycles. It really would be more beneficial to avoid atomic operations if you can.tha_specializt wrote:A GPU is capable of doing a [Censored] of atomic (!) operations per second, MUCH more than a standard CPU will ever dream of - hence EVERY CODE YOU CAN THINK OF fits in a GPU - especially (!) with frameworks like OpenCL, Brook+, Stream, CUDA and whatnot.
I'm not quite sure what you mean by the "1080p / ~1000s" here.tha_specializt wrote:One would have to break down the code into atomic operations - which is a [Censored] of annoying work but a not-so-small team of developers (therefore : $1 != megalomaniac scriptkiddies) could transpose the essential code in a few weeks - months, maybe. Oh and before you start crying : companys have already catched on, CUDA is very well supported - how does "1080p / ~1000s" sound to you?
Wow. As everyone else has noted, you are a moron.tha_specializt wrote:Most enormous [Censored] i read in the past 2 years. A GPU is capable of doing a [Censored] of atomic (!) operations per second, MUCH more than a standard CPU will ever dream of - hence EVERY CODE YOU CAN THINK OF fits in a GPU - especially (!) with frameworks like OpenCL, Brook+, Stream, CUDA and whatnot.jbrjake wrote:s55 wrote:There is practically no code capable of being accelerated that way in our project,
One would have to break down the code into atomic operations - which is a [Censored] of annoying work but a not-so-small team of developers (therefore : $1 != megalomaniac scriptkiddies) could transpose the essential code in a few weeks - months, maybe. Oh and before you start crying : companys have already catched on, CUDA is very well supported - how does "1080p / ~1000s" sound to you? Yes, thats what i thought. For the future : stop trolling, stop assuming and start KNOWING things, thank you.
lol ... comparing pixels in hardware ... what a brilliant evidence of the fact that you dont even know what you're doing.First off, coalesced loads are a pain in the ass when you need to do things like compare a pixel in one line against a pixel in another line/frame that's got a horizontal offset from the first
I suggest you get in touch with the x264 project team (again, you're on the wrong forum) and enlighten them on how to properly support GPU acceleration. Until I see a x264 commit thanking you for showing them the way, I think I'll trust those who have a proven track record and have demonstrated their abilities.tha_specializt wrote:in this thread : 99,999 of the people dont know how CPUs and/or GPUs nor do most people know what gates are and how they work .... well i refuse to trolls and wannabes who start their "arguments" (if appliable) with insults, you kids need to learn about discussions.
There WAS _one_ real argument, though - the increased amount of cycles needed to access the storage ... but even this problem can be reduced a quite significant amount - with asynchronous data-distribution, inline-structures which are able to wait on a event (even if that means waiting 99% of the time), probability-distribution in terms of accessing parts of the memory (where COULD my data be? Where IS it? Where CAN it be in the next cycle?) and many more techniques which are rather advanced and ... well ... are way too complex to explain them to a few trolls(yes, i dont understand some of them myself - yes, i am no superhuman; thank you cpt. obvious)
lol ... comparing pixels in hardware ... what a brilliant evidence of the fact that you dont even know what you're doing.First off, coalesced loads are a pain in the ass when you need to do things like compare a pixel in one line against a pixel in another line/frame that's got a horizontal offset from the first
You are actually using large structures, references and the like ON HARDWARE?? Lololololol no wonder why you guys never managed it to harness the power of GPUs
by the way : "1000s / 1080p" = a whole movie, transcoded in ~1000s; 1080p. I thought that would be obvious ....