Cuberite Forum

Full Version: GPU code
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Pages: 1 2 3 4 5 6 7
Two things, one did you install the AMD SDK? Two is the value of OPENCL_INCLUDE_DIR NOT-FOUND? if so you need to set it.
Do I need the AMD SDK if I have an NVidia?
If you've got an NVIDA card you need the cuda toolkit but I haven't tried building with that.
Now doing Biomal Without the linear interpolation at 1ms a chunk.
Cool Is that better than before?
Haven't checked. One thing is that it is only using 65% of the GPUs capability. If we implemented batching it would double throughput.

What I'm trying to do is implement OpenCl for all the height Generators where it makes sense so I can change the interface when using openCl.
Did test with the mountains height Gen. Using OpenCL is 75% slower then using the CPU generator. I suspect that this is due to the huge latency at start-up, about 450 microseconds ~ 500,000 CPU cycles. If I can move more things on to the GPU and make the generator asynchronous I should be able to remove that from the throughput though.
But the HeightGen isn't really the bottleneck is it?
Cant build your branch with thr NV cuda sdk, because it don't got a "cl.hpp" included. Sad
Even the CUDA SDK 3.1 (with open cl ? 1.1 ? support)
Did you set the OPENCL_INCLUDE_DIR and OPENCL_LIBRARY_PATH vars? It may be cmake can't find cuda.

, no but if part of the pipeline is on the CPU and part GPU it will cause latencies so I need to transfer most parts of the pipeline.
Pages: 1 2 3 4 5 6 7