More to the point, you didn’t actually mention the ATi card you tested against. You might want to retest that code because I’d imagine the GFLOPS between your Macbook Pro and the 8600GT will differ wildly as well. It would take quite time to catch on, but I think it might hit the sweet spot. Put the management decisions of what work need to be done in the high-level language (it would be easier to organize such tasks there), and then write directly the low-level workers in OpenCL – most likely all OpenCL implementations would always have the compiler loaded so you can even change on the fly, instead of recompiling – much like GLSL in OpenGL. the garbage collector should not move it) and accessing it later. From that perspective, you can forget all your worries about using C++ as dominantly performance language (through boost, and other template libraries), and use your favourite high-level language (javascript, C#, java, lisp, rub, python, perl, etc.) as long as it has some form of freezing foreign array data (e.g. The beauty of it, is that it offers a more restricted “C”-based language, where you can still program normally (not an assembler), but it would still run efficiently. And OpenCL would be there for the PS3 SPE’s… You can head over to the GeekBench website, and download the latest release for Linux. GeekBench automatically puts your system through a battery of tests and produces a complete set of results as well as an overall score. VNC is too slow for solution (maybe only HP RGS). GeekBench is another complete test suite that’s available for Linux. There are lots of servers, where graphics cards are not present, also if you have Remote-Desktop’d into such machine, the graphics driver is replaced and you can’t use CUDA (won’t comment on OpenCL). That to be said, CPU based OpenCL is exactly what we need. GeForce 8600M GT (PCI-Express x16 width) 256MBīootcampe’d Windows XP 32 bit Service Pack 3 using 190.38 nvidia drivers (bit modified to install on the 8600M – thanks to laptop2go) Intel Core 2 Duo 2.6 Ghz, 1 Processor, 2 cores, 4MB L2 Cache, 4GB Memory, Bus Speed 800Mhz 28GFLOPS CUDA (can’t comment on NVIDIA OpenCL). I can’t comment on NVIDIA OpenCL benchmarks as I’m under NDA, but compared to the same CUDA sample it was 1.3GFLOPS for AMD/OpenCL (CPU) vs. I’ve “ported” the nvidia nbody sample from their OpenCL package, to the AMD cpu based one. Besides HPC Challenge in the Phoronix Test Suite, a test profile I did this week in an overall OpenCL push not associated with any specific customer needs (its my birthday today and I just enjoy working on Linux benchmarks and playing with hardware, so heres a present to you all, but theres also more coming later today.
0 Comments
Leave a Reply. |