GPU computing Stay up to date in OpenCL, DirectCompute, CUDA, CAL and OpenGL information

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Friday, 6 November 2009

Naive OpenCL benchmarks..

Posted on 23:23 by Unknown
Used for selecting GPU devices for ATI
export DISPLAY=:0.0
export DISPLAY=:0.1
For switching Nvidia and ATI see previous blog ..
For fixing MersenneTwister on Nvidia see my blog

Timing:Assure to use
clEnqueueNDRangeKernel(queue, kernel, 1, NULL, global_work_size, NULL, 0, NULL, &event);
clWaitForEvents(1, &event);
    long long profiling_start,profiling_end;
    clGetEventProfilingInfo(event, CL_PROFILING_COMMAND_START, sizeof(long long), &profiling_start, NULL);
    clGetEventProfilingInfo(event, CL_PROFILING_COMMAND_END, sizeof(long long), &profiling_end, NULL);

My tests
=====
core i7 920 gtx 275 ati 5850 ati 4850
linux
./MersenneTwister -q -t --device cpu -x 1000000
cpu
2000000                  0.259                    7.72201e+06
w7
2000000                  0.328564                 6.08709e+006  
gpu
2000000                  1.582                    1.26422e+06 
w7
2000000                  0.66877                  2.99057e+006    
r770
2000000                  2.911                    687049
gt200 (190)
2000000                  1.371                    1.43962e+06
195
2000000                  0.675                    2.96296e+06
195 w7
2000000             0.733789                 2.72558e+006
./MatrixMultiplication --device gpu -x 2048 -y 2048 -z 2048 -t -q

cpu
44sec
cpu w7
2048x2048                2048x2048                56.4574
gpu
2048x2048                2048x2048                1.34713 
w7
2048x2048                2048x2048                0.960548
rv770
2048x2048                2048x2048                3.80943   
gt200 (190)
2048x2048                2048x2048                1.77975
195
2048x2048                2048x2048                0.641499
195 w7
2048x2048                2048x2048                0.60038
w7 oc
2048x2048                2048x2048                0.55

Updated for 195 drivers

ATI FORUMS
========

2.66GHz core 2 Duo running Ubuntu 9.04, and I have a Radeon 4350 GPU.  I'v
ati.txt
$ ./MatrixMultiplication --device gpu -x 2048 -y 2048 -z 2048 -t -q
MatrixA                  MatrixB                  Time(sec)               
2048x2048                2048x2048                95.0832                 

$ ./MatrixMultiplication --device cpu -x 2048 -y 2048 -z 2048 -t -q
MatrixA                  MatrixB                  Time(sec)               
2048x2048                2048x2048                81.8752                 

$ ./MersenneTwister -q -t --device cpu -x 1000000
Generated Numbers        Time(sec)                Numbers/sec             
2000000                  0.506                    3.95257e+06             

$ ./MersenneTwister -q -t --device gpu -x 1000000
Generated Numbers        Time(sec)                Numbers/sec             
2000000                  4.133                    483910  

Phenom X4 9650 and Radeon 5770 on vista 32bit SP2.
MatrixMultiplication.exe --device cpu -x 2048 -y 2048 -z 2048 -t -q
MatrixA             MatrixB            Time          KernelTime
2048x2048      2048x2048        140.612        139.926

MatrixMultiplication.exe --device gpu -x 2048 -y 2048 -z 2048 -t -q
MatrixA             MatrixB           Time          KernelTime
2048x2048      2048x2048         3.3639         0.826539

MersenneTwister.exe -q -t --device cpu -x 1000000
Generated Numbers     Time            kernelTime    Numbers/sec
2000000                      0.976359     0.1577           2.04843e+006

MersenneTwister.exe -q -t --device gpu -x 1000000
Generated Numbers     Time             kernelTime   Numbers/sec
2000000                      1.89666        0.0705628    1.05449e+006
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Porting CUDA to OpenCL!
    Well so you want to port CUDA code to OpenCL: you are in AMD GPU competition of porting Cuda codes to opencl (see previous post) or you are ...
  • Megapost!
    Today fools{ *GTX 485 is 512 cores 3gbytes gddr5 and 850/1750 shaders.. *ati 5990 has 4 gpus in board.. *bulldozer benchmarks }end fools.. A...
  • About ATI and Nvidia drivers (OCL included)!
    Hi I have been investigating AMD and Nvidia drivers.. for 10.3 there are 3d hooks support for 120hz monitors but is d3d9 d3d10 or d3d11 enab...
  • things found in CUDA forums
    Also some CUDA news: Mandelbulb stereo angalyph -> have to port to 3D Vision http://forums.nvidia.com/index.php?showtopic=150985&st=2...
  • opencl/opengl linux interop! seen in opencl cuda 3.0 sdk samples
    Following my OpenCL/OpenGL Window interop work: now has come to Linux  for Nvidia GPU computing registered developers via 195.17 driver! Als...
  • State of the blog..
    Sorry for the delay guys of posting code of Apple OpenCL demos port.. the blog has been with no updated for more than 2 weeks in this rapid ...
  • Optix and OpenCL SDKs with Visual Studio 2010
    Optix 1.0 ========= install cg download Cmake 2.80 cmake says error dumpbin not found and it is cuda doesn't work with vc2010 so copy pt...
  • CUDA 3.0 forums stuff!
    1.Getting CUBIN instead of ELF If you need the older text format, you can disable ELF cubins in nvcc.profile by changing "CUBINS_ARE_EL...
  • News from the web!
    Some things learned in AMD forums: 1.Why 3xxx no OpenCL: Compute shader mode is a hardware feature that did not exist in the HD38XX line of ...
  • Shaders: measuring perf, source translation and parsing different languages!
    Hi, I hope to be pretty exhaustive of options for parsing and translating between graphics and compute shaders ( some open source) For DX sh...

Blog Archive

  • ►  2013 (5)
    • ►  September (1)
    • ►  March (3)
    • ►  February (1)
  • ►  2012 (1)
    • ►  December (1)
  • ►  2010 (46)
    • ►  July (4)
    • ►  May (1)
    • ►  April (3)
    • ►  March (9)
    • ►  February (15)
    • ►  January (14)
  • ▼  2009 (125)
    • ►  December (51)
    • ▼  November (53)
      • Two big games coming today: State of the art Direc...
      • News from the web (IV) (big compilation)
      • Wishes in GPU drivers before Q2 2009!
      • CUDA Atomics perf!
      • GPU Compute benchmark results!
      • Interesting AMD Stream forums posts! (old posts)
      • Testing my apps with 8600GTS and WinXP!
      • A lot of Catalyst AMD drivers!
      • News from the web III
      • News from the web II (big compilation)
      • News from OpenCL forums!
      • Bugs in OpenGL AMD drivers: Geometry shader and te...
      • Testing LDS perf in OpenCL!
      • OpenCL bugs!
      • Benchmarking OpenCL and DirectCompute!
      • Benchmarking stientific kernels on OpenCL!
      • News from the web!
      • OpenCL learning and tutorials!
      • Porting CUDA to OpenCL!
      • GPU computing programming contests..
      • AMD 5xxx series overclocking..
      • OpenCL on Apple: update!
      • State of the blog..
      • Places where OpenCL shines!
      • Running Optix with Geforce in Linux
      • New exciting soft and info coming this year!
      • Matmul bench for CUDA, CAL, and MultiCore CPUs!
      • More than 10 places where DX Compute 5.0 is better...
      • CUDA 3.0 has CUBLAS functions for MAGMA with compl...
      • About IBM OpenCL
      • OpenGL interop perf in CUDA and OCL in Linux
      • Fraps like for Linux and for Windows DX11!
      • opencl/opengl linux interop! seen in opencl cuda 3...
      • AMD OpenCl forums (I)
      • About CUDA 3.0 (II)
      • About CUDA 3.0 (I)
      • CAL 2.0 vs 1.4 API
      • Naive OpenCL benchmarks..
      • Managing AMD OpenCL GPU devices and OpenCL backend...
      • About Xvba VAAPI backend..
      • CUDA 3.0 released
      • About Khronos ICD model..
      • Exploring Nvidia OpenCL 195.39 drivers:Bugs , perf...
      • Nvidia OpenCL samples with AMD OpenCL drivers!
      • Nvidia OpenCL samples on Nvidia 195 OpenCL drivers!!
      • AMD OpenCL samples on Nvidia 195 OpenCL drivers!!
      • Optix and OpenCL SDKs with Visual Studio 2010
      • OpenCL on AMD GPUs!
      • Dreaming about Ubuntu 10.04
      • News from the web!
      • OpenCL-z is here!
      • Port of Apple demos to Windows..
      • Shared memory names..
    • ►  October (21)
Powered by Blogger.

About Me

Unknown
View my complete profile