GPU computing Stay up to date in OpenCL, DirectCompute, CUDA, CAL and OpenGL information

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Sunday, 8 November 2009

Matmul bench for CUDA, CAL, and MultiCore CPUs!

Posted on 07:23 by Unknown
Test Nvidia, AMD GPUS and CPUs (CUDA, CAL, CPUs)

Key points of benchmark:

*Uses very high performance codes:
*Win x32,x64! (big win in x64 in multicore CPUs)
*Single precision and double precision!
*Test any size!
Download!

Also Works on Wine! (at least it worked with the 32 binaries with Wine Wrappers around CUDA and CAL I did a year ago but which I think I sadly lost! anyway where based on ideas around CUDA Wine wrapper for Folding@home)
There were issues in CPU using only one thread altough I forced using it via OpenMP env variables and Intel MKL env variables..

Also I tested x64 with a CAL Wine64 wrapper.. but crashed..


Uses very high performance codes:

*For Nvidia GPus uses CUBLAS fast matmul (volkovs code)
*CAL matmul uses AMD IL from AMD SDK
(anyway better is known see beyon3d 1tflop matmul)
*Uses Intel MKL libs (hopefully also using new dll's you get AVX acceleration (at least I hope so..))..

Also assuming Larrabe executes normal executables and has a intel MKL library with binary compatibilty then
also that..


I done it past year!
for testing a 8800GT and 4850!
now with Windows 7 you can use to test simultaneuos perf on ATI and Nvidia and CPUs!

Still working only need to use HxD to change references from amdcal to atical DLL's
 There I can remove the dll's as a part of AMD driver and I get also new GPU support automatically..
tested on ATI 5850..

 Source code: sorry guys but I would have to search very hard for finding it! it's old cde..
Results:
Core i7 920
ATI 5850
GTX 275

Build info: X86 Release
Build date: Jul 31 2008 20:45:28
Machine arch: little endian.
Number of CPU cores: 8
Timer used: MULTIMEDIA TIMER
Freq. of TIMER: : 2.63554 Mhz.
CPU: 2716.93 Mhz.

Size: 4096

CAL tests:
=========

Float
Tam: 4096 Time running: 0.369012 s. Gflops: 744.902950 Gflops.
Testing CAL matmul  double
Double
Tam: 2992 Time running: 0.483039 s. Gflops: 221.800549 Gflops.

CUDA tests:
==========

There is 1 device supporting CUDA
GPU 1: "GeForce GTX 275" SM:1.3
 MP: 30 Cores: 240 Freq:1.40 GHz
Mem: 896Mbytes BW (Pinned) H2D: 2.65GB/s D2H:2.81GB/s D2D:55.3

CUDA floating code
Time running: 0.432000 s. Gflops: 318.106889 Gflops.
CUDA no IO floating code
Time running: 0.339000 s. Gflops: 405.375151 Gflops.
CUDA double code
Time running: 1.844000 s. Gflops: 74.523957 Gflops.
CUDA no IO floating code
Time running: 1.682000 s. Gflops: 81.701651 Gflops.

CPU tests:
==========

Intel MKL code
Time running: 2.573000 s. Gflops: 53.409318 Gflops.
Double
Time running: 5.825000 s. Gflops: 23.591790 Gflops.
OK.

Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Porting CUDA to OpenCL!
    Well so you want to port CUDA code to OpenCL: you are in AMD GPU competition of porting Cuda codes to opencl (see previous post) or you are ...
  • Megapost!
    Today fools{ *GTX 485 is 512 cores 3gbytes gddr5 and 850/1750 shaders.. *ati 5990 has 4 gpus in board.. *bulldozer benchmarks }end fools.. A...
  • About ATI and Nvidia drivers (OCL included)!
    Hi I have been investigating AMD and Nvidia drivers.. for 10.3 there are 3d hooks support for 120hz monitors but is d3d9 d3d10 or d3d11 enab...
  • things found in CUDA forums
    Also some CUDA news: Mandelbulb stereo angalyph -> have to port to 3D Vision http://forums.nvidia.com/index.php?showtopic=150985&st=2...
  • opencl/opengl linux interop! seen in opencl cuda 3.0 sdk samples
    Following my OpenCL/OpenGL Window interop work: now has come to Linux  for Nvidia GPU computing registered developers via 195.17 driver! Als...
  • State of the blog..
    Sorry for the delay guys of posting code of Apple OpenCL demos port.. the blog has been with no updated for more than 2 weeks in this rapid ...
  • Optix and OpenCL SDKs with Visual Studio 2010
    Optix 1.0 ========= install cg download Cmake 2.80 cmake says error dumpbin not found and it is cuda doesn't work with vc2010 so copy pt...
  • CUDA 3.0 forums stuff!
    1.Getting CUBIN instead of ELF If you need the older text format, you can disable ELF cubins in nvcc.profile by changing "CUBINS_ARE_EL...
  • News from the web!
    Some things learned in AMD forums: 1.Why 3xxx no OpenCL: Compute shader mode is a hardware feature that did not exist in the HD38XX line of ...
  • Shaders: measuring perf, source translation and parsing different languages!
    Hi, I hope to be pretty exhaustive of options for parsing and translating between graphics and compute shaders ( some open source) For DX sh...

Blog Archive

  • ►  2013 (5)
    • ►  September (1)
    • ►  March (3)
    • ►  February (1)
  • ►  2012 (1)
    • ►  December (1)
  • ►  2010 (46)
    • ►  July (4)
    • ►  May (1)
    • ►  April (3)
    • ►  March (9)
    • ►  February (15)
    • ►  January (14)
  • ▼  2009 (125)
    • ►  December (51)
    • ▼  November (53)
      • Two big games coming today: State of the art Direc...
      • News from the web (IV) (big compilation)
      • Wishes in GPU drivers before Q2 2009!
      • CUDA Atomics perf!
      • GPU Compute benchmark results!
      • Interesting AMD Stream forums posts! (old posts)
      • Testing my apps with 8600GTS and WinXP!
      • A lot of Catalyst AMD drivers!
      • News from the web III
      • News from the web II (big compilation)
      • News from OpenCL forums!
      • Bugs in OpenGL AMD drivers: Geometry shader and te...
      • Testing LDS perf in OpenCL!
      • OpenCL bugs!
      • Benchmarking OpenCL and DirectCompute!
      • Benchmarking stientific kernels on OpenCL!
      • News from the web!
      • OpenCL learning and tutorials!
      • Porting CUDA to OpenCL!
      • GPU computing programming contests..
      • AMD 5xxx series overclocking..
      • OpenCL on Apple: update!
      • State of the blog..
      • Places where OpenCL shines!
      • Running Optix with Geforce in Linux
      • New exciting soft and info coming this year!
      • Matmul bench for CUDA, CAL, and MultiCore CPUs!
      • More than 10 places where DX Compute 5.0 is better...
      • CUDA 3.0 has CUBLAS functions for MAGMA with compl...
      • About IBM OpenCL
      • OpenGL interop perf in CUDA and OCL in Linux
      • Fraps like for Linux and for Windows DX11!
      • opencl/opengl linux interop! seen in opencl cuda 3...
      • AMD OpenCl forums (I)
      • About CUDA 3.0 (II)
      • About CUDA 3.0 (I)
      • CAL 2.0 vs 1.4 API
      • Naive OpenCL benchmarks..
      • Managing AMD OpenCL GPU devices and OpenCL backend...
      • About Xvba VAAPI backend..
      • CUDA 3.0 released
      • About Khronos ICD model..
      • Exploring Nvidia OpenCL 195.39 drivers:Bugs , perf...
      • Nvidia OpenCL samples with AMD OpenCL drivers!
      • Nvidia OpenCL samples on Nvidia 195 OpenCL drivers!!
      • AMD OpenCL samples on Nvidia 195 OpenCL drivers!!
      • Optix and OpenCL SDKs with Visual Studio 2010
      • OpenCL on AMD GPUs!
      • Dreaming about Ubuntu 10.04
      • News from the web!
      • OpenCL-z is here!
      • Port of Apple demos to Windows..
      • Shared memory names..
    • ►  October (21)
Powered by Blogger.

About Me

Unknown
View my complete profile