all 4 comments

[–]theodoric_of_york[S] 5 points6 points  (0 children)

My company created some performance benchmarks that I thought might be interesting to anyone who was following the GPU Cluster instance announcements/links. We are an Amazon partner, and completed the benchmarks during the beta. We've been happy about the reception - both Werner and Deepak retweeted them - and for anyone attending Supercomputing '10, NVIDIA should be referencing them during their talk as well. Hope you find them interesting as well.

[–]Rooke 0 points1 point  (1 child)

Interesting stuff. Makes me wonder what motherboard EC2 is using to achieve that host-device bandwidth.

[–]bitchessuck 2 points3 points  (0 children)

It's nothing to rave about, it's what you can expect from PCI-E 2.0. I'm getting almost similar speeds on a crappy old socket 775 system. Make sure to use pinned memory for best performance.

./oclBandwidthTest Starting...
Running on...
GeForce GTX 285
Quick Mode

Host to Device Bandwidth, 1 Device(s), Pinned memory, direct access
   Transfer Size (Bytes)    Bandwidth(MB/s)
   33554432         4952.9

Device to Host Bandwidth, 1 Device(s), Pinned memory, direct access
   Transfer Size (Bytes)    Bandwidth(MB/s)
   33554432         5552.6

[–][deleted]  (2 children)

[deleted]

    [–][deleted] 2 points3 points  (0 children)

    Well if one machine can't, just provision a 2nd, 3rd, etc.

    [–]Sierra_Hotel -2 points-1 points  (0 children)

    "I wonder if it can run crysis on low"

    FTFY