
One thing that Nvidia seems to be doing right these days is embracing OpenCL, with the Santa Clara company now bragging with being the first to provide public OpenCL GPU drivers for both Windows and Linux.
In addition to the OpenCL v1.0-compliant drivers (which are available
here), Nvidia has also let loose the OpenCL Visual Profiler, a tool which offers developers with insight into performance bottlenecks and opportunities for optimization. The OCL VP has support for all CUDA-ready GPUs and features:
- Profiling of actual hardware signals, kernel efficiency, and instruction issue rate
- Timing of memory copies between system memory and GPU dedicated memory
- Customizable graphs to help developers focus in on problem areas
- Basic auto-analysis to reveal warp serialization problems
- Easy import/export to CSV for custom analysis
Last but not least, Nvidia has also compiled a OpenCL Best Practices Guide to help OpenCL developers programming for the CUDA architecture implement high performance parallel algorithms and understand best practices for GPU Computing.
For more info see
Nvidia's CUDA Zone.