A C language development environment for CUDA-enabled GPUs

CUDA Ranking & Summary


  • Rating:
  • License:
  • Freeware
  • Price:
  • FREE
  • Publisher Name:
  • NVIDIA Corporation
  • Publisher web site:
  • Operating Systems:
  • Mac OS X 10.5.2 or later
  • File Size:
  • 33.8 MB


CUDA Description

A C language development environment for CUDA-enabled GPUs The CUDA Toolkit is a C language development environment for CUDA-enabled GPUs. In a matter of a few years, the programmable graphics processor unit has developed into an absolute computing workhorse. With multiple cores driven by very high memory bandwidth, today's GPUs offer incredible resources for both graphics and non-graphics processing. The main reason behind such an evolution is that the GPU is specialized for compute-intensive, highly parallel computation exactly what graphics rendering is about and therefore is designed such that more transistors are devoted to data processing rather than data caching and flow control.The CUDA development environment includes:· nvcc C compiler· CUDA FFT and BLAS libraries for the GPU· Profiler· gdb debugger for the GPU (alpha available in March, 2008)· CUDA runtime driver (now also available in the standard NVIDIA GPU driver)· CUDA programming manualThe CUDA Developer SDK provides examples with source code to help you get started with CUDA. Examples include:· Parallel bitonic sort· Matrix multiplication· Matrix transpose· Performance profiling using timers· Parallel prefix sum (scan) of large arrays· Image convolution· 1D DWT using Haar wavelet· OpenGL and Direct3D graphics interoperation examples· CUDA BLAS and FFT library usage examples· CPU-GPU C- and C++-code integration· Binomial Option Pricing· Black-Scholes Option Pricing· Monte-Carlo Option Pricing· Parallel Mersenne Twister (random number generation)· Parallel Histogram· Image Denoising· Sobel Edge Detection Filter· MathWorks MATLAB Plug-in Here are some key features of "CUDA": · Standard C programming language enabled on a GPU. · Unified hardware and software solution for parallel computing on CUDA-enabled NVIDIA GPUs. · CUDA compatible GPUs range from lower power notebook GPUs to high performance, multi-GPU systems. · CUDA-enabled GPUs support the Parallel Data Cache and Thread Execution Manager. · Standard numerical libraries for FFT (Fast Fourier Transform) and BLAS (Basic Linear Algebra Subroutines). · Dedicated CUDA driver for computing. · Optimized direct upload and download path from the CPU to CUDA-enabled GPU. · CUDA driver interoperates with OpenGL and DirectX graphics drivers. · Support for Linux 32/64-bit and Windows XP 32/64-bit operating systems. · Direct driver and assembly level access through CUDA for research and language development. What's New in This Release: New Features: · CUFFT Features: Performance enhancements: Double precision · CUFFT now supports double-precision transforms, with types and functions analagous to the existing single-precision versions. Similarly, the "cufftType" enumeration (used in calls like cufftPlan1d) has expanded to include double-precision identifiers. · The double-precision versions are invoked in an identical manner to the single-precision ones, obviously with arguments changed from the single- to the double-precision types. See "cufft.h" for exact definitions of the above. · Separate Packaging: CUDA Driver and CUDA Toolkit are now available via separate packages · Double Handling by the Compiler: when a ptx file with an sm version prior to sm_13 contains double precision instructions, ptxas now emits a warning that double precision instructions are demoted to single precision. ptxas has a new option --suppress-double-demote-warning to suppress this warning Major Bug Fixes: · C++ Support for Device Emulation: Support is restored for using C++ code in device emulation mode

CUDA Related Software