The evolution of GPUs (graphics processing units) has been enormous in the past few years. Their calculation power has improved exponentially, while the range of the tasks computable on GPUs has got significantly wider. The milestone of GPU development of the recent years is the appearance of the unified architecture-based devices. These GPUs implement a massively parallel design, which led them be capable not only of processing the common computer graphics tasks, but qualifies them for performing highly parallel mathematical algorithms effectively. Recognizing this availability GPU providers have issued developer platforms, which let the programmers manage computations on the GPU as a data-parallel computing device without the need of mapping them to a graphics API. Researchers salute this initiative, and the application of the new technology is quickly spreading in various branches of science
NVIDIA CUDA Compute unified device architecture programming guide , NVIDIA Corporation, Version 1.1, 2007.
Triolet D. NVIDIA CUDA, preview, 2007, http://www.behardware.com/art/lire/659
NVIDIA Corporation, GeForce 8800 & NVIDIA CUDA, A new architecture for computing on the GPU, 2006, http://developer.nvidia.com/CUDA
NVIDIA Corporation, Technical Brief, NVIDIA GeForce 8800 GPU, Architecture Overview , 2006
Triolet D. NVIDIA CUDA, practical uses, 2007, http://www.behardware.com/art/lire/678
Michalakes J., Vachharajani M. GPU Acceleration of numerical weather prediction, submitted to 2008 Workshop on Large-Scale Parallel Processing (LSPP), April 2008. http://www.mmm.ucar.edu/wrf/WG2/michalakes_lspp.pdf
Lietsch S., Marquardt O. A CUDA-supported approach to remote rendering, Proc. of the 3rd International Symposium on Visual Computing (ISVC2007), 2007, pp. 724–733.
Liu W., Schmidt B., Voss G., Müller-Wittig W. Molecular dynamics simulations on commodity GPUs with CUDA, Lecture Notes in Computer Science , Vol. 4873/2007, pp. 185–196.
Manavski S. A. CUDA compatible GPU as an efficient hardware accelerator for AES cryptography, Proc. of IEEE International Conference on Signal Processing and Communication , ICSPC 2007, pp. 65–68.
Manavski S. A., Mariano A., Valle G. CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment, Italian Bioinformatics Society (BITS) Meeting, 2007, http://www.manavski.com/downloads/SWcuda01-11-pics.pdf
Gumerov N. A., Duraiswami R. Fast multipole methods on graphics processors, numerical analysis seminar, 2007, UMD, Maryland, USA, https://wikio.nrao.edu/pub/HPC/AstroGPU07NotesShelton/AstroGPU07.pdf
Gumerov N. A. , '', in Duraiswami R. Fast multipole methods on graphics processors, numerical analysis seminar , (2007 ) -.
Schive H., Chien C., Wong S., Tsai Y., Chiueh T. Graphic-card cluster for astrophysics (GraCCA), Performance tests, (Accepted for publication in New Astronomy, 2008), http://arxiv.org/abs/0707.2991