Programming Massively Parallel Processors,
Edition 4 A Hands-on ApproachEditors: By Wen-mei W. Hwu, David B. Kirk and Izzat El Hajj, Ph.D.
Conformance
-
PDF/UA-1
-
The publication was certified on 20250728
-
For queries regarding accessibility information, contact [email protected]
Ways Of Reading
-
This e-publication is accessible to the full extent that the file format and types of content allow, on a specific reading device, by default, without necessarily including any additions such as textual descriptions of images or enhanced navigation.
Navigation
-
The contents of the PDF have been tagged to permit access by assistive technologies as per PDF-UA-1 standard.
-
Page breaks included from the original print source
Additional Accessibility Information
-
The language of the text has been specified (e.g., via the HTML or XML lang attribute) to optimise text-to-speech (and other alternative renderings), both at the whole document level and, where appropriate, for individual words, phrases or passages in a different language.
Note
-
This product relies on 3rd party tooling which may impact the accessibility features visible in inspection copies. All accessibility features mentioned would be present in the purchased version of the title.
Programming Massively Parallel Processors: A Hands-on Approach shows both students and professionals alike the basic concepts of parallel programming and GPU architecture. Concise, intuitive, and practical, it is based on years of road-testing in the authors' own parallel computing courses. Various techniques for constructing and optimizing parallel programs are explored in detail, while case studies demonstrate the development process, which begins with computational thinking and ends with effective and efficient parallel programs. The new edition includes updated coverage of CUDA, including the newer libraries such as CuDNN. New chapters on frequently used parallel patterns have been added, and case studies have been updated to reflect current industry practices.
Key Features
- Parallel Patterns Introduces new chapters on frequently used parallel patterns (stencil, reduction, sorting) and major improvements to previous chapters (convolution, histogram, sparse matrices, graph traversal, deep learning)
- Ampere Includes a new chapter focused on GPU architecture and draws examples from recent architecture generations, including Ampere
- Systematic Approach Incorporates major improvements to abstract discussions of problem decomposition strategies and performance considerations, with a new optimization checklist
About the author
By Wen-mei W. Hwu, CTO, MulticoreWare and professor specializing in compiler design, computer architecture, microarchitecture, and parallel processing, University of Illinois at Urbana-Champaign, USA; David B. Kirk, NVIDIA Fellow and Izzat El Hajj, Ph.D., Assistant Professor, Department of Computer Science, American University of Beirut, Lebanon
1. Introduction
Part I Fundamental Concepts
2. Heterogeneous data parallel computing
3. Multidimensional grids and data
4. Compute architecture and scheduling
5. Memory architecture and data locality
6. Performance considerations
Part II Parallel Patterns
7. Convolution: An introduction to constant memory and caching
8. Stencil
9. Parallel histogram
10. Reduction And minimizing divergence
11. Prefix sum (scan)
12. Merge: An introduction to dynamic input data identification
Part III Advanced patterns and applications
13. Sorting
14. Sparse matrix computation
15. Graph traversal
16 Deep learning
17. Iterative magnetic resonance imaging reconstruction
18. Electrostatic potential map
19. Parallel programming and computational thinking
Part IV Advanced Practices
20. Programming a heterogeneous computing cluster: An introduction to CUDA streams
21. CUDA dynamic parallelism
22. Advanced practices and future evolution
23. Conclusion and outlook
Appendix A: Numerical considerations
Reviews
The book is an added value to the field. Thanks
The book is a great resource. My only feedback is that there are some errors in mathematical formulas throughout the book.
Es una referencia en la programación de GPU