589689.xyz

Heterogeneous Parallel Programming

  • 收录时间:2018-03-23 13:32:57
  • 文件大小:910MB
  • 下载次数:114
  • 最近下载:2021-01-05 21:55:41
  • 磁力链接:

文件列表

  1. 4 - 7 - 4.7- Parallel Computation Patterns - More on Parallel Scan.mp4 40MB
  2. 4 - 5 - 4.5- Parallel Computation Patterns - A Work-Inefficient Scan Kernel.mp4 38MB
  3. 4 - 6 - 4.6- Parallel Computation Patterns - A Work-Efficient Parallel Scan Kernel.mp4 38MB
  4. 2 - 6 - 2.6- Tiled Matrix Multiplication Kernel.mp4 36MB
  5. 3 - 6 - 3.6- Parallel Computation Patterns - Data Reuse in Tiled Convolution.mp4 32MB
  6. 4 - 1 - 4.1- Parallel Computation Patterns - Reduction.mp4 31MB
  7. 5 - 3 - 5.3- Parallel Computation Patterns - Atomic Operations in CUDA.mp4 31MB
  8. 3 - 1 - 3.1- Performance Considerations - DRAM Bandwidth.mp4 30MB
  9. 2 - 3 - 2.3- Memory Model and Locality -- CUDA Memories.mp4 29MB
  10. 5 - 4 - 5.4- Parallel Computation Patters - Atomic Operations Performance.mp4 28MB
  11. 1 - 4 - 1.4- Introduction to CUDA, Data Parallelism and Threads.mp4 27MB
  12. 4 - 4 - 4.4- Parallel Computation Patterns - Scan (Prefix Sum).mp4 27MB
  13. 2 - 5 - 2.5- Tiled Matrix Multiplication.mp4 26MB
  14. 1 - 1 - 1.1- Course Overview.mp4 26MB
  15. 2 - 1 - 2.1- Kernel-based Parallel Programming - Thread Scheduling.mp4 26MB
  16. 3 - 5 - 3.5- Parallel Computation Patterns - 2D Tiled Convolution Kernel.mp4 25MB
  17. 4 - 2 - 4.2- Parallel Computation Patterns - A Basic Reduction Kernel.mp4 25MB
  18. 2 - 4 - 2.4- Tiled Parallel Algorithms.mp4 25MB
  19. 3 - 4 - 3.4- Parallel Computation Patterns - Tiled Convolution.mp4 25MB
  20. 1 - 5 - 1.5- Introduction to CUDA, Memory Allocation and Data Movement API.mp4 24MB
  21. 1 - 6 - 1.6- Introduction to CUDA, Kernel-Based SPMD Parallel Programming.mp4 24MB
  22. 5 - 5 - 5.5- Parallel Computation Patterns - A Privatized Histogram Kernel.mp4 23MB
  23. 3 - 2 - 3.2- Performance Considerations - Memory Coalescing in CUDA.mp4 23MB
  24. 5 - 1 - 5.1- Parallel Computation Patterns - Histogramming.mp4 22MB
  25. 5 - 2 - 5.2- Parallel Computation Patterns - Atomic Operations.mp4 22MB
  26. 1 - 8 - 1.8- Kernel-based Parallel Programming, Basic Matrix-Matrix Multiplication.mp4 22MB
  27. 1 - 7 - 1.7- Kernel-based Parallel Programming, Multidimensional Kernel Configuration.mp4 22MB
  28. 2 - 8 - 2.8- A Tiled Kernel for Arbitrary Matrix Dimensions.mp4 21MB
  29. Рекомендуемая литература David B. Kirk, Wen-mei W. Hwu Programming Massively Parallel Processors, Second Edition.pdf 21MB
  30. 3 - 3 - 3.3- Parallel Computation Patterns - Convolution.mp4 21MB
  31. 4 - 3 - 4.3- Parallel Computation Patterns - A Better Reduction Kernel.mp4 20MB
  32. 2 - 2 - 2.2- Control Divergence.mp4 20MB
  33. 2 - 7 - 2.7- Handling Boundary Conditions in Tiling.mp4 18MB
  34. 1 - 2 - 1.2- Introduction to Heterogeneous Parallel Computing.mp4 18MB
  35. 1 - 3 - 1.3- Portability and Scalability in Heterogeneous Parallel Computing.mp4 10MB
  36. hetero-lecture_slides_002-Lecture 1-Lecture-1-5-cuda-API.pdf 893KB
  37. Lecture-5-3-CUDA-atomic.pdf 770KB
  38. hetero-lecture_slides_002-Lecture 1-Lecture-1-4-cuda-intro.pdf 593KB
  39. Lecture-4-7-more-on-scan.pdf 580KB
  40. Lecture-5-5-privatized-histogram.pdf 541KB
  41. Lecture-4-4-scan.pdf 526KB
  42. Lecture-3-2-memory-coalescing.pdf 513KB
  43. Lecture-3-6-convolution-reuse.pdf 506KB
  44. Lecture-5-1-histogram.pdf 502KB
  45. Lecture-3-3-convolution.pdf 500KB
  46. Lecture-3-1-dram-bandwidth.pdf 498KB
  47. Lecture-4-6-work-efficient-scan-kernel.pdf 492KB
  48. hetero-lecture_slides_002-Lecture 1-Lecture-1-6-cuda-kernel.pdf 491KB
  49. Lecture-3-5-2D-convolution-kernel.pdf 477KB
  50. Lecture-3-4-tiled-convolution.pdf 452KB
  51. Lecture-5-4-atomic-performance.pdf 444KB
  52. hetero-lecture_slides_002-Lecture 2-Lecture-2-2-control-divergence.pdf 442KB
  53. Lecture-5-2-atomic-operations.pdf 437KB
  54. hetero-lecture_slides_002-Lecture 2-Lecture-2-1-transparent-scaling.pdf 430KB
  55. Lecture-4-3-better-reduction-kernel.pdf 414KB
  56. Lecture-4-2-reduction-kernel.pdf 363KB
  57. hetero-lecture_slides_002-Lecture 1-Lecture-1-7-kernel-multidimension.pdf 343KB
  58. Lecture-4-5-naive-scan-kernel.pdf 339KB
  59. Lecture-4-1-reduction.pdf 294KB
  60. hetero-lecture_slides_002-Lecture 2-Lecture-2-3-cuda-memories.pdf 294KB
  61. hetero-lecture_slides_002-Lecture 1-Lecture-1-3-software-cost.pdf 280KB
  62. hetero-lecture_slides_002-Lecture 1-Lecture-1-2-heterogeneous.pdf 272KB
  63. hetero-lecture_slides_002-Lecture 1-Lecture-1-8-kernel-matrix-multiplication.pdf 270KB
  64. hetero-lecture_slides_002-Lecture 1-Lecture-1-1-Overview.pdf 243KB
  65. hetero-lecture_slides_002-Lecture 2-Lecture-2-8-boundary-condition-kernel.pdf 233KB
  66. hetero-lecture_slides_002-Lecture 2-Lecture-2-6-tiled-kernel.pdf 231KB
  67. hetero-lecture_slides_002-Lecture 2-Lecture-2-4-tiled-algorithms.pdf 222KB
  68. hetero-lecture_slides_002-Lecture 2-Lecture-2-7-boundary-condition.pdf 176KB
  69. hetero-lecture_slides_002-Lecture 2-Lecture-2-5-tiled-matrix-multiplication.pdf 162KB
  70. 2 - 6 - 2.6- Tiled Matrix Multiplication Kernel.srt 33KB
  71. 4 - 1 - 4.1- Parallel Computation Patterns - Reduction.srt 28KB
  72. 3 - 1 - 3.1- Performance Considerations - DRAM Bandwidth.srt 28KB
  73. Гетерогенное параллельное программирование.docx 28KB
  74. 3 - 6 - 3.6- Parallel Computation Patterns - Data Reuse in Tiled Convolution.srt 27KB
  75. 1 - 4 - 1.4- Introduction to CUDA, Data Parallelism and Threads.srt 26KB
  76. 2 - 3 - 2.3- Memory Model and Locality -- CUDA Memories.srt 26KB
  77. 4 - 5 - 4.5- Parallel Computation Patterns - A Work-Inefficient Scan Kernel.srt 26KB
  78. 1 - 1 - 1.1- Course Overview.srt 26KB
  79. 4 - 7 - 4.7- Parallel Computation Patterns - More on Parallel Scan.srt 25KB
  80. 2 - 5 - 2.5- Tiled Matrix Multiplication.srt 25KB
  81. 4 - 6 - 4.6- Parallel Computation Patterns - A Work-Efficient Parallel Scan Kernel.srt 24KB
  82. 4 - 4 - 4.4- Parallel Computation Patterns - Scan (Prefix Sum).srt 24KB
  83. 1 - 6 - 1.6- Introduction to CUDA, Kernel-Based SPMD Parallel Programming.srt 24KB
  84. 2 - 4 - 2.4- Tiled Parallel Algorithms.srt 23KB
  85. 1 - 5 - 1.5- Introduction to CUDA, Memory Allocation and Data Movement API.srt 23KB
  86. 3 - 5 - 3.5- Parallel Computation Patterns - 2D Tiled Convolution Kernel.srt 23KB
  87. 2 - 1 - 2.1- Kernel-based Parallel Programming - Thread Scheduling.srt 23KB
  88. 4 - 2 - 4.2- Parallel Computation Patterns - A Basic Reduction Kernel.srt 22KB
  89. 5 - 3 - 5.3- Parallel Computation Patterns - Atomic Operations in CUDA.srt 22KB
  90. 3 - 4 - 3.4- Parallel Computation Patterns - Tiled Convolution.srt 21KB
  91. 2 - 8 - 2.8- A Tiled Kernel for Arbitrary Matrix Dimensions.srt 20KB
  92. 1 - 8 - 1.8- Kernel-based Parallel Programming, Basic Matrix-Matrix Multiplication.srt 20KB
  93. 2 - 6 - 2.6- Tiled Matrix Multiplication Kernel.txt 20KB
  94. 5 - 4 - 5.4- Parallel Computation Patters - Atomic Operations Performance.srt 20KB
  95. 1 - 2 - 1.2- Introduction to Heterogeneous Parallel Computing.srt 19KB
  96. 1 - 7 - 1.7- Kernel-based Parallel Programming, Multidimensional Kernel Configuration.srt 19KB
  97. 4 - 3 - 4.3- Parallel Computation Patterns - A Better Reduction Kernel.srt 19KB
  98. 3 - 3 - 3.3- Parallel Computation Patterns - Convolution.srt 19KB
  99. 3 - 2 - 3.2- Performance Considerations - Memory Coalescing in CUDA.srt 18KB
  100. 2 - 2 - 2.2- Control Divergence.srt 18KB
  101. 4 - 1 - 4.1- Parallel Computation Patterns - Reduction.txt 18KB
  102. 3 - 1 - 3.1- Performance Considerations - DRAM Bandwidth.txt 17KB
  103. 5 - 5 - 5.5- Parallel Computation Patterns - A Privatized Histogram Kernel.srt 17KB
  104. 3 - 6 - 3.6- Parallel Computation Patterns - Data Reuse in Tiled Convolution.txt 17KB
  105. 2 - 7 - 2.7- Handling Boundary Conditions in Tiling.srt 17KB
  106. 2 - 3 - 2.3- Memory Model and Locality -- CUDA Memories.txt 16KB
  107. 4 - 5 - 4.5- Parallel Computation Patterns - A Work-Inefficient Scan Kernel.txt 16KB
  108. 1 - 4 - 1.4- Introduction to CUDA, Data Parallelism and Threads.txt 16KB
  109. 1 - 1 - 1.1- Course Overview.txt 16KB
  110. 2 - 5 - 2.5- Tiled Matrix Multiplication.txt 15KB
  111. 5 - 1 - 5.1- Parallel Computation Patterns - Histogramming.srt 15KB
  112. 4 - 7 - 4.7- Parallel Computation Patterns - More on Parallel Scan.txt 15KB
  113. 5 - 2 - 5.2- Parallel Computation Patterns - Atomic Operations.srt 15KB
  114. 4 - 6 - 4.6- Parallel Computation Patterns - A Work-Efficient Parallel Scan Kernel.txt 15KB
  115. 4 - 4 - 4.4- Parallel Computation Patterns - Scan (Prefix Sum).txt 15KB
  116. 1 - 6 - 1.6- Introduction to CUDA, Kernel-Based SPMD Parallel Programming.txt 14KB
  117. 2 - 4 - 2.4- Tiled Parallel Algorithms.txt 14KB
  118. 1 - 5 - 1.5- Introduction to CUDA, Memory Allocation and Data Movement API.txt 14KB
  119. 3 - 5 - 3.5- Parallel Computation Patterns - 2D Tiled Convolution Kernel.txt 14KB
  120. 2 - 1 - 2.1- Kernel-based Parallel Programming - Thread Scheduling.txt 14KB
  121. 4 - 2 - 4.2- Parallel Computation Patterns - A Basic Reduction Kernel.txt 13KB
  122. 3 - 4 - 3.4- Parallel Computation Patterns - Tiled Convolution.txt 13KB
  123. 5 - 3 - 5.3- Parallel Computation Patterns - Atomic Operations in CUDA.txt 13KB
  124. 1 - 8 - 1.8- Kernel-based Parallel Programming, Basic Matrix-Matrix Multiplication.txt 13KB
  125. 2 - 8 - 2.8- A Tiled Kernel for Arbitrary Matrix Dimensions.txt 13KB
  126. 1 - 2 - 1.2- Introduction to Heterogeneous Parallel Computing.txt 12KB
  127. 5 - 4 - 5.4- Parallel Computation Patters - Atomic Operations Performance.txt 12KB
  128. 1 - 7 - 1.7- Kernel-based Parallel Programming, Multidimensional Kernel Configuration.txt 12KB
  129. 4 - 3 - 4.3- Parallel Computation Patterns - A Better Reduction Kernel.txt 12KB
  130. 3 - 3 - 3.3- Parallel Computation Patterns - Convolution.txt 11KB
  131. 3 - 2 - 3.2- Performance Considerations - Memory Coalescing in CUDA.txt 11KB
  132. 2 - 2 - 2.2- Control Divergence.txt 11KB
  133. 1 - 3 - 1.3- Portability and Scalability in Heterogeneous Parallel Computing.srt 11KB
  134. 5 - 5 - 5.5- Parallel Computation Patterns - A Privatized Histogram Kernel.txt 10KB
  135. 2 - 7 - 2.7- Handling Boundary Conditions in Tiling.txt 10KB
  136. 5 - 1 - 5.1- Parallel Computation Patterns - Histogramming.txt 9KB
  137. 5 - 2 - 5.2- Parallel Computation Patterns - Atomic Operations.txt 9KB
  138. 1 - 3 - 1.3- Portability and Scalability in Heterogeneous Parallel Computing.txt 7KB