Contact APC LLC for event and ticket information.
Education and Research Center APPLIED PARALLEL COMPUTING

Looks like this event has already ended.

Check out upcoming events by this organizer, or organize your very own event.

View upcoming events Create an event

GPU Computing and CUDA.

Monday, January 16, 2012 at 10:00 AM - Wednesday, January 18, 2012 at 4:00 PM (GMT+0100)

Munich, Germany

GPU Computing and CUDA.

Ticket Information

Ticket Type Remaining Sales End Price Fee Quantity
Special price offer 5 tickets Ended €600.00 €7.50
SHARE THIS EVENT

Event Details

Education and Research Center APPLIED PARALLEL COMPUTING

NVIDIA Advanced CUDA Programming Course Plan

  1. From GPU to GPGPU
    • Performance and parallelism
    • GPU evolution
    • Parallel systems: multicore and clustering
  2. CUDA programming model
    • Key principles
    • Threads and blocks
    • Language extensions
      • Attributes
      • Builtin types and variables
      • Kernel invocation operator
    • CUDA runtime API
      • Asynchronous execution
      • Handling runtime errors in CUDA
      • Querying GPU capabilities 
  3. Memory hierarchy
    • Global memory
      • Example: matrix multiplication
      • Optimizing global memory usage
    • Block-shared memory
      • Example: matrix multiplication
      • Shared memory access patterns
    • Constant memory
    • Texture memory
    • Unified virtual address space (UVA)
  4. Implementing basic data processing
    • Parallel reduction
    • Prefix sum (scan)
      • CUDA implementation
      • CUDPP implementation
  5. CUDA Libraries
    • CUBLAS
    • CUSPARSE
    • CUFFT
    •  CURAND
  6. CUDA Fortran Overiew
  7. Using multiple GPUs
    • CUDA context
    • fork
    • MPI
    •  POSIX-threads
    • OpenMP
    • Boost.Threads
  8. CUDA Streams
    • Example: concurrent kernels execution
    • Example: matrix multiplication
    • Example: Multi-GPU Async Copy
  9. Debugging
    • Principles and terminology
    • gdb
    • cuda-gdb
    • Nsight
    • CUDA (Visual) Profiler
    • cuda-memcheck
  10. OpenCL Overview
    • Simple example
    • OpenCL host API
    •  Developing and deploying OpenCL kernels
    • Comparison with CUDA
  11. Optimization Techniques

Hands-ons

  1. Parallel sine function computation.
  2.  Matrix-matrix multiply with shared memory.
------------------------------------------------------------
CUDA course will be conducted in English.
 

When & Where



Leibniz-Rechenzentrum der Bayerischen Akademie der Wissenschaften
Boltzmannstraße, 1, H.U.010.
Garching bei München
85748 Munich
Germany

Monday, January 16, 2012 at 10:00 AM - Wednesday, January 18, 2012 at 4:00 PM (GMT+0100)


  Add to my calendar

Hosted By

APC LLC



APC LLC
  "Applied Parallel Computing Education and Research Center" specializes in:

  • CUDA training, consulting and education
  • CUDA and parallel computing remote education
  • Mathematical models and computational algorithms
  • Porting various computational tasks to parallel architectures with full adaptation of code, optimization of computing tasks, training and consulting of local staff
http://parallel-compute.com