Introduction to GPU Programming Training Course

GPU programming harnesses the parallel processing capabilities of Graphics Processing Units to accelerate applications demanding high-performance computing, including artificial intelligence, gaming, graphics rendering, and scientific computation. Various frameworks and tools facilitate GPU programming, each presenting distinct advantages and limitations. Among the most widely used are OpenCL, CUDA, ROCm, and HIP.

This instructor-led, live training (available online or onsite) is designed for developers at the beginner to intermediate level who wish to master the fundamentals of GPU programming and familiarise themselves with the primary frameworks and tools for developing GPU-based applications.

By the end of this training, participants will be able to:
Grasp the differences between CPU and GPU computing, as well as the benefits and challenges associated with GPU programming.
Select the most appropriate framework and tool for their specific GPU application.
Develop a foundational GPU program that performs vector addition using one or more of the taught frameworks and tools.
Utilise the relevant APIs, languages, and libraries to query device information, allocate and deallocate device memory, transfer data between host and device, launch kernels, and synchronise threads.
Leverage distinct memory spaces—such as global, local, constant, and private—to optimise data transfers and memory access patterns.
Employ specific execution models, including work-items, work-groups, threads, blocks, and grids, to manage parallelism effectively.
Debug and test GPU programs using tools such as CodeXL, CUDA-GDB, CUDA-MEMCHECK, and NVIDIA Nsight.
Optimise GPU programs through techniques such as coalescing, caching, prefetching, and profiling.

Format of the Course

Interactive lectures and discussions.
Extensive exercises and practical practice.
Hands-on implementation in a live-lab environment.

Course Customisation Options

To request a tailored training session for this course, please contact us to arrange details.

This course is available as onsite live training in Malaysia or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Upcoming Courses

Introduction to GPU Programming

2026-08-31 09:30

21 hours

Kuala Lumpur KL Sentral

21661 MYR (Online)

22636 MYR (Classroom)

Introduction to GPU Programming

2026-09-14 09:30

21 hours

Bayan Lepas, iDEAL

21661 MYR (Online)

22396 MYR (Classroom)

Introduction to GPU Programming

2026-09-28 09:30

21 hours

Kuala Lumpur KL Sentral

21661 MYR (Online)

22636 MYR (Classroom)

Introduction to GPU Programming

2026-10-12 09:30

21 hours

Bayan Lepas, iDEAL

21661 MYR (Online)

22396 MYR (Classroom)

Introduction to GPU Programming

2026-10-26 09:30

21 hours

Kuala Lumpur KL Sentral

21661 MYR (Online)

22636 MYR (Classroom)

Introduction to GPU Programming Training Course

Course Outline

Requirements

Upcoming Courses

Introduction to GPU Programming

Introduction to GPU Programming

Introduction to GPU Programming

Introduction to GPU Programming

Introduction to GPU Programming

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Introduction to GPU Programming Training Course

Course Outline

Requirements

Upcoming Courses

Introduction to GPU Programming

Introduction to GPU Programming

Introduction to GPU Programming

Introduction to GPU Programming

Introduction to GPU Programming

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites