Paul Graham

Accelerating Python on GPUs

Is your Python code hitting a performance wall? Learn how to leverage the massive parallelism of GPUs with minimal code changes.

Accelerating Python on GPUs
#1about 2 minutes

The rise of general-purpose GPU computing

NVIDIA's evolution from a graphics hardware company to a leader in general-purpose computing was accelerated by the use of GPUs for AI with models like AlexNet.

#2about 4 minutes

Why GPUs outperform CPUs for parallel tasks

As single-threaded CPU performance plateaued, GPUs offered a path forward with their massively parallel architecture designed for simultaneous computation.

#3about 6 minutes

Understanding modern GPU architecture and operation

GPUs work with CPUs by offloading compute-intensive code and use thousands of threads to hide memory latency, leveraging streaming multiprocessors and high-bandwidth memory.

#4about 7 minutes

Introducing the CUDA parallel computing platform

The CUDA platform is a complete ecosystem with compilers, libraries, and frameworks that enables developers to program GPUs using various languages and abstraction levels.

#5about 3 minutes

Leveraging specialized hardware like Tensor Cores

Specialized hardware like Tensor Cores can be used transparently through high-level libraries like cuDNN or programmed directly with low-level APIs for maximum performance.

#6about 6 minutes

High-level frameworks for domain-specific acceleration

Frameworks like Rapids provide GPU-accelerated, drop-in replacements for popular data science libraries such as Pandas (cuDF) and NetworkX (cuGraph) with minimal code changes.

#7about 10 minutes

A progressive approach to programming GPUs in Python

Developers can choose from a spectrum of Python libraries, from simple drop-in replacements like CuNumeric and CuPy to JIT compilers like Numba and direct kernel programming with PyCUDA.

#8about 6 minutes

Developer tools and learning resources for GPUs

NVIDIA offers a comprehensive suite of developer tools for profiling and debugging, along with learning resources like the NGC repository, DLI courses, and community events.

Related jobs
Jobs that call for the skills explored in this talk.
SabIna compys

SabIna compys
Vienna, Austria

Remote
20-100K
Intermediate
JavaScript
.NET
+1

Featured Partners

Related Articles

View all articles
CH
Chris Heilmann
All the videos of Halfstack London 2024!
Last month was Halfstack London, a conference about the web, JavaScript and half a dozen other things. We were there to deliver a talk, but also to record all the sessions and we're happy to share them with you. It took a bit as we had to wait for th...
All the videos of Halfstack London 2024!
AV
Admir Video
gif
gif
DC
Daniel Cranney
How to Use Generative AI to Accelerate Learning to Code
It’s undeniable that generative-AI and LLMs have transformed how developers work. Hours of hunting Stack Overflow can be avoided by asking your AI-code assistant, multi-file context can be fed to the AI from inside your IDE, and applications can be b...
How to Use Generative AI to Accelerate Learning to Code

From learning to earning

Jobs that call for the skills explored in this talk.

Python Engineer

Usersnap
Vienna, Austria

Intermediate
Python
Amazon Web Services (AWS)