Svetlin Penkov
Debugging Machine Learning Code
#1about 6 minutes
The core challenge of debugging machine learning code
Machine learning models are defined by complex computations on high-dimensional data, making traditional debugging methods ineffective.
#2about 4 minutes
Why you should verify code correctness before redesigning models
Poor model performance is often caused by simple code bugs rather than flawed model architecture, a common oversight in the R&D cycle.
#3about 4 minutes
Distinguishing between semantic and runtime bugs in development
The development process involves two distinct feedback loops for handling semantic bugs from model translation and runtime bugs from data issues.
#4about 9 minutes
Limitations of traditional debugging methods for ML
Standard techniques like printing variables, plotting, and custom dashboards fail to provide insight into the complex, high-dimensional state of modern ML models.
#5about 5 minutes
Introducing FMRI for interactive 3D data visualization
The FMRI debugger allows you to inspect high-dimensional tensors visually in 3D, making it easy to understand complex data structures with a single line of code.
#6about 8 minutes
Visualizing a CNN's computational graph with FMRI scan
By wrapping a training loop with the scan function, FMRI automatically generates an interactive 3D computational graph of a PyTorch model.
#7about 3 minutes
Scaling visual debugging and using automated assertions
FMRI handles large-scale models like VGG19 and includes a library of assertions to automatically detect common issues like vanishing gradients or invalid inputs.
#8about 6 minutes
Live demo of debugging a CNN with FMRI assertions
A live demonstration shows how to inspect a 3D tensor and use FMRI's built-in assertions to instantly find the root cause of NaN errors in a CNN.
#9about 3 minutes
Exploring the full computational graph of ResNet-101
This demonstration visualizes the entire ResNet-101 model, showcasing the tool's ability to handle massive computational graphs and explore learned features.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
14:46 MIN
Key takeaways and Q&A on debugger internals
Debugging Unveiled: Exploring Debugger Internals and Hidden Gems
01:39 MIN
Using LLMs to understand and navigate codebases
How E.On productionizes its AI model & Implementation of Secure Generative AI.
03:01 MIN
Common challenges in developing machine learning applications
Data Fabric in Action - How to enhance a Stock Trading App with ML and Data Virtualization
05:25 MIN
Modern developer tools and debugging workflows
WeAreDevelopers LIVE - Whats Nuxt and Next for app development, 20 years AJAX and more
01:47 MIN
Using AI for root cause analysis and fixes
Debugging in the Dark
05:56 MIN
Developer tools and learning resources for GPUs
Accelerating Python on GPUs
05:14 MIN
Challenge three: Ensuring machine learning models are robust
How Machine Learning is turning the Automotive Industry upside down
03:55 MIN
The future of developer tools in an AI-driven world
Are frameworks like React redundant in an AI world?
Featured Partners
Related Videos
Getting Started with Machine Learning
Alexandra Waldherr
Overview of Machine Learning in Python
Adrian Schmitt
The pitfalls of Deep Learning - When Neural Networks are not the solution
Adrian Spataru & Bohdan Andrusyak
Machine Learning for Software Developers (and Knitters)
Kris Howard
From ML to LLM: On-device AI in the Browser
Nico Martin
Debugging Unveiled: Exploring Debugger Internals and Hidden Gems
Johannes Bechberger
Machine learning in the browser with TensorFlowjs
Håkan Silfvernagel
30 Golden Rules of Deep Learning Performance
Anirudh Koul
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Samsung Group
Cambridge, United Kingdom
Remote
PyTorch
Tensorflow
Computer Vision
Machine Learning

Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
Berlin, Germany
GIT
MySQL
Keras
DevOps
Docker
+4

Squarepoint Capital
London, United Kingdom
Intermediate
gRPC
Linux
Kotlin
PyTorch
PostgreSQL
+3

Understanding Recruitment Group
Barcelona, Spain
Remote
Node.js
Computer Vision
Machine Learning

ImFusion GmbH
München, Germany
Remote
€35K
PyTorch
Machine Learning


Sony Europe Bv
Schlieren, Switzerland
Tensorflow
Machine Learning

BMW
GIT
OpenCL
Docker
PyTorch
Tensorflow
+2

Infosupport
Utrecht, Netherlands
€0K
Data analysis
Machine Learning
Software Architecture