Tobias Dunn-Krahn
Applying Agile Principles to Incident Management
#1about 6 minutes
Defining digital service incidents and key stakeholders
An incident is any interruption to a digital service, from a full outage to an SLO breach, involving service teams, IT, support, and management.
#2about 4 minutes
Applying agile and SRE principles to incident response
Improve incident management by adopting agile principles like iterative mitigation, DevOps culture-bridging, Scrum retrospectives, and SRE-driven automation.
#3about 3 minutes
Using Failure Friday to practice incident management
Regularly practicing incident response through simulated outages, known as Failure Friday, builds team confidence and refines resolution processes.
#4about 2 minutes
Demo setup of a company's modern and legacy toolchains
The demo scenario involves a company with an agile team using tools like Slack and Jira, and a major incident team using ServiceNow.
#5about 5 minutes
Demo of receiving an alert and initiating an incident
An automated workflow enriches an incoming alert with diagnostic data and, upon escalation, creates linked artifacts in Slack, Jira, and ServiceNow.
#6about 6 minutes
Using an incident console to manage response and resolvers
The incident console provides a central hub for tracking status, managing on-call resolvers, and accessing collaboration channels to streamline remediation.
#7about 2 minutes
Conducting a post-incident review to drive improvement
After resolution, a post-incident review helps analyze the timeline, document learnings, and create trackable action items to prevent future occurrences.
#8about 9 minutes
Building custom automation with a low-code flow designer
The low-code flow designer allows teams to build custom automation workflows by connecting triggers and steps to integrate with any tool, including on-premise systems.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
05:39 MIN
Q&A on agile development, tooling, and observability
Why shifting left is so important for software developers
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
01:12 MIN
Improving incident response to make on-call less painful
What Developers Get Wrong About Application Quality
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
01:56 MIN
Shifting from a waterfall to an agile NetDevOps workflow
How Cisco embraced a DevOps culture within its network engineering team
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
09:21 MIN
Navigating the common pitfalls of DevOps adoption
Demystifying DevOps—Pros, cons, dos & don'ts
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
03:13 MIN
Applying an agile mindset to an infrastructure project
AWS Migration within 3 months
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
04:03 MIN
Why agile teams struggle with adopting new technologies
Retooling and refactoring - an investment in people.
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
01:18 MIN
Making time for transformation amid constant firefighting
How Cisco embraced a DevOps culture within its network engineering team
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
04:41 MIN
Actionable takeaways for SREs on incident management
Serverless Observability: where SLOs meet transforms
Unlock Moments
Create a free account to watch a limited number of Moments each month.
Upgrade to PRO for unlimited access to the full archive.
Upgrade to PRO for unlimited access to the full archive.
You have an account? Log in
Featured Partners
Related Videos
The user in the eye of the Cargo1492 storm
Martin Nader
DevOps Maturity Check – a way to balance autonomy and alignment
Martin Thalmann
Handling incidents collaboratively is like solving a rubix cube
Nele Uhlemann
SRE Methods In an Agency Environment
Martin Beránek
Navigating the Future of Junior Developers in Tech
Chris Heilmann
3 Key Steps for Optimizing DevOps Workflows
Daniel Tao
Enabling automated 1-click customer deployments with built-in quality and security
Christoph Ruggenthaler
Retooling and refactoring - an investment in people.
Andrew Holway
Related Articles
View all articles
.webp?w=240&auto=compress,format)


From learning to earning
Jobs that call for the skills explored in this talk.




Peter Park System GmbH
München, Germany
Senior
Python
Docker
Node.js
JavaScript

IKEA
Amsterdam, Netherlands
Intermediate
Azure
Terraform
Google Cloud Platform
Amazon Web Services (AWS)
Scripting (Bash/Python/Go/Ruby)


forty-five Personalberatung Wiesbaden GmbH & Co. KG
GIT
HTML
JIRA
DevOps
Docker
+2

