Videos

How AI is Helping Site Reliability Engineers Automate Incident Response



Broadcom Infrastructure Software

Subscribe to our YouTube channel to stay up to date on all of our world-class products and exciting updates: https://goo.gl/YhZF9h

In this AIOps Virtual Summit session, Kieran Taylor, Sr. Director, Product Marketing at CA Technologies is joined by Todd Palino, a Senior SRE at LinkedIn, and David Blank-Edelman, an SRE author and SRECon founder.

Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies that to IT operations problems. SRE strives to bring an engineering mindset to the operational challenges of running production systems at scale. By focusing on reliability as a primary property of the complex systems we run, SRE enables the business to navigate the tricky path between feature velocity and operational stability.

As the complexity of the systems we operate increases thanks to scale and the adoption of modern software architectures, so too does the difficulty of the work. People with the skill set necessary to deal with these challenges are scarce and in high demand. This resulting skills gap presents an opportunity to leverage machine learning and artificial intelligence to augment SRE teams. Watch this video to learn about Site Reliability Engineering and learn how SREs are automating incident response to improve user experience.

Learn more at: www.ca.com/AIOps

Source

Similar Posts

WP2Social Auto Publish Powered By : XYZScripts.com