LLMs and Agents in DevOps Workflows Training Course
Large language models (LLMs) and autonomous agent frameworks such as AutoGen and CrewAI are transforming how DevOps teams automate tasks like change tracking, test generation, and alert triage by mimicking human-like collaboration and decision-making.
This instructor-led, live training (available online or onsite) is tailored for advanced-level engineers who want to design and implement DevOps automation workflows driven by large language models (LLMs) and multi-agent systems.
By the end of this training, participants will be able to:
- Integrate LLM-based agents into CI/CD workflows for intelligent automation.
- Automate test generation, commit analysis, and change summaries using agents.
- Coordinate multiple agents to triage alerts, generate responses, and provide DevOps recommendations.
- Build secure and maintainable agent-powered workflows using open-source frameworks.
Format of the Course
- Interactive lecture and discussion.
- Extensive exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction to LLMs and Agent Frameworks
- Overview of large language models in infrastructure automation
- Key concepts in multi-agent workflows
- AutoGen, CrewAI, and LangChain: use cases in DevOps
Setting Up LLM Agents for DevOps Tasks
- Installing AutoGen and configuring agent profiles
- Using OpenAI API and other LLM providers
- Setting up workspaces and CI/CD-compatible environments
Automating Test and Code Quality Workflows
- Prompting LLMs to generate unit and integration tests
- Using agents to enforce linting, commit rules, and code review guidelines
- Automated pull request summarization and tagging
LLM Agents for Alert Handling and Change Detection
- Designing responder agents for pipeline failure alerts
- Analyzing logs and traces using language models
- Proactive detection of high-risk changes or misconfigurations
Multi-Agent Coordination in DevOps
- Role-based agent orchestration (planner, executor, reviewer)
- Agent messaging loops and memory management
- Human-in-the-loop design for critical systems
Security, Governance, and Observability
- Handling data exposure and LLM safety in infrastructure
- Auditing agent actions and restricting scope
- Tracking pipeline behavior and model feedback
Real-World Use Cases and Custom Scenarios
- Designing agent workflows for incident response
- Integrating agents with GitHub Actions, Slack, or Jira
- Best practices for scaling LLM integration in DevOps
Summary and Next Steps
Requirements
- Experience with DevOps tooling and pipeline automation
- Working knowledge of Python and Git-based workflows
- Understanding of LLMs or exposure to prompt engineering
Audience
- Innovation engineers and AI-integrated platform leads
- LLM developers working in DevOps or automation
- DevOps professionals exploring intelligent agent frameworks
Open Training Courses require 5+ participants.
LLMs and Agents in DevOps Workflows Training Course - Booking
LLMs and Agents in DevOps Workflows Training Course - Enquiry
LLMs and Agents in DevOps Workflows - Consultancy Enquiry
Upcoming Courses
Related Courses
Agentic Development with Gemini 3 and Google Antigravity
21 HoursGoogle Antigravity serves as an agentic development environment specifically crafted to create autonomous agents capable of planning, reasoning, coding, and executing tasks through the multimodal capabilities of Gemini 3.
This instructor-led live training, available both online and onsite, is tailored for advanced technical professionals seeking to design, build, and deploy autonomous agents using Gemini 3 and the Antigravity ecosystem.
Upon completing this training, participants will be equipped to:
- Construct autonomous workflows that leverage Gemini 3 for reasoning, planning, and execution.
- Develop agents within Antigravity that can analyze tasks, generate code, and interact with various tools.
- Integrate Gemini-powered agents with enterprise systems and APIs.
- Enhance agent behavior, safety, and reliability in complex operational environments.
Course Format
- Expert demonstrations paired with interactive discussions.
- Hands-on experimentation focused on autonomous agent development.
- Practical implementation utilizing Antigravity, Gemini 3, and complementary cloud tools.
Course Customization Options
- If your team requires domain-specific agent behaviors or custom integrations, please contact us to tailor the program to your needs.
Advanced Antigravity: Feedback Loops, Learning & Long-Term Agent Memory
14 HoursGoogle Antigravity serves as an advanced framework designed for experimenting with long-lived agents and emergent interactive behaviors.
This instructor-led training, available both online and onsite, targets advanced-level professionals seeking to design, analyze, and optimize agents that can retain memories, improve via feedback, and evolve over extended operational periods.
Upon completing this course, participants will acquire the ability to:
- Design memory structures for agent persistence.
- Implement effective feedback loops to influence agent behavior.
- Evaluate learning trajectories and model drift.
- Integrate memory mechanisms into complex multi-agent ecosystems.
Course Format
- Expert-led discussions complemented by technical demonstrations.
- Hands-on exploration through structured design challenges.
- Application of concepts within simulated agent environments.
Customization Options
- Should your organization require tailored content or case-specific examples, please contact us to arrange a customized training session.
Advanced Mastra Integrations: APIs, Tools, Enterprise Data & External Systems
21 HoursMastra is a framework designed to facilitate deep integration between AI agents, APIs, enterprise applications, and external data systems.
This instructor-led live training, available online or onsite, targets intermediate-level engineers looking to build reliable, secure, and scalable integrations between Mastra agents and the broader enterprise ecosystem.
Upon completing this training, participants will be equipped to:
- Implement API-driven integrations between Mastra agents and external services.
- Connect enterprise data systems and tools to automated agent workflows.
- Apply secure data exchange and authentication best practices.
- Design integration layers that are scalable, maintainable, and ready for production.
Course Format
- Interactive lectures and discussions.
- Hands-on integration engineering and API exercises.
- Live-lab implementation using real-world enterprise scenarios.
Course Customization Options
- Custom API scenarios, enterprise system mappings, or data-integration workshops are available upon request.
AIOps in Action: Incident Prediction and Root Cause Automation
14 HoursAIOps (Artificial Intelligence for IT Operations) is increasingly being used to predict incidents before they occur and automate root cause analysis (RCA) to minimize downtime and accelerate resolution.
This instructor-led, live training (online or onsite) is aimed at advanced-level IT professionals who wish to implement predictive analytics, automate remediation, and design intelligent RCA workflows using AIOps tools and machine learning models.
By the end of this training, participants will be able to:
- Build and train ML models to detect patterns leading to system failures.
- Automate RCA workflows based on multi-source log and metric correlation.
- Integrate alerting and remediation processes into existing platforms.
- Deploy and scale intelligent AIOps pipelines in production environments.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
AIOps Fundamentals: Monitoring, Correlation, and Intelligent Alerting
14 HoursAIOps (Artificial Intelligence for IT Operations) is a discipline that leverages machine learning and analytics to automate and enhance IT operations, with a specific focus on monitoring, incident detection, and response.
This instructor-led live training, available both online and onsite, targets intermediate-level IT operations professionals seeking to implement AIOps techniques. The course aims to help participants correlate metrics and logs, reduce alert noise, and improve observability through intelligent automation.
Upon completion of this training, participants will be able to:
- Grasp the core principles and architecture of AIOps platforms.
- Correlate data across logs, metrics, and traces to pinpoint root causes.
- Mitigate alert fatigue via intelligent filtering and noise suppression.
- Employ open-source or commercial tools to monitor incidents and trigger automated responses.
Course Format
- Interactive lectures and group discussions.
- Extensive exercises and practical applications.
- Hands-on implementation within a live laboratory environment.
Course Customization Options
- To arrange a customized training session for this course, please get in touch with us.
Building an AIOps Pipeline with Open Source Tools
14 HoursLeveraging exclusively open-source tools to build an AIOps pipeline enables teams to create scalable and cost-efficient solutions for observability, anomaly detection, and intelligent alerting within production environments.
This instructor-led training, available both online and onsite, is designed for advanced engineers aiming to implement an end-to-end AIOps pipeline. Key tools covered include Prometheus, ELK, Grafana, and custom machine learning models.
Upon completion of this course, participants will be able to:
- Architect an AIOps infrastructure using only open-source components.
- Gather and standardize data from logs, metrics, and traces.
- Utilize machine learning models to identify anomalies and forecast incidents.
- Automate alerting and remediation processes using open-source tooling.
Course Format
- Engaging lectures and discussions.
- Extensive exercises and practical practice.
- Hands-on implementation within a live laboratory environment.
Customization Options
- For customized training arrangements, please contact us directly.
Antigravity for Developers: Building Agent-First Applications
21 HoursAntigravity serves as a development platform specifically crafted for creating AI-driven, agent-first applications.
This instructor-led live training, available either online or on-site, targets intermediate developers keen on constructing real-world applications using autonomous AI agents within the Antigravity ecosystem.
Upon completing this training, participants will gain the ability to:
- Build applications that leverage autonomous and coordinated AI agents.
- Utilise the Antigravity IDE, editor, terminal, and browser for end-to-end development.
- Manage multi-agent workflows via the Agent Manager.
- Integrate agent capabilities into production-grade software systems.
Format of the Course
- A blend of presentations with in-depth demonstrations.
- Extensive hands-on practice and guided exercises.
- Real implementation work inside the Antigravity live environment.
Course Customization Options
- For tailored content aligned with your development stack, please contact us to arrange a customized version of this training.
Getting Started with Antigravity: An Introduction to Agent-First IDEs
14 HoursGoogle Antigravity is an agent-first development environment designed to streamline engineering workflows through intelligent automation.
This instructor-led, live training (online or onsite) is aimed at beginner-level practitioners who wish to explore the fundamentals of Antigravity and understand how agent-driven coding environments enhance productivity.
Upon completion of this training, participants will be able to:
- Install and configure Google Antigravity.
- Navigate and understand both the Editor View and Manager View.
- Work effectively with agents to automate simple development tasks.
- Use Antigravity to generate, refine, and manage project files.
Format of the Course
- Instructor explanations supported by real-time demonstrations.
- Guided exercises focused on hands-on use of agents.
- Practical exploration of core Antigravity features in a controlled lab environment.
Course Customization Options
- If you require a tailored version of this training, please contact us to arrange a customized program.
Antigravity for Web Automation & Browser-Based Tasks
21 HoursGoogle Antigravity serves as a platform designed for creating agents that can interact with web applications, browser environments, and workflows spanning multiple surfaces.
This instructor-led training, available both online and onsite, targets intermediate-level professionals aiming to build, automate, and test browser-based workflows using Google Antigravity.
After completing the training, participants will be equipped to:
- Develop agents capable of interacting with web applications within the browser interface.
- Automate complete workflows across different browser contexts.
- Validate and resolve issues related to agent behavior in user interface-driven environments.
- Deploy cross-surface automation strategies leveraging Antigravity.
Course Format
- Structured instruction accompanied by live demonstrations.
- Practical, hands-on activities and scenario-based exercises.
- Implementation of agent workflows within an interactive lab environment.
Customization Options
- For tailored training needs, please reach out to us to adapt the course content to your specific objectives.
Enterprise AIOps with Splunk, Moogsoft, and Dynatrace
14 HoursEnterprise-grade AIOps platforms such as Splunk, Moogsoft, and Dynatrace offer robust capabilities for identifying anomalies, correlating alerts, and automating responses across expansive IT environments.
This instructor-led training, available online or onsite, is designed for intermediate-level enterprise IT teams looking to incorporate AIOps tools into their current observability frameworks and operational workflows.
Upon completing this training, participants will be equipped to:
- Configure and integrate Splunk, Moogsoft, and Dynatrace into a cohesive AIOps architecture.
- Correlate metrics, logs, and events across distributed systems using AI-driven analysis.
- Automate incident detection, prioritisation, and response through built-in and custom workflows.
- Enhance performance, reduce MTTR, and boost operational efficiency at an enterprise scale.
Course Format
- Interactive lectures and discussions.
- Numerous exercises and practical sessions.
- Hands-on implementation within a live-lab environment.
Customisation Options
- To request a tailored training session for this course, please get in touch to make arrangements.
Implementing AIOps with Prometheus, Grafana, and ML
14 HoursPrometheus and Grafana are industry-standard tools for ensuring observability within modern infrastructure. By integrating machine learning, these platforms gain the ability to deliver predictive and intelligent insights, thereby automating operational decision-making.
This instructor-led live training, available either online or onsite, is designed for observability professionals with intermediate-level expertise. It aims to help participants modernise their monitoring infrastructure by incorporating AIOps practices using Prometheus, Grafana, and machine learning techniques.
Upon completion of this training, participants will be equipped to:
- Configure Prometheus and Grafana to provide observability across various systems and services.
- Collect, store, and visualise high-quality time series data.
- Apply machine learning models for the purposes of anomaly detection and forecasting.
- Develop intelligent alerting rules driven by predictive insights.
Course Format
- Interactive lectures and discussions.
- Ample exercises and practical application.
- Hands-on implementation within a live-lab environment.
Course Customisation Options
- To arrange a customised training session for this course, please contact us.
AI Agent Development with Mastra
14 HoursThis instructor-led, live training (available online or onsite) is designed for intermediate-level software developers and engineering teams aiming to build scalable, observable AI systems using Mastra.
By the end of this training, participants will be able to:
- Understand Mastra’s architecture and how it integrates with LLMs and external APIs.
- Design and implement AI agents and workflows using TypeScript.
- Use Mastra’s observability and memory tools to monitor and improve agent performance.
- Deploy production-ready AI applications leveraging Mastra’s framework features.
Mastra Debugging, Evaluation & Quality Assurance for AI Agents
21 HoursMastra serves as a framework offering structured tools to evaluate, debug, and ensure the reliability of AI agents operating within complex workflows.
This instructor-led live training, available online or onsite, is designed for intermediate-level practitioners who want to rigorously test agent behaviour, enhance reliability, and implement measurable evaluation processes.
By the end of this training, participants will be able to confidently:
- Apply debugging techniques to identify and rectify issues with agent behaviour.
- Evaluate agents using structured metrics, benchmarks, and quality scores.
- Implement tooling and workflows that monitor reliability, drift, and hallucinations.
- Design QA strategies that guarantee consistent and predictable agent performance.
Course Format
- Interactive lectures and discussions.
- Hands-on exercises focused on debugging and evaluation.
- Live-lab analysis of agent behaviours using observability tools.
Course Customization Options
- Customised reliability testing scenarios and industry-specific QA methods can be arranged upon request.
Managing Agent Workflows in Google Antigravity: Orchestration, Planning and Artifacts
14 HoursGoogle Antigravity serves as an agent-centric development platform designed to orchestrate, supervise, and coordinate AI-driven coding and automation workflows.
This live training, delivered by an instructor via online or onsite sessions, targets intermediate-level professionals aiming to design, manage, and optimize multi-agent workflows within the Google Antigravity environment.
Upon completing this training, participants will acquire the following skills:
- Configure agent responsibilities and orchestration pipelines using the Manager interface.
- Generate and interpret Antigravity artifacts, such as task lists, plans, logs, and browser recordings.
- Implement verification strategies to ensure that agent actions remain transparent and auditable.
- Optimize collaboration among multiple agents for complex development and operational tasks.
Course Format
- Guided presentations combined with practical demonstrations.
- Scenario-based exercises addressing real-world workflow challenges.
- Hands-on experimentation within a live Antigravity workspace.
Customization Options
- For a tailored version of this course, please contact us to discuss customization possibilities.
Testing & Verifying Agent-Driven Code: Quality Assurance in Antigravity
14 HoursAntigravity is a framework that embodies advanced agent-driven development workflows.
This instructor-led, live training (available online or onsite) is designed for intermediate to advanced professionals who wish to verify, validate, and secure the output produced by AI agents working within Antigravity-driven environments.
Upon completing this training, participants will be able to:
- Assess the accuracy and safety of agent-generated code artifacts.
- Use structured techniques to verify agent-executed tasks.
- Analyze browser recordings and trace agent activity effectively.
- Apply QA and security principles to ensure the reliability of agent workflows.
Format of the Course
- Instructor-guided technical briefings and discussions.
- Practical exercises focused on verifying real agent workflows.
- Hands-on testing and validation within a controlled lab environment.
Course Customization Options
- Adaptation of scenarios, workflows, and testing examples is available upon request.