Topic:
Site Reliability Engineer Training
Training Details:
What will I Learn by the end of this course?
- You can start playing the SRE professional role, managing tasks for the Development and Operations functions.
- You will be able to provide cloud architectural solutions indispensable to maintaining the reliability of the system
- You can set up a Monitoring solution and an efficient Alerting Mechanism required for the Production environment.
- You will be able to perform infrastructure automation and other mundane tasks and eliminate toil
- You will be able to get a perfect view of new performance engineering trends – DevOps, cloud and resiliency
- You will be able to confidently talk to developers & architects on the production issues, remediation & recommendations

Course Duration: 50 hours
Demo Session: 16 Jun 2025
Class Timing:
For Participants in India: Every @ Monday, Wednesday & Friday, 7:30 AM – 9 AM (IST)
For Participants in US: Every @ Monday, Wednesday & Friday, 10 PM – 11:30 PM (EST)
For Participants in UK: Every @ Monday, Wednesday & Friday, 3 AM – 4:30 AM (BST)
Salient Features:
- Approximately 50 Hours of Live Training, along with recorded videos
- Lifetime access to the recorded videos
- Course Completion Certificate
Course Content:
- Cloud Computing – AWS
- Cloud Computing – Introduction
- Why Cloud Computing?
- What is Cloud Computing?
- Service Models – Iaas, PaaS, SaaS
- Deployment Models – Public, Private, Hybrid, Community
- Major Cloud Providers
- Create an AWS account
- AWS Budget Setup
- Core Cloud Concepts
- AWS Regions & AZs
- AWS Services
- Security – IAM
- Compute Services – EC2
- Provision EC2 in AWS Cloud
- Private vs Public vs Elastic IP
- Connect to your instance remotely
- Security Groups
- Launch your website
- Storage Services – EBS, EFS, S3
- Database Services: RDS – MySQL, Oracle
- Networking – VPC
- Serverless Services – Lambda, API Gateway, SNS
- AWS CloudFormaiton
- AWS Beanstalk
- Elastic Load Balancing & Auto Scaling Groups
- Cloud Monitoring – AWS CloudWatch
- Metrics
- Dashboards
- Logs
- Alarms
- Events
- AWS X-Ray – Troubleshooting performance issues
- Linux Command Line – ‘Get as comfortable with Linux OS as you are with Windows’
- Shell Scripting for Performance Monitoring (Only recorded videos provided)
- Directory Navigation
- System Information
- Installing Packages
- Hardware Information
- User Information & Management
- File & Directory Commands
- Process Management
- File Permissions
- Networking
- Archives (TAR Files)
- Search
- SSH Logins
- File Transfers
- Disk Usage
- Performance Monitoring & Stats
- Linux Performance Monitoring
- top
- uptime
- vmstat
- free
- iostat
- sar
- mpstat
- Site Reliability Engineering – Introduction
- What is Reliability of a system mean?
- What is Fault Tolerance?
- What id High Availability?
- What is Disaster Recovery
- SRE Vs DevOps (class SRE implements DevOps)
- SLIs, SLOs, SLAs, Error Budget
- Incident Management
- Change Management
- Roles and Responsibilities of SRE
- Software Architectures
- What is Monolithic?
- What is Monolithic Architecture?
- Advantages & Disadvantages
- Monolithic Architecture – The Big problem
- Why & What MicroService?
- Characteristics of MicroService Architecture
- Advantages & Disadvantages
- MicroService vs. Monolithic
- Runtime Environment for MicroServices
- Container Engine – Docker
- Containers in General
- Containers in IT
- Container Architecture
- Advantages
- Container Engines
- Docker – What and why?
- Introduction to Docker Hub
- Docker Installation
- Common Docker Operations
- Build docker images using Dockerfile
- Docker Volumes
- Install Docker-Compose
- Build and run images using ‘docker-compose.yaml’ file
- Container Orchestration Engine – Kubernetes
- What is Container Orchestration Engine (COE)?
- COE Features
- Top 3 Container Orchestration
- What is Kubernetes?
- Why Kubernetes?
- Kubernetes Architecture
- AWS ECS (Elastic Container Service)
- AWS EKS (Elastic Kubernetes Service)
- Difference between ECS and EKS
- EKS Continued (EKS Cluster Setup)
- GKE (Google Kubernetes Engine) Cluster
- What is Pod?
- Pod Deployment
- Multi-container
- Pod Networking
- Inter-Pod & Intra-Pod Communication
- Pod Lifecycle
- Services – NodePort, ClusterIP, LoadBalancer
- ReplicaSet
- CloudWatch Container Insights
- Horizontal Pod AutoScaler (HPA)
- Cluster AutoScaler
- Performance Engineering in DevOps
- Trasitional approach
- Before DevOps
- What is DevOps?
- Why DevOps?
- DevOps Lifecycle
- DevOps Phases
- DevOps Phases – CI/CD
- How Performance Engineering fits in DevOps
- Elements of Performance Engineering
- Implement Continuous Performance Testing
- What is “Shift Left” approach in Performance Testing?
- What is “Shift Right” approach in Performance Monitoring?
- Feedback Loop in the Performance Engineering Process
- What is Continuous Integration
- Jenkins Introduction
- Features of Jenkins
- Installing Jenkins on EC2 – Commands
- Jenkins Architecture
- Managing Nodes on Jenkins
- Jenkins Integration with DevOps tools – GIT, DockerHub
- What are CI/CD Pipelines
- JMeter – Get Started
- Jenkins Integration with JMeter
- Automated Feedbacks – Email notifications
- Creating Jenkins (Freestyle) Jobs
- Creating Pipeline
- Scheduling the Pipeline Jobs with Dependencies
- Introduction to AWS Services for CI/CD pipeline
- Gaining Resilience and Reliability on AWS
- AWS Global, Regional, and Zonal Architecture Design
- Amazon’s Global Storage Services – S3
- Running Resilient Databases On AWS – RDS and DynamoDB
- Fault Tolerant Computation On AWS – Lambda and EC2
- Core Resilience Principles for AWS – Load Balancing and Auto Scaling
- Using Kubernetes and EKS On AWS
- Observability – Issue Debugging, Monitoring and Alerting
- Linux Monitoring
- Cloud Watch
- Prometheus and Grafana
- ELK (Elastic Search, Log Stash, Kibana)
- Troubleshooting Performance Bottlenecks (8 hours Java Performance Engineering Self-paced videos will be provided)
- JMeter – Performance Testing tool overview
- Java Performance Tuning
- Database Performance
- Eliminating toil and Automation
- Infrastructure Automation using Terraform
- Configuration Management using Ansible
- CI/CD Pipeline using Jenkins
- Automating Mundane tasks using Shell-Scripting and Python Scripting
- Shell Scripting (3+ hours Self-paced videos will be provided)
- Directory Navigation
- System Information
- Installing Packages
- Hardware Information
- User Information & Management
- File & Directory Commands
- Process Management
- File Permissions
- Networking
- Archives (TAR Files)
- Search
- SSH Logins
- File Transfers
- Disk Usage
- Core Java (32+ hours Self-paced videos will be provided)
- Installation of jdk , setting up environment variables and eclipse
- Java – Classes/objects/methods/ packages
- Java programs for creating class/objects/packages
- Basics of Strings & arrays
- Java Programs with strings and arrays
- Single Thread Vs Multi Threading concepts
- Java – File handling and sample programs
- Installing mysql database and integration with java program
- Basics of html
- Basic of JSP
- Basics of Servlets
- Compilation and creating jar/war file
- Installation of Tomcat
- Deployment of Java application
- Python (30+ hours Self-paced videos will be provided)
- An Introduction to Python
- What can Python do?
- Why Python?
- Python Syntax compared to other programming languages
- Python and Anaconda Installation
- Data Types and Operations
- Prints and Comments
- Numbers, Strings, List
- Tuple, Dictionary
- String Operations in Python
- Simple Input & Output
- Simple Output Formatting
- Operators in python
- Python Program Flow
- Indentation
- The If statement and its’ related statement
- An example with if and it’s related statement
- The while loop
- The for loop
- The range statement
- Break &Continue
- Assert
- Examples for looping
- Functions& Modules
- Create your own functions
- Functions Parameters
- Variable Arguments
- Scope of a Function
- Function Documentations
- Lambda Functions & map
- Exercise with functions
- Create a Module
- Standard Modules
- Exceptions Handling
- Errors
- Exception handling with try
- handling Multiple Exceptions
- Writing your own Exception
- File Handling
- File handling Modes
- Reading Files
- Writing & Appending to Files
- Handling File Exceptions
- The with statement
- Classes in Python
- New Style Classes
- Creating Classes
- Instance Methods
- Inheritance
- Polymorphism
- Exception Classes & Custom Exceptions
- Generators and iterators
- Iterators
- Generators
- The Functions any and all
- With Statement
- Data Compression
- Data Structures
- List Comprehensions
- Nested List Comprehensions
- Dictionary Comprehensions
- Functions
- Default Parameters
- Variable Arguments
- Specialized Sorts
- Collections
- namedtuple()
- ChainMap
- Counter
- OrderedDict
- defaultdict
- UserDict
- UserList
- UserString
- Data Modification and Analysis with Pandas
- Defining Pandas
- Pandas – Creating and Manipulating Data
- How to Create Data Frames?
- Importance of Grouping and Sorting
- Plotting Data
- Selenium Python Basics
- Introduction to Selenium Webdriver
- Basic Navigation using selenium on website
- Automation tips
How to register?
Contact Details: +91 91331 90573 / 8977922427 (WhatsApp/Regular Call)
Organized By: Isha Training Solutions
Disclaimer: PerfMatrix does not conduct any classroom or online training/classes. The training/demo details given on this page are either taken from the social media source or provided by the individual trainer or institute. Request you to please verify the authenticity of the training (and trainer) before providing your details. PerfMatrix will not take any responsibility in case of false knowledge, waste of time, or financial losses.