AWS SRE (NM+)
cgi | 196 days ago | Bangalore

AWS -Site Reliability Engineer (SRE) is responsible for maintaining and improving the operational efficiency of systems.Some of their responsibilities include:

• System optimization: Designing and implementing systems that can handle increasing loads and user demands without compromising performance
• Automation: Automating processes and tasks through tools and DevOps solutions
• Dev: Developing and maintaining CI/CD pipelines, enhancing the consistency and speed of software deployment
• Monitoring and alerting: Setting up monitoring tools and alerts to detect potential issues and take action before they affect users
• Incident response: Documenting incidents, understanding root causes, and implementing preventive actions – available for P1/P2s & any major incident management issues.
• Performance metrics: Developing and maintaining metrics to monitor system and application performance
• Disaster recovery: Developing and testing plans to ensure data integrity and system resilience
• Cloud migration: Performing risk analysis and identifying mitigation plans and
• Collaboration: Working with other teams to ensure system and application reliability
• Research and evaluation: Researching and evaluating new technologies and tools to improve system reliability

The ideal candidate will have robust problem-solving skills and a strong desire to implement scalable and sustainable technological solutions. Some projects this role will work on include:

• Scalability projects: Designing and implementing scalable, highly available system architectures to handle increasing loads and user demands without compromising performance.
• Continuous integration/continuous deployment (CI/CD) pipelines: Creating and optimizing CI/CD pipelines to automate testing and deployment processes, reducing the time from development to production and ensuring consistent quality control.
• Disaster recovery planning: Developing and testing disaster recovery plans to guarantee data integrity, system resilience, and swift restoration of services in case of critical incidents.

Skills
Kubernetes
Terraform
Networking
CI/CD
DevOps

Official notification
Contact US

Let's work laptop charging together

Any question or remark? just write us a message

Send a message

If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.