Site Reliability Engineer (NM+)
Natwest | 72 days ago | Gurugram

Your role will also involve:

 

  • Anchor & provide strategic direction regarding technologies & solutions in Digital operations. Lead infrastructure & application builds & technical maintenance along with the core engineering & delivery teams.
  • Custodian of SRE SLO, SLI & Error Budgets.
  • Application scalability & optimization: Assist in designing and implementing scalable, highly available system architectures to handle increasing loads and user demands without compromising performance.
  • Creating and optimizing CI/CD pipelines to automate testing and deployment processes, reducing the time from development to production and ensuring consistent quality control.
  • Designing, Monitoring & Responding to system alerts, Monitoring system performance, identifying bottlenecks, and executing optimization & permanent fixes.
  • Managing incident response protocols, including on-call rotations.
  • Conducting post-incident reviews to prevent recurrence and refine the system reliability framework.
  • Provide primary operational support and engineering for multiple large-scale distributed software applications
  • Collaborate with development operations staff to create, monitor, and troubleshoot the system infrastructure.
  • Increase system resilience and serve larger customer volumes with expert-level coding, bulletproof release, and change management skills.
  • Improve automation and increase the system’s self-healing capability.
  • Collect operating system data and report performance metrics to stakeholders.
  • Manage cloud and database system maintenance, debugging production issues as they arise.
  • Ensuring the effective and seamless integration of security policies and practices to DevOps workflows to reduce overall risks and deliver products and services on time.
  • Implement the E2E automated VAPT for any new or existing application.
  • Reduce the planned deployment downtime by ensuring robust CI/CD setup by 50%.
  • MTTR (Mean time to recovery) to less than 2 hr for any major issues, MTTD (Mean time to detect) to less than 5 min with help of automated tools & methods.

The skills you'll need

 

We’re looking for someone with technical knowledge and experience including platform, technology, products and domains. Along with this, you’ll bring experience of 7+ years of strong experience in DevSecOps & SRE experience in production support. This is a individual contributor role – must have the capability of performing independent POCs and working with cross functional departments along with the below Tech skills.

 

You'll also have the ability to communicate at all levels. Proven experience in managing large-scale distributed systems and understanding the principles of scalability and reliability. Ownership of DevOps DORA metrics, SRE TOIL reduction – with automation.

 

We’re also looking for:

 

  • Experience in security tools like SAST, DAST, container security.

  • Understanding of Node.js, React.js, JAVA, Oracle, IDMC,

  • Experience in Infra as Code like Terraform, CloudFormation, experience in container technologies like Docker, Kubernetes, OpenShift

  • Must have knowledge of DevSecOps tools like Git, Maven, Selenium, Jenkins, Ansible, Security Tool, anyone of the Monitoring tools knowledge Geneos, Nagios, Prometheus, DynaTrace, AppDynamics, DX-APM, SPLUNK, scripting Knowledge: UNIX Shell, (Python groovy, YAML ((good to have)).

  • Experience and understanding in at least one cloud provider like AWS, Azure etc.

  • On demand Infra provisioning – environment spinoffs – environment cloning – EKS, IAAC

  • Working hands-on knowledge of configuring SLA, SLO, SLIs and infra + business rules/logics in AppDynamics, AWS CW, PingDom, DataDog, Tivoli etc (APM – preferably).

  • Understanding network protocols, load balancing, and firewall management for secure and efficient network operations.

     

Official notification
Contact US

Let's work laptop charging together

Any question or remark? just write us a message

Send a message

If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.