Site Reliability Engineer (5+)
hpe | 15 days ago | Bangalore

What you’ll do:

  • Enable SRE support and monitoring for HPE Networking SASE products to ensure that applications are running as per their requirements.
  • Create strategies to detect issues, address those issues, and design systems to troubleshoot automatically using tools like Prometheus, Grafana, or Datadog.
  • Ensure high availability and performance of cloud-based applications and services.
  • Design, implement, and maintain scalable infrastructure using Infrastructure as Code (IaC) tools such as Terraform or CloudFormation.
  • Collaborate with development teams to improve application performance and reliability from design through production.
  • Gain insights from the data fetched from monitoring tools to enhance the product's performance.
  • Drive automation for deployment, monitoring, scaling, and incident response.
  • Manage and optimize Kubernetes clusters and containerized applications.
  • Define and implement SLOs/SLIs and continuously improve observability and monitoring practices.
  • Lead and participate in incident management and root cause analysis to prevent recurrence.

 

What you need to bring:

  • Bachelor's or Master’s degree in Computer Science, Information Systems, or equivalent.
  • 4-7 years of overall experience in DevOps or SRE.
  • 5+ years programming experience in Python is a must.
  • 5+ years of experience in developing Cloud native applications using Kubernetes, Helm, or Docker container environments is a must.
  • Expertise in automation and CI-CD pipeline tools like Terraform, Ansible, Jenkins, and/or Git is a must.
  • Expertise in monitoring tools like Grafana, Datadog, or Prometheus is a must.
  • Experience in developing, deploying, and maintaining applications for Public Cloud environments (AWS, Azure, GCP, etc).
  • Knowledge of networking protocols and concepts such as routing, TCP/IP, BGP, OSPF/ISIS, NetFlow, SNMP, and Internet Traffic Engineering techniques.
  • Good communication skills, written and verbal, along with ability to communicate complex procedures.
  • A desire to constantly grow and learn new skills. In this team you will be exposed to new technologies and new problems. Ability to assimilate new ideas and tackle tough problems as they arise will be the key to success.
Official notification
Contact US

Let's work laptop charging together

Any question or remark? just write us a message

Send a message

If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.