Senior Site Reliability Engineer (NM+)
adp | 132 days ago | Chennai

Qualifications you'll need:

Education: Bachelor's degree

Experience:

  • Expert in Windows systems administration – system design, upgrades, migrations, patching, maintenance, application deployment, capacity planning, etc.
  • Extensive hands-on experience in Citrix administration and management
  • Hands on experience in Active Directory administration and management, DHCP, DNS, File Shares, NFS.
  • Good experience with one of the scripting tools like Python, Bash, PowerShell or similar.
  • Exposure to one of the CI/CD tools like Jenkins, TeamCity, etc. for continuous integration, continuous delivery.

Technical skills (preferred – must have at least one of the below)

  • Knowledge on Linux server administration
  • Experience with cloud-based Infrastructure including plan, design, setup, migration to and ongoing support for such environments.
  • AWS services experience – VPC, S3, Lambda, ALB, Route53, API Gateway, EC2, EBS, AWS FSX, Eventbridge, Oracle RDS, AWS Backup, Cloudwatch, etc.
  • Experience in Infrastructure As a Code (IaaC), Cloud formation, Ansible (or similar) platforms to automate infrastructure deployments and configuration.
  • Experience with Docker, and deployment of the Docker applications in Kubernetes, Docker swarm or similar.
  • Administration/operations experience with Oracle, Postgres, Mongo database technologies.
  • Experience in monitoring and logging tools Dynatrace, Prometheus/Grafana, Splunk.
  • Consistent application of good source code and configuration management practices

Other skills

  • Experience with agile product teams developing and supporting solutions in production
  • Experience with leading incident response and conducting post-mortems/root cause analysis.
  • Strong problem solving and troubleshooting skills.

Your contribution will include (but not limited to):

  • Support services before they go live through activities such as system design consulting, automating environment provisioning and configuration management, capacity planning and operational readiness reviews.
  • Drive full automation of deployment pipelines for both platform and application changes
  • Monitor and measure availability, latency and overall system health across infrastructure, database, container and application levels.
  • Apply infrastructure as code practices, via use of tools such as git/Bitbucket, Jenkins, Ansible or similar.
  • Ensure systems are kept up to date and are compliant with ADP security standards
  • Scale systems sustainably through automation to improve reliability and velocity.
  • Lead on-call support and incident management, blameless post-mortems, and continuous improvement activities
  • Proactively ensure the highest levels of systems and infrastructure availability.
Official notification
Contact US

Let's work laptop charging together

Any question or remark? just write us a message

Send a message

If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.