What Site Reliability Engineering contributes to Cardinal Health
The eCommerce Site Reliability Engineering (SRE) Team is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of eCommerce applications. Site Reliability Engineers (SREs) are expected to use a software development mindset to solve operational and platform challenges through automation.
- Demonstrates general understanding of hardware, software, and cloud platforms including but not limited to operating systems, databases, JVMs, application servers, web servers and integration technologies.
- Plans executive implementations that ensure success and minimize risk of system outages or other negative production impacts.
- Demonstrates conceptual knowledge of modern architecture standards and technologies.
- Demonstrates problem solving ability that allows for effective and timely resolution of system issues including but not limited to production outages.
- Analyzing production system operations using tools such as monitoring, capacity analysis and outage root cause analysis to identify and drive change that ensures continuous improvement in system stability and performance.
- Utilizes an ownership mindset when working on supported tools and applications.
Qualifications
- 12-14 Years of relevant experience in Java Application Development, Architecting DevOps Lifecyle
- Engineer in Computer Science / Engineering in related field. Advance Degree preferred Ideally
Capabilities What is expected of you and others at this level
- Experience with Cloud native technology and processes, including CI/CD pipelines and supporting technologies like Cloud Foundry, Concourse, dockers and Kubernetes is preferred.
- Good to have Complex SQL queries, HCL/IBM WebSphere Products like Portal, Application server, WebSphere Commerce
- Build, maintain, administer, and continuously evolve Ecommerce CI/CD capabilities, supporting tools including Kubernetes, Concourse and Spinnaker build and deployment pipelines.
- Become our internal technical subject matter expert and face of the Ecommerce Site Reliability Engineering team supporting the AEM platform, Analytics, including Search
- Hands-on experience with API gateways (APIGEE preferred)
- Applies comprehensive knowledge and a thorough understanding of concepts, principles, and technical capabilities to perform varied tasks and projects
- May contribute to the development of policies and procedures, Works on complex projects of large scope
- Create strategies, Milestone, generate general guidance on new projects
- Develops technical solutions to a wide range of difficult problems. Solutions are innovative and consistent with organization objectives
- Exposure to modern web technologies such as Angular/React, Java, Spring Boot, Node.js
- Plan and execute infrastructure projects that improve observability tools and platforms, including metrics, logging, distributed tracing, dashboarding, alerting, and application performance management.
- Strong desire to learn new technologies, solve problems through automation, and improve relentlessly.
- Experience with Cloud native technology and processes, including CI/CD pipelines and supporting technologies in Cloud Foundry, Concourse, dockers, Kubernetes and Adobe Experience Manager
- SRE /Team player to work across Technical Stack in enterprise eCommerce platforms (Medical/Corp and Pharma).
- Assist and work on other key operational objectives with other SRE Team Members and Application Development including cloud, application platforms, deployment pipelines, monitoring, alerting, reporting, change management.
- Key player to help in Application Restoration process in Service Restoration Team conversation.
- Working experience in or strong understating of DevSecOps and Agile development processes
Official notification