Manager, HPC Cloud Engineer (3+)
msd | 112 days ago | Hyderabad

The Opportunity

  • Based in Hyderabad, join a global healthcare biopharma company and be part of a 130- year legacy of success backed by ethical integrity, forward momentum, and an inspiring mission to achieve new milestones in global healthcare.

  • Be part of an organisation driven by digital technology and data-backed approaches that support a diversified portfolio of prescription medicines, vaccines, and animal health products.

  • Drive innovation and execution excellence. Be a part of a team with passion for using data, analytics, and insights to drive decision-making, and which creates custom software, allowing us to tackle some of the world's greatest health threats.

 

Our Technology Center’s focus on creating a space where teams can come together to deliver business solutions that save and improve lives. An integral part of our company’s IT operating model, Tech Centers are globally distributed locations where each IT division has employees to enable our digital transformation journey and drive business outcomes. These locations, in addition to the other sites, are essential to supporting our business and strategy.

 

A focused group of leaders in each Tech Center helps to ensure we can manage and improve each location, from investing in growth, success, and well-being of our people, to making sure colleagues from each IT division feel a sense of belonging to managing critical emergencies. And together, we must leverage the strength of our team to collaborate globally to optimize connections and share best practices across the Tech Centers.

 

Role Overview

 

As Manager of HPC Cloud Engineering, you will modernize, transform, and maintain HPC solutions in the cloud using IaC, CICD, job schedulers, researcher pipelines, and cloud-native services. Automate low-value tasks to allow scientists to focus on high-value work, leveraging elastic cloud resources to accelerate workflows. Optimize HPC performance and cloud costs. Collaborate with senior HPC SMEs, Application Support Engineers, and research stakeholders to reduce cycle times, boost efficiencies, and enhance UX, increasing researcher capacity.

Ultimately, you will play a key role in implementing scalable and efficient solutions on AWS to support compute-intensive applications.

 

What will you do in this role:

  • Deploy and manage high-performance computing clusters, platforms and AWS managed services in a DevOps environment.

  • Monitor and maintain the performance of HPC resources, ensuring high availability and reliability.

  • Optimize compute and data workflows for performance and cost-efficiency in cloud-based HPC environments.

  • Create scripts and tools for automating cluster deployment, monitoring, and management (e.g., using CloudFormation, Terraform, Ansible, AWS CLI, etc.).

  • Analyze and tune HPC cloud environment, including network performance and parallel file systems.

  • Diagnose and resolve issues in complex HPC environments, including job scheduling, software, and cloud infrastructure.

  • Document processes, configurations, and best practices for HPC workloads in AWS and Linux environments.

  • Work within a matrix organizational structure, reporting to both the functional manager and Hyderabad based Director, as well as collaborating with other teams.

  • Participate in project planning, execution, and delivery, ensuring alignment with both functional and project goals.

 

What should you have:

  • Bachelor’s degree in Computer Science, Engineering, or a related field (with relevant experience).

  • 3+ years of hands-on proficiency in Linux operating systems, shell scripting and HPC architecture in a DevOps environment.

  • Experience with job schedulers like SLURM, AWS ParallelCluster, GridEngine, PBS Pro or other HPC workload managers.

  • Knowledge of network architectures and high-speed interconnects.

  • Strong analytical skills to assess and interpret complex data.

  • Excellent verbal and written communication skills to convey findings and recommendations.

  • Strong problem-solving skills.

  • Proactive approach to identifying and addressing potential risks.

  • Meticulous attention to detail to ensure accurate assessments and reporting.

  • Understanding of relevant regulations and compliance standards.

    Official notification
Contact US

Let's work laptop charging together

Any question or remark? just write us a message

Send a message

If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.