Responsible for designing and developing a cutting-edge AI and Generative AI infrastructure on AWS Cloud platform and COLO, tailored for pharmaceutical business use-cases. The platform will facilitate Biomedical reseacrh Scientists and other business users for early molecule development and other research activities by providing robust, scalable, and secure computing resources.
Architect and Design: Lead the design and architecture of an GPU based AI infrastructure platform, with a focus on supporting Generative AI workloads and advanced analytics for pharma business use-cases like BioNeMo, Alpha Fold, ESM Fold, Open Fold, ProtGPT2 and NVIDIA Clara suite.
- Platform Development: Work with Biomedical Reseacrh scientists to develop and implement technical solutions for ML/Ops (Run:AI) hosted on K8 EKS cluster.
- Data Management: Oversee the design and implementation of data storage, retrieval, and processing pipelines, ensuring the efficient handling of large datasets, including genomics and chemical compound data.
- Security and Compliance: In collaboration with cloud domain security architects, implement robust security measures for multi-cloud environment and ensure compliance with relevant industry standards, particularly in handling business sensitive data.
- Collaboration: Work closely with Biomedical Reseacrh & Data scientists and other business stakeholders to understand their needs and translate them into technical solutions.
- Performance Optimization: Optimize the performance and cost-efficiency of the platform, including monitoring and scaling resources as needed.
- Innovation: Stay updated with the latest trends and technologies in AI and cloud infrastructure, continuously exploring new ways to enhance the platform's capabilities.
Additional Specifications Required for the Position:
- Bachelor’s degree in information technology, Computer Science, or Engineering.
- AWS Solution Architect certification – professional
- 8+ years of strong technical hands-on experience of delivering infrastructure and platform services across geogrphic and business boundaries.
- Experience of working on GPU based AI Infrastructure. Experience in NVIDIA DGX Infra will be highly preferred.
- Deep understanding of Architecture and Design of Platform Engineering products with focus mainly on Data science, ML/Ops and Bio science or Pharma Gen AI foundational models. Experience in NVIDIA BioNeMo or Clara will be highly preferred.
- Extensive experience in building infra solutions on AWS, particularly with services like AWS Bedrock, Amazon Q, SageMaker, ECS/EKS
- Knowledge of containerization and orchestration technologies, such as Docker and Kubernetes.
- Experience with DevOps practices and tools, including CI/CD pipelines, infrastructure as code (IaC), and monitoring solutions.
- Excellent skills in collaborating with business users, Product team, Operationalizing the delivered products and working closely with Security for implementing compliance.
- Good knowledge on implementing well defined & industry standard Change management process for platform & its products. Have a well-structured Use-case onboarding process. Should ensure to have documentation for Platform products and implementations done.
- Experience with DevOps Orchestration/Configuration/Continuous Integration Management technologies
- Good understanding of High Availability and Disaster Recovery concepts for infrastructure
- Ability to analyze and resolve complex infrastructure resource and application deployment issues.
Official notification