Staff DevOps - 1 (7+)
coindcx | 93 days ago | Bengaluru

You need to be a HODLer of these

  • 7+ years of hands-on experience in DevOps/SRE with a deep focus on Kubernetes cluster design, cloud-native application deployment, and ultra low latency systems.
  • Extensive experience with Kubernetes, container orchestration, and advanced networking (e.g., custom resource definitions, operators, service meshes).
  • Proven expertise in ultra low latency network design, including direct cloud interconnects, low-latency load balancing, and optimization of data paths.
  • In-depth understanding of security best practices in cloud environments, including container security, encryption, and access control.
  • Proficiency with CI/CD, GitOps, and Infrastructure-as-Code tools (Terraform, CloudFormation, Pulumi).
  • Strong programming and scripting skills (Python, Go) for automation and tooling.
  • Proven experience in senior or leadership roles, mentoring and guiding teams through complex technical challenges.
  • Excellent problem-solving, analytical, and communication skills, with the ability to drive consensus across diverse teams.
  • Experience in architecting solutions that balance ultra low latency performance with robust security and operational stability.
  • A strong background in building and managing cloud-native, high-performance systems in demanding environments.
  • Prior experience in industries that require ultra low latency systems (e.g., finance, trading) is highly desirable

 

You will be mining through these tasks

  • Kubernetes Architecture & Operations:
  • Design, deploy, and manage high-availability, scalable Kubernetes clusters (EKS, GKE, or AKS) powering production-grade applications.
  • Optimize cluster performance with advanced scheduling, resource management, and autoscaling techniques.
  • Drive best practices for container orchestration, network policies, persistent storage, and service mesh integration (Istio/Linkerd).
  • Open Application Model (OAM) Implementation:
  • Champion and implement OAM to standardize and simplify the deployment of cloud-native applications in a declarative, platform-agnostic manner.
  • Develop and refine OAM component and trait definitions that support rapid application updates and portability.
  • Integrate OAM with GitOps workflows and CI/CD pipelines to enable seamless, automated deployments.
  • Ultra Low Latency Network & Systems Design:
  • Architect and optimize ultra low latency network topologies for data centers and cloud infrastructures, focusing on minimizing network hops, optimizing routing paths, and leveraging specialized load balancing solutions.
  • Collaborate with network engineers to implement technologies such as AWS Global Accelerator, low-latency load balancers, direct connect solutions.
  • Design systems that prioritize real-time data processing and response times, ensuring that microservices, APIs, and data pipelines meet ultra low latency requirements.
  • Evaluate and integrate hardware accelerators and specialized networking protocols when needed to achieve minimal latency.
  • Security Best Practices & Compliance:
  • Implement and enforce robust security measures across the entire infrastructure, including container and network security best practices, encryption (in transit and at rest), and secure configuration management.
  • Develop and maintain strict access control policies using RBAC, network segmentation, and automated compliance checks.
  • Collaborate with security teams to conduct regular vulnerability assessments, penetration tests, and audits, ensuring adherence to industry standards and regulatory requirements.
  • Integrate security into the CI/CD pipeline (DevSecOps) to identify and remediate risks early in the development lifecycle.
  • CI/CD, GitOps & Infrastructure Automation:
  • Lead the design, development, and optimization of CI/CD pipelines using Kubernetes-native tools (ArgoCD, GitHub Actions) to ensure rapid, reliable deployments.
  • Drive Infrastructure-as-Code (IaC) initiatives using Terraform, CloudFormation, and Pulumi, ensuring consistent, automated, and reproducible infrastructure deployments.
  • Advocate and implement GitOps best practices to manage Kubernetes configurations and application deployments.
  • Observability, Monitoring & Incident Response:
  • Develop comprehensive monitoring, logging, and alerting systems (using Prometheus, Grafana, ELK, Datadog, etc.) that provide deep insights into system performance, including detailed latency metrics.
  • Establish and refine SLOs/SLIs for ultra Official notification
Contact US

Let's work laptop charging together

Any question or remark? just write us a message

Send a message

If you would like to discuss anything related to payment, account, licensing,
partnerships, or have pre-sales questions, you’re at the right place.