Roles and Responsibilities
Google Cloud Platform (GCP) Management:
- Manage and maintain infrastructure services hosted on GCP.
- Optimize GCP resources for performance, scalability, and cost efficiency.
- Implement GCP IAM policies, security best practices, and resource governance.
Kubernetes (GKE) Administration:
- Deploy, monitor, and maintain containerized applications in Google Kubernetes Engine (GKE).
- Implement best practices for Kubernetes clusters, including resource allocation, scaling, and reliability.
- Troubleshoot issues with deployments, networking, and pods.
Cloudflare Management:
- Oversee Cloudflare configurations for DNS management, security, and CDN optimization.
- Optimize website performance and security settings through Cloudflare.
Heroku Services:
- Manage and monitor services hosted on Heroku.
- Ensure proper scaling, uptime, and cost optimization for applications.
Monitoring and Incident Management:
- Set up and manage monitoring tools (e.g., Prometheus, Grafana, Stackdriver, or Datadog) to ensure system health and performance.
- Respond to incidents promptly, perform root cause analysis (RCA), and implement preventive measures.
- Document incident management processes and reports.
Cost Management:
- Monitor and optimize cloud costs across GCP, Kubernetes, and Heroku environments.
- Proactively identify opportunities to reduce operational expenses without compromising performance.
On-Call Rotation:
- Participate in a rotational on-call schedule to address incidents, system outages, and performance issues.
- Ensure quick resolution and follow escalation processes as needed.
Collaboration and Documentation:
- Collaborate with development, operations, and product teams to improve deployment pipelines and workflows.
- Create and maintain detailed technical documentation for configurations, processes, and incident reports.
Key Skills and Qualifications
- Bachelor's degree in computer science, Information Technology, or a related field.
- 8+ years of experience in a DevOps, Cloud Engineer, or related role.
- Strong hands-on experience with Google Cloud Platform (GCP) services.
- Proven experience managing applications in Kubernetes (GKE).
Official notification