We are looking for a talented DevOps Engineer with a strong Site Reliability Engineering (SRE) background to join our engineering and infrastructure team. The ideal candidate will be responsible for improving system reliability, scalability, performance, and security across cloud and on-premises environments. You will work closely with software engineers, QA, and operations to implement DevOps practices and deliver stable, high-performance systems.
Mandatory Skill(s)
- Must have 4+ years of experience in DevOps and Site Reliability Engineering;
- Must have experience in automating infrastructure provisioning, configuration management, and CI/CD pipeline implementation;
- Must have hands-on experience with containerization and orchestration technologies such as Docker and Kubernetes;
- Experience with monitoring and observability tools such as Prometheus, Grafana, ELK, or Datadog;
- Familiarity with Infrastructure as Code (IaC) using tools like Terraform, Ansible, or Helm;
- Proficient in scripting or programming languages such as Bash, Python, or Go;
- Solid understanding of cloud platforms (AWS, Azure, or GCP);
- Knowledge of system performance tuning and incident response.
Desirable Skill(s)
- Has experience implementing SRE best practices (SLIs, SLOs, error budgets, chaos engineering);
- Has worked with service mesh technologies (e.g., Istio, Linkerd);
- Understands GitOps workflows using tools like ArgoCD or Flux.
Responsibilities
- Ensure high availability, performance, and scalability of production systems;
- Design and maintain CI/CD pipelines, automated deployments, and rollback strategies;
- Implement observability and monitoring solutions to proactively detect and resolve issues;
- Collaborate with engineering teams to improve reliability and service ownership;
- Drive incident management processes, perform root cause analysis, and implement postmortems;
- Apply DevOps and SRE best practices to reduce toil and improve mean time to recovery (MTTR);
- Build self-healing infrastructure and automate operational processes;
- Participate in on-call rotations and provide support for production systems.
If you are interested in this role, click on the “Apply to this job” button below or you could also write in with your CV to Kiran Kumar Pandity at kiran.kp@sciente.com quoting the job title.
