Site Reliability Engineer at SixteenFifty, England, £350 per day

£350 per day
Duration not stated Information Technology

Contract Description

My client are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join there growing team. The ideal candidate will have hands-on experience in creating and managing YAML pipelines, working with Python for automation tasks, and deploying and managing containerized applications using Kubernetes.


You will be responsible for ensuring the reliability, scalability, and performance of our production systems while also improving the efficiency of our development pipelines and operations.


This is paying £350 PD Outside IR35 and is fully remote.


Key Responsibilities of the Site Reliability Engineer


  • Build and Maintain CI/CD Pipelines:
  • Design, implement, and optimize YAML-based pipelines for continuous integration and continuous deployment (CI/CD) across various environments.
  • Automation & Scripting:
  • Write Python scripts to automate operational tasks, monitor system health, and improve development workflows.
  • Kubernetes Management:
  • Deploy, manage, and scale containerized applications using Kubernetes. Ensure optimal performance, security, and reliability in production and staging environments.
  • Monitoring and Incident Management:
  • Develop and implement monitoring systems to ensure the uptime, reliability, and performance of production services. Participate in incident response and troubleshooting.
  • System Scalability and Performance Tuning:
  • Identify and address performance bottlenecks and scalability challenges in both infrastructure and applications.
  • Collaboration & Mentoring:
  • Work closely with software engineers to ensure smooth deployment cycles, high availability, and performance standards. Mentor junior engineers on SRE practices, Kubernetes usage, and automation techniques.
  • Infrastructure as Code (IaC):
  • Use tools such as Terraform, Ansible, or CloudFormation to manage and provision infrastructure resources in cloud environments.
  • Continuous Improvement:
  • Continuously improve system reliability and performance through proactive monitoring, testing, and implementing best practices in DevOps and SRE methodologies.


Required Skills and Experience:


  • Experience with CI/CD Pipelines:
  • Hands-on experience with YAML-based pipeline configuration and CI/CD tools such as Jenkins, GitLab CI, or Azure DevOps.
  • Proficiency in Python:
  • Strong experience writing Python scripts to automate tasks, manage infrastructure, or integrate with APIs. Understanding of Python libraries and frameworks relevant to system automation (eg, requests, boto3, flask).
  • Kubernetes:
  • Extensive experience with Kubernetes clusters, including container orchestration, deployment strategies, Helm charts, and managing services in a cloud-native environment.
  • Cloud Infrastructure:
  • Experience with cloud platforms (eg, AWS, GCP, Azure) and cloud-native tools, including managing and deploying infrastructure.
  • Monitoring & Observability:
  • Familiarity with monitoring tools (eg, Prometheus, Grafana, ELK stack, Datadog) and logging systems to monitor the health of applications and infrastructure.
  • Linux/Unix Systems:
  • Solid experience with Linux-based systems and troubleshooting system performance issues.
  • Version Control Systems:
  • Proficient in Git-based version control systems and workflows.


Preferred Skills:


  • Infrastructure as Code (IaC):
  • Experience using tools like Terraform, Ansible, or CloudFormation for provisioning cloud resources.
  • Containerization & Orchestration:
  • Knowledge of Docker and Kubernetes beyond basic usage, such as setting up multi-cluster environments or creating custom Helm charts.
  • Cloud Platforms:
  • Familiarity with multi-cloud environments (AWS, GCP, Azure) and using cloud-native services for deployment and monitoring.
  • SRE/DevOps Certifications:
  • Certifications like Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer, or other relevant DevOps certifications are a plus.
  • Networking:
  • Understanding of network protocols and security practices (eg, TCP/IP, HTTP, SSL/TLS, VPN, DNS).