Site Reliability Engineer / Automation Engineer
Location Madrid
Hybrid working (2 days per week in office)
Our client is global consultancy that works globally but operates locally. They are currently recruiting for an experienced Automation Engineer to work in their small but perfectly formed team.
The SRE Automation Engineer is an experienced engineer with strong coding skills, in charge of implementing, maintaining and supporting solutions for infrastructure integration based in software, in order to meet business requirements.
The required solutions might involve the integration and customisation of a wide range of infrastructure components, ranging from private cloud clusters (for instance OpenStack) to VPSs hosted in different cloud providers, networks components (both physical and virtualised), cloud-native services deployed on Kubernetes, and many more.
The role also will collaborate closely with the Architecture and Engineering layers in the design of those solutions, according to the business requirements, company guidelines and best practices. Additionally, you will be in charge of creating and maintaining Standard Operational Procedures (SOPs) for Tiers 1 and 2 of support, as well as participating in ROTA for on-call tier 3 support of the developed services.
You will be with an array of clients, platforms and applications.
Responsibilities:
* Implement, maintain and support solutions for infrastructure integration based in software in order to meet business requirements.
* Collaborate with the Architecture and Engineering layers in the design of those solutions according to the business requirements, company guidelines and best practices
* Create and maintain Standard Operational Procedures (SOPs) for Tiers 1 and 2 of support.
* Participate in ROTA for on-call tier 3 support.
* Take ownership of the projects assigned and strive to drive the project goals forward.
* Effectively share information to collaborate with teammates and to keep everyone up to date.
* Communicate risks and blockers using the appropriate channels and take initiative to remediate them.
* Actively contribute to the definition of the ways of working of the team and honor them.
* When tasks are large, work on scoping into smaller actionable parts and prioritize to deliver tasks effectively.
* Actively participate in Engineering efforts and discussions to:
* Refine and improve the team’s products
* Strive to reduce technical debt
* Improve the efficiency of the team’s processes
* Complete tasks assigned to them based on standards, with the agreed technical approach and within agreed timelines.
Requirements:
* At least 5 years of experience developing applications using one or more general programming languages utilizing git-based source control management tools
* Knowledgeable in Javascript and Python programming languages, including the corresponding ecosystem and best practices.
* Back-end skills (multithreading, OOP, etc.)
* Understanding of web based applications (HTTP protocol, Web Servers, Caching, etc.)
* Good testing practices (unit, component and E2E testing)
* Skilled in publishing and consuming services (REST, web services, etc)
* Confident in the use of Linux, shell scripting, Git, and other open-source development tooling.
* Skilled in authoring and using containers, including building new container images.
* Experienced including observability in the developed applications (metrics, logs, tracing).
* Skilled in Ansible (creating and evolving Ansible playbooks, roles and custom modules).
* Experienced with deploying and troubleshooting applications in Kubernetes.
* Experience providing on-call support.
* Experience working with critical systems that require high availability levels and support high loads.
* Experience in standardization and systems automation.
* Experience working with standard procedures, change control and under Agile frameworks.
* Extremely organized with a strong attention to detail.
* Ability to work well under pressure.
* Demonstrated ability to manage multiple tasks and competing priorities.
* Great communication, presentation, interpersonal and teamwork skills.
* Must be able to communicate fluently in written and spoken English
* Bonus points:
* Familiar with haproxy, Varnish, CDNs and other load balancing and caching solutions.
* Familiar with k3s.
* Familiar with Cilium and Cilium Cluster Mesh.
* Familiar with OpenStack.
* Familiar with ArgoCD.
If you would like to have a confidential chat about the vacancy please do not hesitate in applying