Job Description
We’re PayRetailers, and we offer cutting‑edge payment solutions that empower businesses to succeed in Latin America & Africa. Our collaborative and inclusive work environment encourages creativity and growth, where every employee’s contribution is valued.
Our Technology team is looking for a new Site Reliability Engineer in São Paulo to help us expand into new markets and make a meaningful impact in the world of payments.
About the Role
Site Reliability Engineers are the guardians of our reliability promise. They deliver a highly reliable, resilient, and cost‑efficient platform that consistently meets business and customer expectations for availability and performance.
Job Requirements
* Proactive attitude, always on the lookout for improvement opportunities.
* Expert knowledge of Grafana, Application Insights, OpenTelemetry, and Prometheus.
* Experience with non‑functional and production testing.
* Analytical mindset, able to connect the dots and establish cause and effect.
* Software engineering skills in .NET 8 (C#), Go, Java, or TypeScript.
* Expert experience with containers and container orchestration platforms.
* Understanding of APIs and asynchronous distributed software architectures.
* Working knowledge of AI‑enabled tools such as VS Code and Claude Code.
* Demonstrable experience applying AI to Site Reliability Engineering.
* Knowledge of process automation tools like N8N.
* Knowledge of Azure and AWS.
* Working experience with chaos engineering.
Job Responsibilities
* Increase automation of operational activities to reduce downtime risk, in collaboration with Platform Engineering and Domain Squads.
* Drive systemic improvements across engineering teams based on incident RCAs and telemetry insights.
* Implement non‑functional improvements (resilience, performance, reliability) directly in code, with Domain Squads reviewing and approving changes.
* Promote adoption of SRE best practices across development teams (integration patterns, monitoring, alerting, real‑time tracing).
* Provide cross‑platform observability capabilities above and beyond what the Domain Squads provide.
* Investigate issues and incidents and propose/implement changes as deemed necessary.
* Continuously review logs, metrics, and alerts to identify and implement continuous improvements.
* Design non‑functional tests and continuously run them to ensure that we build quality up to and including production.
* Create dashboards and alerts in Grafana to guarantee the observability of the platform.
Job Benefits
* CAJU Flex Card (meal voucher): R$880.00 per month.
* Health insurance with co‑payment for the employee.
* Dental insurance without co‑payment for the employee.
* Personal life insurance without payroll deduction.
* Wellhub/Gympass access.
* Three days off per year, following internal policy guidelines.
* Parking allowance for commuting to the São Paulo/SP office.
* Childcare allowance for parents with children up to 1 year old, enrolled in a private school or daycare, R$474.50 per month.
Equal Employment Opportunity
At PayRetailers, diversity, equity, and inclusion aren’t just values – they’re fundamental to who we are. We’re dedicated to fostering an environment where every individual feels valued, respected, and empowered to bring their authentic selves to work. We welcome applicants from all backgrounds and identities, recognizing that diversity drives innovation and strengthens our team.
Please feel free to include your pronouns in your application (e.g., she/her, he/him, they/them, etc.).
#J-18808-Ljbffr