In order to cultivate relationships and promote team spirit, we require your presence in the office for a minimum of 8 days per month.
As a NoSQL Database Platform Engineer, you are responsible for the engineering, reliability, security, performance, and lifecycle management of enterprise NoSQL database platforms. You will operate and evolve MongoDB, Azure Cosmos DB, and DocumentDB services across multiple environments (DEV/TEST/UAT/PROD), ensuring they meet business requirements for availability, resilience, scalability, and compliance.
This role combines hands-on operations (incident support, upgrades, backups, monitoring) with engineering ownership (automation, Infrastructure as Code, standardization, and design guidance). You will partner closely with application teams, cloud/platform engineers, security, and architects to deliver robust solutions and continuously improve service quality.
This role follows a hybrid working model, with an expected minimum on‑site presence of 8 days per month. It is not a fully remote position, as close collaboration with operations, infrastructure, and project teams is essential.
What you’ll do (Key responsibilities)
1. Platform Engineering & Operations
2. Own the operational health and engineering roadmap of NoSQL platforms (MongoDB/CosmosDB/DocumentDB).
3. Provision, configure, and maintain database services across environments, aligning with internal standards.
4. Execute and continuously improve operational routines: health checks, maintenance, patch coordination, and environment consistency.
5. Lead technical troubleshooting and root-cause analysis for complex platform issues.
6. Participate in a 24/7 on‑call rotation (approximately 7–10 days per month).
7. High Availability, Resilience & Disaster Recovery
8. Design and maintain HA and resilience mechanisms: MongoDB replica sets, sharding, and cluster topology optimization Cosmos DB multi-region replication, failover, and consistency configurations DocumentDB cluster configurations, replication behaviour, scaling patterns
9. Ensure recovery readiness via periodic restore drills and evidence collection.
10. Backup, Restore & Data Protection Engineering
11. Implement and manage backup strategies (retention policies, encryption, access controls).
12. Engineer restores automation and validation pipelines where possible.
13. Maintain “audit-ready” documentation: backup schedules, restore results.
14. Performance Engineering & Capacity Planning
15. Establish baselines and continuously optimize: Index strategy, query patterns, and schema design guidance (document modelling best practices) Connection management, throughput tuning, and latency optimization
16. Drive capacity planning for growth and cost: MongoDB storage/IO, shard balancing, index growth Cosmos DB RU/s capacity planning, partition key strategy, hot partition mitigation DocumentDB sizing, scaling, and throughput optimization
17. Provide actionable recommendations and standards to application teams. indexing, partitioning...
18. Security, Governance & Compliance
19. Apply least-privilege and secure-by-default principles: RBAC, authentication mechanisms, secret management, and secure connectivity patterns
20. Maintain auditing/logging, periodic access reviews, and compliance evidence.
21. Partner with security teams for hardening baselines, vulnerability remediation, and secure configuration patterns.
22. Observability, Monitoring & Incident Management
23. Implement and maintain end-to-end observability: Metrics, logs, alerts
24. Integrate platform alerting into enterprise incident workflows (on-call, ticketing, escalation).
25. Automation & Infrastructure as Code (IaC)
26. Automate provisioning, configuration drift control, user management, backups/restores, reporting, and routine checks.
27. Contribute to or maintain IaC modules/templates (., Terraform / ARM, as applicable).
28. Standards, Documentation & Knowledge Sharing
29. Produce and maintain platform documentation (architecture, operational procedures).
30. Define platform standards: naming/tagging, parameter baselines, maintenance policies, patch strategies, and supportability requirements.
31. Mentor and enable engineering teams through best practices, reviews, and knowledge transfer.
32. Project Support & Continuous Improvement
33. Support migrations and modernization initiatives (., on-prem to cloud)
34. Participate in design reviews and provide guidance on resiliency patterns and non-functional requirements.
35. Drive continuous improvement initiatives: reduce recurring incidents, improve automation coverage, and optimize costs.
Required qualifications
36. Proven experience (3+) as an administrator supporting NoSQL databases in production.
37. Strong hands-on expertise with Azure Cosmos DB: Partitioning strategy, RU/s management, consistency levels, multi-region design, operational monitoring
38. Strong hands-on expertise with Azure DocumentDB: Cluster operations, scaling, backups/restores, performance behaviour and limitations compared to MongoDB
39. Practical knowledge of MongoDB: Replica sets, sharding, backup/restore, upgrades, index and query optimization, performance troubleshooting
40. Solid understanding of: HA/DR concepts (RTO/RPO), incident/problem/change management Security hardening (RBAC, encryption, key management integration, auditing) Networking fundamentals (private connectivity, firewalls/security groups, routing concepts)
41. Strong scripting/automation skills in at least one: Python, PowerShell, Bash, Ansible
42. Experience working with cloud services (Azure), CI/CD practices, and infrastructure automation
43. Nice to Have / Assets
44. Familiarity with ServiceNow and Azure DevOps
45. Experience with other database platforms (PostgreSQL)
46. Microsoft Azure certifications like AZ-900
Please note that this is a permanent position, and we do not offer freelance/contract arrangement for the role
#LI-AK1