Description
We are seeking a highly skilled Lead OpenTelemetry Developer responsible for developing and maintaining OpenTelemetry-based solutions. You will work on instrumentation, data collection, and observability tools to ensure seamless integration and monitoring of applications.
This role involves code development, instrumentation of systems using both code-based and zero-code solutions, writing documentation, and promoting best practices around OpenTelemetry.
Primary Responsibilities
* Partner with and support application developers by instrumenting systems using code-based or zero-code solutions.
* Design and build OpenTelemetry-based solutions that integrate with various observability platforms.
* Create clear documentation to guide developers on instrumenting applications with OpenTelemetry.
* Improve application observability by building dashboards and providing guidance on monitoring technologies.
* Educate teams on best practices related to OpenTelemetry, semantic conventions, and supported frameworks.
Monitoring/Observability Specialty
* Develop best-in-class monitoring frameworks for end-to-end flow monitoring and noiseless alerting.
* Create automated solutions for upgrades, change management, and release management to ensure stability, speed, and reliability.
Skills & Qualifications
* Programming Expertise: Proficiency in at least 3 of the following languages (and familiarity with others):
* Python, Java, Go, .NET, PHP, React
* Technical Skills:
* Strong experience with Prometheus, Grafana, and observability platforms (e.g., Dynatrace, AppDynamics, Splunk, Amazon CloudWatch, Azure Monitor, Honeycomb).
* Hands-on knowledge of Java instrumentation techniques (e.g., bytecode manipulation, JVM internals, Java agents).
Business Knowledge
* Strong understanding of reliability and production management, ensuring high availability and stability.
* Risk-aware mindset with awareness of key operational risks in financial services or large-scale enterprises.
* Commitment to continuous improvement by enhancing processes and systems proactively.
Experience
* Strong background in system and software security (SSO, Kerberos, LDAP, Windows AD).
* Application of engineering principles to support scalable, efficient production management.
* Proven experience in automation, reducing manual work, and improving workflow consistency.