Job Description
JOB SUMMARY:
- Plays a key role in operating, observing, and improving the reliability of existing distributed systems.
- Concentrates on understanding how services behave in production, detecting when they are not operating correctly.
- Partners closely with backend and platform teams to evolve observability practices, define reliability signals.
RESPONSIBILITIES:
- Designs, implements, and continuously improves observability strategies across services, including metrics, logs, traces, alerts, and dashboards.
- Focuses on understanding system behavior in production, identifying failure modes, performance bottlenecks, and reliability risks.
WHAT WE OFFER:
- 100% Remote Work: Enjoy the freedom to work from the location that helps you thrive.
- Highly Competitive USD Pay: Earn an excellent, market-leading compensation in USD, that goes beyond typical market offerings.
- Paid Time Off: We value your well-being. Our paid time off policies ensure you have the chance to unwind and recharge when needed.
About Truelogic
Truelogic is a leading provider of nearshore staff augmentation services. They have a team of 600+ highly skilled tech professionals based in Latin America, partnering with U.S. companies on impactful projects and valuing expertise and aspirations.