Core Responsibilities:
- Modernize and maintain the foundational EKS infrastructure using Terraform for elastic scaling and reliability.
- Deep-dive into database performance and queue systems to plan for 10x to 100x scaling growth.
- Proactively identify scaling issues using telemetry tools like Datadog and Honeycomb to uphold service standards.
Operational Excellence:
- Uphold and improve upon the platform's track record of >99.95% uptime.
- Support the product engineering team by enhancing developer experience with faster deployment cycles and canary releases.
- Participate in on-call rotations alongside the engineering team to ensure system reliability.
Team and Culture:
- Operate with high autonomy and accountability, driving improvements while documenting changes for team-wide adoption.
- Work effectively in a fully distributed, remote-first team, valuing strong written communication and collaborative problem-solving.
- Contribute to an inclusive and equitable team culture, with a commitment to diversity and continuous learning.
Knock
Knock builds APIs for product notifications to help software communicate more thoughtfully with its users. It is a remote-first Series A startup of over 20 employees, backed by top investors, with a culture that believes in the power of great software and the API-first movement.