Configure/operate monitoring, logging, and tracing tools for application performance.
Build dashboards and automation workflows for system reliability and uptime.
Collaborate with software engineering teams to design and implement robust systems.
Jobgether is a platform that uses AI-powered matching to connect job seekers with employers. They ensure applications are reviewed quickly and fairly, then share a shortlist with the hiring company for final decisions.
Cloud Engineering experience with AWS, GCP, and/or Azure.
Designed and maintained CI/CD process and tools.
In-depth experience with orchestration and config management tools.
Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. They focus on quality and customer satisfaction, fostering a collaborative and enriching work environment where each team member can grow and thrive.
Build evaluation infrastructure to measure AI system speed and accuracy.
Create observability tooling and dashboards that surface quality metrics week-over-week.
Prototype and validate improvements to our RAG pipeline.
Circle is building an all-in-one platform for online communities, enabling creators and businesses to bring together their audience with discussions, live streams, events, chat, courses, and payments. They are a fully remote company of around 200 team members from 30+ countries, valuing autonomy, trust, and collaboration across time zones.
Optimizing how the team produces code and collaborates to build WorkOS.
Identifying pain points and recommendations to improve how the company builds software internally.
Serving as a bridge between infrastructure, product, and leadership to ensure the tools and systems are maturing alongside the product.
WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. They are a fully distributed team with employees across North American time zones and are well-funded by top investors.
You'll own infrastructure as a product, serving Atticus's product engineering teams as your customers.
Shaping our infrastructure roadmap — developing a clear vision for where our infra needs to go and driving progress toward it
Empowering product teams — you'll build the platforms and tools that let them own their systems end-to-end
Atticus makes it easy for any sick or injured person in crisis to get the life-changing aid they deserve. In 2025, their team grew to 210, and they will grow again in 2026; they have ambitions to create a category-defining business assisting needy Americans.
Design, implement, and manage AI Platform architecture.
Control AI-related costs, including models, GPUs, and other resources.
Collaborate with ML teams to operationalize AI models and integrate them into systems.
Docplanner empowers patients by giving them access to leave and read reviews about their visit and provides doctors with the technology to manage bookings easily and save time. They are leaders in 13 countries with 2,500+ employees globally and maintain a startup-mindset.
Design, build, and maintain efficient and reliable software and infrastructure delivery pipelines on AWS
Recommend upgrades to services as/when new features on the underlying platform (AWS) are built and functioning
Implement and maintain infrastructure as code (IaC) using tools like Terraform
They build and deploy software and infrastructure delivery pipelines. They optimize and maintain production systems and services, set up, monitor and observe key alerts, and balance service reliability with delivery speed.
Designs and maintains CI/CD pipelines using GitLab CI/CD.
Implements Infrastructure as Code (IaC) with tools like Terraform.
Automates complex workflows and enhances infrastructure scalability.
Everseen is a vision AI solutions provider for global retailers. They have over 900 employees globally, with headquarters in Cork, Ireland, European headquarters in Cork, Ireland, and a U.S. headquarters in Miami, with hubs in Romania, Serbia, India, Australia, and Spain.
Respond to production incidents and contribute to post-incident analysis.
Identify and automate manual processes to improve efficiency and reduce risk.
Enhance monitoring tools and platforms to improve observability.
Restaurant365 is a SaaS company that provides a unique, centralized solution for accounting and back-office operations for restaurants. They focus on empowering team members to produce top-notch results while elevating their skills.
Leverage infrastructure as code (Terraform) to build and maintain complex production and analytics workflows including networking and containerized services.
Rapidly diagnose and resolve faults in system services as part of a 24/7 on-call rotation focused on actionable alerting and eliminating toil.
Improve speed of delivery by developing and maintaining CI/CD pipelines.
Linus Health is a Boston-based digital health company transforming brain health worldwide. They combine cutting-edge neuroscience, clinical expertise, and AI to advance early detection and intervention for cognitive and brain disorders, empowering people to live longer, healthier lives. With 100+ team members and growing, they’re entering a phase of accelerated growth and looking for top talent to help shape their future.