Lead platform engineering initiatives using Kubernetes (EKS), Helm, and Infrastructure as Code.
Design and operate CI/CD platforms and deployment strategies to enable safe, low-risk releases.
Build and maintain strong observability foundations, including metrics, logging, alerting, and dashboards tied to service health.
Patriot Software is a remote-first, product-led tech company with a mission to make accounting and payroll fast, simple, and affordable for millions of American businesses. With 175+ team members across the U.S. and a collaborative office hub in Canton, Ohio, we’re building software that empowers the backbone of the American economy.
Helping improve the infrastructure and data platform using a lean approach.
Creating a data platform and infrastructure optimized for developments using Machine Learning and massive data processing.
Improving the development experience and spreading the DevOps culture in the company.
Clarity AI is a global tech company founded in 2017 with a mission to bring societal impact to markets. They leverage AI and machine learning to provide data, methodologies, and tools to investors, governments, companies, and consumers for informed decisions; they are a team of over 300 individuals with offices in New York, Madrid, London, Paris, and Abu Dhabi, backed by investors like BlackRock and SoftBank. .
Lead and contribute to projects focused on enhancing system reliability, release processes, developer experiences, cost optimizations, observability, and security.
Collaborate with various engineering teams to solve reliability, performance, and security issues.
Implement and manage infrastructure-as-code (IaC) strategies.
AllTrails is the world’s most popular and trusted platform for outdoor exploration, connecting people to the outdoors. They have a global community of millions of trailgoers and an inclusive workplace that values diversity.
Ensure the smooth operation and high availability of Clarifai's core services
Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
Design and implement scalable, secure, and cost-effective infrastructure solutions
Clarifai is a leading AI platform specializing in computer vision and generative AI, empowering organizations to transform unstructured data into actionable insights. Founded in 2013, they have a diverse, globally distributed team with $100M in funding and are committed to building a diverse and inclusive team.
Conduct pre-study and analysis of technical requirements.
Manage and optimize GitLab infrastructure for scalability and security.
Automate deployment, monitoring, and maintenance workflows.
Tietoevry Create is a digital accelerator for innovation and sustainable value creation. It combines business design with software engineering to bring digital business to life and is one of the largest tech companies in the Nordics. They provide a flexible hybrid work model as part of their culture and way of working.
Improve and maintain Xapo's cloud-native platform.
Enhance self-serve functionality and drive automation.
Empower developers by making the platform more efficient and reliable.
Xapo Bank is committed to changing the way things are done in the financial world. They are a fully distributed team of over 130 Xapiens that work remotely from 40+ countries around the world.
Design, implement, and maintain scalable, high-availability cloud infrastructure for Twilio’s microservices.
Operate and maintain highly available services handling billions of weekly requests.
Manage Infrastructure as Code (IaC) using tools like Terraform, ensuring operational best practices.
Twilio is shaping the future of communications, delivering innovative solutions to hundreds of thousands of businesses and empowering millions of developers worldwide. They are dedicated to remote-first work, with a culture of connection and global inclusion.
Design, build, and maintain secure, scalable cloud infrastructure
Own CI/CD pipelines and deployment workflows across services and environments
Improve reliability, availability, and performance through monitoring, alerting, and incident response practices
Wizard is revolutionizing the shopping experience using the power of generative AI and rich messaging technologies to build a personalized shopping assistant for every consumer. We scour the entire internet of products and ratings across brands and retailers to find the best products for every consumer’s personalized needs.
Own and maintain the incident response process, including defining procedures, tools, and best practices
Guide teams in establishing and monitoring Service Level Objectives (SLOs), including setting up alerts and reporting systems
Lead capacity planning initiatives, focusing on both short and long-term scalability while optimizing costs
Underdog makes sports more fun by building the best products for sports fans. They are a fast-growing sports company valued at $1.3B with a focus on a seamless, simple, easy to use, intuitive and fun app.
Maintaining and updating Glia’s core infrastructure.
Troubleshooting and resolving infrastructure-related issues.
Improving our security posture.
Glia provides an AI customer service solution for banks and credit unions, unifying AI and human agents across every voice and digital conversation through its ChannelLess® Architecture. Valued at over $1 billion, Glia powers over 700 financial institutions and is certified as a Great Place to Work, with 98% employee satisfaction.