Source Job

Europe

  • Evolve our kernel language to something that is usable both by developers inside and outside the compiler team and company
  • Design and implement backend compiler optimizations to efficiently map workloads onto heterogeneous architectures (CPU, NPU, and specialized accelerators)
  • Implement advanced optimization strategies across the compiler stack based on your experience, e.g.: Memory planning, tiling, vectorization, task partitioning, concurrency optimizations (compute and memory), etc.

Python C++

20 jobs similar to Senior/Staff Backend Compiler Engineer

Jobs ranked by similarity.

Europe North America 7w PTO

  • Profile large-scale training workloads and identify communication and computation bottlenecks.
  • Custom kernel development to improve training performance.
  • Enhance and maintain our training and inference codebases.

Poolside aims to be the company that builds a world where AI powers economically valuable work and scientific progress. Their team is a multidisciplinary blend of research, engineering, and business experts distributed across Europe and North America, fostering a culture of collaboration and hard work.

Global

  • Architect and ship new backend capabilities that integrate AI-adjacent functionality into Kraken’s core systems.
  • Design distributed services that meet high standards for reliability, performance, and correctness.
  • Own end-to-end technical design, from protocol and service boundaries through production deployment.

Kraken is a mission-focused company rooted in crypto values. It aims to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. As a fully remote company, Kraken has Krakenites in 70+ countries who speak over 50 languages.

Global

  • Design and build the infrastructure layer powering AI agent systems in production
  • Develop high-performance Rust services that handle model inference, orchestration, and execution
  • Architect scalable systems capable of supporting millions of users and high request throughput

Kraken is a mission-focused company rooted in crypto values, aiming to accelerate the global adoption of crypto so that everyone can achieve financial freedom and inclusion. As a fully remote company, Kraken has employees in 70+ countries and is committed to industry-leading security, crypto education, and client support.

$225,000–$315,000/yr
US 20w maternity 12w paternity

  • Architect and optimize distributed training and inference systems for large-scale AI models
  • Design and deliver customer-focused solutions that maximize performance and business value
  • Lead the transition of ML pipelines from POC to scalable production systems

The company offers an AI-centric cloud platform reshaping the landscape of artificial intelligence. They provide infrastructure, tools, and services for developers to service the explosive growth of the global AI industry, catering to Fortune 1000 companies, startups, and AI researchers.

  • Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures.
  • Collaborate with engineering teams to turn novel architectural breakthroughs into scalable solutions for enterprise customers.
  • Drive rapid, iterative prototyping of AI functionalities, refining model performance and task-accuracy based on real-world telemetry.

Fastino is building the next generation of LLMs with a team of alumni from Google Research, Apple, Stanford, and Cambridge and has developed the GLiNER family of open source models. Fastino has raised $25M through seed round and is backed by leading investors including Microsoft, Khosla Ventures, and Insight Partners.

$135,000–$175,000/yr
US

  • Architect and scale the core intelligence behind our platform.
  • Design, build, and optimize the pipelines and agent systems that drive live customer interactions.
  • Build real-time and batch pipelines for ingestion, training, and inference.

Raynmaker is building RaynBrain, an agentic AI platform for complex conversations grounded in machine learning, neuroscience, and forensic linguistics. They empower autonomous systems that interpret, adapt, and act in real time, turning raw leads into revenue without scripts or human handoffs. Raynmaker is a small team helping other small teams move faster and convert more leads.

Global

  • Design and build robust backend services and microservices that power the DevX platform ecosystem.
  • Integrate Large Language Models (LLMs) and custom AI models to enable features like semantic code search, automated refactoring, and natural language infrastructure provisioning.
  • Act as a technical liaison and co-developer with our India-based engineering team, participating in daily stand-ups and code reviews to ensure architectural alignment.

They are developing the DevX platform, a next-generation engineering platform designed to accelerate time-to-market and improve code quality through intelligence. The company seems to be focused on developer tools and AI-driven solutions to enhance the software development lifecycle.

$180,000–$200,000/yr
US Unlimited PTO 20w maternity 12w paternity

  • Work directly with CV researchers to understand their goals, review their code, and engineer it for reliability and performance at scale.
  • Profile and optimize performance-sensitive code across both training and real-time inference.
  • Identify patterns across research efforts and propose standardized, composable abstractions.

GameChanger believes in the life changing impact youth sports have on and off the field. By building the first and best place to experience the youth sports moments important to their community, they are helping families elevate the next generation through youth sports. They are a remote first, dynamic tech company based in New York City, and they are solving some of the biggest challenges in youth sports today.

US

  • Analyze requirements and propose innovative AI-native solutions to technical problems
  • Write clean scalable code
  • Test and deploy features & services

WorkHero is building the AI-powered back office for the skilled trades, starting with the $50B+ HVAC industry. They have exciting traction and just closed a $5M seed round to expand their engineering and product organization, as well as add additional services.

Europe

  • Architect an entirely new networking operating system.
  • Use a unique multi-process state-sharing architecture.
  • Use an unmodified Linux kernel, maintaining full, secured access to the Linux shell and utilities.

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. They leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide their clients with a competitive edge. At Arista they value the diversity of thought and perspectives that each employee brings to the table and believe that fostering an inclusive environment is essential for driving creativity and innovation.

EMEA

  • Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
  • Design, build, and maintain high-performance, cost-efficient inference pipelines, making architectural decisions about scaling, reliability, and cost trade-offs
  • Proactively identify and resolve infrastructure bottlenecks, proposing and scoping improvements to iteration speed and production reliability

AssemblyAI builds best-in-class Speech AI models that power the next generation of voice applications. They are a remote team building one of the next great AI companies where teammates define and build their company culture.

US

  • Bridging the gap between the market and our internal teams by actively translating client feedback and usage scenarios into actionable product improvements.
  • Dealing with technical questions/issues from customers, and assisting our customers in accelerating their development of our technology
  • Partner with Sales to identify opportunities and articulate the value of Innatera technology to secure wins

Innatera is a rapidly growing Dutch semiconductor company that develops ultra-efficient neuromorphic processors for AI at the edge. We are committed to building a diverse, inclusive, and respectful workplace.

Europe

  • Research, prototype, develop and optimize solutions, tools, and libraries.
  • Analyse, influence, and improve deep learning libraries and frameworks standards and APIs.
  • Collaborate with team members and other partners.

Jobgether is a platform that connects job seekers with companies. They leverage AI to match candidates with roles.

Europe

  • Own organization-wide, multi-year technical direction across teams and silicon programs
  • Drive system and micro-architecture specs through high-quality RTL implementation
  • Partner across architecture, verification, physical design, software, product, and DFT

Axelera AI is creating the next-generation AI platform to support anyone who wants to help advancing humanity and improve the world around us. The company has a world-class team of 220+ employees and is headquartered at the High Tech Campus in Eindhoven, Netherlands.

US Europe

  • Conduct fundamental and innovative development in low-cost yet powerful vision-language models (VLM), unified models, automatic model compression, optimization and deployement on cloud and edge.
  • Design or implement state-of-the-art techs on model compression, inference speedup, deployement on harwares, tool automation.
  • Contribute to library and tool development to support business; or Publish influential research in top-tier conferences and journals.

Sony Corporation of America is the U.S. headquarters of Sony Group Corporation, based in Tokyo, Japan. Sony creates and delivers more entertainment experiences to more people than anyone else on earth.

$131,473–$180,628/yr
North America 5w PTO

  • Architect and implement engine enhancements to meet performance and functionality requirements.
  • Identify rendering limitations in Unreal Engine and design solutions that support our visual identity and fidelity goals.
  • Optimize memory usage, GPU and CPU workloads, and rendering performance across platforms.

They create revolutionary, story-driven RPGs which go straight to the hearts of gamers. They share behind-the-scenes insights and stories direct from their team members on their social media, YouTube channel, and Beyond the Game Blog.

$93,153–$153,419/yr

  • Work closely with designers and animators to fulfill the vision of the game
  • Build and maintain solid and flexible gameplay systems that work in both singleplayer and multiplayer
  • Write high-quality code that is optimized, bug-free, and aligned with project goals

CD PROJEKT RED creates story-driven RPGs. They share behind-the-scenes insights and stories directly from their team members via their social media, YouTube channel, and Beyond the Game Blog.

US

  • Design, build, and operate the network fabric that interconnects our GPU fleet.
  • Develop and maintain network automation using Ansible, Terraform, and custom tooling.
  • Drive incident response and root-cause analysis for network-related production issues.

Fal's platform orchestrates AI inference workloads across thousands of GPUs spread over multiple data centers and cloud providers. They offer visa sponsorship and will help you relocate to San Francisco.

$200,000–$245,000/yr
US

  • Expert Rust engineer, sets standards, mentors others, and elevates the team’s Rust maturity.
  • Contribute to new product development of complex in-browser applications.
  • Architect and implement complex applications that apply ML techniques to large volumes of data.

Machinify is a healthcare intelligence company delivering value, transparency, and efficiency to health plan clients. They deploy an AI-powered platform to over 85 health plans, representing more than 270 million lives, to maximize financial outcomes and reduce healthcare costs.

Global

  • Design and build scalable serving infrastructure for video generation models.
  • Build robust APIs and SDKs that enable customers and partners to integrate video generation into their products.
  • Develop compelling demo applications that showcase our platform's capabilities.

EnCharge AI is building the next generation AI platform with in-memory-computing architecture that delivers a 10x improvement in compute energy efficiency and performance for AI inference workloads. They are an experienced team of AI researchers, silicon & systems engineers, and architects backed by leading investors.