Lead Machine Learning Engineer, Inference & Performance

Egen

Benefits

Similar Jobs

See all

Technical Toolkit:

  • Must have mastery of Python and shell scripting, with comfort in CUDA-adjacent performance code.
  • Hands-on experience with vLLM or SGLang for high-performance inference serving.
  • Solid grasp of GPU architecture, LLM inference bottlenecks, and FlashAttention techniques.

Basic Qualifications:

  • Bachelor's or Master's degree in CS or related field with 5+ years of ML/AI engineering experience.
  • Proven track record of deploying and optimizing models in production environments.
  • Experience with profiling tools and GPU utilization improvement is essential.

Personal Attributes:

  • Ownership mentality—you take pride in seeing optimizations through from profile to production.
  • Curiosity and lifelong learning to stay ahead in fast-changing hardware and serving frameworks.
  • Consultative spirit to translate technical complexity into business value for clients.

Egen

Egen is a fast-growing technology company with a data-first mindset, partnering with clients on Google Cloud and Salesforce to drive action through data and insights. We are a team of dedicated engineers who thrive on solving tough problems and continually innovate to achieve fast, effective results.

Apply for This Position