Similar Jobs

See all

The Role:

  • We are looking for a systems-level engineer to own Fastino’s model platform end-to-end.
  • You will design and build training pipelines.
  • You will own the platform that turns models into production systems.

What You’ll Work On:

  • Architect distributed fine-tuning pipelines for small encoder and decoder models
  • Implement LoRA, adapters, distillation, and compression workflows
  • Design experiment tracking, reproducibility, and dataset versioning systems

Strong candidates will have:

  • Deep experience with PyTorch and transformer architectures
  • Experience building production ML systems end-to-end
  • Experience with distributed training and inference

Bonus:

  • Experience with RL or RLHF
  • Experience with distillation and compression

Fastino

Fastino is building the next generation of LLMs, with a team of alumni from Google Research, Apple, Stanford, and Cambridge. Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and Airbnb.

Apply for This Position