Similar Jobs

See all

Responsibilities:

  • Train and fine-tune language models powering our AI companions.
  • Own and improve agent harnesses, agentic loops, and the chat interface algorithm.
  • Build and maintain the full LLM stack — from model training to production deployment.

Requirements:

  • Deep experience training and fine-tuning large language models.
  • Hands-on experience building agent harnesses and agentic pipelines.
  • Strong understanding of RLHF, DPO, and alignment/post-training techniques.

Benefits:

  • Remote opportunity with full-time work.
  • 28 vacation days and 7 wellness days per year.
  • Health benefits and workplace organization support.
  • Bonuses up to $5000 for referrals and 50% payment for professional training.

Social Discovery Group

Social Discovery Group is one of the world's largest groups of social discovery companies, solving loneliness and disconnection through social entertainment platforms like DateMyAge and Dating.com. The international team of 1000+ professionals works remotely worldwide and is a two-time 'Great Place to Work' winner.

Apply for This Position