Source Job

$35–$100/hr

  • Build sandbox UIs that agents and RL actors will interact with.
  • Create tasks for built environments and programmatically validate task completion.
  • Take ownership of the entire task creation process for a given environment.

React.js Python Docker UI/UX API

20 jobs similar to RL Environments Specialist

Jobs ranked by similarity.

  • Design and implement interfaces across the platform for compute orchestration and RL training.
  • Translate complex backend systems into intuitive, production-ready product experiences.
  • Build for technical audiences, including AI and general software engineers.

Prime Intellect makes frontier AI accessible to everyone and enables individuals/organizations to train models using their agentic training infrastructure.

Canada 3w PTO

  • Design, develop, and maintain end-to-end features for Cresta’s no-code processing platform.
  • Build intuitive UI components and visual editors for configuring conversation logic and workflows.
  • Architect and implement backend services and APIs to power a dynamic no-code interface.

Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center.

US Unlimited PTO

  • Build AI-focused user interfaces.
  • Own the frontend architecture and component library.
  • Create responsive, accessible experiences.

Mento is a career technology company that helps people be exceptional and thrive at work through human, AI, and software-based coaching. They strive to create a fun, conscientious, collaborative, and supportive work environment and are a US-based, remote-first company, backed by leading VCs and angel investors.

$72–$72/hr
US

Design clean, intuitive, responsive interfaces using modern web standards. Use AI tools to accelerate layout generation, component creation, content drafting, and workflow optimization. Partner with developers to ensure designs translate accurately into production-ready components.

1840 & Company is a global leader in Business Process Outsourcing (BPO) and remote talent solutions, dedicated to propelling businesses forward through our comprehensive suite of services.

Join Zapier’s AI Tasks team, where you’ll own the core AI surfaces that make our customers’ workflows smarter. Collaborate with AI experts, product managers, designers, and platform engineers to bring LLM-native workflows to life. Build and refine user-facing features and interfaces for initiating, monitoring, and guiding agent behavior.

At Zapier, we build and use automation every day to make work more efficient, creative, and human.

US

  • Build early creator onboarding, profile, and campaign-related flows.
  • Implement frontend components and backend endpoints for first-party workflows, supporting the core experiences needed for the Alpha Launch.
  • Support API integration and early BFF patterns for the platform services.

Beast Industries is a multifaceted media and entertainment company founded by Jimmy Donaldson, popularly known as MrBeast, the most watched person in the world.

Build resilient AI Agents using LangGraph and microservices. Develop complex automation workflows in n8n. Collaborate with Internal Business Analysts to focus on coding, not guessing requirements.

At Gcore, you’ll help design and deliver that foundation for an AI-driven world, being a global provider of infrastructure and software solutions for AI, cloud, network, and security.

Design, develop, and maintain responsive, scalable web applications using React.js and Python/Django. Implement complex application state management using Redux, RTK Query, or Context API. Build robust and secure back-end APIs and business logic following best practices.

Rightshero is a spin-off company from DigiSay Group, a UAE leading media-tech enabling group based and active in MENA, founded in 2016.

$40–$60/hr

  • Design responsive user interfaces, graphs, and visualizations.

Epoch AI is a research institute that investigates trends in machine learning and the economic consequences of AI. Their mission is to develop a comprehensive, publicly accessible knowledge base on AI that informs policymakers, industry leaders, and society at large.

$104,000–$166,000/yr
US

  • Develop technical solutions to complex problems.
  • Design, develop, document, tests and debugs applications software.
  • Develop and implement user interfaces for web applications using ReactJS.

Peraton is a next-generation national security company that drives missions of consequence spanning the globe and extending to the farthest reaches of the galaxy.

  • Build scalable backend services and APIs.
  • Create responsive dashboards and tools to manage automation systems.
  • Implement automation workflows to power real-time operations.

Base360.ai is an intelligent automation engine that connects data, orchestrates workflows, and powers large-scale operational systems.

Global

  • Work with teams to modernize platform components and refactor legacy code.
  • Improve APIs, frontend, backend, and CI/CD; ensure observability and security.
  • Establish AI-assisted development guardrails and champion these practices.

Gcore provides infrastructure and software solutions for AI, cloud, network, and security. They power real-time communication, streaming, enterprise AI, and secure web applications, with over 550 professionals globally and 210+ edge locations.

US

  • Develop complex React applications.
  • Craft amazing user experiences.
  • Solve problems in creative ways while balancing tradeoffs and timelines.

Flocknote is a small, startup engineering team with a passion for the Church; striving to create a place where every developer can build the best work of their lives.

US Unlimited PTO

  • Build core user-facing systems that power renovation financing in the U.S.
  • Partner with product and design to solve real operational and financial problems and improve existing flows.
  • Take end-to-end responsibility for the quality, reliability and maintainability of your code.

RenoFi's platform enables homeowners to borrow funds from RenoFi approved lenders in the form of the first home equity loan purpose-built for renovations!

US

  • Build and ship high-quality product features in an Agile team.
  • Translate designs and requirements into performant React experiences.
  • Optimize for real-world performance at scale.

As the industry-leading, human-focused engagement platform, we deliver powerful online communities and communication tools to organizations looking to build, retain, and grow.

As a Front-End Software Engineer on the P2D team, you’ll play a vital role in developing user-facing features that are performant, accessible, and delightful to use. Help design seamless UI experiences within our existing frameworks. Collaborate closely with product designers, backend engineers, and other cross-functional teams to deliver high-impact improvements.

Turnitin partners with educational institutions to promote honesty, consistency, and fairness across all subject areas and assessment types.

$97,920–$146,880/yr
US

Develop innovative UI/UX features that help users. Operate in a collaborative, agile environment. Create proofs-of-concept and prototypes to quickly test ideas.

Two Six Technologies builds, deploys, and implements innovative products that solve the world’s most complex challenges today.

$104,000–$156,000/hr

  • Design, build, and optimize high-performance systems in Python supporting AI data pipelines and evaluation workflows
  • Develop full-stack tooling and backend services for large-scale data annotation , validation, and quality control
  • Improve reliability, performance, and safety across existing Python codebases

Alignerr connects top technical experts with leading AI labs to build, evaluate, and improve next-generation models. They work on real production systems and high-impact research workflows across data, tooling, and infrastructure.

  • Design and build intuitive, user-facing features.
  • Leverage modern JavaScript frameworks and CSS design systems.
  • Write clean, maintainable, and efficient code for frontend components.

At G2i, we connect subject-matter experts with flexible, remote opportunities in AI development and training.

Europe

  • Design, implement, and maintain SFT and RL post-training pipelines for multi-step coding agents.
  • Train and adapt LLMs for agent workflows, including planning, tool use, and multi-step interactions inside JetBrains IDEs.
  • Build and develop evaluation and simulation environments where coding agents can act, be measured, and compared on realistic developer tasks.

At JetBrains, code is their passion and they strive to make the strongest, most effective developer tools on earth. Today, AI-powered assistance and agents are becoming a core part of how developers work in their IDEs.