Challenge advanced language models on software engineering tasks.
Verify logical accuracy and coding fluency in German.
Capture reproducible error traces and suggest improvements.
Project World Wide is shaping the future of AI through high-quality training data. They appear to be a technologically advanced organization focused on evolving language models into powerful engines.
Challenge advanced language models on topics like sentence structure and idiomatic expressions.
Verify factual accuracy and logical soundness of the AI's responses.
Suggest improvements to the model's prompt engineering and evaluation metrics.
Invisible Technologies makes AI work by structuring messy data, automating digital workflows, deploying agentic solutions, and integrating human expertise. They reached $134M in revenue and ranked as the number two fastest growing AI company on the 2024 Inc. 5000.
Define and enforce enterprise architecture standards, patterns, and conventions across all product domains.
Lead technical design sessions and author Architectural Decision Records (ADRs) for major system changes.
Champion integration of AI‑assisted coding tools into the SDLC.
They are seeking an accomplished Software Architect to lead the design, implementation, and governance of enterprise‑grade, cloud‑native Java applications. They are a growing company that values innovation and collaboration and they are looking for someone to help them build the future of AI-augmented development.
Review and analyze generated code against the original software engineering prompt
Evaluate whether the coding task itself is clearly and correctly defined
Validate whether tests accurately reflect whether the problem has been solved
Vetto is a tech company focused on building and scaling high-quality datasets for artificial intelligence systems. They work at the intersection of human expertise and AI, ensuring that models are trained on technically accurate, well-defined, and realistic data.
Completing AI training tasks such as analyzing, editing, and writing Python
Judging the performance of AI in performing Python-related prompts
Improving cutting-edge AI models
Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.
Serve as the technical point of reference for the squad and broader engineering team.
Lead the design and architectural discussions for systems.
Advocate for and ensure the delivery of high-quality, maintainable, and scalable code.
Nerdy, the company behind Varsity Tutors, is redrawing the blueprint of learning. Their Live + AI™ platform fuses real-time human expertise with proprietary generative-AI systems, setting a new bar for measurable academic impact at global scale.
Converse with the model on language scenarios, verify factual accuracy and logical soundness.
Capture reproducible error traces, and suggest improvements to our prompt engineering and evaluation metrics.
Challenge advanced language models on topics like verb conjugation, sentence structure and nuances of Japanese writing systems.
They are shaping the future of AI by providing high-quality training data to large-scale language models. As a contractor for this project, company-sponsored benefits such as health insurance and PTO do not apply.
Reviewing, annotating, and testing AI outputs for grammatical accuracy, naturalness, and strict cultural context.
Acting as a primary quality check during production to proactively identify and correct subtle cultural errors or awkward phrasing in the target language.
Analyzing task quality trends and autonomously developing educational resources and feedback documentation to increase alignment between AI task outputs and campaign expectations.
Greenhouse provides recruiting software. No information about company size or culture is available in the job description.
Lead the design of distributed systems and data models, ensuring the Java Spring Boot environment can support rapid scaling and feature expansion.
Take full accountability for the end-to-end delivery of major cross-functional product updates, managing the release lifecycle and ensuring high-impact features are deployed reliably.
Proactively leverage AI tools (e.g., Claude) to complement your workflow and increase velocity, while retaining total ownership over the logic, security, and architectural implications of the code.
Archy is transforming the way dental practices operate with its vertical SaaS solution that provides cutting-edge tools. They are looking for passionate engineers to join their growing team.