Design, build, and iterate on machine learning models and LLM-based systems that power critical decisions across fraud, compliance, growth, and operations
Work with messy, real-world data to identify signals, build features, and continuously improve model performance
Make practical tradeoffs between model performance, interpretability, and operational cost
River is building the world’s most trusted financial institution to empower people to take ownership of their financial lives through Bitcoin. River is growing quickly and has raised more than $50 million from leading investors.
Conduct fundamental LLM research using our SOTA story engine.
Create a benchmark for evaluating LLM behavior.
Deliver a benchmark library and a written report of compiled results.
Latitude is building the future of AI-native games by creating a platform where developers and creators can build entirely new kinds of interactive worlds. Latitude is a team of high-agency builders and storytellers who thrive on craft, curiosity, and community.
Review Physics papers and a graphical abstract that summarizes the paper generated by an LLM.
Check and identify issues where the graphical abstract doesn’t accurately represent the scientific paper.
Influence the AI models of the future using professional legal expertise.
Prolific is building the biggest pool of quality human data in the world. Over 35,000 AI developers, researchers, and organizations use Prolific to gather data from paid study participants with a wide variety of experiences, knowledge, and skills.
Lead end-to-end onboarding projects for enterprise customers.
Drive adoption, expansion, and measurable value realization.
Enable agentic workflows for audience ideation, campaign optimization, and performance analysis.
GrowthLoop is rebuilding how enterprise marketing teams operate. They drive compound growth by accelerating the marketing cycle, using Agentic AI powered by customers’ enterprise cloud data, and hundreds of marketers at enterprises like Google, Costco, and Albertsons rely on GrowthLoop.
Build and maintain context infrastructure for AI tools.
Design and run evaluation frameworks for AI-generated insights.
Build and orchestrate AI agent systems for analytics tools.
Airtable is a no-code app platform empowering people to accelerate critical business processes. More than 500,000 organizations rely on Airtable to transform how work gets done, suggesting a large company size and a culture of innovation.
Develop the logic to detect incomplete or failed API requests.
Build a reasoning layer utilizing LLMs/VLMs to process unstructured documents.
Aledade empowers independent primary care and is the largest network of independent primary care in the country. They help practices, health centers and clinics deliver better care to their patients and thrive in value-based care. Aledade has a collaborative, inclusive and remote-first culture.
Creatively writing prompts and responses to a variety of diverse topics
Perform LLM annotation and evaluation tasks (ranking, scoring, labeling, tagging)
Evaluate model outputs for accuracy, relevance, and instruction-following
Welo Data is an AI services company that specializes in data annotation. They deliver high-quality training data transformation solutions for NLP-enabled machine learning by blending technology and human intelligence to collect, annotate, and evaluate all content types.
Creatively writing prompts and responses to a variety of diverse topics.
Leading labeling initiatives with third party firms and internal customers.
Creating and updating detailed guidelines and specifications for stakeholders.
Welo Data provides AI services, specifically data annotation. They enable brands and companies to reach, engage, and grow international audiences, delivering multilingual content transformation services in translation, localization, and adaptation.