Assess the factual accuracy, relevance, and quality of AI-generated Computer Science content
Craft and answer domain-specific questions related to Computer Science and adjacent technical disciplines
Evaluate and rank AI-generated responses based on technical correctness and reasoning quality
The company is seeking Computer Science Experts with PhDs to support the training and evaluation of advanced AI models. This initiative focuses on improving the accuracy, reasoning, and domain expertise of generative AI systems through expert human feedback.
Update and populate E Source data tools with thorough, accurate information after receiving training.
Support research requests from utility clients and internal teams, including contributions to reports, articles, and webinars.
Organize, evaluate, and interpret quantitative and qualitative data drawn from primary and secondary sources.
E Source helps utilities make sense of complexity in a rapidly changing landscape. It is a research, data/analytics, and technology-focused professional services firm focused exclusively on the Utility industry in North America and has 450+ employees across the US and Canada.
Be the Analytics Engineering lead within the Sales and Marketing organization.
Be the data steward for Sales and Marketing: architect and improve the collection of underlying data.
Develop and maintain robust data pipelines and workflows for data ingestion, processing, and transformation.
Reddit is a community of communities, built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and millions of daily active unique visitors, Reddit is one of the internet’s largest sources of information.
Evaluate and improve model safety: Label, rank, audit, and refine human- and model-generated text to improve safety, quality, and policy alignment.
Apply nuanced safety judgment: Assess model outputs against detailed safety guidelines, rubrics, and style standards, making consistent decisions across ambiguous, sensitive, and context-dependent cases.
Create prompts and safety test cases: Write realistic prompts, user scenarios, and adversarial examples that help evaluate model behavior across safety categories and uncover unsafe, evasive, over-refusing, or policy-inconsistent responses.
Cohere's mission is to scale intelligence to serve humanity by training and deploying frontier models for developers and enterprises. They are a team of researchers, engineers, and designers passionate about their craft, believing that a diverse range of perspectives is a requirement for building great products.
Develop AI systems that automate dispute and chargeback handling using structured evidence and business logic, creating a better experience for our customers.
Build models that automate refunds, getting money back to our customers faster.
Build and maintain evidence extraction pipelines that process unstructured data using LLM-powered workflows to produce structured, actionable outputs.
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. They are a remote-first company with competitive benefits and focus on an inclusive interview experience.
Design, implement, and maintain a streaming-first data platform.
Build and operate a hybrid Databricks and Snowflake architecture.
Champion platform standards, tooling, and best practices across the data organization.
1Password is building a foundation for a safe, productive digital future to unleash employee productivity without compromising security. They have over 180,000 businesses trusting their teams; its culture prioritizes collaboration, clear communication, and alignment with core values.
Own and maintain data pipeline architectures, ensuring reliability and monitoring.
Manage and evolve data modeling environments for analysts and engineers.
Implement observability for data systems, detecting issues early and continuously monitoring data quality.
Voltus unlocks the full value of distributed energy resources for customers and the grid. They are a fast-growing climate-tech company with a bright, gritty, and good team that values innovation, impact, and integrity.
Data-driven Culture: you will be an ambassador for our data-driven culture, taking data and using it to answer business questions and drive strategy & decision making
Data Management: you will access, clean and organize raw data in a way that makes sense for you, your team and your stakeholders
Visualizations: you will help us to see and understand our data in a way that makes sense, creating automated dashboards and funnels for various internal teams
Super.com helps people save more, earn more, and get more out of life. They invest in learning, celebrate bold ideas, and create pathways for career growth.
Design experiments, prioritize hypotheses, and analyze what moved the needle in partnership with our Acquisition and Activation Growth squads.
Own the experimentation framework end-to-end: design, statistical rigor, readouts, and the decisions that come out of them.
Build out our semantic layer in Omni so PMs, designers, and Sales can self-serve trustworthy answers without pinging you.
Instrumentl is a YC-backed SaaS platform helping nonprofits discover, track, and win grant funding. They are a profitable, hypergrowth company with over 5,500 nonprofits using their platform and are hiring people who want to build something that matters.
Design and build an integrated data platform, unifying existing tools and pipelines into a cohesive, scalable architecture.
Own data pipelines and SLAs end to end, ensuring reliable data movement between systems with clear expectations.
Shape the data strategy and platform roadmap, researching new technologies and introducing tools as the platform evolves.
Wrapbook is a vertical fintech platform that enables companies to seamlessly onboard, pay, and insure their workforces, operating in the entertainment industry. They are at an exciting stage of growth, having raised over 30M from investors like Andreessen Horowitz.