Development of various services in Python: integration with marketing partners, obtaining data from various sources.
Creation and support of processes on Airflow.
Supporting the migration of marketing data pipelines and DWH components from MS SQL to Google Cloud Platform (including BigQuery), contributing to architecture decisions and best practices.
Social Discovery Group (SDG) is one of the world's largest groups of social discovery companies, uniting millions of users on dozens of products. Our international team of 1000+ professionals and digital nomads works all over the world and we are proud to be a two-time “Great Place to Work” winner.
Execute and advance the enterprise data science and AI strategy aligned to organizational goals.
Design, develop, and deploy advanced machine learning models including predictive modeling and LLMs.
Partner with engineering teams to operationalize MLOps practices and productionize models.
Pyramid Systems is an award-winning technology leader driving digital transformation across federal agencies. Voted a Top Workplace both regionally and nationally, the company values flexible work, employee voice, and development.
Design and maintain data pipelines and auto-labeling systems to support ML model training from multimodal data.
Write and optimize SQL queries for data extraction, analysis, and ingestion from various sources.
Develop and prototype learning-based models using a data-centric approach with techniques like active learning and fine-tuning.
Serve Robotics is reimagining urban delivery with sidewalk robots, aiming to reduce congestion and support local businesses. The team is an agile, diverse group of tech industry veterans focused on robotics, machine learning, and end-to-end user experience.
Design, develop, and maintain scalable data pipelines and infrastructure.
Build and optimize data warehouses, databases, and data models.
Implement and maintain data governance and security practices.
Jobgether is a company that uses an AI-powered matching process to ensure applications are reviewed quickly, objectively, and fairly. They connect candidates with companies; their culture is collaborative and inclusive, focused on innovation and growth.
Build, define, and activate Appfire’s company-wide data asset (Snowflake) from ground up data sources.
Architect, build, and launch efficient & reliable data models and pipelines in partnership with Data Engineering, leveraging DBT and Snowflake.
Design and implement metrics and dimensions enabling self-service analysis and structured views of Appfire’s KPI’s that scale trust, accountability and decision impact.
Appfire empowers teams to break silos and collaborate seamlessly with its software. They are a remote-first company with 850+ employees (called "fireflies") across 28 countries, fostering a culture of respect and growth.
Deliver high impact code to create pipelines, visualizations, and dashboards that enable informed decision-making at scale.
Operate as a senior technical expert in data and analytics engineering, leading architecture discussions and implementing robust, scalable data models.
Collaborate cross-functionally with Product Managers, Engineers, and Business Stakeholders to capture requirements and translate them into technical solutions.
Lovevery is a fast-growing digitally native brand co-founded by successful serial entrepreneurs and based in Boise, Idaho, taking a science-based approach to help parents give their children meaningful development experiences. Named one of Fast Company's 10 most innovative companies of 2024, Lovevery has received awards from TIME, Red Dot, Good Housekeeping and Forbes.
Proactively explore data to identify customer problems and product bets.
Translate ambiguous problems into clear analyses and recommendations.
Partner with Product Managers to shape roadmaps.
RevenueCat removes the headaches of building and scaling in-app subscriptions. We're a remote-first crew of 120+, spread across 25 countries, helping everyone from solo devs to the OpenAI mobile team understand and grow their revenue.
Independently deliver analytical projects across the consumer credit lifecycle, including acquisition, account management and collections
Build statistical and machine learning models through all phases of development, from design through training, evaluation, validation and implementation
Use a broad set of technologies: SQL, PySpark, Python, AWS and more to obtain insights from large volumes of data
Experian is a global data and technology company, powering opportunities for people and businesses around the world. We operate across a range of markets, from financial services to healthcare, automotive, agribusiness, insurance, and many more. They have an amazing team of 25,200 people in 32 countries.
Lead the design and evolution of the data platform architecture, establishing patterns and standards the team builds on.
Build and operate production-grade data pipelines that ingest and transform high-variance, real-world clinical data reliably and at scale.
Contribute to quarterly data product releases, working closely with product, clinical, and customer success teams to meet commitments.
Verantos is the market leader in high-accuracy real-world evidence (RWE) generation. The Verantos RWE platform integrates heterogeneous real-world data sources and generates evidence with the accuracy necessary for regulatory and reimbursement use, serving some of the largest biopharma companies globally.
Design, build, and maintain scalable data pipelines using Python, Spark, and Airflow.
Collaborate cross-functionally with AI/ML and Product teams to implement new features.
Proactively identify and resolve bottlenecks in our complex ETL processes.
Sayari provides judgment infrastructure for trustworthy AI in economic security and commercial risk. They resolve primary-source records forming the ground truth of global commerce, and are headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv.
Be the Analytics Engineering lead within the Sales and Marketing organization.
Be the data steward for Sales and Marketing: architect and improve the collection of underlying data.
Develop and maintain robust data pipelines and workflows for data ingestion, processing, and transformation.
Reddit is a community of communities, built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. With 100,000+ active communities and millions of daily active unique visitors, Reddit is one of the internet’s largest sources of information.
Assess current pipelines and data architecture to produce a prioritized plan for change.
Design durable data and ML systems grounded in customer needs with documented tradeoffs.
Harden pipelines, upgrade data architecture, and raise standards for observability and reliability.
FutureFit AI's core mission is to help more people get to better jobs faster and cheaper, with a focus on those facing barriers to opportunity. Their team of 30-50 across the US and Canada fosters a high trust, high intensity culture with a will to win.
Own the model lifecycle: requirements, experimentation, model development, evaluation.
Translate complex fraud patterns into well-framed ML solutions: defining what to model, what success looks like.
Monitor model quality in production, tracking performance over time, detecting data drift, and determining when to retrain.
Extend is revolutionizing the post-purchase experience for retailers and their customers by providing merchants with AI-driven solutions. They work with more than 1,000 leading merchant partners across industries and are backed by some of the most prominent technology investors in the industry.
Design and maintain production-grade ETL/ELT pipelines in a multi-hundred terabyte Snowflake environment.
Translate client loyalty program requirements into dimensional models and platform tables.
Build reliable, event-driven data architecture to support AI-powered loyalty products.
Kobie is a leader in loyalty solutions, helping brands build lasting emotional connections with consumers. Named a Top Workplace in the USA, the company fosters a collaborative, growth-focused culture with a diverse suite of benefits and flexible work arrangements.
Architect and lead the design of data systems serving both operational business stakeholders and product/engineering teams.
Extend internal AI platform and bring software engineering rigor to data work including testing, CI/CD, and code review.
Build and own data models, partner with product engineering, and mentor teammates to raise the technical bar.
ZipRecruiter is a leading online employment marketplace powered by AI-driven intelligent matching technology. The company has the #1 rated job search app on iOS & Android and connects job seekers with millions of businesses.
Build, maintain, and operate data pipelines and curated data products across Snowflake, Airflow (MWAA), AWS, Python, and SQL.
Implement observability and data quality controls and build monitoring for freshness, volume, schema, distribution, and lineage.
Define and enforce data platform standards, establish orchestration patterns, DAG anti-patterns, deployment practices, observability standards, data quality patterns, and operational runbooks used across the organization.
Design, develop, and maintain robust, scalable ETL/ELT data pipelines using Python, SQL, and data processing frameworks.
Implement data quality checks, monitoring, and alerting across all data pipelines to ensure data integrity and reliability.
Work closely with data analysts, data scientists, and business intelligence engineers to understand their data requirements and deliver reliable, high-quality data access.
InStride Health delivers specialty anxiety and OCD care. They focus on expanding access to insurance-based care, increasing engagement, and improving treatment outcomes by combining clinical care and innovative technology. They are a mission-driven company.
Take ownership of the ML API serving NBA recommendations and harden it for low-latency production traffic.
Ship your first agent tool contract end-to-end: schema design, handler implementation, and unit tests.
Set up the eval foundation for agents with golden transcripts, rubric-based judges, and regression suites.
Clutch is a vertical SaaS company backed by Andreessen Horowitz that helps credit unions become fintech lenders, providing affordable lending solutions to over 130 million Americans. The team is small, ambitious, and shipping fast with a culture that values pragmatism and real autonomy.
Develop and implement scalable AI/ML solutions for generative AI models including large language models and multimodal architectures.
Design multi-year vision and shape the direction of crucial generative AI areas such as text generation, image synthesis, and personalized content.
Partner with product management and stakeholders to identify use cases, analyze patterns, and maintain compliance in healthcare AI.
Aledade is a healthcare technology company that builds web applications and data pipelines to support primary care. They are a large organization with a culture focused on engineering excellence, observability, and incremental delivery.
Own end-to-end analytical problems from framing through modeling and stakeholder rollout.
Build predictive and inferential models to improve product and operational decisions.
Partner with Product, Engineering, and Operations to define metrics and drive insights.
Focal Systems is the industry leader in retail AI solutions, using deep learning computer vision to automate and optimize brick and mortar retail. They are a tight-knit team with an ambitious mission, deployed at scale with top retailers worldwide.