Similar Jobs

See all

Role and Responsibilities:

  • Construct infrastructure for data ingestion, transformation, and loading from diverse sources using technologies like Spark and Databricks.
  • Build an entity resolution framework to merge vast numbers of individual entities into clean, usable datasets.
  • Develop CI/CD pipelines and systems for anomaly detection to enhance ongoing data quality in production.

Required Skills and Experience:

  • Possess 1-2+ years of industry experience demonstrating strategic technical problem-solving and implementation.
  • Exhibit strong software development fundamentals and experience with scalable data processing systems.
  • Have familiarity with tools such as Python, SQL, Apache Spark, Airflow, and cloud services like AWS.

Work Environment and Traits:

  • Balance high ownership and autonomy with effective collaboration in a remote-first setting.
  • Demonstrate strong written communication skills on platforms like Slack and in technical documents.
  • Scope projects, communicate progress, and manage blockers proactively with managers and team stakeholders.

People Data Labs

People Data Labs is the provider of people and company data, integrating thousands of compliantly sourced datasets into a single, developer-friendly source of truth. The company culture is focused on mentorship and collaboration, with engineers described as thoughtful and quirky, working to support a data-as-a-service business.

Apply for This Position