Similar Jobs

See all

Responsibilities:

  • Own the gold data layer: transform messy silver tables into curated, semantically rich, clean, and documented gold datasets for AI model development.
  • Bridge semantics with AI needs: design and build gold data products that meet researcher data needs for efficient AI-first model R&D.
  • Build pipelines for reuse: develop transformations inside Databricks/Spark as scheduled, observable workloads that allow researchers to iterate on new features.

Required Skills and Experience:

  • 5+ years building production data systems, with at least 2 supporting ML or AI workloads.
  • Advanced Python, SQL, and PySpark/Databricks for working with large, messy data.
  • Databricks ecosystem depth: Delta Lake, Unity Catalog, Spark/PySpark tuning, MLflow.

Preferred Qualifications:

  • Hands-on EHR data experience, ideally in skilled nursing, long-term care, post-acute care, or senior living.
  • Working knowledge of clinical terminologies (ICD-10, SNOMED CT, LOINC) and data standards (HL7v2, FHIR, CCDA).
  • Familiarity with training-side ML frameworks (e.g., PyTorch) sufficient to debug data-side bottlenecks.

PointClickCare

PointClickCare is a leading health tech company that helps providers deliver exceptional care through a platform serving over 30,000 provider organizations. Founder-led and privately held, the company reinvests in R&D and has been recognized by Forbes as a top private cloud company and one of Canada's Most Admired Corporate Cultures.

Apply for This Position