Back to the board

Data Engineer, Databricks, Python, Azure

100% remote Flexible hours Hiring now

Job Description:

  • Design, develop, and maintain scalable data pipelines using Databricks (PySpark) and Python.
  • Build and optimize ETL/ELT processes within Azure cloud environments.
  • Implement data models following modern Data Lakehouse principles (e.g., Medallion architecture).
  • Ensure data quality, consistency, and performance across ingestion, staging, and curated layers.
  • Collaborate with data architects, analysts, and business stakeholders to translate healthcare data requirements into technical solutions.
  • Develop reusable data transformation logic and modular processing components.
  • Support deployment processes following CI/CD and DevOps best practices.
  • Monitor and optimize data workflows for performance, scalability, and reliability.
  • Contribute to data governance, security, and compliance practices relevant to healthcare environments.

Requirements:

  • Current knowledge of an using modern data tools like (Databricks, FiveTran, Data Fabric and others); Core experience with data architecture, data integrations, data warehousing, and ETL/ELT processes
  • Applied experience with developing and deploying custom whl and or in session notebook scripts for custom execution across parallel executor and worker nodes
  • Applied experience in SQL, Stored Procedures, and Pyspark based on area of data platform specialization.
  • Strong knowledge of cloud and hybrid relational database systems, such as MS SQL Server, PostgresSQL, Oracle, Azure SQL, AWS RDS, Aurora or a comparable engine.
  • Strong experience with batch and streaming data processing techniques and file compactization strategies.
  • Strong analytical and problem-solving skills.
  • Ability to work effectively in cross-functional and distributed teams.
  • Clear communication skills, with the ability to explain technical concepts to non-technical stakeholders.
  • Proactive mindset with a strong sense of ownership.
  • Commitment to delivering high-quality, reliable data solutions.

Benefits:

  • Allata is an equal opportunity employer
  • We celebrate diversity and are committed to creating an inclusive environment for all employees

Apply tot his job Apply To this Job

Keep exploring

Lead Engineer, Applications - Azure/Container/.Net Core/ADF - Remote

100% remote Flexible hours

Azure Integration & Logic Apps

100% remote Flexible hours

GCP Data Quality Test Engineer – Retail Domain

100% remote Flexible hours

Data Engineer (Azure, Fabric, Databricks)

100% remote Flexible hours

GCP Security SecDevOps Engineer-NYC, NY or Alpharetta, GA

100% remote Flexible hours

Google Cloud DevOps Engineer

100% remote Flexible hours

Healthcare Data Software Engineer (Azure, Kafka, Databricks, Coding, Healthcare Data)

100% remote Flexible hours

Senior Data Engineer (GCP, BigQuery, dbt)

100% remote Flexible hours

DevOps Engineer job at EverDriven Technologies in Greenwood Village, CO

100% remote Flexible hours

Senior Azure Systems & Platform Engineer

100% remote Flexible hours

Interface Analyst

100% remote Flexible hours

[Remote] Mid/Senior - Data Scientist

100% remote Flexible hours

Senior/Principal Statistician - Medical Affairs (Remote)

100% remote Flexible hours

Experienced Customer Service Representative - Hybrid - Must be Located in OK - $16.25 / hour, Monday - Friday!

100% remote Flexible hours

Customer Success Manager

100% remote Flexible hours

Account Manager, Affiliate Marketing

100% remote Flexible hours

Remote Customer Support Associate – arenaflex – Delivering Exceptional Service for Food Delivery Platform

100% remote Flexible hours

Experienced Part-Time Data Entry Specialist – Flexible Hours – arenaflex

100% remote Flexible hours

Remote Customer Service Jobs $25 An Hour - CA

100% remote Flexible hours

[Remote] Manager - Business Development (Virtual - Miami)

100% remote Flexible hours