Job Title Databricks Data Lead Location San Rafael, CA - Bay Area, California (Local candidates only) Hybrid: 3 days/week onsite at client office in San Rafael, CA Experience Level 8-12 years in data engineering, analytics engineering, or Distributed data systems Role Overview We are seeking a Databricks Data Lead to support the design, implementation, and optimization of cloud-native data platforms built on the Databricks Lakehouse Architecture . This is a hands-on, engineering-driven role requiring deep experience with Apache Spark, Delta Lake, and scalable data pipeline development , combined with early-stage architectural responsibilities. The role involves close onsite collaboration with client stakeholders , translating analytical and operational requirements into robust, high-performance data architectures , while adhering to best practices for data modeling, governance, reliability, and cost efficiency . Key Responsibilities Design, develop, and maintain batch and near-real-time data pipelines using Databricks, PySpark, and Spark SQL Implement Medallion (Bronze/Silver/Gold) Lakehouse architectures , ensuring proper data quality, lineage, and transformation logic across layers Build and manage Delta Lake tables , including schema evolution, ACID transactions, time travel, and optimized data layouts Apply performance optimization techniques such as partitioning strategies, Z-Ordering, caching, broadcast joins, and Spark execution tuning Support dimensional and analytical data modeling for downstream consumption by BI tools and analytics applications Assist in defining data ingestion patterns (batch, incremental loads, CDC, and streaming where applicable) Troubleshoot and resolve pipeline failures, data quality issues, and Spark job performance bottlenecks Collaborate onsite with client data engineers, analysts, and business stakeholders to: Gather technical requirements Review architecture designs Validate implementation approaches Maintain technical documentation covering data flows, transformation logic, table designs, and architectural decisions Contribute to code reviews, CI/CD practices, and version control workflows to ensure maintainable and production-grade solutions Required Skills
Read Less