← back to jobs
> job detail
C
👽Other

Principal Data Engineer

Cedar ¡ USA
// classified as
Other (Adjacent or hard to classify.)
posted
<1d ago
location
USA
languages
python, sql
tools
dbt, fivetran, kafka
> stack
pythonsqldbtfivetrankafkasnowflakeairflowdbt
> description
<div class="content-intro"><p>Our healthcare system is the leading cause of personal bankruptcy in the U.S. Every year, over 50 million Americans suffer adverse financial consequences as a result of seeking care, from lower credit scores to garnished wages. The challenge is only getting worse, as high deductible health plans are the fastest growing plan design in the U.S.</p> <p>Cedar’s mission is to leverage data science, smart product design and personalization to make healthcare more affordable and accessible. Today, healthcare providers still engage with its consumers in a “one-size-fits-all” approach; and Cedar is excited to leverage consumer best practices to deliver a superior experience.</p></div><p><strong>Principal Data Architect &amp; Engineer</strong></p> <h2>Why this Job?</h2> <p>Cedar is at a pivotal moment in its technical evolution. We are evolving towards a brand-new, enterprise data platform and model, architected specifically to power the future of Cedar’s product offerings. As the Principal Data Architect, you will be the senior technical authority leading this transition, ensuring our new data foundation is clean, scalable, and cleanly separated from legacy debt.</p> <p>This is a hands-on technical leadership role for an engineer who loves building production systems at scale and views AI as a massive accelerator for both personal and team productivity.</p> <h2>What You’ll Do</h2> <ul> <li>Architect the Future-State Data Model &amp; Storage: Lead the design and execution of a new, high-scale data model and storage architecture that lives outside our legacy monolith. You will drive decisions on service boundaries, data ownership, and storage patterns—implementing Medallion architecture (Bronze, Silver, Gold layers) to ensure progressive data quality refinement—while defining the API contracts that will anchor Cedar’s data strategy for years to come. This includes real-time event-driven pipelines and streaming architectures alongside batch analytics.</li> <li>Ensure Metrics Reliability: Own the accuracy and reliability of Cedar’s core business metrics including collection rate, days in AR, and invoice balance tracking. Design validation frameworks that ensure our source of truth remains consistent across all reporting and product surfaces.</li> <li>Power Intelligent Agents: Build the real-time data foundations that power Cedar’s AI-driven products, ensuring agents, like <a href="https://www.cedar.com/solutions/kora-ai">Kora</a>, have the context, entity graphs, and metrics they need to act autonomously in revenue cycle workflows and transform the patient experience.</li> <li>Design for the Future: Design data models for new product lines that expand Cedar’s presence across the full revenue cycle from pre-service through post-service collections.</li> <li>Build Production Code: You are a hands-on builder. You will write, ship, and maintain high-quality production code alongside our team of Data Engineers. You will personally model critical domains and implement the core patterns that the rest of the team will follow.</li> <li>Guide and Mentor Engineers: Act as the technical role model for a team of Data Engineers. You will raise the bar for engineering excellence through rigorous code reviews, design guidance, and technical mentorship, turning every challenge into a growth opportunity for the team.</li> <li>Collaborate: Drive technical alignment across data engineering, platform, and product teams, set priorities, unblock dependencies, and ensure delivery against customer-facing commitments.</li> <li>Steward Data Integrity: Define and enforce standards for Cedar's enterprise Data Dictionary and metadata strategy. You will partner with engineering and product to ensure data is accurate, discoverable, and synchronized across all environments.</li> </ul> <h2>What You Bring</h2> <ul> <li>Strategic Technical Authority: 10+ years of experience in data engineering and backend systems at scale. You have experience leading large technical projects with a high accountability mentality, focused on delivering value, with a proven track record of shipping production enterprise systems that customers depend on.</li> <li>Modern Stack Mastery: Deep, hands-on proficiency with a wide range of data and engineering technologies and patterns such as Snowflake, dbt, Liquibase, Fivetran, Airflow, OpenMetadata, Kafka, SQL, Python, Kafka, streaming/event driven architectures, CDC (Change Data Capture), and real-time data processing. You have built production-grade ELT/ETL pipelines and managed complex data transformations in cloud-first environments.</li> <li>Metrics &amp; Integrity Focus: Experience building "source of truth" systems and a passion for data accuracy. You understand how to define and audit key business metrics to ensure they remain resilient to system changes. You have experience with data validation and reconciliation, comparing pipeline output against source systems and methodically diagnosing discrepancies.</li> <li>ML/AI Fuel: You have experience building data APIs, context systems, or feature stores that power downstream ML/AI applications.</li> <li>Architectural Judgment: You can spot over-engineering and under-engineering with equal confidence. You know how to design for resilience, modularity, and backwards compatibility. You prefer incremental milestones over big bang deliverables.</li> <li>Clear Communication: You can translate complex technical risks and trade-offs into business-relevant terms for senior stakeholders and cross-functional partners.</li> <li>AI-Native Workflow: You are an avid learner in general, and an adopter of AI engineering tools like Claude Code and Codex. You have experience directing AI agents to generate complete, high-quality solutions and apply your judgment to test and own the output.</li> <li>[highly preferred] Healthcare Domain Expertise: Demonstrable understanding of healthcare concepts, including revenue cycle management (RCM), billing, and insurance adjudication.</li> </ul> <p><strong>Compensation Range and Benefits</strong></p> <ul> <li>Salary/Hourly Rate Range*: $246,500 - $311,750</li> <li>This role is equity eligible</li> <li>This role offers a competitive benefits and wellness package</li> </ul> <p>*Subject to location, experience, and education</p><div class="content-conclusion"><p><strong>What do we offer to the ideal candidate?</strong></p> <ul> <li>A chance to improve the U.S. healthcare system at a high-growth company! Our leading healthcare financial platform is scaling rapidly, helping millions of patients per year</li> <li>Unless stated otherwise, most roles have flexibility to work from home or in the office, depending on what works best for you</li> <li>For exempt employees: Unlimited PTO for vacation, sick and mental health days–we encourage everyone to take at least 20 days of vacation per year to ensure dedicated time to spend with loved ones, explore, rest and recharge</li> <li>16 weeks paid parental leave with health benefits for all parents, plus flexible re-entry schedules for returning to work</li> <li>Diversity initiatives that encourage Cedarians to bring their whole selves to work, including three employee resource groups: be@cedar (for BIPOC-identifying Cedarians and their allies), Pridecones (for LGBTQIA+ Cedarians and their allies) and Cedar Women+ (for female-identifying Cedarians)&nbsp;</li> <li>Competitive pay, equity (for qualifying roles), and health benefits, including fertility &amp; adoption assistance, that start on the first of the month following your start date (or on your start date if your start date coincides with the first of the month)</li> <li>Cedar matches 100% of your 401(k) contributions, up to 3% of your annual compensation</li> <li>Access to hands-on mentorship, employee and management coaching, and a team discretionary budget for learning and development resources to help you grow both professionally and personally</li> </ul> <p><strong>About us&nbsp;</strong></p> <p>Cedar was co-founded by Florian Otto and Arel Lidow in 2016 after a negative medical billing experience inspired them to help improve our healthcare system. With a commitment to solving billing and patient experience issues, Cedar has become a leading healthcare technology company fueled by remarkable growth. "Over the past several years, we've raised more than $350 million in funding &amp; have the active support of Thrive and Andreessen Horowitz (a16z).</p> <p>As of November 2024, Cedar is engaging with 26 million patients annually and is on target to process $3.5 billion in patient payments annually. Cedar partners with more than 55 leading healthcare providers and payers including Highmark Inc., Allegheny Health Network, Novant Health, Allina Health and Providence.</p></div>