> job detail
C
🧪Data Scientist
Data Engineer, Human Cohorts
Calico · South San Francisco, CA
// classified as
Data Scientist (Modeling, experiments, research.)
posted
2d ago
location
South San Francisco, CA
languages
python, sql
tools
aws, kubernetes
> stack
pythonsqlawskubernetesfastapi
> education
phd
> description
<h3><strong>Who We Are:</strong></h3>
<p>Calico (Calico Life Sciences LLC) is an Alphabet-founded research and development company whose mission is to harness advanced technologies and model systems to increase our understanding of the biology that controls human aging. Calico will use that knowledge to devise interventions that enable people to lead longer and healthier lives. Calico’s highly innovative technology labs, its commitment to curiosity-driven discovery science, and, with academic and industry partners, its vibrant drug-development pipeline, together create an inspiring and exciting place to catalyze and enable medical breakthroughs.</p>
<h3><strong>Position Description:</strong></h3>
<p>Calico is seeking a Data Engineer to join our highly collaborative Engineering team and focus on developing high-performance research data infrastructure for large human cohorts. To succeed, you will need to be an enthusiastic team player, detail-oriented, extremely organized, and comfortable working on complex data, software, and scientific problems.</p>
<p>In this position, you will be the engineering lead for data infrastructure to support our human biology teams. You will drive projects from requirements-gathering to production deployment, engineering high-performance data systems that integrate with our internal data systems and our internally-developed AI platform.</p>
<h3><strong>Position Responsibilities:</strong></h3>
<ul>
<li><strong>End-to-End Project Ownership:</strong> Collaborate with data scientists and bench scientists to gather requirements, architect solutions, and deploy production-grade software that facilitates data movement, transformation, analysis, and visualization</li>
<li><strong>Data Flow Architecture:</strong> Define and optimize data flows across the organization</li>
<li><strong>Full-Stack Tool Development:</strong> Develop data systems and internal web applications (using React and Python) that allow stakeholders to review, visualize, and communicate complex scientific data</li>
<li><strong>Mentorship & Leadership:</strong> Serve as a strong technical voice within a larger Engineering team; provide mentorship to junior engineers across Calico and help onboard future hires</li>
<li><strong>Engineering Excellence:</strong> Champion best practices for infrastructure-as-code, CI/CD, and containerization while helping to set standards for data engineering at Calico</li>
</ul>
<h3><strong>Position Requirements:</strong></h3>
<ul>
<li>BS/MS/PhD in Computer Science, Data Science, or a related technical field, or equivalent practical experience</li>
<li>4+ years (for BS/MS) or 1-2 years (for PhD) of professional software or data engineering experience developing robust, production-grade, and high-performance R&D-focused information systems</li>
<li>Experience working with large-scale biological datasets</li>
<li>Fluency in Python and SQL with a strong grasp of software and data engineering principles (testing, modularity, design patterns, data modeling)</li>
<li>Demonstrated experience developing and deploying cloud-based applications on Google Cloud Platform (GCP) (preferred), AWS, or Azure</li>
<li>Strong experience with modern web frameworks and infrastructure, specifically FastAPI, React, Kubernetes, and Terraform</li>
<li>Proven ability to lead complex projects involving diverse stakeholders (e.g., ML engineers, computational biologists, bench scientists) from concept to production</li>
<li>Experience enforcing robust data governance policies and compliance with internal information security standards and best practices</li>
<li>Must be willing to work onsite at least four days per week</li>
</ul>
<p>The estimated base salary range for this role is $191,000 - $195,000. Actual pay will be based on a number of factors including experience and qualifications. This position is also eligible for two annual cash bonuses.</p>
<h2 id="ojxVC" class="wLzlc"></h2>
<p> </p>