← back to jobs
> job detail
F
⚙️Data Engineer

(Senior) Data Engineer

Flagship Pioneering, Inc. · Boston, MA USA
// classified as
Data Engineer (Pipelines, infra, ingestion, ETL.)
posted
37d ago
location
Boston, MA USA
languages
tools
aws, dbt, dynamodb
> stack
awsdbtdynamodbicebergnosqlredshifts3dagsterdbt
> education
msphd
> description
<div class="" data-block="true" data-editor="c12n7" data-offset-key="dhkj5-0-0"> <div class="" data-block="true" data-editor="c12n7" data-offset-key="6ni4e-0-0"> <div class="public-DraftStyleDefault-block public-DraftStyleDefault-ltr" data-offset-key="6ni4e-0-0"> <p><strong>About ProFound Therapeutics</strong></p> <p>ProFound Therapeutics is pioneering the discovery of the expanded human proteome to unlock a new universe of potential therapeutics. By integrating multi-omics, advanced computation, and translational biology, we aim to reveal and characterize thousands of previously uncharted proteins and systematically explore their role in health and disease.</p> <p><strong>The Role</strong></p> <p>We're seeking a (Senior) Data Engineer to join our data team. This individual will play a central role in building the data foundation that powers ProFound's drug discovery platform. You'll architect systems that integrate diverse biological data—from genomics and proteomics to imaging and perturbation experiments—enabling our scientists to make breakthrough discoveries and our ML models to identify novel therapeutic targets.</p> <p>This role offers the unique challenge of working at the intersection of computational biology, machine learning, and modern data engineering, with the impact of accelerating life-saving therapeutics.</p> <p><strong>Key Responsibilities</strong></p> <ul> <li>Contribute to design and scaling of our multi-modal data platform that integrates public and proprietary biological data (genomics, transcriptomics, proteomics, imaging, perturbation data) across data lakes, graph databases, relational and NoSQL databases, and data warehouses, enabling ML training, computational biology pipelines, and scientific exploration.</li> <li>Build production data pipelines and workflows that automate data ingestion and transformation, working with domain experts to optimize analysis pipelines for scientific discovery.</li> <li>Partner with computational and wet-lab scientists to model experimental data, manage instrument outputs and electronic lab notebook data, and ensure seamless integration into our data platform.</li> <li>Develop and manage cloud infrastructure on AWS following best practices and the Well-Architected framework, with focus on scalability, security, and cost optimization.</li> <li>Contribute to the data engineering team’s best practices including comprehensive documentation, monitoring and observability, and robust testing frameworks.</li> <li>Collaborate with external partners including CROs, vendors, and consultants to coordinate data transfers and support platform integrations.</li> </ul> <p><strong>Required Qualifications</strong></p> <ul> <li>BS, MS, or PhD in Computer Science, Bioinformatics, or related field with 0-4 years of professional data engineering experience.</li> <li>Background in scientific domains (biology, chemistry, or related fields).</li> <li>Python expertise including data science libraries and testing frameworks.</li> <li>AWS experience with storage, database, compute, and analytics services (S3, RDS, DynamoDB, Redshift, Lambda, EC2, Batch, ECS, Glue, Athena).</li> <li>Proven experience designing, deploying, and maintaining production data pipelines at scale.</li> <li>Hands-on experience with workflow orchestration systems (AWS Step Functions, NextFlow, dbt, Dagster) and event-driven architectures.</li> <li>Working knowledge of CI/CD frameworks, infrastructure-as-code (CloudFormation or AWS CDK), and containerization (Docker).</li> <li>Strong technical communication skills with ability to translate complex technical concepts for scientific audiences and collaborate effectively across disciplines.</li> <li>Demonstrated ability to thrive in dynamic environments, prioritize competing demands, and make pragmatic trade-offs in a fast-paced startup setting.</li> </ul> <p><strong>Preferred Qualifications</strong></p> <ul> <li>Experience with data lakes and open table formats (Iceberg preferred).</li> <li>Experience with knowledge graph technologies and graph databases (Neo4j).</li> <li>Familiarity with lab data management systems (LIMS, ELN, integrated data lakes).</li> <li>Experience with MLOps practices and tools for model training pipelines, experiment tracking, and model deployment.</li> <li>AWS certification (Associate or Professional level).</li> </ul> <p><strong>Why ProFound</strong></p> <p>This role offers the opportunity to shape the technical and scientific foundation of a next-generation drug discovery platform. You will operate with real ownership, influence core architectural decisions, and work directly with leaders in AI, human genetics, and computational biology to expand the human proteome and uncover new therapeutic opportunities</p> <p><strong>ABOUT FLAGSHIP PIONEERING:</strong></p> <p>Flagship Pioneering invents and builds platform companies, each with the potential for multiple products that transform human health, sustainability and beyond. Since its launch in 2000, Flagship has originated more than 100 companies. Many of these companies have addressed humanity’s most urgent challenges: vaccinating billions of people against COVID-19, curing intractable diseases, improving human health, preempting illness, and feeding the world by improving the resiliency and sustainability of agriculture.&nbsp;&nbsp;</p> <p>Flagship has been recognized twice on FORTUNE’s “Change the World” list, an annual ranking of companies that have made a positive social and environmental impact through activities that are part of their core business strategies and has been twice named to Fast Company’s annual list of the World’s Most Innovative Companies. Learn more about Flagship at&nbsp;<a href="https://nam12.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.flagshippioneering.com%2F&amp;data=05%7C02%7Cemccurdy%40flagshippioneering.com%7C8e54295c994c4d440a7b08ddc94a4a72%7Cd00b8682fc8b4548b82dda0524788e9b%7C0%7C0%7C638888043086011906%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&amp;sdata=gknIK%2BmYfQpsSi0%2FyQmhzNBo4DurYfmbUQpruIKf6Mc%3D&amp;reserved=0">www.flagshippioneering.com</a>.</p> <p>At Flagship, we accept impossible missions to enable bigger leaps. Our <a href="https://www.flagshippioneering.com/values">core values</a> guide us through uncertainty and toward lasting impact.</p> <p><strong>We are an equal opportunity employer</strong>. All qualified applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic protected by law.</p> <p>We recognize that great candidates often bring unique strengths <strong>without fulfilling every qualification</strong>. If you have some of the experience listed above but not all, please apply anyway. We are dedicated to building diverse and inclusive teams and look forward to learning more about your background<strong> and interest in Flagship.</strong></p> <p><strong><em>Recruitment &amp; Staffing Agencies</em></strong><em>: Flagship Pioneering and its affiliated Flagship Lab companies (collectively, “FSP”) do not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to FSP or its employees is strictly prohibited unless contacted directly by Flagship Pioneering’s internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of FSP, and FSP will not owe any referral or other fees with respect thereto.</em></p> <p>#LI-MB1</p> </div> </div> </div> <p id="pay-transparency" style="margin: 0 !important; padding: 0 !important; float: none !important; clear: both !important; display: block;">The salary range for this role is $74,000 - $176,000. Compensation for the role will depend on a number of factors, including a candidate’s qualifications, skills, competencies, and experience. ProFound Therapeutics, Inc. currently offers healthcare coverage, annual incentive program, retirement benefits and a broad range of other benefits. Compensation and benefits information is based on ProFound Therapeutics, Inc.'s good faith estimate as of the date of publication and may be modified in the future.</p><div class="content-conclusion"><p></p> <p style="text-align: left;"><strong>Privacy Notice for Applicants:&nbsp;</strong>When you apply for a role at Flagship Pioneering or one of its portfolio companies, we collect and use personal information you provide (such as your name, contact details, work history, and application materials) to evaluate your application, communicate with you, and comply with legal obligations. Your application data is processed through Greenhouse, our applicant tracking system, and may also be reviewed using AI-assisted screening tools. We do not sell your personal information. California residents have rights under the CCPA/CPRA including to know, delete, and opt out of the sharing of their personal information. If you are located in the EU or UK, we process your data under GDPR and you have rights to access, rectify, and erase your data. To exercise your rights or for questions, contact privacy@flagshippioneering.com. Full Applicant Privacy Notice: flagshippioneering.com/privacy-policy.<span data-teams="true"><span id="message-body-1776802997034" class="fui-ChatMessage__body rxadtj ___fwoqik0 f10pi13n ftqa4ok f2hkw1w f8hki3x f1d2448m f1bjia2o ffh67wi f1j6vpng f1pniga2 f987i1v f1ffjurs f15bsgw9 f14e48fq f18yb2kv fd6o370 ffwy5si f3znvyf f57olzd f4stah7 f480a47 fs1por5 fk6fouc figsok6 fkhj508 f19n0e5 f9ijwd5 f14diw67 f1o0qvyv f9ggezi f1xp5gbu f150uoa4 fd9xhir f16xq7d1 fo7qwa0 fxowb0n f11ghf3q f13aoclr flypziy f10kwr27 fquw1qa fftr39l f13lathq f15hsm81 f2ss68y ffb60jq f8nuap2 f13nk4fk f7jacry fq08z5q fd9af6s fr74w9q fcl9uv6 f13sm7pj f1u6qqly f16wpxbl faim3u9 f6cs3qo fa2w2z3 fd39nx6 f10gn8j9 frcqmxy f1w9ws4k f1ddxkqj fd10euv fvuz61 f1nbc6gw"><span id="content-1776802997034" class="fui-Primitive ___11tzqds f1oy3dpc f89hs3r fqtknz5 fyvcxda"></span></span></span></p></div>