← back to jobs
> job detail
G
👽Other

Staff Engineer (ML Engineer)

Graphcore · London, UK
// classified as
Other (Adjacent or hard to classify.)
posted
1d ago
location
London, UK
languages
python
tools
> stack
pythontensorflow
> description
<h2><strong>About Graphcore&nbsp;</strong></h2> <p>At Graphcore, we’re building the future of AI compute.</p> <p>We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale.</p> <p>As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem.</p> <p>To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world.</p> <p>We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence</p> <h2><strong>Job Summary </strong></h2> <p data-start="120" data-end="486">Applicants for this role should have strong experience working with machine learning systems and frameworks, along with a solid understanding of core AI concepts and model behaviour. The role centres on testing, validating, and benchmarking a complex ML software stack, with a particular focus on performance, reliability, and correctness across modern AI workloads.</p> <p data-start="488" data-end="1009">The ideal candidate is an experienced ML engineer who understands how contemporary models are trained and executed, and who has hands-on experience debugging functional and performance issues in ML systems. This person will be comfortable working with industry-standard frameworks and state-of-the-art models, bringing them up on internal infrastructure, and collaborating closely with software and hardware teams in a technically demanding environment spanning ML frameworks, infrastructure, and AI accelerator hardware.</p> <h2><strong>The Team</strong></h2> <p><span data-contrast="auto">The&nbsp;ML&nbsp;QA team is composed of highly skilled software engineers with a strong focus on automation, software quality, and data-driven validation. The team works closely&nbsp;with&nbsp;</span><span data-contrast="none">industry-standard machine learning frameworks and models</span><span data-contrast="auto">, contributing to upstream open-source projects and collaborating across the wider software organization.</span><span data-ccp-props="{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335559740&quot;:240}">&nbsp;</span></p> <p><span data-contrast="auto">Operating in a fast-paced environment, the team plays a critical role in ensuring reliability, performance, and maintainability across the ML software stack, helping to deliver robust and high-quality products to customers.</span><span data-ccp-props="{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335559740&quot;:240}">&nbsp;</span></p> <h2><strong>Responsibilities and Duties</strong></h2> <p><span data-contrast="auto"><span class="TextRun SCXW130785288 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW130785288 BCX4">• Benchmark ML models and frameworks, analysing results to<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">identify</span><span class="NormalTextRun SCXW130785288 BCX4"><span class="Apple-converted-space">&nbsp;</span>regressions,<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">performance bottlenecks, and correctness issues.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW130785288 BCX4"><span class="SCXW130785288 BCX4">&nbsp;</span><br class="SCXW130785288 BCX4"></span><span class="TextRun SCXW130785288 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW130785288 BCX4">• Work hands-on with industry-standard ML frameworks to<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">validate</span><span class="NormalTextRun SCXW130785288 BCX4"><span class="Apple-converted-space">&nbsp;</span>functionality and performance across different execution environments.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW130785288 BCX4"><span class="SCXW130785288 BCX4">&nbsp;</span><br class="SCXW130785288 BCX4"></span><span class="TextRun SCXW130785288 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW130785288 BCX4">• Build and<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">maintain</span><span class="NormalTextRun SCXW130785288 BCX4"><span class="Apple-converted-space">&nbsp;</span>automated testing and benchmarking pipelines targeting simulators, emulators, and physical hardware.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW130785288 BCX4"><span class="SCXW130785288 BCX4">&nbsp;</span><br class="SCXW130785288 BCX4"></span><span class="TextRun SCXW130785288 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW130785288 BCX4">• Collaborate closely with software</span><span class="NormalTextRun SCXW130785288 BCX4"><span class="Apple-converted-space">&nbsp;</span>teams<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">to<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">ensure<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">adequate test coverage for new and existing features.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW130785288 BCX4"><span class="SCXW130785288 BCX4">&nbsp;</span><br class="SCXW130785288 BCX4"></span><span class="TextRun SCXW130785288 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW130785288 BCX4">• Develop tooling and scripts (primarily in Python) to support testing, benchmarking, and<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">functional reporting.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW130785288 BCX4"><span class="SCXW130785288 BCX4">&nbsp;</span><br class="SCXW130785288 BCX4"></span><span class="TextRun SCXW130785288 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW130785288 BCX4">•<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW130785288 BCX4">Take ownership over aspects of our testing and infrastructure, owning the roadmap and driving innovation independently.</span></span></span></p> <h2><strong>Candidate Profile</strong></h2> <p><strong>Essential:</strong></p> <p><span data-contrast="auto"><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">• Experience working in Machine Learning or ML-adjacent engineering roles.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW40476868 BCX4"><span class="SCXW40476868 BCX4">&nbsp;</span><br class="SCXW40476868 BCX4"></span><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">•<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW40476868 BCX4">Strong foundation</span><span class="NormalTextRun SCXW40476868 BCX4"><span class="Apple-converted-space">&nbsp;</span>in core AI and ML concepts (e.g. neural networks, training vs inference, numerical precision, performance trade-offs).</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW40476868 BCX4"><span class="SCXW40476868 BCX4">&nbsp;</span><br class="SCXW40476868 BCX4"></span><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">• Hands-on experience with one or more major ML frameworks such as<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SpellingErrorV2Themed SCXW40476868 BCX4">PyTorch</span><span class="NormalTextRun SCXW40476868 BCX4">, TensorFlow, JAX, or similar.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW40476868 BCX4"><span class="SCXW40476868 BCX4">&nbsp;</span><br class="SCXW40476868 BCX4"></span><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">• Strong<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SCXW40476868 BCX4">proficiency</span><span class="NormalTextRun SCXW40476868 BCX4"><span class="Apple-converted-space">&nbsp;</span>in Python for ML workflows, experimentation, and automation.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW40476868 BCX4"><span class="SCXW40476868 BCX4">&nbsp;</span><br class="SCXW40476868 BCX4"></span><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">• Experience designing, running, and analysing ML benchmarks or experiments.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW40476868 BCX4"><span class="SCXW40476868 BCX4">&nbsp;</span><br class="SCXW40476868 BCX4"></span><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">• Experience working in Linux environments.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW40476868 BCX4"><span class="SCXW40476868 BCX4">&nbsp;</span><br class="SCXW40476868 BCX4"></span><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">• Strong analytical and debugging skills, with the ability to reason about model behaviour and system performance.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW40476868 BCX4"><span class="SCXW40476868 BCX4">&nbsp;</span><br class="SCXW40476868 BCX4"></span><span class="TextRun SCXW40476868 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW40476868 BCX4">• Bachelor/Master's/PhD or equivalent experience in Computer Science, Maths, Machine Learning, Data Science, or related field.</span></span><span class="EOP SCXW40476868 BCX4" data-ccp-props="{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335559740&quot;:240}">&nbsp;</span></span></p> <p><strong>Desirable</strong></p> <p><span data-contrast="none"><span class="TextRun SCXW213961340 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW213961340 BCX4">• Experience with<span class="Apple-converted-space">&nbsp;</span></span><span class="NormalTextRun SpellingErrorV2Themed SCXW213961340 BCX4">MLOps</span><span class="NormalTextRun SCXW213961340 BCX4"><span class="Apple-converted-space">&nbsp;</span>pipelines, model deployment, or production ML systems.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW213961340 BCX4"><span class="SCXW213961340 BCX4">&nbsp;</span><br class="SCXW213961340 BCX4"></span><span class="TextRun SCXW213961340 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW213961340 BCX4">• Familiarity with performance analysis, profiling tools, or numerical accuracy validation.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW213961340 BCX4"><span class="SCXW213961340 BCX4">&nbsp;</span><br class="SCXW213961340 BCX4"></span><span class="TextRun SCXW213961340 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW213961340 BCX4">• Exposure to distributed training or inference systems.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW213961340 BCX4"><span class="SCXW213961340 BCX4">&nbsp;</span><br class="SCXW213961340 BCX4"></span><span class="TextRun SCXW213961340 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW213961340 BCX4">• Experience with hardware-accelerated ML, compilers, or system-level performance considerations.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW213961340 BCX4"><span class="SCXW213961340 BCX4">&nbsp;</span><br class="SCXW213961340 BCX4"></span><span class="TextRun SCXW213961340 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW213961340 BCX4">• Familiarity with CI/CD systems used for ML workflows.</span></span><span class="LineBreakBlob BlobObject DragDrop SCXW213961340 BCX4"><span class="SCXW213961340 BCX4">&nbsp;</span><br class="SCXW213961340 BCX4"></span><span class="TextRun SCXW213961340 BCX4" lang="EN-GB" data-contrast="auto"><span class="NormalTextRun SCXW213961340 BCX4">• Experience contributing to open-source ML frameworks or tooling.</span></span><span class="EOP SCXW213961340 BCX4" data-ccp-props="{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335559740&quot;:240}">&nbsp;</span></span></p> <h2><strong>Benefits</strong></h2> <p>In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.</p> <p>&nbsp;</p> <p><em data-stringify-type="italic">Applicants for this position must hold the right to work in the UK. Unfortunately at this time, we are unable to provide visa sponsorship or support for visa applications</em></p>