Site Reliability Engineering Lead
Background
At Motion Applied – Connected Intelligence (CI), we create cutting edge wireless connectivity solutions that are transforming customer experience across the transport industry. We create solutions that drive efficiency and cost-effectiveness for customers, delivering unrivalled internet connectivity services.
By applying our expertise, we deliver real benefits and pioneer a better future.
Purpose of the Role
This is an opportunity to join our Software Engineering community as a Site Reliability Engineer (SRE) leading initiatives that create and improve the CI ‘Platform as a Service’ (Paas) offering to the product delivery teams:
- Enabling product development teams to deliver software products on immutable infrastructure.
- Developing and facilitating production and development infrastructure and associated tooling.
- Integrating third party managed services used for delivery and development lifecycles
You will lead on implementation of the CI cloud governance policy in the form the PaaS and be a key participant in maintaining and updating the policy in accordance with technology shifts, customer feedback and product development needs.
You will need to be an open-minded technologist, who values a collaborative work environment and is willing to learn and explore as the fast-paced industry evolves and changes.
Key Responsibilities
- Key contributor to the Roadmap for the CI Platform.
- Development and maintenance of the CI Platform.
- Collaborate and consult with software engineers and data scientists to help design and implement robust and scalable software products.
- Knowledge sharing and education of team members to enable our DevOps culture.
- Proactively monitor costs and security posture of the CI Platform and products running on it.
- Define and implement tooling to continually improve our software development, release and maintenance processes.
- Develop product features with product delivery teams, building upon the CI Platform offering.
- Supporting our live systems, including identifying and implementing improvements to products, tools and processes to improve the on-going reliability of our solutions.
Example scenarios you will be helping us with
- Design and implement infrastructure for multi-region and multi-tenant products and platform.
- Design and implement monitoring infrastructure for real-time data streams.
- Design network and access to allow software engineers and data scientists to access services in AWS while keeping the services and data safe and secure.
- Enable and collaborate with teams to automate the entire delivery of a product. From a single web application to the configuration of a cloud account.
- Design and implement security and access management so that users and roles have access only to resources they need within the AWS account and attaching IoT devices.
- Identify root cause of live issues, to help both recover any immediate situation and design/implement improvements for future reliability.
Experience we are looking for
- Working with delivery teams deploying software on the cloud.
- Strategic technical leadership, in particular related to Site Reliability Engineering.
- Evidence of tailored and contextual communication in all directions to realise value as feedback and enquiry.
- Hands on experience in delivering production quality services.
- Experience in supporting live production systems.
Required
We are looking for an applicant who has:
- Bachelor's degree in computer science, similar technical field of study, or equivalent practical experience
- 5+ years of hands-on experience with large cloud providers such as AWS, Azure, GCP, or OCI.
- Leading technical initiatives including roadmap input, cross-team collaboration and mentoring.
- Proficiency in Infrastructure as Code (IaC) tools such as Terraform.
- 3+ years of hands-on experience with containerization and orchestration tools like Docker and Kubernetes.
- Experienced working in and developing for a Linux environment.
- Excellent programming, debugging, and optimization skills at least one strong purpose programming language (Go or Python preferred).
- Solid understanding of DevOps practices, CI/CD pipelines and version control e.g. Git.
- Knowledge of observability and monitoring tools e.g. Prometheus, Grafana, ELK etc
- Experience in troubleshooting incidents and live environments.
- Ability to write and speak in English fluently
Desirable & Development Areas
Some of the below would be beneficial to the applicant, but opportunities to develop in these areas will also be provided:
- Familiarity with AWS Well-Architected Framework.
- Designing, building and maintaining multitenant and multi-region cloud infrastructure.
- Exposure to configuration management tools like Ansible or SaltStack.
- Knowledge in cyber security, including but not limited to threat intelligence, IAM, key management systems, data security, application security, applied cryptography, certificate management.
- Cost optimisation experience at a platform or organisation level.
- Experience of IoT systems integration, protocols, and services, including gRPC and MQTT.
- Networking concepts and network performance modelling / optimisation.
- Experience with Hashicorp tools like Vault or Consul.
- AWS Certified Solutions Architect – Professional or AWS Certified DevOps Engineer – Professional.
Location
This role is based at our head office in central Woking. The role is included in our hybrid working policy with the expectation of a minimum of two days a week in the office, which should be co-ordinated with other members of the Engineering team.
What we can offer you
In return for everything you bring to the table, we can ensure an exciting, challenging role in a dynamic business surrounded by some of the best people in their respective fields.
At Motion Applied we firmly believe it’s the relationships and friendships we create while working that make us special. We’re also aware that the world is changing and we are part of that change. We all want and need different things from our work and home lives, so, if you have commitments outside of work, we’re open to talking through flexible working options that work for you and us.
- Annual leave (25 days + bank holidays, pro-rated for part time colleagues).
- Enhanced Company Maternity, Paternity and Adoption leave and pay.
- Flexible working policies, including Hybrid Working.
- Life assurance to the value of 4 times base salary
- Opportunity to join the Motion Applied Pension Plan
- Company funded individual private healthcare with the opportunity to extend to partner or spouse and/or dependents at a discounted rate.
- Electric car scheme – opportunity to drive a brand-new car in a more affordable way through this salary sacrifice scheme. Employees are eligible to join the scheme after successful competition of their probationary period.
Who we are
Motion Applied are a medium-sized tech firm spun out of McLaren Group. We’re looking for people who will thrive in a non-hierarchical, growth-orientated company, self-starters who are flexible and somewhat entrepreneurial in their approach.
Motion Applied are committed to Diversity, Equality and Inclusion (DEI) and promote DEI in all we do. Motion Applied are also members of the UK Government Disability Confident Scheme.