Software Engineering Manager 1 â Streaming & Cloud Platform Reliability
 Â
This role has been designed as ââOnsiteâ with an expectation that you will primarily work from an HPE office.Who We Are:
Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in todayâs complex world. Our culture thrives on finding new and better ways to accelerate whatâs next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.
Job Description:
  Â
Weâre looking for a handsâon Software Engineering Manager to lead a small team (2â4 developers) focused on improving the reliability of Mistâs cloud platform by driving concrete postmortem action items from our incident management process.
This team owns followâups from production incidentsâespecially those involving our streaming data pipelines (Kafka / Flink / Storm) and core APIs. Youâll work closely with senior engineers to turn incident learnings into durable engineering improvements.
This is a hybrid role requiring onâsite collaboration multiple days per week in Cupertino, California. Due to the requirements of this position, this role requires a US Citizen or Green Card holder.
What Youâll Do
- Own and drive postâincident followâups from our Incident Management process, turning incident reports into design and implementation work.
- Lead, mentor, and grow a 2â4 person engineering team, while contributing handsâon code in production services.
- Design, implement, and harden streaming topologies using Kafka, Storm, and/or Flink (e.g., stats, telemetry, alerts, pcaps).
- Improve reliability of core APIs (REST API, WebSocket, Webhooks, etc.), including auth, rate limiting, and DRâsensitive flows.
- Enhance observability and runbooks: add metrics/alerts, define SLOs, and codify playbooks for recurring incident patterns.
- Collaborate with SRE, Platform, and Data teams on DR, multiâregion, and multiâcloud behavior (AWS, GCP, DR regions).
- Ensure robust testing and deployment practices (unit/integration tests, regression tests for past incidents, safe rollout/rollback).
Experience Required for this Role
- 7+ years total professional software engineering experience.
- This is a hybrid role requiring onâsite collaboration multiple days per week in Cupertino, California. Due to the requirements of this position, this role requires a US Citizen or Green Card holder.
- 2+ years in a team lead role (mentors, performance feedback, prioritization), while remaining handsâon technically.
- 5+ years building backend or distributed systems in Python, Go, or Java proficiency in at least one of these languages to lead design reviews and contribute production code.
- 3+ years designing, implementing, and operating distributed, eventâdriven systems using:
- Kafka and at least one of Flink or Storm, or a comparable streaming framework.
- 3+ years building and operating RESTful APIs (service design, auth, rate limiting, client IP handling, versioning).
- 3+ years working with cloudânative infrastructure:
- Kubernetes, containerized microservices, CI/CD pipelines.
- 3+ years with production datastores such as:
- Redis, Postgres, Cassandra/Datastax, S3/GCS, or similar distributed storage systems.
- 2+ years directly involved in production incident response:
- Onâcall participation, postmortems, and driving remediation work through to completion.
- Proven ability to debug latency, throughput, data correctness, and availability issues in streaming pipelines and/or APIs.
- Experience adding or improving metrics, logging, tracing, and alerts for production services.
Preferred Qualifications
- 2+ years working with bigâdata / analytics or ETL systems
(e.g., Apache Spark, Airflow, Snowflake, or similar). - Experience with webhook or eventâdelivery systems (idempotency, retries, ordering, DLQs).
- Exposure to multiâregion / DR design: crossâcloud migrations, DNS and certificate management, environmentâdriven configuration.
- Familiarity with DevOps practices, CI/CD automation, and service ownership.
- Experience with observability stacks such as Prometheus, Grafana, Kibana/Elasticsearch.
Why This Role
- Direct, visible impact on the stability and reliability of Mistâs cloud platform and AIâdriven networking products.
- A focused charter with real, concrete backlogs driven by incidentsânot vague âplatform work.â
- Close collaboration with strong senior engineers and SREs, with room to shape both technical direction and team culture.
Additional Skills:
What We Can Offer You:
Health & Wellbeing
We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.
Personal & Professional Development
We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have â whether you want to become a knowledge expert in your field or apply your skills to another division.
Unconditional Inclusion
We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.
Let's Stay Connected:
Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.
Job:
EngineeringJob Level:
Manager_1Â Â Â Â
"The expected salary/wage range for this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level.â United States of America: Annual Salary USD 155,500 - 315,000 in California
The listed salary range reflects base salary. Variable incentives may also be offered."
Information about employee benefits offered in the US can be found at https://myhperewards.com/main/new-hire-enrollment.html
HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.
Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities.
  Â
HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.
  Â
No Fees Notice & Recruitment Fraud Disclaimer
Â
It has come to HPEâs attention that there has been an increase in recruitment fraud whereby scammer impersonate HPE or HPE-authorized recruiting agencies and offer fake employment opportunities to candidates. These scammers often seek to obtain personal information or money from candidates.
Â
Please note that Hewlett Packard Enterprise (HPE), its direct and indirect subsidiaries and affiliated companies, and its authorized recruitment agencies/vendors will never charge any candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process.  The credentials of any hiring agency that claims to be working with HPE for recruitment of talent should be verified by candidates and candidates shall be solely responsible to conduct such verification. Any candidate/individual who relies on the erroneous representations made by fraudulent employment agencies does so at their own risk, and HPE disclaims liability for any damages or claims that may result from any such communication.