[Remote] Engineering Manager, DevOps
Note: The job is a remote job and is open to candidates in USA. Vannevar is a defense technology company focused on building AI to enhance national security and deter adversaries. They are looking for a DevOps Engineering Manager to lead a team that ensures platform reliability and secure delivery while collaborating across engineering, security, and product teams.
Responsibilities
- Lead and develop a DevOps team: Hire, mentor, and grow engineers; set clear expectations and create an environment of ownership, high standards, and continuous improvement
- Drive platform reliability: Own the health of CI/CD, deployment, and runtime infrastructure; improve availability, performance, and incident response through measurable SLOs and operational rigor
- Build self-service and automation: Create developer-facing tooling to reduce toil (golden paths, paved roads, templates, and automation for common workflows)
- Evolve CI/CD and release engineering: Improve build and deploy pipelines, change management, release safety (progressive delivery, rollbacks), and supply-chain security
- Observability and monitoring: Implement and mature logging, metrics, and alerting; build dashboards and guardrails that help teams understand and improve system behavior
- Infrastructure as code: Standardize and scale infrastructure management using modern IaC patterns and review/approval workflows
- Security and compliance partnership: Work closely with Security and compliance stakeholders to deliver secure-by-default systems and audit-ready practices
- Cross-functional collaboration: Partner with application teams to improve deployability and operability, and translate business priorities into an executable roadmap
Skills
- 3+ years leading and developing software engineers, including coaching, performance management, hiring, and building healthy team practices
- 8+ years of experience as an individual contributor in DevOps/SRE/platform/infrastructure or software engineering roles
- Strong experience operating production systems in a major cloud environment (AWS preferred) and building for reliability and scale
- Proven experience designing and operating modern build and deploy pipelines and automating operational workflows
- Experience with logging/metrics/alerting stacks (e.g., Datadog, Grafana, CloudWatch, ELK/OpenSearch) and using them to drive reliability improvements
- Experience with Terraform, Pulumi, or equivalent tools and associated engineering practices (code review, testing, drift detection)
- Familiarity with containerization and orchestration (Docker; Kubernetes/ECS preferred)
- Ability to align stakeholders, explain tradeoffs, and drive execution across teams and functions
- Experience building DevSecOps practices (secret management, policy as code, artifact signing, SBOMs)
- Experience with multi-account AWS environments, network segmentation, and zero-trust patterns
- Experience supporting regulated environments (e.g., FedRAMP, DoD/IC, export-controlled systems)
- Experience with GitHub Actions or other CI platforms at scale
- U.S. TS Security clearance with SCI Eligibility
Benefits
- Health, dental, and vision insurance
- 100% remote first culture. You can work from anywhere in the US and all full time employees have WeWork access
- Unlimited PTO including competitive vacation and holiday schedules
- Lifestyle stipends - Monthly mental health, wellness & fitness stipend, in-home office setup stipend and family planning assistance
- Salary top-up during military reserve duty
- Fully paid parental leave
- Child and pet care reimbursement during travel
Company Overview