[Remote] Site Reliability Engineering Manager

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. Dice is seeking a Senior Manager of Site Reliability Engineering (SRE) to enhance SRE practices within the Financial Services & Innovation organization. This role involves establishing operational discipline, driving SRE standards, and ensuring alignment across teams to improve reliability and performance.

Responsibilities

Drive adoption of the SRE operating model across application teams
Establish clarity in roles between:
SRE
Production Support Engineering (PSE)
Application teams
Ensure SRE practices are embedded into the development lifecycle, not treated as post-production activities
Define and enforce:
SLIs, SLOs, and Error Budgets
Production readiness criteria
Reliability best practices
Lead SLO adoption and compliance reviews across the organization
Establish governance frameworks to ensure consistent application of standards
Partner with:
Application product teams
Production Support Engineering (MG team)
Platform / Infrastructure / Observability teams
Drive alignment and reduce friction between engineering and operations
Ensure clear handoffs, escalation models, and operational ownership
Lead adoption of centralized observability standards across:
Metrics
Logging
Tracing
Align tooling (AppDynamics, Splunk, Prometheus, etc.)
Ensure monitoring and alerting are SLO-driven and actionable, not noise-based
Partner with PSE to strengthen:
Incident management processes
RCA (Root Cause Analysis) standards
Drive identification of patterns and systemic issues
Ensure learnings translate into engineering improvements and automation
Identify opportunities to:
Reduce manual operational work
Improve system resilience
Enable self-healing capabilities
Promote a culture of engineering over reaction
Define and track reliability metrics across FS&I
Build reporting that provides visibility into:
System health
Incident trends
SLO performance
Translate technical data into actionable business insights

Skills

10+ years in engineering, operations, or SRE roles
5+ years leading SRE, platform, or reliability-focused teams
Proven experience implementing SRE practices at scale (SLIs, SLOs, error budgets)
Strong background in cloud environments (AWS, Azure, Google Cloud Platform)
Hands-on experience with observability tools (Splunk, AppDynamics, Prometheus, etc.)
Experience in incident management and production operations at scale
Ability to operate effectively in high-pressure and complex enterprise environments
Experience driving organizational transformation (not just technical implementation)
Strong understanding of CI/CD, DevOps, and automation practices
Experience working in regulated or large enterprise environments
Familiarity with AIOps or advanced automation strategies

Company Overview

Dice is the go-to career marketplace for tech professionals. It was founded in 2010, and is headquartered in Drachten, Friesland, NLD, with a workforce of 201-500 employees. Its website is https://www.or-quest.nl/.

Company H1B Sponsorship

Dice has a track record of offering H1B sponsorships, with 2 in 2022, 4 in 2021, 5 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Apply To This Job

Apply

[Remote] Site Reliability Engineering Manager

Related roles

[Remote] Growth Marketing Director

[Remote] Mechanical Engineering AI Reviewer India | $65/hr Remote

[Remote] Senior Revenue Accountant

[Remote] Business Development Consultant

[Remote] Machine Learning Engineer Expert

[Remote] Marketing Specialist

[Remote] Functional Consultant

[Remote] Remote Internal Medicine Physician

[Remote] Litigation Consultant - General Liability | Jurisdiction: Nationwide | Licensing: Required | Dedicated Account (Remote MN)

[Remote] Technical Account Manager

Experienced Remote Chat Moderator – Fostering Positive Digital Interactions and Enhancing Customer Support

Pepsi Co Data Entry (Remote, Part/Full Time) $75000/Year – WHF – US

Experienced Loan Servicing Customer Service Representative – Remote Opportunity with arenaflex

[Remote/WFM] Content Moderator - Danish - Hybrid Qormi or remote

Experienced Remote Live Chat Associate – Customer Service and Engagement Expert

Client Service Coordinator (Fully Remote)

Care Manager RN - Medical Management

Experienced Remote Customer Service Representative - Entry Level - Delivering Exceptional Support & Enhancing Customer Experience with Amazon

DevSecOps Engineer DroneSense

[Remote] CAD Designer - III