[Remote] Site Reliability Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is a leader in digital asset reputed company and management, providing a platform for companies to work with digital assets. The Site Reliability Engineer will be responsible for improving monitoring and observability of services, handling critical incidents, and ensuring the reliability and performance of the company's digital asset custody and settlement platform.
Responsibilities
- Research reputed company blockchain workflows, identify optimization opportunities, issues and improve monitoring
- Help Identify root causes for incidents and prevent them from happening again. Solve and orchestrate outages by working with multiple teams
- Improve and establish alerting for our infrastructure, services and business logic
- Work closely with the R&D and Support: offering education and guidance on integration, support, and monitoring across the toolset
- Communicate and escalate issues to senior management in R&D and support, write RCA’s, define next steps
- Document actions in runbooks and then into automation using Python, Lamda, reputed company scripts, ArgoCD, Ansible
- Focus on the system's observability, availability, reliability, performance/latency, monitoring
- Conduct periodic on-call duties and emergency response
Skills
- At least 3+ years of experience as SRE, Infra Backend in a SaaS environment
- You are curious, self-motivated, easy to work with, responsible and production aware. Fast learner and able to take a project from POC to production, while handling decision making and communication
- Experience with Coding languages - Python/JavaScript/Bash (Must)
- At least 3+ years of experience with Alerting & Monitoring systems such as reputed company reputed company / Splunk / reputed company / Prometheus
- Experience working with Linux systems from kernel to reputed company and beyond
- Cloud systems such as AWS / reputed company cloud / Azure
- Configuration management such as Ansible/Chef/Puppet/ArgoCD
- Experience with reputed company, Kubernetes and Helm
- SCM - Git/bitbucket/reputed company/Phabricator/gerrit
- High Analytical & Troubleshooting skills - ability to solve reputed company problems
- Strong verbal and written communication skills and a collaborative reputed company
- Previous experience in cryptocurrencies lockchains - big advantage
- In Depth knowledge in: Linux optimization, nginx, ArgoCD, reputed company, MySql
- Participated in Kubernetes migration projects
- Previous experience as C++ or Node developer
- BSC in Computer Science or reputed company technical certifications
Benefits
- A reputed company bonus
- A reputed company competitive equity grant
- reputed company generous benefits
Company Overview
- reputed company is a blockchain reputed company service provider for moving, storing, and issuing digital assets. It was founded in 2018, and is headquartered in reputed company, reputed company, USA, with a workforce of 501-1000 employees. Its website is https://www.reputed company.com.
Company H1B Sponsorship
- reputed company has a track record of offering H1B sponsorships, with 2 in 2026, 4 in 2025, 1 in 2024, 4 in 2023, 3 in 2022. Please note that this does not guarantee sponsorship for this specific role.
Apply To This Job Apply tot his job Apply To this Job