Remote Senior Site Reliability Engineer
Tipi i punes: Full-Time
Vendndodhja: Remote
Aplikimet skadojne me: 28-03-2024
Skills & Experience:
Roles and Responsibilities
- Supporting business infrastructure to ensure service availability, even outside of regular business hours when necessary.
- Optimizing services across the company to manage costs, including right-sizing and deprecating systems.
- Contributing to the technology strategy by guiding production and development technical architecture, maintaining high-quality standards, fostering a culture of long-term thinking and innovation.
- Overseeing technical scoping and planning for the team, guiding and empowering the development approach.
- Researching new technologies to address future deployment, monitoring, and scaling needs.
- Managing and participating in 24x7 on-call rotations to ensure site reliability and performance.
- Defining best practices for monitoring, alerting, and incident management.
- Leading and participating in root cause analysis and documenting procedures.
Job Requirements
- BS in Computer Science or related field or equivalent work experience
- 5+ years of experience working with cloud infrastructure (GCP prefered, AWS, Private Cloud, etc.) in a secure environment (ISO27001, SOC 2 type 2, GDPR, etc.).
- 4+ years of technical operations experience, with a background in SaaS and cloud-based platforms.
- Experience dealing with environments that leverage container orchestration tools like Kubernetes.
- Experience building scalable and fault-tolerant systems.
- Experience in successfully leading one or more DevOps projects (CI/CD, pipeline tools, operations management, etc.) to completion through tools like Jenkins, Helm, Terraform.
- Experience with system health monitoring tools such as New Relic, OpsGenie, and Uptime Robot.
- Experience with databases, including relational and non-relational. Proficiency in MySQL and MS SQL is a plus.
- Proficiency with scripting and/or programming languages - Bash, Python, and Golang preferred.
What you bring to the role
- Ability to maintain, design, and build development and deployment systems.
- Active management of hosting at scale at multiple companies, ensuring reliability, stability, scalability, and 24x7 uptime.
- Experience migrating from Data Centers to Cloud-based solutions and migrating solutions from other cloud providers.
- Understanding of DevOps as a culture and practice in organizations of our size or larger.
- Comfortable in a fast-paced development environment.
- Familiarity with Intranet tools and processes including Confluence, Jira, and Microsoft Teams.
- Excellent verbal and written communication skills.
What We Offer
- 100% Remote Work- Work From Anywhere
- Opportunity To Learn & Develop New Skills
- An Open & Collaborative Work Environment
- Cutting Edge Technology and Implementations
- Generous Compensation based on Industry Standards + Benefits
Working Hours
9 AM - 5 PM EST (flexibility required during upgrades or critical issues for on-call support)