Site Reliability Engineer (Remote)
Job Description
Responsibilities
- Develop tooling and infrastructure to support thousands of customers on companies SaaS offering
- Write thoughtful and high-quality code in Go
- Define infrastructure in code with Terraform
- Implement, maintain and tune monitoring and alerting systems
- Build custom tools and services to automatically recover from incidents
- Respond on-call to incidents with quick and effective resolutions
- Deploy applications to and manage Kubernetes clusters
- Write clear and concise plans for incident response playbooks
Requirements:
- Bachelor's degree in Computer Science or related fields, or significant professional software development or DevOps experience
- Strong demonstrable experience in building and maintaining highly reliable services
- Strong experience with SRE and DevOps methodologies
- Experience with or an ability to quickly become proficient in Go
- Familiarity with containers and orchestration systems, like Docker and Kubernetes
- Comfortable working with infrastructure as code tools, such as Terraform
- Ability to be on-call
Pluses
- Experience with distributed application systems using HTTP, WebSockets, RPC, pub/sub at scale
- Practical AWS experience
- Knowledge of Grafana and Prometheus
- Comfortable with GitHub, Jira, Jenkins, CircleCI
- Experience working in open source communities
Job Type
Client Payroll
Positions
DevOps Engineer
Must have Skills
Languages
english -Basic
Skip


Refer a friend for this role and earn
25 USD
Use the share options below Learn More
Refer a friend for this role and earn 25 USD
Don’t forget to share your referral URL
Up to 450 USD/Hour
450 USD
Up to 450 K/Year USD (Annual salary)
Longterm (Duration)
Fully Remote
Teresa N