Site Reliability Engineer - (Remote)

Site Reliability Engineer - (Remote)

Jobs

Information

What you'll be doing Be on a PagerDuty rotation to respond to swapcard.com availability incidents. Run our infrastructure with Terraform, Kubernetes and ECS. Make monitoring and alerting alert on symptoms and not on outages. Improve the deployment process. Design, build and maintain core infrastructure pieces that allow Swapcard scaling to support hundred of thousands of concurrent users. Debug production issues and abnormal behavior/metrics on the infrastructure. Plan the growth of Swapcard infrastructure. What you should have MS/Bs or equivalent and 3+ years experience in computer science. Have understanding of Linux. Comfortable with Bash Scripting. Be well versed with building CI/CD pipelines. Be Skilled with Docker/Kubernetes. Proficient with AWS. Have experience with Infrastructure as a code (Terraform, Ansible, Packer). Highly motivated, goal driven, can-do approach. Innovative, entrepreneurial, team player, ability to multi-task. Bonus Points Strong with at least one backend languages (NodeJS, Python, GO, etc.). Familiar with GitOps. You have already designed, analysed and troubleshooted large-scale distributed systems.

Join the event!