We are looking for a Site Reliability Engineer (SRE) who will be responsible for the day-to-day operation of our servers, as well as load analysis and long-term optimization. We want to work with a person who deeply understands servers, can accurately analyze what they are doing, debug issues in real-time and work proactively to anticipate and prevent problems.
About us and the job
OnTheGoSystems is a software development company, which creates and sells WordPress plugins. Our sites serve over 250,000 clients who regularly log-in, get support, download updates and read the content.
We use a number of services and technologies from AWS, including CloudWatch, Elastic Load Balancer, EC2, RDS, VPC, Network ACLs, Security Groups, S3, Route53, CloudFront.
You will be joining a talented team of developers and systems engineers, who design, build and run our infrastructure. As we grow, we are looking for a talented and passionate SRE, who specialises in load analysis, server cost and uptime. You will be helping us design, develop and analyse our infrastructure considering the implementation of monitoring and alert systems and perform data analysis that can support capacity planning, continuous improvement and incident response.
What responsibilities you will have
What’s required for this role
Tools that you must master:
What we offer:
This is a 100% remote position. Candidates must be self-motivated, focused and organized to succeed.
Most of our development team is located in Europe. We are looking for candidates from Europe, the Middle East or Africa working hours.
If you’re interested in joining us, please send your application and let’s talk.