Telecom Company
• Ensure environment stability, security and performance through SLO’s and CI/CD enforcement
• Create and improve delivery- and stability- focused tooling across a range of languages and environments
• Ongoing monitoring and control of the availability of the different services of the production 24/7
• Monitor and detect problems in the production environment, as well as 1st and 2nd tier infrastructure and application troubleshooting
• Participate in system design consulting, platform management, and capacity planning
• 3+ years’ experience in a similar role
• Coding experience with Python, Javascript or another equivalent
• Hands-on experience with public cloud (AWS) and serverless architecture
• Experience in using Infrastructure As Code (Terraform and/or CloudFormation)
• Experience with monitoring tools and vendors such as Prometheus, Grafana, ELK, NewRelic, Signalfx, CloudWatch, DataDog, PagerDuty, etc;