Scope
We are seeking a skilled Platform Reliability Engineer to ensure the reliability, performance, and scalability of Live & Media Asset platforms running on customers’ premises, and eventually in the cloud, for major players in the broadcast industry.
Sitting at the intersection of T3 Engineering and Site Reliability Engineering (SRE) and R&D DevOps Engineer this role emphasizes proactive monitoring, technical troubleshooting, and the continuous improvement of platform infrastructure. The ideal candidate will play a pivotal role in ensuring customer success by supporting field teams, collaborating with R&D, and driving platform reliability through automation, monitoring, and system optimization.
Job Description
Key Responsibilities
1. Proactive Monitoring and Incident Management
- Monitor customer ecosystems using tools like Grafana, Prometheus, and Loki, ensuring issues are detected and resolved before impacting operations.
- Respond to incidents, perform root cause analysis, and implement preventive measures to minimize downtime.
- Prepare and support intervention plans and installations on Platform matters.
- Define and monitor SLIs and SLOs to maintain system availability and reliability.
2. Technical Support and Collaboration
- Act as an expert resource for Global and local support engineers, providing guidance and escalation support for customer issues.
- Partner with Product Owners, R&D, and Test teams to enhance platform reliability based on real-world usage.
- Serve as a bridge between R&D and operational teams to translate customer experiences into actionable improvements.
3. Platform Infrastructure and Automation
- Maintain and optimize Linux-based platform infrastructure and ensure seamless deployment processes, in collaboration with R&D.
- Design and implement CI/CD pipelines to streamline delivery and configuration.
- In collaboration with R&D, develop Installers & toolkits
4. Training and Knowledge Sharing
- Provide advanced technical training to field support teams and customers.
- Document system configurations, troubleshooting procedures, and best practices for knowledge sharing.
5. Platform Evolution and Security
- Evaluate and implement tools and technologies to improve platform reliability, security, and performance.
- Conduct regular security assessments and ensure compliance with industry standards.
- Be part of the Cloud initiatives to productize, support and master EVS products Cloud deployments.
6. Customer Intimacy and Crisis Management
- Occasionally travel to audit customer platforms, reinforce relationships, and participate in crisis management scenarios (Emergency Response Team).
- Be available for on-call rotations to assist during critical events or platform upgrades.
Profile
Required Skills and Qualifications
- Strong Linux expertise, including troubleshooting and performance optimization.
- Proficiency in monitoring tools like Grafana, Prometheus, and Loki.
- Solid experience with containerization technologies such as Docker and Kubernetes.
- Experience in microservice & web architecture
- Scripting and automation expertise Bash, Python, …
- Familiarity with CI/CD pipelines, version control systems (e.g., TeamCity CI/CD)
- Strong understanding of networking (TCP/IP) and virtualization technologies.
- Experience with relational databases, particularly PostgreSQL.
- Excellent analytical, problem-solving, and communication skills.
Preferred Skills
- Exposure to microservices infrastructure running on K8S Clusters.
- Experience with related technologies RabbitMQ, Kafka, …
- Knowledge of storage technologies and advanced virtualization concepts.
- Stay updated with the latest trends and technologies in the industry
- Proven experience in a similar role, such as DevOps, Systems Engineering or Software Engineer.
- Aligned with EVS values: Innovation, Passion, Excellence, Agility, Accountability, Teamwork, and Customer Success
Languages
- Fluent in English
- Other languages are considered as asset
Working Conditions
- Availability for on-call rotations and occasional travel to customer sites.
- You may be expected to work in a shift pattern to cover operating hours in other regions
- Collaborative work environment with opportunities for professional growth and learning.
Offer
Becoming Part of the EVS Team not only means that you will receive a competitive salary in line with your skills and the market, but also a range of other additional wellness and healthcare benefits. Our flexible schedules and hybrid way of working (homeworking) policies will help you preserve your work-life balance.
EVS will give you the tools to develop your skills and your career by giving you the opportunities of internal mobilities and a wide range of trainings. We encourage our motivated talents with a friendly, lively, and inclusive environment.
This role will offer you the unique opportunity to shape the reliability and scalability of modern platform (Cloud or “on prem”) in the media industry while directly influencing customer satisfaction and operational excellence. If you’re passionate about combining hands-on technical expertise with a customer-centric approach, this is the role for you!