Some details
US video broadcasting company built pretty solid geo-distributed infrastructure to serve broadcast video streams for TV channels around a world. When your infrastructure becomes huge, it always requires regular operation activities like configuration improvements, description of changes in the code, security updates. The core tech team has a tight plan of rolling new features and business wants them to focus on that instead
of wasting time on day-to-day activities. The company decided to hire us as an operation team to cover these needs.We started with knowledge transfer and realized, that some parts of infrastructure weren',t 100% covered by the monitoring system. A lot of monitoring metrics and alerts were added to address this situation. As a next step, our team wrote a lot of ansible roles and terraform modules to ensure that the infrastructure state fully described in code. Our engineers built mongodb and redis clusters there to improve resilience, added service discovery with consul and optimized elasticsearch cluster.As a result, the company received a reliable operation team that helps with day-to-day maintenance and infrastructure improvements. Core tech team was able to spend time on product development and focus on the business-critical roadmap.