Ensure the reliability, scalability, and performance of our systems by monitoring infrastructure, troubleshooting issues, implementing automation, and collaborating with development teams to maintain smooth production environments. Responsibilities include supporting and developing monitoring systems, configuring and optimizing monitoring solutions (Prometheus, Grafana, Loki, etc.), and supporting logging systems by organizing and improving logging systems (Opensearch or ELK). Additional responsibilities involve creation and maintenance of traffic routing, configuring and optimizing traffic routing solutions (CloudFlare, Nginx, HAproxy, etc.) to ensure high availability and system resilience, updating and automating processes and services, providing IT support, and promptly addressing any issues that arise.