Observability Engineer

Observability Engineer

Do you want to be part of the team?
Observability Engineer
Observability Engineer

Information

Observability Engineer (SRE) Join our Site Reliability Engineering (SRE) team as an Observability Engineer. Your mission is to implement and optimize automated monitoring tools to ensure the stability, availability, and high performance of our cloud products in mission-critical production environments. You will be challenged to guarantee the operational continuity of large-scale data centers that support the critical, uninterrupted infrastructure we deploy. Remote Locations: Mexico, Chile, Argentina, Colombia, Uruguay, and Peru. Responsibilities Design & Implementation: Create and optimize robust monitoring solutions for cloud infrastructures. Data Visualization: Define, analyze, and implement advanced dashboards to visualize critical Key Performance Indicators (KPIs). Platform Management: Ensure the correct operation of production clouds based on open-source technologies (specifically Kubernetes and OpenStack). Incident Management: Address critical platform incidents, escalating to Senior Engineers or Product Development teams as necessary. Continuous Improvement: Provide the technical insights required for bug fixes and proactive system optimization. Why Whitestack? We are Latin American leaders in Telco Cloud and Open Networking solutions. We offer a high-level technical environment working with cutting-edge stacks (Prometheus, Grafana, ELK, etc.) within a collaborative culture recognized as a Great Place to Work. If you are passionate about observability and high-scale environments, this is the place for you! 📊☁️
Nombre de la empresa
Whitestack
Location
LATAM
Work scheme
Remote
Años de experiencia requeridos
4

Join the event!

See all the content and easy-to-use features by logging in or registering!