Ver oferta completa

SENIOR SITE RELIABILITY ENGINEER

Miguel Hidalgo - Ciudad de México

Descripción de la oferta de empleo

Job Description Ideally you are an ex-application programmer who moved to SRE/DevOps out of a love for automation and to satisfy your curiosity about computer systems.
You will join and lead a team of passionate technologists dedicated to core SRE principles and building an exemplary technology organization.
The role of the Senior – Site Reliability Engineer is to be hands-on and provide mentorship to other team members on core SRE principles and tools.
The Senior SRE will participate in end to end operational aspects of Production environment.
The individual concerned will be able to work on cloud systems, networks, databases and help drive incident lifecycle management.
As a member of the SRE team, you will also be working closely with the Architects, DevOps, Product and development teams to ensure we get the most out of the software on AWS platform.
This role requires a highly skilled technology professional with excellent communication skills, strategic mindset, strong analytical and troubleshooting skills on AWS Cloud Platform.
MEX-Distrito Federal-Blvd Manu Apply Job Description Ideally you are an ex-application programmer who moved to SRE/DevOps out of a love for automation and to satisfy your curiosity about computer systems.
You will join and lead a team of passionate technologists dedicated to core SRE principles and building an exemplary technology organization.
The role of the Senior – Site Reliability Engineer is to be hands-on and provide mentorship to other team members on core SRE principles and tools.
The Senior SRE will participate in end to end operational aspects of Production environment.
The individual concerned will be able to work on cloud systems, networks, databases and help drive incident lifecycle management.
As a member of the SRE team, you will also be working closely with the Architects, DevOps, Product and development teams to ensure we get the most out of the software on AWS platform.
This role requires a highly skilled technology professional with excellent communication skills, strategic mindset, strong analytical and troubleshooting skills on AWS Cloud Platform.
Other responsibilities include working with internal business partners to gather requirements, prototyping, architecting, implementing/updating solutions, building and executing test plans, performing quality reviews, managing operations, and triaging and fixing operational issues.
Site Reliability Engineers must be able to adjust to constant business change; common types of changes include new requirements, evolving goals and strategies, and emerging technologies.
Key Responsibilities Be hands-on and provide mentorship to a growing SRE team on core SRE principles and tools.
Foster a sense of automation in issue resolution; everything possible should be automated, and only when automation can’t resolve an issue should people get involved in the resolution Lead efforts for updating production with new versions/infrastructures as they are available Lead capacity planning efforts in collaboration with Architects and DevOps engineers to determine changes to infrastructure that are needed to support new load and performance characteristics Leads engagement with software developers, DevOps and other infrastructure engineers to integrate software development and delivery from inception to full operation, ensuring robust released software and systems.
Ensure highest level of uptime to meet the customer SLA by implementing system wide corrections to prevent reoccurrence of issues.
Mentor other SRE team members to further develop their soft and hard skills Triage, troubleshoot and resolve issues using golden signals and go past golden signals Go past golden signals with additional principles such as chaos engineering to detect failure points and lead Game days for testing resiliency of team when it comes to incident response and remediations and synthetic monitoring.
Lead SRE team members to create and maintain Recovery Procedures, RCA’s in collaboration with other engineering teams.
Ensure Incidents assigned to the team are being managed within agreed SLAs Ensure alarms are documented in up to date Knowledge Base Articles.
Ensures Production infrastructure is up to date with server/security patches and certificates.
Continuous improvement of system and application monitoring and automation Identify and automate manual workarounds and process improvements Proactive monitoring of Monitor the availability, latency, scalability and efficiency of all services Perform periodic on-call duty as part of the SRE team Qualifications Skilled with cloud operations/administration in Amazon AWS.
Tax/Accounting domain experience Bachelors or Master’s in Computer Science discipline.
5+ years’ experience focussed on Site Reliability Engineering or related position in AWS Cloud Platform.
AWS Certification is a Plus.
Experience working with SQL, Windows Servers, Load balancers, Linux Deep experience with AWS Services and Windows support.
Program at a high level in at least one language such as.
Java, C#, Javascript, Python or Ruby.
Integration experience with PagerDuty, ServiceNow, Datadog, CloudWatch.
Good understanding of Site Reliability Engineering (SRE) philosophies, technologies, platforms and tools, SLO management, incident resolution, and automation; Ability to explain technical concepts in clear, non-technical language Working knowledge of infrastructure components (e.
.
routers, load balancers, cloud products, container systems, compute, storage, and networks) Knowledge of security and compliance standards such as SOC/PCI is a plus
Ver oferta completa

Detalles de la oferta

Empresa
  • Sin especificar
Municipio
Dirección
  • Sin especificar - Sin especificar
Fecha de publicación
  • 06/05/2024
Fecha de expiración
  • 04/08/2024
Microsoft Dynamics Product Support Engineer _ Remote
Cliecon solution inc

Required/minimum skills/qualifications: minimum 2+ years relevant experience as technical/functional consultant or engineer engineering or master’s degree in computer science/information technology (it) or equivalent relevant product certifications from microsoft excellent communication skills - verbal......

Desarrollador Full Stack .Net Senior
Consultoria YOKMAK

Nuestra visión es que el software es un medio ideal para solucionar los problemas que el negocio sufre diariamente... net, proyectos con arquitecturas muy variadas, siempre buscando conseguir la mejor solución para cada caso, así como cumplir las necesidades, escalabilidad, estabilidad, responsabilidad......

Arquitecto de Soluciones Senior
Consultoria YOKMAK

Net avanzado otros datos del puestobajo esquema híbrido... arquitectura de soluciones (capas de arquitectura, patrones de diseño, modelado de arquitectura)... escolaridad: ingeniería / licenciatura en sistemas deseable: maestría inglés: avanzado experiencia: 7 años lidereando equipos de trabajo, con......

Technical Department
Rainsteal Oil and Gas Limited, UK.

Administrative department business analyst, payroll manager, marketing specialist, administration supervisor, human resources officer, financial analyst, senior marketing analyst, logistics coordinator / expert, procurement officer, secretary / office assistants / office clerks / front desk clerks, account......

Google Ads Manager
No Bull Marketing

Join us, and let’s generate remarkable results together!oliver, senior google ads team leader at nobull marketing llc... remote flexibility: work from anywhere in the world and connect with our international team... com/to/hmsg7wxo(please note: you must be physically located in latin america to apply......

Pasante de Derecho
EMBOTELLADORA ZACATECAS, S.A. DE C.V.

Apoyar en la atención y seguimiento de asuntos legales bajo la supervisión de abogados senior... requisitos del puestorequisitos: estudiante activo de la carrera de derecho, preferiblemente en los últimos años de la carrera... capacidad para realizar investigaciones legales y redactar documentos con......