Work with the team to design for the performance, capacity and high availability of infrastructure and services Participate in problem resolution activities Troubleshoot issues across the entire stack - software, database and infrastructure.Diagnose and troubleshoot complex distributed systems handling large volumes of data and develop solutions that have a significant impact at scale.Participate in building advanced tooling for testing, monitoring, administration and operations of multiple clusters across multiple geographically distributed data centersDevelop innovative ways to smartly measure, monitor & report application and infrastructure healthExperience improving the performance of micro-services and solve scaling/performance issuesDefine and Monitor SLI/SLO Error BudgetsDrive efficiencies in systems and processes: capacity planning, configuration management, performance tuning, monitoring and root cause analysis.
Requisitos
3+ years of hands-on experience with cloud computing - including infrastructure, storage, platforms and data management, preferably in AWS.Experience with container orchestration technologies, like Docker & KubernetesHands-on experience on AWS Elastic Kubernetes Service.Hands-on experience with Github Actions.
Postularme Ver aviso completo
  Work with the team to design for the performance, capacity and high availability of infrastructure and services Parti... ver más
2024-11-27