yandex
five9
Страна
США
Зарплата
71 800 $ – 190 000 $
+500% приглашений

Откликайтесь
на вакансии с ИИ

Ускорим процесс поиска работы
УдалённоПолная занятость

Site Reliability Engineer

Оценка ИИ

Отличная позиция в стабильной публичной компании с прозрачной вилкой зарплаты и сильным социальным пакетом. Роль предлагает правильный баланс между разработкой и операционкой, что способствует профессиональному росту SRE-специалиста.


Вакансия из Quick Offer Global, списка международных компаний
Пожаловаться

Сложность вакансии

ЛегкоСложно
Оценка ИИ

Роль требует глубоких знаний как в разработке (50% времени), так и в эксплуатации, включая опыт работы с Kubernetes, Terraform и мониторингом высоконагруженных систем. Необходимость участия в дежурствах 24/7 и управления бюджетами ошибок добавляет ответственности и сложности.

Анализ зарплаты

Медиана155 000 $
Рынок130 000 $ – 210 000 $
Оценка ИИ

Предложенная вилка $71,800 – $190,000 очень широкая. Нижняя граница соответствует уровню Junior/Early Middle, в то время как верхняя граница ($190k) является конкурентной для Senior SRE в США, хотя в топовых технологических гигантах (Big Tech) она может быть выше за счет акций.

Сопроводительное письмо

I am writing to express my strong interest in the Site Reliability Engineer position at Five9. With over three years of experience managing large-scale production environments and a deep-seated passion for the SRE philosophy, I am confident in my ability to contribute to your team-first culture and help maintain the high availability of your cloud contact center solutions. My background in balancing software development with operational automation aligns perfectly with your 50/50 split approach, ensuring that reliability is built into the code itself.

In my previous roles, I have successfully implemented comprehensive observability stacks using Prometheus and Grafana, and I am well-versed in defining SLIs and SLOs to manage error budgets effectively. My expertise in Infrastructure as Code with Terraform and container orchestration via Kubernetes has consistently led to reduced toil and more resilient CI/CD pipelines. I am particularly drawn to Five9's commitment to diversity and inclusion, and I am eager to bring my technical skills in Python and cloud infrastructure to a team that values both innovation and the human element of customer experience.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в five9 уже сейчас

Присоединяйтесь к Five9 и станьте ключевым звеном в обеспечении надежности облачных инноваций мирового уровня!

Описание вакансии

![](https://www.five9.com/sites/default/files/2025-02/five9-logo.svg)

Join us in bringing joy to customer experience.  Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide.

Living our values everyday results in our team-first culture and enables us to innovate, grow, and thrive while enjoying the journey together. We celebrate diversity and foster an inclusive environment, empowering our employees to be their authentic selves.

We are seeking a Site Reliability Engineer (SRE) to join our team and help build and maintain highly reliable, scalable systems. This role combines software engineering and operations expertise to ensure our services meet ambitious reliability targets while enabling rapid development and deployment. This position requires approximately 50% software development and 50% operational work, focusing on automation, monitoring, and system reliability rather than manual operations. The team works collaboratively with our platform, application and database teams to provide a reliable and available service.

Key Responsibilities:

  • Observability & Monitoring

•    Dashboards & Metrics: Design and implement comprehensive dashboards. These dashboards cover OS/platform level monitoring and application-level monitoring. These dashboards are broken into primary (RED) and secondary indicators (USE).

•    Availability & Reliability: Establish and maintain SLIs (Service Level Indicators), SLOs (Service Level Objectives), and error budgets for the service.

•    Performance Monitoring: Build alerting systems and performance monitoring to proactively identify and resolve issues before they impact users.

•    Incident Response: Participate in on-call rotations and lead incident response efforts, including post-mortem analysis and remediation. Maintain the official on-call routing. Assign and track application level problems to the engineering team.

  • Infrastructure Automation & Deployment

•    CI/CD Pipeline Management: Maintain continuous integration and deployment pipelines working with our cloud and on-premise deployment teams.

•    Infrastructure as Code: Develop and maintain infrastructure using tools like Terraform, Ansible, or similar.

•    Configuration Management: Automate system configuration and ensure consistency across environments. Provide recommendations for and implement best practices for configuration control.

  • Security & Compliance

•    Security Automation: Ensure security scanning systems are in place and review escalated vulnerabilities. 

•    Access Control: Maintain proper authentication, authorization, and audit logging systems.

•    Compliance Reporting: Ensure systems meet regulatory requirements and industry standards.

•    Security Incident Response: Participate in security incident response and remediation efforts.

  • Cost Optimization

•    Resource Management: Monitor and optimize cloud resource usage and costs looking for planned and unplanned resource changes.

•    Capacity Planning: Analyze usage patterns and plan for future capacity needs.

•    Cost Analysis: Provide recommendations for cost-effective architecture and resource allocation.

•    Right-sizing: Implement automated scaling and resource optimization strategies.

  • Common Services & Platform Engineering:

•    Shared Infrastructure: Build and maintain common services like notification systems, caching layers, and message queues or third-party software stacks.

•    Database Operations: Manage database reliability, performance, and scaling (where not handled by dedicated DB teams).

•    Service Mesh & Networking: Implement and maintain service discovery, load balancing, and network policies.

•    Developer Tools: Create and maintain tools and platforms that improve developer productivity and system reliability.

Required Qualifications:

  • Operational Experience

•    Production Systems: 3+ years managing large-scale production environments.

•    On-call Experience: Comfortable with 24/7 on-call responsibilities and incident response.

•    System Administration: Strong Linux/Unix system administration skills.

•    Networking: Understanding of TCP/IP, DNS, load balancing, and network security.

•    Database Systems: Experience with SQL and NoSQL databases in production environments.

  • Technical Skills

•    Programming Languages: Proficiency in at least two of: Python, Shell, PHP, Java, or similar languages.

•    Cloud Platforms: Experience with one of AWS, GCP, or Azure infrastructure and services.

•    Containerization: Hands-on experience with Docker, Kubernetes, and container orchestration.

•    Monitoring & Observability: Experience with Prometheus, Grafana, ELK stack, or similar tools.

•    Infrastructure as Code: Proficiency with Terraform, CloudFormation, or similar tools.

•    Version Control: Expert-level Git usage and collaborative development practices.

  • SRE-Specific Knowledge

•    SLI/SLO Management: Experience defining and maintaining service level objectives.

•    Error Budget Policy: Understanding of error budget concepts and implementation.

•    Toil Reduction: Track record of identifying and eliminating repetitive manual work.

•    Capacity Planning: Experience with performance testing and capacity management.

Preferred Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or equivalent experience.
  • Experience with microservices architecture and distributed systems.
  • Knowledge of security best practices and compliance frameworks.
  • Experience with chaos engineering and reliability testing.
  • Previous experience in an SRE or DevOps role at a technology company.
  • Contributions to open-source projects or technical communities.

Work Location: This role is fully remote for candidates who reside outside the 50 mile radius of our San Ramon office.  For candidates who reside within 50 miles of our San Ramon location, this role is Hybrid and would require 3 days a week (M, W, TH) in our San Ramon office.


As part of our continued commitment to diversity, equity, and inclusion, Five9 supports pay transparency during the entire recruitment process.  Actual compensation packages are based on several factors that are unique to each candidate including, but not limited to: skill set, depth of experience, certifications, and specific work location. The range displayed reflects the minimum and maximum target for new hire salaries for the job across the United States. Your recruiter can share more about the specific compensation package during your hiring process.

Additionally, the total compensation package for this position may also include an annual performance bonus, stock, and/or other applicable incentive compensation plans.

Our total reward package also includes:

  • Health, dental, and vision coverage, beginning on the first day of employment. Five9 covers 100% of the employee portion of the health, dental and vision coverage and shares a high portion of the dependent cost. We also offer Short & Long-Term Disability, Basic Life Insurance, and a 401k saving plan with employer matching.
  • Access to an innovative mental health support platform that offers personalized care and resources in areas such as: therapy, coaching and self-guided mindfulness exercises for all covered employees and their covered dependents.
  • Generous employee stock purchase plan.
  • Paid Time Off, Company paid holidays, paid volunteer hours and 12 weeks paid parental leave.

All compensation and benefits are subject to the requirements and restrictions set forth in the applicable plan documents and any written agreements between the parties.

The US base salary range for this role is below.

$71,800—$190,000 USD

Five9 embraces diversity and is committed to building a team that represents a variety of backgrounds, perspectives, and skills.  The more inclusive we are, the better we are.  Five9 is an equal opportunity employer. 


View our privacy policy, including our privacy notice to California residents here: https://www.five9.com/pt-pt/legal.  

Note: Five9 will never request that an applicant send money as a prerequisite for commencing employment with Five9.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Создайте идеальное резюме с помощью ИИ-агента

Навыки

  • Python
  • Shell
  • PHP
  • Java
  • AWS
  • GCP
  • Azure
  • Docker
  • Kubernetes
  • Prometheus
  • Grafana
  • ELK stack
  • Terraform
  • CloudFormation
  • Ansible
  • Git
  • Linux
  • SQL
  • NoSQL
  • TCP/IP
  • DNS

Возможные вопросы на собеседовании

SRE в Five9 уделяют 50% времени разработке. Важно понять, как кандидат автоматизирует рутину.

Расскажите о самом значительном примере 'toil reduction' (сокращения рутины), который вы реализовали. Какой инструмент вы создали и какой измеримый результат это принесло?

Вакансия делает упор на SLI/SLO и бюджеты ошибок.

Как бы вы подошли к определению SLI и SLO для критически важного микросервиса в облачной среде Five9? Как вы будете действовать, если бюджет ошибок исчерпан?

Работа включает управление инцидентами и пост-мортемы.

Опишите ваш опыт руководства процессом устранения критического инцидента в продакшене. Как вы организуете коммуникацию и последующий анализ причин (RCA)?

Требуется опыт работы с Terraform и облачными платформами.

С какими основными проблемами масштабирования инфраструктуры через код (IaC) вы сталкивались и как обеспечивали консистентность сред?

Упоминается работа с базами данных и сетевыми протоколами.

Как вы обеспечиваете наблюдаемость (observability) на уровне базы данных и сети, чтобы проактивно выявлять узкие места в производительности?

Похожие вакансии

Комплексные технологии
200 000 ₽ – 220 000 ₽

DevOps Middle +/ Senior

SeniorУдалённоРоссия
SQL · Kubernetes · Docker · Ansible · Prometheus · Grafana · ELK stack · CI/CD · Java · Go · C++ · Bash · Terraform · SonarQube · SAST · Python · Linux · Windows Server · Cisco · MikroTik · Fortinet · Ubiquiti · TCP/IP · DNS · DHCP · BGP · OSPF · VLAN · NAT · Zero Trust · RBAC · SIEM · Zabbix · Wazuh · PowerShell · VMware · Proxmox · Hyper-V · KVM
+39 навыков
WMT Group
300 000 ₽ – 400 000 ₽

Senior DevOps/Mlops

SeniorУдалённоРоссия
Docker · Helm · Jenkins · GitLab CI · Python · Airflow · JupyterHub · MLflow · Seldon Core · CUDA · Kubernetes · Hadoop · Apache Spark · Apache Kafka · ELK stack · LLM · Computer Vision
+17 навыков
DstLab
240 000 ₽ – 280 000 ₽

Devops Middle+ / Senior

SeniorУдалённоРоссия
Kubernetes · Redis · Kafka · Keycloak · PostgreSQL · MonetDB · VK Cloud · GitLab CI · ArgoCD · HashiCorp Vault · Prometheus · Grafana · ELK stack · Linux
+14 навыков
Hi, Rockits!
300 000 ₽ – 400 000 ₽

Senior DevOps/SRE Engineer (On-Premise инфраструктура)

SeniorУдалённоРоссия
Kubernetes · Ansible · Terraform · GitLab CI/CD · PostgreSQL · Redis · RabbitMQ · ElasticSearch · Prometheus · Grafana · Linux · Go · Python · Kafka · Vault · NATS · Bash
+17 навыков
Volna.tech
268 000 ₽ – 294 000 ₽

DevOps - senior

SeniorУдалённоРоссия
Linux · RHEL · Debian · TCP/IP · Docker · Git · GitLab CI · GitHub Actions · TeamCity · Jenkins · Nexus · Artifactory · Terraform · Ansible · Chef · Puppet · OpenStack · AWS · Molecule · TestInfra · REST API
+21 навыков
Тезис
130 000 ₽ – 200 000 ₽

Junior+ / Middle DevOps Engineer

MiddleУдалённоРоссия
Kubernetes · Helm · Docker · Terraform · Linux · Bash · Python · Go · GitLab CI · PostgreSQL · Redis · Prometheus · Grafana · Loki · Ansible · Yandex Cloud · Selectel · ArgoCD · FluxCD · ClickHouse · MongoDB · Kafka · Vault · Trivy · Teleport · ETL · CDC · Debezium · PgBouncer · HAProxy · Velero · Cilium · ELK
+33 навыков
более 1000 офферов получено
4.9

1000+ офферов получено

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

five9
Страна
США
Зарплата
71 800 $ – 190 000 $