yandex
okta
Страна
США
Зарплата
194 000 $ – 267 000 $
+500% приглашений

Откликайтесь
на вакансии с ИИ

Ускорим процесс поиска работы
ГибридПолная занятость

Staff Site Reliability Engineer - Observability

Оценка ИИ

Отличная позиция в топовой компании с высокой зарплатой, сильным соцпакетом и возможностью работать над критически важной инфраструктурой.


Вакансия из Quick Offer Global, списка международных компаний
Пожаловаться

Сложность вакансии

ЛегкоСложно
Оценка ИИ

Высокая сложность обусловлена ролью уровня Staff, требованиями к глубокому знанию GKE, программированию на Go/Python и необходимостью доступа к федеральным данным США.

Анализ зарплаты

Медиана235 000 $
Рынок190 000 $ – 280 000 $
Оценка ИИ

Предлагаемый диапазон $194k–$267k полностью соответствует рыночным стандартам для позиции Staff SRE в Сан-Франциско, где медиана составляет около $230k.

Сопроводительное письмо

I am writing to express my strong interest in the Staff Site Reliability Engineer - Observability position at Okta. With extensive experience in managing large-scale distributed systems and a deep focus on Google Cloud Platform, I am confident in my ability to lead the expansion of your observability ecosystem. My background in automating infrastructure with Terraform and developing internal tools in Go and Python aligns perfectly with Okta's mission to eliminate toil and deliver high-availability services.

Throughout my career, I have specialized in building comprehensive monitoring and logging solutions using Splunk and Grafana. I am particularly excited about the opportunity to optimize data collection and processing within GKE and to drive "observability-driven development" across the organization. My proactive approach to incident response and commitment to infrastructure-as-code will ensure that Okta's infrastructure remains robust and scalable as you continue to secure identities in the era of AI.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в okta уже сейчас

Присоединяйтесь к Okta, чтобы строить будущее безопасности ИИ и масштабировать Observability-платформу мирового уровня!

Описание вакансии

Secure Every Identity, from AI to HumanIdentity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

We are seeking a highly technical Observability Site Reliability Engineer with a specialty in Google Cloud, to own and expand our Observability ecosystem into GCP. In this role, you will move beyond simple monitoring to delivering a world class, comprehensive, scalable Observability Platform that enables our SRE teams and business partners. You will treat infrastructure as code—utilizing Terraform and strong coding proficiency in Go, Python, or Ruby—to automate the deployment of agents and collectors across complex distributed systems.

Key Responsibilities

  • Automated Infrastructure: Design, build, and maintain scalable observability infrastructure using tools like Terraform.
  • GCP Observabilty Engineering: Optimize the collection, processing, and storage of Observabilty data to ensure high reliability and low latency of our Splunk and Grafana services
  • Incident Response: Participate in on-call rotations and lead post-incident reviews to drive systemic improvements and "observability-driven development."
  • Automation: Eliminate "toil" by automating the deployment and scaling of observability agents and collectors.

Required Skills & Experience (The Essentials)

GKE: Minimum 5+ Experience scaling and managing observability in a Google Cloud platform. Visualization: Expertise in creating intuitive, actionable Splunk or Grafana dashboards that correlate data across multiple sources.SRE Mindset: Minimum 3+ years of experience in an SRE, DevOps, or Systems Engineering role with a focus on high-availability systems.

  • Programming Proficiency: Strong coding skills in Python, Go for building internal tools and automating workflows.
  • Distributed Systems: Deep understanding of Linux internals, networking (TCP/IP, DNS, Load Balancing), and container orchestration (Kubernetes/GKE).
  • Problem Solving: A data-driven approach to debugging complex, cross-service performance bottlenecks.

Bonus Skills (The "Nice-to-Haves")

  • Telemetry Standards: Hands-on experience with OpenTelemetry (OTel), Vector, or similar frameworks for instrumenting applications.
  • Grafana Loki: Experience in migrating Splunk to Grafana Loki

Other Cloud Platforms: Experience managing observability native tools within AWS.

Additional requirements:

  • This position requires the ability to access federal environments and/or have access to protected federal data.  As a condition of employment for this position, the successful candidate must be able to submit documentation establishing U.S. Person status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee. 22 CFR 120.15) upon hire.

#LI-MM

#LI-Hybrid

P24517_3387022

Below is the annual base salary range for candidates located in San Francisco Bay Area. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit:https://rewards.okta.com/us.

The annual base salary range for this position for candidates located in the San Francisco Bay area is between:

$194,000—$267,000 USD

The Okta Experience

We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.

Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please click here to view our full NYC AEDT Notice.

Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at https://www.okta.com/legal/personnel-policy/.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Создайте идеальное резюме с помощью ИИ-агента

Навыки

  • Google Cloud Platform
  • Google Kubernetes Engine
  • Terraform
  • Go
  • Python
  • Ruby
  • Splunk
  • Grafana
  • Kubernetes
  • Linux
  • OpenTelemetry
  • Vector
  • Grafana Loki
  • Amazon Web Services

Возможные вопросы на собеседовании

Проверка опыта работы с конкретным стеком и понимания масштабируемости в GCP.

Расскажите о вашем опыте масштабирования систем мониторинга в среде GKE. С какими основными узкими местами вы сталкивались?

Оценка навыков автоматизации и владения инструментами IaC.

Как вы организуете структуру Terraform-модулей для управления агентами и коллекторами в мульти-кластерной среде?

Проверка аналитических способностей и подхода к решению проблем.

Опишите случай, когда вам пришлось отлаживать сложную проблему производительности между сервисами, используя только данные Observability.

Оценка понимания современных стандартов сбора данных.

Каков ваш подход к внедрению OpenTelemetry в существующую микросервисную архитектуру?

Проверка лидерских качеств и SRE-культуры.

Как вы внедряете принципы 'observability-driven development' в командах разработки, которые привыкли к традиционному мониторингу?

Похожие вакансии

Комплексные технологии
200 000 ₽ – 220 000 ₽

DevOps Middle +/ Senior

SeniorУдалённоРоссия
SQL · Kubernetes · Docker · Ansible · Prometheus · Grafana · ELK stack · CI/CD · Java · Go · C++ · Bash · Terraform · SonarQube · SAST · Python · Linux · Windows Server · Cisco · MikroTik · Fortinet · Ubiquiti · TCP/IP · DNS · DHCP · BGP · OSPF · VLAN · NAT · Zero Trust · RBAC · SIEM · Zabbix · Wazuh · PowerShell · VMware · Proxmox · Hyper-V · KVM
+39 навыков
WMT Group
300 000 ₽ – 400 000 ₽

Senior DevOps/Mlops

SeniorУдалённоРоссия
Docker · Helm · Jenkins · GitLab CI · Python · Airflow · JupyterHub · MLflow · Seldon Core · CUDA · Kubernetes · Hadoop · Apache Spark · Apache Kafka · ELK stack · LLM · Computer Vision
+17 навыков
DstLab
240 000 ₽ – 280 000 ₽

Devops Middle+ / Senior

SeniorУдалённоРоссия
Kubernetes · Redis · Kafka · Keycloak · PostgreSQL · MonetDB · VK Cloud · GitLab CI · ArgoCD · HashiCorp Vault · Prometheus · Grafana · ELK stack · Linux
+14 навыков
Hi, Rockits!
300 000 ₽ – 400 000 ₽

Senior DevOps/SRE Engineer (On-Premise инфраструктура)

SeniorУдалённоРоссия
Kubernetes · Ansible · Terraform · GitLab CI/CD · PostgreSQL · Redis · RabbitMQ · ElasticSearch · Prometheus · Grafana · Linux · Go · Python · Kafka · Vault · NATS · Bash
+17 навыков
Volna.tech
268 000 ₽ – 294 000 ₽

DevOps - senior

SeniorУдалённоРоссия
Linux · RHEL · Debian · TCP/IP · Docker · Git · GitLab CI · GitHub Actions · TeamCity · Jenkins · Nexus · Artifactory · Terraform · Ansible · Chef · Puppet · OpenStack · AWS · Molecule · TestInfra · REST API
+21 навыков
Тезис
130 000 ₽ – 200 000 ₽

Junior+ / Middle DevOps Engineer

MiddleУдалённоРоссия
Kubernetes · Helm · Docker · Terraform · Linux · Bash · Python · Go · GitLab CI · PostgreSQL · Redis · Prometheus · Grafana · Loki · Ansible · Yandex Cloud · Selectel · ArgoCD · FluxCD · ClickHouse · MongoDB · Kafka · Vault · Trivy · Teleport · ETL · CDC · Debezium · PgBouncer · HAProxy · Velero · Cilium · ELK
+33 навыков
более 1000 офферов получено
4.9

1000+ офферов получено

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

okta
Страна
США
Зарплата
194 000 $ – 267 000 $