Страна: Чехия

+500% приглашений

Откликайтесь
на вакансии с ИИ

УдалённоПолная занятость

Manager, Site Reliability Engineering

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Veeam — признанный лидер рынка с отличным пакетом льгот и четко структурированными процессами. Позиция предлагает реальное влияние на продукт мирового уровня и работу с передовым стеком технологий.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Роль требует сочетания глубокого технического бэкграунда (Azure, K8s, IaC) и управленческого опыта. Высокая ответственность за глобальную доступность сервисов и внедрение SRE-практик в масштабах всей компании повышает планку требований.

Анализ зарплаты

Медиана6 500 €

Рынок5 500 € – 8 000 €

Предлагаемая позиция менеджера SRE в Чехии соответствует высокому уровню рынка для опытных руководителей. Указанный диапазон отражает текущие рыночные реалии для международных технологических компаний в Праге и Брно.

I am writing to express my strong interest in the Manager, Site Reliability Engineering position at Veeam. With over seven years of experience in reliability and platform engineering, including successful leadership of engineering teams, I have developed a deep understanding of how to balance rapid innovation with systemic stability. My background in managing Azure-based environments and implementing Kubernetes-driven architectures aligns perfectly with the technical requirements of the Veeam Data Cloud.

Throughout my career, I have been a staunch advocate for SRE principles, specifically in operationalizing SLIs/SLOs and fostering a blameless postmortem culture. I am particularly drawn to Veeam’s commitment to a 'software-first' approach to reliability and your 'follow-the-sun' operational model. I am confident that my experience in scaling SRE teams and driving cross-functional initiatives will allow me to contribute significantly to Veeam’s mission of ensuring data resilience for over 550,000 customers worldwide.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в veeamsoftware уже сейчас

Присоединяйтесь к лидеру рынка данных и ИИ, чтобы возглавить SRE-трансформацию в глобальном масштабе!

Описание вакансии

Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running. Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands.

About the Role

Veeam is expanding its global Site Reliability Engineering (SRE) organization to support the Veeam Data Cloud. As an SRE Manager, you will report to our Global Director of SRE and will build and lead a high-performing team that partners with product, platform, and security engineering to make our systems reliable, scalable, and observable from the ground up. You’ll collaborate with peer engineering leaders to embed reliability into service roadmaps, and you’ll represent your team in global SRE planning and delivery of cross-cutting reliability initiatives across all VDC services.

You’ll drive adoption of SRE principles (SLIs/SLOs/error budgets, toil reduction, blameless learning) and operate a healthy, daytime follow-the-sun on call model in partnership with our other regions. You will lead your team to make code contributions leading to improvements in the overall operability, reliability, resilience, and security of the codebase(s) we support.

What You’ll Do

People & Team Leadership

Hire, onboard, and grow your SRE team; coach career development and performance
Foster a psychologically safe, blameless culture that favors learning over blame and emphasizes engineering over firefighting
Ensure a sustainable operational coverage; monitor on-call health and workload
Track and cap toil so engineers spend the majority of time on project work that reduces future toil

Reliability Strategy & Governance

Establish and operationalize SLIs/SLOs and error budgets with service owners; run reliability reviews and hold teams accountable to outcomes
Define reliability standards, runbooks, readiness checklists, and alerting patterns (including SLO-based alerting)
Partner with product/EMs to align reliability work with service goals and customer experience, not as a gate but as an enabler

Operations & Incident Excellence

Ensure incident response readiness; lead/coordinate major incidents; drive fast, high-quality postmortems and systemic fixes
Measure MTTR, change failure rate, SLO posture, and repeat-incident reduction; publish learning broadly

Engineering & Automation

Lead software-first reliability investments: observability, deployment safety (canary/blue-green), resilience testing/chaos, and self-service guardrails
Drive platform improvements (IaC, CI/CD, Kubernetes) and internal tools that scale operations and improve developer experience

What You’ll Bring

7+ years in Software, Platform, and/or Reliability Engineering with 2+ years managing engineers
Demonstrable experience leading engineering teams to predictably deliver outcomes
Experience leading cross-functional initiatives collaboratively with peers through influence
Experience with public cloud (Azure preferred), Kubernetes, IaC (Terraform, Pulumi), CI/CD (Github Actions, ArgoCD, Azure DevOps), and observability (OpenTelemetry, Elastic, Datadog, Prometheus, Grafana)
Coding background with experience improving service reliability
Hands-on incident management and postmortem practice; excellent cross-geo communication
Willingness to participate in an on-call rotation (typically during daytime hours, including weekends/holidays)

Bonus Skills

Demonstrated success leading SLO/error-budget adoption and reliability programs for cloud services
Experience operating a multi-region, follow-the-sun on-call model
Background in chaos/resilience/performance testing and release validation
Track record building or scaling SRE teams and influencing org-wide standards
Familiarity with compliance frameworks common to SaaS

What You’ll Get

25 vacation days, 4 sick days, 21 paid medical leave days, plus 4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
Premium private medical insurance for employees and dependents
Daily meal vouchers for restaurants and groceries (180 CZK per working day)
Flexible cafeteria platform with thousands of lifestyle benefit options
Multisport Card for gym and wellness, with family add-on options
Annual public transport reimbursement up to a set limit
Corporate mobile plan with optional family tariff
Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops and learning events like our annual Global Day of Learning

Please note: If the applicant is permanently present outside of the Czech Republic, Veeam reserves the right to refuse to consider the application for a job. Remote job is only possible in case the employee is located in the Czech Republic.

#LI-EZ1

#Remote

Veeam Software is an equal opportunity employer and does not tolerate discrimination in any form on the basis of race, color, religion, gender, age, national origin, citizenship, disability, veteran status or any other classification protected by federal, state or local law. All your information will be kept confidential.

Please note that any personal data collected from you during the recruitment process will be processed in accordance with our Recruiting Privacy Notice.

The Privacy Notice sets out the basis on which the personal data collected from you, or that you provide to us, will be processed by us in connection with our recruitment processes.

By applying for this position, you consent to the processing of your personal data in accordance with our Recruiting Privacy Notice.

By submitting your application, you acknowledge that the information provided in your job application and any supporting documents is complete and accurate to the best of your knowledge. Any misrepresentation, omission, or falsification of information may result in disqualification from consideration for employment or, if discovered after employment begins, termination of employment.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Azure
Kubernetes
Terraform
Pulumi
GitHub Actions
ArgoCD
Azure DevOps
OpenTelemetry
ElasticSearch
Datadog
Prometheus
Grafana
SRE
CI/CD
Infrastructure as Code

Возможные вопросы на собеседовании

Проверка понимания ключевых концепций SRE и умения применять их на практике для управления рисками.

Как вы подходите к определению SLI/SLO для нового облачного сервиса и как вы действуете, когда бюджет ошибок (error budget) исчерпан?

Оценка лидерских качеств и способности развивать культуру обучения в команде.

Опишите ваш опыт внедрения культуры 'blameless postmortems'. Как вы справляетесь с ситуациями, когда команда склонна искать виноватых?

Проверка навыков управления нагрузкой и предотвращения выгорания инженеров.

Как вы измеряете и ограничиваете 'toil' (рутинную работу) в своей команде, чтобы гарантировать, что инженеры сфокусированы на проектной деятельности?

Оценка технического видения и опыта работы с современным стеком.

Каков ваш подход к обеспечению безопасности развертывания (например, canary или blue-green) в среде Kubernetes при использовании GitOps?

Проверка умения работать в распределенной среде.

Расскажите о вашем опыте управления инцидентами в модели 'follow-the-sun'. Как вы обеспечиваете эффективную передачу контекста между регионами?

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

Чехия

Откликайтесь
на вакансии с ИИ

Manager, Site Reliability Engineering

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в veeamsoftware уже сейчас

Описание вакансии

About the Role

What You’ll Do

What You’ll Bring

Bonus Skills

What You’ll Get

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Как вы подходите к определению SLI/SLO для нового облачного сервиса и как вы действуете, когда бюджет ошибок (error budget) исчерпан?

Опишите ваш опыт внедрения культуры 'blameless postmortems'. Как вы справляетесь с ситуациями, когда команда склонна искать виноватых?

Как вы измеряете и ограничиваете 'toil' (рутинную работу) в своей команде, чтобы гарантировать, что инженеры сфокусированы на проектной деятельности?

Каков ваш подход к обеспечению безопасности развертывания (например, canary или blue-green) в среде Kubernetes при использовании GitOps?

Расскажите о вашем опыте управления инцидентами в модели 'follow-the-sun'. Как вы обеспечиваете эффективную передачу контекста между регионами?

Похожие вакансии

Senior Devops инженер\Тимлид

Senior DevOps

DevOps Middle

DevOps Engineer (Senior)

Инженер по внедрению (DevSecOps)

DevOps Middle/Middle+

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

Manager, Site Reliability Engineering

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в veeamsoftware уже сейчас

Описание вакансии

About the Role

What You’ll Do

What You’ll Bring

Bonus Skills

What You’ll Get

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Как вы подходите к определению SLI/SLO для нового облачного сервиса и как вы действуете, когда бюджет ошибок (error budget) исчерпан?

Опишите ваш опыт внедрения культуры 'blameless postmortems'. Как вы справляетесь с ситуациями, когда команда склонна искать виноватых?

Как вы измеряете и ограничиваете 'toil' (рутинную работу) в своей команде, чтобы гарантировать, что инженеры сфокусированы на проектной деятельности?

Каков ваш подход к обеспечению безопасности развертывания (например, canary или blue-green) в среде Kubernetes при использовании GitOps?

Расскажите о вашем опыте управления инцидентами в модели 'follow-the-sun'. Как вы обеспечиваете эффективную передачу контекста между регионами?

Похожие вакансии

Senior Devops инженер\Тимлид

Senior DevOps

DevOps Middle

DevOps Engineer (Senior)

Инженер по внедрению (DevSecOps)

DevOps Middle/Middle+

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ