Страна: Румыния

+500% приглашений

Откликайтесь
на вакансии с ИИ

LeadУдалённоПолная занятость

Team Lead, Site Reliability Engineering

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Veeam — стабильный лидер рынка с отличным социальным пакетом (страховка, бонусы, выходные). Позиция предлагает реальное влияние на архитектуру и процессы крупной международной компании, а также удаленный формат работы в пределах Румынии.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Роль требует сочетания сильных лидерских качеств и глубокой технической экспертизы в SRE, Kubernetes и облачных технологиях. Сложность заключается в необходимости внедрения стандартов надежности (SLO/Error Budgets) на уровне всей организации и управлении инцидентами в глобальном масштабе.

Анализ зарплаты

Медиана5 500 €

Рынок4 500 € – 7 000 €

Зарплата в объявлении не указана, но для позиции Team Lead в SRE в Бухаресте рыночные показатели обычно выше среднего по IT-сектору из-за высокой ответственности. Предлагаемый соцпакет (питание, медицина, страхование жизни) значительно повышает общую ценность компенсации.

I am writing to express my strong interest in the SRE Team Lead position at Veeam. With over 3 years of experience in engineering management and a deep background in platform reliability, I have a proven track record of building high-performing teams and operationalizing SLI/SLO frameworks that bridge the gap between development and operations.

In my previous roles, I have successfully led cross-functional initiatives to improve system observability and resilience using Kubernetes and Infrastructure as Code. I am particularly drawn to Veeam's commitment to a 'software-first' reliability investment and your follow-the-sun operational model. I am confident that my experience in incident management and my passion for fostering a blameless engineering culture will allow me to make a significant impact on your SRE organization.

I look forward to the possibility of discussing how my leadership skills and technical expertise can support Veeam's mission of ensuring data resilience for over 550,000 customers worldwide.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в veeamsoftware уже сейчас

Присоединяйтесь к лидеру рынка данных и ИИ, чтобы возглавить SRE-трансформацию в глобальном масштабе!

Описание вакансии

Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running. Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands.

About the Role

Veeam is expanding its Site Reliability Engineering (SRE) organization to support Veaam services. As an SRE Team Leader, you will build and lead a high-performing team that partners with product, platform, and security engineering to make our systems reliable, scalable, and observable from the ground up. You’ll collaborate with peer engineering leaders to embed reliability into service roadmaps.

You’ll drive adoption of SRE principles (SLIs/SLOs/error budgets) and operate a healthy, daytime follow-the-sun on-call model in partnership with other regions. You will lead your team to make improvements in the overall operability, reliability, resilience, and security of the services we support.

What You’ll Do

People & Team Leadership

Hire, onboard, and develop your SRE team
Encourage culture that prioritizes learning and engineering over fault-finding and firefighting
Ensure a sustainable operational coverage; monitor on-call health and workload

Reliability Strategy & Governance

Establish and operationalize SLIs/SLOs and error budgets with service owners
Run reliability reviews and hold teams accountable to outcomes
Define reliability standards, runbooks, readiness checklists, and alerting patterns (including SLO-based alerting)

Operations & Incident Excellence

Ensure incident response readiness
Lead and coordinate major incidents
Measure MTTR, change failure rate, SLO posture, and repeat-incident reduction

Engineering & Automation

Lead software-first reliability investments: observability, resilience testing/chaos, and self-service guardrails
Drive platform improvements and internal tools

What You’ll Bring

3+ years in managing Software, Platform, and/or Reliability Engineering
Experience in IT Platform Engineering or Software Development
Demonstrable experience leading engineering teams to predictably deliver outcomes
Demonstrated success leading SLO/error-budget adoption and reliability programs for services
Experience leading cross-functional initiatives collaboratively with peers through influence
Experience with public clouds, Kubernetes, IaC, CI/CD, and observability
Hands-on incident management and postmortem practice
Readiness to participate in an on-call rotation (typically during daytime hours, including weekends/holidays)

Bonus Skills

Experience operating a multi-region follow-the-sun on-call model
Background in chaos/resilience/performance testing
Experience in building or scaling SRE teams and influencing org-wide standards
Coding background with experience improving service reliability

What You’ll Get

21 annual vacation days, additional days based on tenure, plus4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
Private health, dental, and vision insurance for employees and dependents, including outpatient care, hospitalization, pregnancy monitoring, and psychology support
Monthly lifestyle and daily meal benefits: 40 RON/day via Edenred and 600 RON/month through a flexible cafeteria platform
Life insurance (2× annual gross salary), critical illness, and disability coverage, plus vision reimbursement
Free access to Bookster library platform for borrowing your favorite books for free
Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, workshops and learning events like our annual Global Day of Learning

Please note: If an applicant is permanently located outside of the Romania, Veeam reserves the right to decline the application for this position.

**#LI-Remote

#LI-JS4**

Veeam Software is an equal opportunity employer and does not tolerate discrimination in any form on the basis of race, color, religion, gender, age, national origin, citizenship, disability, veteran status or any other classification protected by federal, state or local law. All your information will be kept confidential.

Please note that any personal data collected from you during the recruitment process will be processed in accordance with our Recruiting Privacy Notice.

The Privacy Notice sets out the basis on which the personal data collected from you, or that you provide to us, will be processed by us in connection with our recruitment processes.

By applying for this position, you consent to the processing of your personal data in accordance with our Recruiting Privacy Notice.

By submitting your application, you acknowledge that the information provided in your job application and any supporting documents is complete and accurate to the best of your knowledge. Any misrepresentation, omission, or falsification of information may result in disqualification from consideration for employment or, if discovered after employment begins, termination of employment.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

SRE
Kubernetes
Infrastructure as Code
CI/CD
Observability
Incident Management
Chaos Engineering
SLO
SLI

Возможные вопросы на собеседовании

Проверка опыта внедрения методологии SRE и умения договариваться с продуктовыми командами.

Расскажите о вашем опыте внедрения SLO и бюджетов ошибок: как вы убеждали стейкхолдеров приоритизировать надежность над новыми фичами?

Оценка лидерских качеств и подхода к формированию команды.

Каков ваш подход к найму и развитию инженеров в SRE-команде? Как вы поддерживаете здоровую культуру дежурств (on-call)?

Проверка навыков управления кризисными ситуациями.

Опишите самый сложный инцидент, которым вы руководили. Какие шаги были предприняты для снижения MTTR и предотвращения повторения ситуации?

Оценка технического видения автоматизации.

Как вы определяете баланс между операционной работой ('toil') и инженерными задачами по автоматизации в вашей команде?

Проверка опыта работы с современным стеком.

С какими основными вызовами вы сталкивались при обеспечении надежности приложений в среде Kubernetes и как их решали?

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

Румыния

Откликайтесь
на вакансии с ИИ

Team Lead, Site Reliability Engineering

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в veeamsoftware уже сейчас

Описание вакансии

About the Role

What You’ll Do

What You’ll Bring

Bonus Skills

What You’ll Get

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Расскажите о вашем опыте внедрения SLO и бюджетов ошибок: как вы убеждали стейкхолдеров приоритизировать надежность над новыми фичами?

Каков ваш подход к найму и развитию инженеров в SRE-команде? Как вы поддерживаете здоровую культуру дежурств (on-call)?

Опишите самый сложный инцидент, которым вы руководили. Какие шаги были предприняты для снижения MTTR и предотвращения повторения ситуации?

Как вы определяете баланс между операционной работой ('toil') и инженерными задачами по автоматизации в вашей команде?

С какими основными вызовами вы сталкивались при обеспечении надежности приложений в среде Kubernetes и как их решали?

Похожие вакансии

Ведущий DevOps инженер CDEK.Shopping

Tech Lead Infrastructure (K8s, SRE, AI)

Руководитель группы DevOps 1С

Руководитель группы SRE офисных сетей

Email Infrastructure Deliverability Lead

Lead DevOps

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

Team Lead, Site Reliability Engineering

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в veeamsoftware уже сейчас

Описание вакансии

About the Role

What You’ll Do

What You’ll Bring

Bonus Skills

What You’ll Get

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Расскажите о вашем опыте внедрения SLO и бюджетов ошибок: как вы убеждали стейкхолдеров приоритизировать надежность над новыми фичами?

Каков ваш подход к найму и развитию инженеров в SRE-команде? Как вы поддерживаете здоровую культуру дежурств (on-call)?

Опишите самый сложный инцидент, которым вы руководили. Какие шаги были предприняты для снижения MTTR и предотвращения повторения ситуации?

Как вы определяете баланс между операционной работой ('toil') и инженерными задачами по автоматизации в вашей команде?

С какими основными вызовами вы сталкивались при обеспечении надежности приложений в среде Kubernetes и как их решали?

Похожие вакансии

Ведущий DevOps инженер CDEK.Shopping

Tech Lead Infrastructure (K8s, SRE, AI)

Руководитель группы DevOps 1С

Руководитель группы SRE офисных сетей

Email Infrastructure Deliverability Lead

Lead DevOps

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ