yandex
mattermost
Страна
США
Зарплата
170 000 $ – 200 000 $
+500% приглашений

Откликайтесь
на вакансии с ИИ

Ускорим процесс поиска работы
LeadУдалённоПолная занятость

Lead Site Reliability Engineer

Оценка ИИ

Отличная вакансия в известной open-source компании с прозрачной вилкой и интересными задачами на стыке DevOps и кибербезопасности. Высокий уровень компенсации и удаленный формат работы.


Вакансия из Quick Offer Global, списка международных компаний
Пожаловаться

Сложность вакансии

ЛегкоСложно
Оценка ИИ

Роль требует глубоких знаний в области безопасности (FedRAMP, ITAR) и управления распределенными системами, а также подтвержденного опыта лидерства в удаленных командах. Высокая ответственность за инфраструктуру, используемую в оборонном секторе США.

Анализ зарплаты

Медиана185 000 $
Рынок160 000 $ – 220 000 $
Оценка ИИ

Предложенная вилка $170k–$200k полностью соответствует рыночным стандартам для позиции Lead SRE в США, особенно в компаниях, работающих с государственными контрактами. Это конкурентоспособное предложение для опытного специалиста.

Сопроводительное письмо

I am writing to express my strong interest in the Lead Site Reliability Engineer position at Mattermost. With extensive experience in managing Kubernetes-based containerized workloads and a deep proficiency in Terraform and AWS, I have consistently driven operational excellence and scalability in complex, high-stakes environments. My background in building resilient infrastructure aligns perfectly with Mattermost’s mission to provide secure, mission-critical collaboration tools for defense and intelligence sectors.

Throughout my career, I have successfully led SRE teams through the challenges of incident management, observability implementation using Prometheus and Grafana, and the automation of cloud infrastructure. I am particularly drawn to Mattermost’s remote-first, open-source culture and your commitment to meeting rigorous compliance standards like FedRAMP and DoD ATO. I am confident that my technical leadership and strategic approach to infrastructure will contribute significantly to the continued reliability and performance of your platform.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в mattermost уже сейчас

Присоединяйтесь к лидеру в сфере защищенных коммуникаций и возглавьте SRE-направление в Mattermost прямо сейчас!

Описание вакансии

Mattermost is the leading collaborative workflow platform for defense, intelligence, security, and critical infrastructure. Trusted by the U.S. Department of War and Fortune 500s, our platform runs on-premises and in private clouds, delivering secure messaging, file sharing, workflow automation, audio/screenshare, and project management—all with full data and operational control. Mattermost powers high-stakes workflows across mission planning, real-time, real-world operations, DevSecOps, incident response, and cyber defense—enabling secure collaboration from tactical edge and DDIL environments to enterprise HQ. Teams operate across web, desktop, and mobile, with embedded interoperability for Microsoft Teams, Outlook, and Microsoft 365.

To learn more, visit www.mattermost.com

Mattermost is seeking an experienced and visionary Lead Site Reliability Engineer (SRE) to guide the architecture, reliability, and operational excellence of the infrastructure powering our secure, mission-critical collaboration platform. 

In this role, you will provide technical leadership across our SRE function, driving strategic initiatives for scalability, observability, performance, and automation across cloud and hybrid environments. You will mentor engineers, establish best practices, and collaborate closely with development, security, and operations teams to ensure our customers in defense, government, and critical infrastructure sectors experience exceptional reliability and performance. 

Responsibilities Include:

  • Define the strategy, architecture, and roadmap for Mattermost’s site reliability engineering function, aligning infrastructure initiatives with product and business goals.
  • Lead the design, deployment, and optimization of production-grade containerized workloads, infrastructure-as-code, and compliant cloud environments for regulated domains (e.g., FedRAMP, DoD).
  • Establish and evolve observability, monitoring, and alerting frameworks to ensure performance, reliability, and capacity planning at scale.
  • Drive incident management processes, including on-call rotations, root cause analysis, and systemic reliability improvements.
  • Partner with security and compliance teams to meet data sovereignty, security, and regulatory requirements.
  • Champion automation and operational excellence to improve efficiency, reduce risk, and scale operations.
  • Oversee cloud cost management and capacity planning to optimize infrastructure spending while meeting performance targets.
  • Build and maintain a developer platform that enables fast, secure software delivery and improves application stability in production.
  • Mentor and coach SRE team members, fostering a culture of learning, collaboration, and technical excellence.

 Requirements:

  • BS in Computer Science, Cybersecurity, Software Engineering, or a related technical field, or equivalent experience, with 5+ years of relevant experience in site reliability engineering, DevOps, or cloud infrastructure roles.
  • Proven expertise in container orchestration platforms, ideally Kubernetes.
  • Extensive experience with infrastructure-as-code, ideally Terraform.
  • Strong background in cloud platforms, ideally AWS.
  • Demonstrated experience designing and implementing monitoring, alerting, and performance optimization strategies.
  • Exceptional troubleshooting and incident management skills for distributed systems.
  • Proficiency in at least one scripting or programming language for automation.
  • Excellent communication skills with a track record of influencing cross-functional teams.
  • Experience leading globally distributed teams in a remote-first environment.
  • For candidates residing in the U.S.: This role may require the ability to obtain and maintain a U.S. government security clearance in the future. As such, U.S. applicants must be U.S. citizens and eligible under applicable clearance requirements.
  • Applicants must meet eligibility requirements for access to export-controlled information as defined by U.S. export control laws, including EAR and ITAR.

 Preferences:

  • Familiarity with observability stacks such as Grafana and Prometheus.
  • Experience designing high-availability, disaster recovery, and scaling architectures.
  • Exposure to GCP and Azure cloud environments.
  • Leadership experience in highly regulated industries such as defense, finance, or critical infrastructure.
  • Experience with U.S. federal compliance frameworks and authorization processes, including FedRAMP, DoD ATO, NIST 800-53, and related government standards.
  • Experience preparing, delivering, and maintaining software offerings through AWS Marketplace and other cloud provider marketplaces (e.g., Azure Marketplace, Google Cloud Marketplace), including packaging, compliance validation, and ongoing operational support.
  • Open-source contributions in reliability, DevOps, or infrastructure tooling.
  • Certifications in cloud infrastructure, reliability, or DevOps engineering (e.g., CKA, CKAD, AWS Certified Solutions Architect).

Mattermost takes a market-based approach to pay and pay may vary depending on your location. The successful candidate’s starting pay will be determined based on job-related skills, experience, qualifications, work location, and market conditions. These ranges may be modified in the future.

Posting Range

$170,000—$200,000 USD

Mattermost is an EEO Employer, we are a remote-first, open-source company.

We are continually working to expand our hiring in more countries and regions, ensuring compliance with local laws and regulations, which takes time.

Mattermost values your unique perspective—we welcome all applicants. We encourage individuals from all backgrounds to apply and are committed to assessing candidates based on their skills and qualifications. We do not tolerate discrimination against staff or applicants based on race, religion, national origin, age, disability, pregnancy status, veteran status, or other personal characteristics.

If you require accommodations during the interview process, please let us know—we’re happy to assist.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Создайте идеальное резюме с помощью ИИ-агента

Навыки

  • Kubernetes
  • Terraform
  • AWS
  • SRE
  • DevOps
  • Prometheus
  • Grafana
  • Go
  • Python
  • Incident Management
  • FedRAMP
  • NIST 800-53
  • Google Cloud Platform
  • Azure

Возможные вопросы на собеседовании

Для Lead-позиции важно понимать, как кандидат расставляет приоритеты между стабильностью и скоростью разработки.

Как вы балансируете между внедрением новых фич и поддержанием SLO/SLA в условиях быстрорастущей платформы?

Mattermost работает с госсектором США, поэтому знание специфических стандартов критично.

Расскажите о вашем опыте работы с комплаенс-фреймворками, такими как FedRAMP или NIST 800-53, при проектировании инфраструктуры.

Проверка навыков архитектурного мышления в облачных средах.

Опишите ваш подход к проектированию катастрофоустойчивой (DR) архитектуры для мультирегионального развертывания в AWS.

SRE — это прежде всего автоматизация. Важно понять, как кандидат избавляется от рутины.

Приведите пример самого сложного случая 'toil' (рутинной работы), который вы успешно автоматизировали в своей практике.

Оценка лидерских качеств и способности развивать команду.

Как вы подходите к менторству инженеров и внедрению культуры 'blameless post-mortems' в распределенной команде?

Похожие вакансии

industrialelectricmanufacturing
95 000 $ – 130 000 $

Engineering Configuration Management Lead

LeadУдалённоСША
Configuration Management · Change Control · PLM · PDM · ERP · ISO 10007 · EIA-649 · Engineering Change Management · Document Control · Version Control
+10 навыков
industrialelectricmanufacturing
95 000 $ – 130 000 $

Engineering Configuration Management Lead

LeadВ офисеКанада
Configuration Management · PLM · PDM · ERP Systems · ISO 10007 · EIA-649 · Change Management · Document Control · Engineering Change Management
+9 навыков
540
Не указана

Data Platform Operations Lead (GCP, PostgreSQL) - Contract

LeadУдалённоСША
PostgreSQL · Google Cloud Platform · AlloyDB · BigQuery · Google Cloud Dataflow · Google Cloud Pub/Sub · Google Cloud Functions · Infrastructure as Code · SRE · FedRAMP · Data Governance · Performance Tuning
+12 навыков
klaviyo
216 000 $ – 324 000 $

Senior Lead Software Engineer - Developer Infrastructure

LeadГибридСША
Python · Go · Django · FastAPI · AWS · Kubernetes · Docker · Terraform · Kafka · PostgreSQL · Redis · React · TypeScript · CI/CD · Distributed Systems
+15 навыков
accenturefederalservices
91 300 $ – 184 900 $

AWS Cloud Implementation Lead

LeadГибридСША
AWS · Cloud Architecture · ETL · Generative AI · Data Science · Agile · SAFe
+7 навыков
accenturefederalservices
113 500 $ – 190 100 $

Principal DevSecOps Lead Engineer

LeadВ офисеСША
Kubernetes · AWS · DevSecOps · Agile · Infrastructure as Code · GitOps · Helm · Terraform · Linux · Windows · Cybersecurity
+11 навыков
более 1000 офферов получено
4.9

1000+ офферов получено

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

mattermost
Страна
США
Зарплата
170 000 $ – 200 000 $