yandex
close
Страна
США
+500% приглашений

Откликайтесь
на вакансии с ИИ

Ускорим процесс поиска работы
УдалённоПолная занятость

Site Reliability Engineer (USA Only - 100% Remote)

Оценка ИИ

Исключительно привлекательная вакансия для SRE: прибыльная компания, современный стек (LGTM, ArgoCD), культура удаленной работы с 2013 года и уникальные бонусы, такие как опция 80% рабочей недели и оплачиваемый саббатикал.


Вакансия из Quick Offer Global, списка международных компаний
Пожаловаться

Сложность вакансии

ЛегкоСложно
Оценка ИИ

Высокая сложность обусловлена требованиями к глубокой экспертизе в управлении огромными кластерами данных (терабайты в MongoDB/PostgreSQL) и сложным стеком (K8s, ArgoCD, ClickHouse). Роль подразумевает статус финальной точки эскалации, что требует исключительных навыков траблшутинга.

Анализ зарплаты

Медиана195 000 $
Рынок160 000 $ – 240 000 $
Оценка ИИ

Зарплата в вакансии не указана, но для Senior/Staff SRE в США в удаленном формате рыночные вилки обычно начинаются от $160k и могут достигать $230k+ для Staff уровня, плюс бонусы. Предложение Close включает 6% матчинг 401k и годовые бонусы, что соответствует верхнему эшелону рынка.

Сопроводительное письмо

I am writing to express my strong interest in the Site Reliability Engineer position at Close. With extensive experience in managing large-scale infrastructure on AWS and a deep proficiency in Kubernetes, Terraform, and Ansible, I am confident in my ability to contribute to your Infrastructure Team. I have a proven track record of maintaining high-availability systems, including multi-terabyte MongoDB and PostgreSQL clusters, which aligns perfectly with the scale and complexity of Close's environment.

What excites me most about Close is your commitment to engineering excellence and open-source contribution. I have followed 'The Making of Close' blog and admire your 'No BS' and 'Build a house you want to live in' values. My background in automating database lifecycles and enhancing disaster recovery systems directly matches your upcoming projects. I am eager to bring my expertise in the LGTM stack and CI/CD optimization to help Close maintain its impressive record of zero scheduled downtime.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в close уже сейчас

Присоединяйтесь к Close и станьте ключевым архитектором надежности в полностью удаленной команде профессионалов!

Описание вакансии

About Us

Close is a bootstrapped, profitable, 100% remote, ~100 person team of thoughtful individuals who prioritize taking ownership and making a meaningful impact. We’re eager to make a product our customers fall in love with over and over again.

We 💛 small scaling businesses. Since 2013, we’ve been building a CRM that focuses on better communication, without the hassle of manual data entry or a complex UI. We are out to supercharge sales productivity with the most modern, thoughtfully designed, all-in-one, communication-focused CRM.

Our backend tech stack consists primarily of Python Flask web apps with our TaskTiger scheduler handling many of the backend asynchronous task processing chores. Our data stores include MongoDB, PostgreSQL, Elasticsearch, and Redis. The underlying infrastructure runs on AWS using a combination of managed services like EKS, MSK, RDS and ElasticCache and non-managed services running on EC2 instances. We have CI/CD pipelines that build Docker images, run automated tests and deploy to Kubernetes clusters. We also use these images in our local development environment allowing coding locally against all of our services. We have a well-documented public API that is consumed by our front-end JavaScript app as well as numerous integrations. Our infrastructure is heavily automated using Terraform, Ansible and other AWS tools.

We love open sourcing our code and ideas on our GitHub and on The Making of Close, our behind-the-scenes Product & Engineering blog. Check out our open source projects like close-mongo-ops-manager, SocketShark, TaskTiger, LimitLion and ciso8601.

About the Role

You will be joining the Infrastructure Team at Close. This team builds and maintains the platform that runs all Close systems (and do we have a lot of those). Work with us and you’ll be working with:

About You

  • You are a rock in the storm. With your hard won expertise, gained through battles won and lost, you consistently build robust systems from quality components fit to underpin mission critical applications. You value simplicity over familiarity. You value resilience over speed. You take pride in building composable and maintainable tools.
  • You’ve worked with a diverse array of infrastructure tools and systems, including:

+ CICD (CircleCI, GitHub Actions, ArgoCD)

+ Configuration Management (Ansible, Terraform)

+ Databases (Elasticsearch, MongoDB, PostgreSQL, ClickHouse)

+ Cloud Computing (Kubernetes, AWS)

+ Telemetry (Loki, Tempo, Grafana, Mimir/Prometheus)

  • You're comfortable working in a fast-paced environment with a small and talented team where you're supported in your efforts to grow professionally. You're able to manage time well, communicate effectively, and collaborate in a fully distributed team.

Come help us with projects like...

  • Fully automating our database’s lifecycles with Argo Workflow
  • Eliminating all static credentials where they may be
  • Reducing downtime and disruption due to maintenance or disaster to new lows
  • Help us improve our multi-region disaster recovery system.

Requirements...

  • Senior 1 & 2 level candidates should have 5+ years of experience building modern infrastructure systems.
  • Staff level candidates should have 8+ years of experience.
  • The buck stops with you! You are the kind of person who is respected as an expert on the systems you run.
  • You have been the final point of escalation in the support of mission critical production systems
  • You are familiar with some of the following technologies: AWS, Terraform, Kubernetes, Ansible, MongoDB, PostgreSQL, Elasticsearch
  • You have a strong grasp of common networking and data transfer protocols such as DNS, HTTP, TCP
  • You are able to speak and write in English
  • You are located in the USA (ET, CT, MT, PT)

Bonus point if you have…

  • Contributed open source code related to our tech stack.
  • Have experience maintaining very large databases
  • Has been through a successful disaster response
  • Have experience with multi-region architectures
  • Have run MLOps systems
  • Experience scaling Temporal

Benefits

  • Competitive compensation including an organization-wide goal-based bonus
  • Paid Time Off: ~5 Weeks PTO upon joining + Winter and Summer Holiday Breaks. Each year with the company, you’ll receive 2 additional PTO days.
  • 80% Work Option: Work with your manager to choose between working 5 day weeks (standard full-time) or 4 day weeks @ 80% pay
  • Paid Parental Leave for primary and secondary caregivers
  • Sabbatical: After 5 years with the team, you’re eligible for a 1 month paid sabbatical
  • Healthcare (US residents): Medical, Dental, Vision with HSA option (US residents), Dependent care FSA (US residents)
  • 401k (US residents): We match 6% contributions with immediate vesting

Our Values

Build a house you want to live in - Examine long-term thinking and action

No BS - Practice transparency and honesty, especially when it’s hard

Invest in each other - Build successful relationships with your coworkers and customers

Discipline equals freedom - Keep your word to yourself and others

Strive for greatness - Constantly challenge yourself and others

Learn More

Listen to our CEO and Founder, Steli Efti, tell the story of Close’s journey in the $0-30m Blueprint.

Watch our culture video from our 2023 team retreat in Milan. Every year our entire team gathers in person to build connection, foster cross-functional collaboration, and have fun. In 2026, we’re headed to Barcelona, Spain!

Explore our product. Check out a demo!

Our Hiring Process

We ask a few role-specific questions as part of our application process. These questions are designed to help us learn more about you from the start so please answer each question thoughtfully. We see this as an opportunity to get to know you beyond your resume.

While we are excited by all the opportunities that generative AI has unlocked, we request that you refrain from relying exclusively on AI tools when completing an application, unless explicitly stated. Every application is read closely by humans and any obviously AI generated applications will be disregarded.

Regardless of fit, you can expect to hear back from our team with an update on the status of your candidacy.

If you progress to the interview process, you’ll receive a full outline of the role-specific interview process in your first touchpoint with us. We do our best to make the hiring process clear and human.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Создайте идеальное резюме с помощью ИИ-агента

Навыки

  • AWS
  • Python
  • Terraform
  • Kubernetes
  • GitHub Actions
  • Prometheus
  • Grafana
  • PostgreSQL
  • Redis
  • Docker
  • TCP
  • HTTP
  • Ansible
  • ElasticSearch
  • MongoDB
  • EC2
  • DNS
  • Flask
  • RDS
  • ArgoCD
  • ClickHouse
  • EKS
  • Loki
  • Elasticache
  • MSK

Возможные вопросы на собеседовании

Проверка опыта работы с масштабами, указанными в вакансии.

Расскажите о вашем опыте оптимизации производительности или миграции MongoDB/PostgreSQL кластеров объемом в несколько терабайт. С какими основными трудностями вы столкнулись?

Вакансия упоминает цель — свести к минимуму простои при обслуживании.

Как бы вы спроектировали процесс обновления версии Kubernetes в продакшн-кластере с нулевым временем простоя для приложений?

Оценка навыков автоматизации и работы с GitOps.

Опишите ваш опыт работы с ArgoCD и Argo Workflows. Как вы организуете структуру репозиториев для управления конфигурациями нескольких регионов?

Проверка готовности к роли 'последней инстанции' в критических ситуациях.

Опишите самый сложный инцидент в вашей практике, где вы были финальным ответственным лицом. Как вы принимали решения под давлением и какие выводы сделали?

Оценка понимания сетевых протоколов, указанных в требованиях.

Как вы будете диагностировать проблему периодических задержек (latency) в HTTP-запросах между микросервисами внутри EKS, учитывая уровни DNS и TCP?

Похожие вакансии

DstLab
240 000 ₽ – 280 000 ₽

Devops Middle+/Senior

SeniorУдалённоРоссия
Kubernetes · Redis · Kafka · Keycloak · PostgreSQL · MonetDB · VK Cloud · GitLab CI · ArgoCD · HashiCorp Vault · Prometheus · Grafana · ELK stack · Linux
+14 навыков
Комплексные технологии
200 000 ₽ – 220 000 ₽

DevOps Middle +/ Senior

SeniorУдалённоРоссия
SQL · Kubernetes · Docker · Ansible · Prometheus · Grafana · ELK stack · CI/CD · Java · Go · C++ · Bash · Terraform · SonarQube · SAST · Python · Linux · Windows Server · Cisco · MikroTik · Fortinet · Ubiquiti · TCP/IP · DNS · DHCP · BGP · OSPF · VLAN · NAT · Zero Trust · RBAC · SIEM · Zabbix · Wazuh · PowerShell · VMware · Proxmox · Hyper-V · KVM
+39 навыков
WMT Group
300 000 ₽ – 400 000 ₽

Senior DevOps/Mlops

SeniorУдалённоРоссия
Docker · Helm · Jenkins · GitLab CI · Python · Airflow · JupyterHub · MLflow · Seldon Core · CUDA · Kubernetes · Hadoop · Apache Spark · Apache Kafka · ELK stack · LLM · Computer Vision
+17 навыков
Avant IT
120 000 ₽ – 200 000 ₽

Middle DevOps Engineer

MiddleУдалённоРоссия
Ansible · Terraform · Python · C++ · Kubernetes · OpenShift · Helm · OpenVPN · Cloudflare · PostgreSQL · Git · SQL · Grafana · GitLab CI · Nexus · Istio · Prometheus · Sentry · Kubespray
+19 навыков
Hi, Rockits!
300 000 ₽ – 400 000 ₽

Senior DevOps/SRE Engineer (On-Premise инфраструктура)

SeniorУдалённоРоссия
Kubernetes · Ansible · Terraform · GitLab CI/CD · PostgreSQL · Redis · RabbitMQ · ElasticSearch · Prometheus · Grafana · Linux · Go · Python · Kafka · Vault · NATS · Bash
+17 навыков
Volna.tech
268 000 ₽ – 294 000 ₽

DevOps - senior

SeniorУдалённоРоссия
Linux · RHEL · Debian · TCP/IP · Docker · Git · GitLab CI · GitHub Actions · TeamCity · Jenkins · Nexus · Artifactory · Terraform · Ansible · Chef · Puppet · OpenStack · AWS · Molecule · TestInfra · REST API
+21 навыков
более 1000 офферов получено
4.9

1000+ офферов получено

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

close
Страна
США