- Страна
- Португалия
Откликайтесь
на вакансии с ИИ

Senior Reliability Engineer (Data Infrastructure)
Отличная вакансия в известном финтех-единороге с современным стеком, опционами (equity) и фокусом на профессиональное развитие. Гибридный формат в Лиссабоне и безлимитный отпуск делают предложение очень привлекательным.
Сложность вакансии
Высокая сложность обусловлена необходимостью глубоких знаний как в SRE (Kubernetes, Terraform), так и в специфике данных (Kafka, Cassandra, Spark), а также участием в on-call ротациях.
Анализ зарплаты
Зарплата в вакансии не указана, но для позиции Senior SRE в Лиссабоне рыночный диапазон составляет от 60 000 до 85 000 евро в год. Учитывая наличие опционов и статус компании, общее вознаграждение может быть выше среднего по рынку.
Сопроводительное письмо
I am writing to express my strong interest in the Senior Site Reliability Engineer (Data Infrastructure) position at ComplyAdvantage. With extensive experience in managing complex data systems like Kafka, PostgreSQL, and Cassandra within Kubernetes environments, I am confident in my ability to enhance the reliability and performance of your critical data layers. My background in automating cloud infrastructure using Terraform and Helm, combined with a deep commitment to GitOps principles, aligns perfectly with your tech stack and operational goals.
Throughout my career, I have successfully implemented SLOs/SLAs and led incident response efforts in multi-cloud environments (AWS and GCP). I am particularly drawn to ComplyAdvantage's mission of using AI to combat financial crime and am excited about the opportunity to contribute to a platform that handles massive scale while maintaining low latency. I look forward to bringing my expertise in distributed databases and infrastructure automation to your Lisbon-based team.
Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в complyadvantage уже сейчас
Присоединяйтесь к команде ComplyAdvantage и станьте ключевым экспертом по надежности данных в глобальном финтех-проекте!
Описание вакансии
What you will be doing:
We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to join our Data Infrastructure team. You will be responsible for ensuring the reliability, availability, and performance of our critical data systems running on AWS and GCP. Your expertise in cloud infrastructure, automation, and operational excellence will be crucial in supporting our Product trough our global client base.
As a Senior Site Reliability Engineer you will:
- Design, implement, and maintain highly available and reliable data infrastructure services, including SQL, NoSQL, Kafka, and Spark-based data layers. Define and monitor Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
- Participate in an on-call rotation to respond to incidents and ensure rapid resolution of production issues. Conduct thorough post-incident reviews to identify root causes and implement preventative measures.
- Manage and automate cloud infrastructure using Terraform and Helm, adhering to GitOps principles.
- Implement and maintain comprehensive monitoring, logging, and tracing solutions to proactively identify and resolve performance and reliability issues.
- Monitor and manage data infrastructure capacity, plan for future growth, and optimize performance through tuning and automation.
- Develop and maintain automation scripts and tools to streamline operational tasks, improve efficiency, and reduce manual effort.
- Ensure the security and compliance of data infrastructure services, implementing best practices for access control, data protection, and vulnerability management.
- Collaborate with development and data engineering teams to ensure smooth deployments and operational support. Maintain thorough documentation of infrastructure configurations, processes, and procedures.
- Manage and maintain distributed databases running within a Kubernetes environment.
Our Tech Stack:
- Cloud-Based Infrastructure: Fully cloud-based with a Kubernetes-focused tech stack. Compute workloads run in Kubernetes clusters across multiple regions.
- Infrastructure Management: Heavy use of Terraform and Helm, adhering to GitOps paradigms for managing cloud infrastructure and Kubernetes applications.
- Core Technologies: Extensive use of Kafka, distributed PostgreSQL and Cassandra QL, Elasticsearch, and Databricks/Spark. Development of inter-cloud failover options to support multi-cloud plans.
- Wide Array of Applications: Teams build and release containerised applications for low latency APIs, machine learning models, and data processing pipelines.
About You:
- Experience as an SRE managing cloud infrastructure (AWS and/or GCP) and data systems (Apache Kafka, Apache Spark, Elasticsearch, PostgreSQL, Cassandra). Proven track record of improving reliability and availability in complex production environments.
- Extensive experience codifying infrastructure using Terraform and Helm charts.
- Proven experience managing and troubleshooting distributed databases within Kubernetes.
- Deep understanding of monitoring, logging, and tracing tools and techniques.
- Strong incident response and troubleshooting skills.
- Proficiency in scripting and automation tools.
- Understanding of security best practices for cloud infrastructure and data systems.
- Familiarity with CI tooling, test pipelines, and asset generation (e.g., Docker images, Helm charts). Understanding of security considerations in data systems.
Education:
- BSc/BA degree in computer science, engineering, or related discipline OR equivalent experience in required skills.
Nice to have
- Familiarity with distributed SQL and NoSQL databases such as Yugabyte, Cockroach, Spanner, HBase, or CouchDB.
- Familiarity with data modelling, sharding, and indexing strategies for large-scale databases.
What’s in it for you?
- Equity as we want you to have a part of what we are building
- Private medical insurance designed to keep you ensuring peace of mind while you excel in your career
- Unlimited Time Off Policy- A work-life balance and focus on our well-being are critical to keeping us performing at our best
- We embrace a hybrid approach that requires employees to be in the office for two days a week. We strongly believe that this approach fosters collaboration and enables the building of meaningful relationships
- You will also get a new starter budget to kit out your home office
- Opportunity to work on innovative projects with smart-minded people keen to share their knowledge and continuously improve
- Annual learning budget (prorated based on start date) to drive your performance and career development
About us:
Our mission is to empower every business to eliminate financial crime.
By harnessing AI, a unified platform, and an extensive partner ecosystem, we help customers turn compliance into a catalyst for growth, operational resilience, and enduring regulatory trust.
More than 3,000 enterprises across 75 countries rely on our end-to-end platform and the world’s most comprehensive financial crime risk intelligence. With full-stack agentic automation, we help organizations automate up to 95% of KYC, AML, and sanctions reviews, cut onboarding times by 50%, reduce false positives by 70%, and handle 7x more work with the same staff.
ComplyAdvantage is headquartered in London and has global hubs in New York, Lisbon, Singapore, and Cluj-Napoca. It is backed by Balderton Capital, Index Ventures, Ontario Teachers’ Pension Plan, Goldman Sachs, and Andreessen Horowitz. Learn more about compliance re-engineered for the age of AI at complyadvantage.com.
Создайте идеальное резюме с помощью ИИ-агента

Навыки
- AWS
- Terraform
- GCP
- Kubernetes
- Helm
- PostgreSQL
- Docker
- GitOps
- Apache Spark
- Databricks
- Apache Kafka
- ElasticSearch
- Cassandra
Возможные вопросы на собеседовании
Проверка опыта работы с критически важными компонентами стека компании.
Расскажите о вашем опыте масштабирования и обеспечения отказоустойчивости кластеров Kafka или Cassandra в Kubernetes. С какими основными проблемами вы сталкивались?
Оценка навыков автоматизации и следования принципам GitOps.
Как вы организуете структуру Terraform-модулей и Helm-чартов для управления инфраструктурой в нескольких облаках (AWS/GCP) одновременно?
Проверка методологического подхода к надежности.
Как вы определяете границы между SLI, SLO и SLA для инфраструктуры данных, и какие действия предпринимаете при нарушении бюджета ошибок (error budget)?
Оценка навыков траблшутинга и стрессоустойчивости.
Опишите самый сложный инцидент с данными, который вы расследовали. Как вы определили корневую причину и какие превентивные меры внедрили?
Проверка знаний в области безопасности данных.
Какие лучшие практики вы применяете для обеспечения безопасности и комплаенса в распределенных базах данных, работающих в публичном облаке?
Похожие вакансии
Devops Middle+/Senior
DevOps Middle +/ Senior
Senior DevOps/Mlops
Senior DevOps/SRE Engineer (On-Premise инфраструктура)
DevOps - senior
Senior DevOps AWS
1000+ офферов получено
Устали искать работу? Мы найдём её за вас
Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!
- Страна
- Португалия