Страна: Индия

+500% приглашений

Откликайтесь
на вакансии с ИИ

LeadВ офисеПолная занятость

Lead DevOps Engineer

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Отличная позиция в быстрорастущем AI-секторе с возможностью влиять на архитектуру и использовать передовой стек технологий. Хороший пакет бенефитов и четко прописанные задачи по лидерству.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Роль требует глубоких знаний как в классическом DevOps (Kubernetes, Terraform), так и в специфических задачах MLOps (оркестрация GPU, Kubeflow). Высокий уровень ответственности за переход на self-hosted решения и менторство команды повышают планку требований.

Анализ зарплаты

Медиана65 000 $

Рынок50 000 $ – 90 000 $

Зарплата для Lead DevOps в Бангалоре (Индия) в AI-стартапах обычно выше среднего по рынку из-за дефицита кадров на стыке DevOps и ML. Предложенная позиция соответствует верхнему эшелону локального рынка.

I am writing to express my strong interest in the Lead DevOps Engineer position at Observe.AI. With over 6 years of experience in scaling cloud infrastructure and a deep focus on Kubernetes and MLOps, I am excited about the opportunity to lead your transition toward self-hosted infrastructure and optimize GPU orchestration for your AI-driven platform.

In my previous roles, I have successfully managed large-scale EKS clusters and implemented robust CI/CD pipelines using Terraform and ArgoCD. I am particularly drawn to Observe.AI's mission of unifying conversation intelligence and look forward to applying my expertise in FinOps and high-availability systems to ensure your AI models are deployed with maximum efficiency and minimum latency.

I am confident that my technical leadership and hands-on experience with Elasticsearch, Prometheus, and AI-focused cloud services will allow me to make an immediate impact on your engineering team. Thank you for considering my application.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в observeai уже сейчас

Присоединяйтесь к лидерам в области AI и возглавьте трансформацию облачной инфраструктуры в Observe.AI!

Описание вакансии

About Us

Observe.AI is the enterprise-grade Customer Experience AI platform that unifies conversations, intelligence, and action to turn contact centers into performance engines. Built to optimize the full lifecycle of human and AI agents, Observe.AI enables enterprises to automate customer interactions, augment agent performance, and deliver governed AI at scale.

On a single platform, Observe.AI combines Voice and Chat AI Agents, real-time AI Copilots, and Conversation Intelligence with 100% interaction coverage for quality, compliance, and performance management. Trusted by brands like DoorDash, Affordable Care, Signify Health, and Verida, Observe.AI delivers fast time-to-value, measurable ROI, and consistent, high-quality customer experiences across every channel.

Why Join Us

Joining Observe.AI as a Lead DevOps Engineer puts you at the forefront of AI and cloud infrastructure, where you’ll own and scale systems powering real-world customer interactions. You’ll drive high-impact initiatives like GPU orchestration, self-hosting, and low-latency AI deployments while working closely with ML teams to productionize cutting-edge models. With end-to-end ownership, a modern tech stack, and the opportunity to shape MLOps best practices, this role offers strong technical leadership, tangible business impact, and accelerated growth in a fast-scaling AI company.

What you’ll be doing

Manager Self-Hosting tools: Lead the transition from managed services to self-hosted Elastic search, Prometheus, and other critical infrastructure components to optimize performance and cost.
Optimize AI Infrastructure: Work closely with ML engineers and data scientists to efficiently deploy and scale AI/ML models, ensuring high availability and low-latency inference.
Infrastructure Scalability & Reliability: Design and implement scalable, fault-tolerant systems capable of handling large-scale AI workloads, distributed training, and high-throughput data pipelines.
Technology Evaluation & Implementation: Continuously assess and introduce new technologies to enhance automation, reliability, and security in AI model deployment and training pipelines.
CI/CD for AI Workflows: Enhance and automate ML model deployment pipelines using MLOps best practices and tools like Kubeflow, MLflow, and Argo Workflows.
Observability & Monitoring: Implement and enhance monitoring, logging, and alerting strategies using Prometheus, Grafana, ELK, OpenTelemetry, etc., tailored for AI workloads.
Security Best Practices: Implement security measures for AI data pipelines, model storage, and cloud infrastructure.
Mentorship & Best Practices: Set high standards by implementing best practices in DevOps and MLOps, mentoring team members to raise the technical bar.

What you bring to the role

6+ years of experience in DevOps, SRE, or Cloud Infrastructure roles, preferably in AI or data-intensive environments.
Strong expertise in Kubernetes (EKS, AKS preferred ) for deploying AI workloads and managing GPU & non-CPU clusters.
Experience with self-hosting services like Elasticsearch, Prometheus, Grafana, Kafka, etc.
Hands-on expertise in Infrastructure as Code (Terraform, CloudFormation).
Deep understanding of cloud platforms (AWS, Azure, GCP) and AI-focused services like AWS Sagemaker, Vertex AI, or Azure ML.
Strong automation and scripting skills in Python, Bash, or Go.
Experience in CI/CD tools (Jenkins, GitHub Actions, ArgoCD, etc.) with a focus on AI model deployment.
Strong leadership and mentorship skills to guide DevOps and ML teams.
FinOps expertise for optimizing GPU and AI cloud compute costs.
Familiarity with service meshes (Istio, Linkerd) and API gateways.
Knowledge of compliance frameworks (SOC2, ISO 27001, etc.) for AI data pipelines.

Perks & Benefits

Excellent medical insurance options and free online doctor consultations
Yearly privilege and sick leaves as per Karnataka S&E Act
Generous holidays (National and Festive) recognition and parental leave policies
Learning & Development fund to support your continuous learning journey and professional development
Fun events to build culture across the organization
Flexible benefit plans for tax exemptions (i.e. Meal card, PF, etc.)

Our Commitment to Inclusion and Belonging

Observe.AIis an Equal Employment Opportunity employer that proudly pursues and hires a diverse workforce. Observe AI does not make hiring or employment decisions on the basis of race, color, religion or religious belief, ethnic or national origin, nationality, sex, gender, gender identity, sexual orientation, disability, age, military or veteran status, or any other basis protected by applicable local, state, or federal laws or prohibited by Company policy. Observe.AI also strives for a healthy and safe workplace and strictly prohibits harassment of any kind.

We welcome all people. We celebrate diversity of all kinds and are committed to creating an inclusive culture built on a foundation of respect for all individuals. We seek to hire, develop, and retain talented people from all backgrounds. Individuals from non-traditional backgrounds, historically marginalized or underrepresented groups are strongly encouraged to apply.

If you are ambitious, make an impact wherever you go, and you're ready to shape the future of Observe.AI, we encourage you to apply. For more information, visitwww.observe.ai.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Kubernetes
Amazon EKS
Azure Kubernetes Service (AKS)
Terraform
AWS CloudFormation
Python
Go
Bash
ElasticSearch
Prometheus
Grafana
Apache Kafka
Jenkins
GitHub Actions
Argo CD
Kubeflow
MLflow
Argo Workflows
Istio
Linkerd
Amazon SageMaker
Google Vertex AI
Azure Machine Learning
FinOps
MLOps

Возможные вопросы на собеседовании

Вакансия предполагает переход от управляемых сервисов к self-hosted решениям (Elasticsearch, Prometheus). Важно понять опыт кандидата в поддержке таких систем.

Расскажите о вашем опыте миграции с управляемых облачных сервисов (например, AWS Managed Elasticsearch) на self-hosted решения в Kubernetes. С какими основными трудностями вы столкнулись?

Работа с AI-нагрузками требует специфических знаний по управлению GPU.

Как вы подходите к планированию и оптимизации ресурсов для GPU-кластеров в Kubernetes, чтобы минимизировать затраты и обеспечить высокую доступность для обучения моделей?

Роль включает в себя внедрение практик MLOps.

Какие инструменты (например, Kubeflow, MLflow) вы использовали для автоматизации жизненного цикла ML-моделей и как вы интегрировали их в стандартные CI/CD процессы?

Упоминается необходимость FinOps экспертизы.

Какие стратегии вы применяете для мониторинга и контроля облачных расходов, особенно в контексте дорогостоящих AI-вычислений?

Позиция уровня Lead подразумевает управление командой и процессами.

Опишите ваш подход к менторству и установлению технических стандартов в команде, где пересекаются интересы DevOps и ML-инженеров.

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

Индия

Откликайтесь
на вакансии с ИИ

Lead DevOps Engineer

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в observeai уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Какие инструменты (например, Kubeflow, MLflow) вы использовали для автоматизации жизненного цикла ML-моделей и как вы интегрировали их в стандартные CI/CD процессы?

Какие стратегии вы применяете для мониторинга и контроля облачных расходов, особенно в контексте дорогостоящих AI-вычислений?

Опишите ваш подход к менторству и установлению технических стандартов в команде, где пересекаются интересы DevOps и ML-инженеров.

Похожие вакансии

Ведущий DevOps инженер CDEK.Shopping

Руководитель группы DevOps 1С

Tech Lead Infrastructure (K8s, SRE, AI)

Руководитель группы SRE офисных сетей

Email Infrastructure Deliverability Lead

Lead DevOps

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

Lead DevOps Engineer

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в observeai уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Какие инструменты (например, Kubeflow, MLflow) вы использовали для автоматизации жизненного цикла ML-моделей и как вы интегрировали их в стандартные CI/CD процессы?

Какие стратегии вы применяете для мониторинга и контроля облачных расходов, особенно в контексте дорогостоящих AI-вычислений?

Опишите ваш подход к менторству и установлению технических стандартов в команде, где пересекаются интересы DevOps и ML-инженеров.

Похожие вакансии

Ведущий DevOps инженер CDEK.Shopping

Руководитель группы DevOps 1С

Tech Lead Infrastructure (K8s, SRE, AI)

Руководитель группы SRE офисных сетей

Email Infrastructure Deliverability Lead

Lead DevOps

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ