Страна: Канада
Зарплата: 10 100 000 ₽ – 11 500 000 ₽

+500% приглашений

Откликайтесь
на вакансии с ИИ

ГибридПолная занятость

ML Infrastructure Engineer

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Отличная позиция для роста: роль 'первого в своем роде' сотрудника дает огромный масштаб влияния. Компания работает с мировыми брендами, предлагает конкурентную зарплату и современный стек технологий.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Роль требует глубоких знаний как в области DevOps (Terraform, Kubernetes), так и в специфике ML (SageMaker, MLflow). Высокая ответственность обусловлена тем, что это первая позиция такого рода в компании, что подразумевает самостоятельное формирование стратегии.

Анализ зарплаты

Медиана150 000 CA$

Рынок130 000 CA$ – 175 000 CA$

Предложенная зарплата в 145,000 - 165,000 CAD находится в верхнем сегменте рыночного диапазона для Ванкувера. Это соответствует уровню Senior/Lead специалиста в области ML Infrastructure.

I am writing to express my strong interest in the ML Infrastructure Engineer position at Later. With over 4 years of experience in building production-grade ML systems and a deep focus on MLOps, I am excited about the opportunity to become your first dedicated engineer in this domain. My background in designing CI/CD pipelines for machine learning and managing containerized workloads on AWS and GCP aligns perfectly with your goal of building a scalable foundation for AI innovation.

In my previous roles, I have successfully automated end-to-end ML lifecycles, from model validation to deployment and monitoring using tools like SageMaker, Docker, and Terraform. I am particularly drawn to Later’s data-driven approach to influencer marketing and am eager to apply my expertise in infrastructure-as-code and GPU-based workload management to accelerate your data science initiatives. I look forward to the possibility of discussing how my technical skills can help Later bridge the gap between experimentation and high-impact production models.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в later уже сейчас

Присоединяйтесь к Later в качестве первого инженера по ML-инфраструктуре и определите будущее AI-технологий в маркетинге!

Описание вакансии

Later is the world’s most intelligent influencer marketing company, built to give brands the confidence to create unforgettable campaigns. By combining real creator relationships, trusted intelligence, and expert guidance, Later removes fear and guesswork from one of marketing’s most visible investments.

Built on a native, AI-powered platform and more than a decade of proprietary data—including billions of social interactions, impressions, and $2.4B+ in verified influencer-driven purchases—Later helps teams understand what will work before they launch.

By combining trusted insight with expert guidance, Later removes guesswork from influencer marketing, enabling brands to choose the right creators, execute fully managed campaigns, and drive meaningful growth across awareness, engagement, and revenue. Trusted by leading enterprise brands including Nike, Wayfair, Unilever, and Southwest Airlines, Later bridges creativity and performance so campaigns don’t just look good—they deliver results. Learn more at later.com.

About this position:

We’re looking for a Machine Learning Infrastructure Engineer to join our growing Data & Platform team and build the foundation that powers our AI and machine learning capabilities across Later’s product portfolio. As our first dedicated ML Infrastructure Engineer, you will own the systems that support model experimentation, training, deployment, and monitoring at scale.

This role is critical to accelerating our data science initiatives and enabling future AI innovation. You’ll design and operate reliable, secure, and scalable ML infrastructure that empowers data scientists and engineers to ship high-impact models with confidence. If you’re excited about building robust ML systems in a fast-moving environment—and want to define the standard for ML Ops at Later—this is your opportunity.

What you'll be doing:

Strategy

Define and own the long-term ML infrastructure roadmap, ensuring it supports both current experimentation needs and future AI initiatives.
Establish best practices for model lifecycle management, deployment standards, monitoring, and governance.
Identify infrastructure gaps and proactively design scalable solutions to enable high-velocity ML development.
Contribute to cross-functional technical planning, ensuring ML systems align with product and platform strategy.

Technical/ Execution

Design, build, and maintain production-grade model deployment and inference systems using CI/CD pipelines, containerized services (Docker), and API frameworks (e.g., Flask).
Automate end-to-end ML lifecycle workflows including training pipelines, model validation, registry management, deployment, and rollback strategies.
Implement robust monitoring systems for model performance, latency, drift detection, and infrastructure health using tools such as CloudWatch, Prometheus, and Grafana.
Operate across AWS and GCP environments to manage training and inference workloads, including GPU-based infrastructure and BigQuery datasets.
Develop and maintain infrastructure-as-code (Terraform, CloudFormation) to ensure scalable, repeatable, and secure cloud environments.
Implement and optimize CI/CD workflows (e.g., GitHub Actions, GitLab CI, Bitbucket Pipelines) for ML and infrastructure automation.

Team / Collaboration

Partner closely with Data Scientists, Analysts, Platform Engineers, and Product Engineers to support end-to-end ML workflows.
Translate data science experimentation needs into production-ready infrastructure solutions.
Serve as the technical bridge between ML experimentation and productized deployment.
Share knowledge and best practices to elevate ML maturity across teams.

Research/Best Practices

Stay current on emerging ML Ops practices, tools, and frameworks to continuously improve system reliability and efficiency.
Evaluate and implement model-serving frameworks (e.g., TorchServe, Seldon, TensorRT) where appropriate.
Contribute to governance, reproducibility, and auditability standards for ML systems.
Experiment with new tooling and workflows to improve reproducibility, performance, and developer velocity.

What success looks like:

ML models move from experimentation to production quickly and reliably, with minimal manual intervention.
CI/CD pipelines enable safe, repeatable deployments with clear rollback strategies.
Model performance, drift, and infrastructure health are proactively monitored and observable.
Infrastructure supports scalable GPU training and real-time inference without bottlenecks.
Data scientists report improved velocity, reproducibility, and confidence in deploying models.
ML systems are secure, compliant, and aligned with evolving product and AI strategy.

What you bring:

4+ years of experience in ML Ops, ML infrastructure, backend engineering, or related roles supporting production ML systems.
Experience working in cloud-native environments (AWS and/or GCP) with hands-on deployment of ML workloads.
Proven track record designing and implementing CI/CD pipelines for ML systems.
Strong experience with Amazon SageMaker, Docker, Flask-based APIs, and infrastructure automation tools.
Hands-on experience with ML lifecycle tooling such as MLflow, SageMaker Studio, or Weights & Biases.
Experience managing container orchestration platforms (Kubernetes, EKS, or GKE).
Strong programming experience in Python (additional experience in Go, Java, or Scala is a plus).
Experience working with infrastructure-as-code tools such as Terraform or CloudFormation.
Familiarity with observability tools such as CloudWatch, Prometheus, Grafana, Datadog, or centralized logging platforms.
Experience managing GPU-based workloads and scaling training/inference systems.
Familiarity with data infrastructure tools such as BigQuery and cloud-native data pipelines.
Bonus: Experience supporting LLMs or generative AI pipelines, distributed training systems, feature stores (e.g., Feast), real-time inference systems, or ML governance frameworks.
A mindset focused on automation, reliability, performance, and continuous improvement in fast-scaling environments.

How you work:

Driven by Impact: You deliver results that matter—prioritizing high-value work, meeting deadlines, and adapting quickly while keeping outcomes clear.
Strategic & Customer-Centric: You anticipate risks and opportunities, connect decisions to long-term growth, and build trust through proactive insights.
Curious & Growth-Oriented: You seek knowledge, ask sharp questions, and apply learnings fast—challenging the status quo with a mindset of improvement.
Collaborative & Resilient: You thrive in change by staying resourceful, solution-focused, and positive—removing roadblocks, sharing insights, and keeping morale high.
Accountable & Honest: You own your work, hold yourself and others to a high bar, and use transparent feedback to drive growth.
Emotionally Intelligent: You build trust through empathy and collaboration, foster inclusion, and inspire others with grit, optimism, and integrity.

Our approach to compensation:

We take a market-based & data-driven approach to compensation. We leverage data from trusted third-party compensation sources to help us understand the market value of a role based on function, level, geographic location, and scope. We evaluate compensation bi-annually, including performance and market-related factors.

Our salaries are benchmarked against market Total Cash Compensation for the geographic location of our job posting. Compensation for some roles is structured as On Target Earnings (OTE = base + commission/variable) while for others it is structured as Salary only.

To comply with local legislation and ensure transparency, we share salary ranges on all job postings. Skills, experience and other factors help determine the final salary we offer which may vary from the original range posted.

Additionally, all permanent team members are eligible to participate in various benefits plans as part of their overall compensation package.

Salary Range:

$ 145,000 -165,000 CAD

#LI-Hybrid

Where we work:

We have offices in Boston, MA; Vancouver, BC; Chicago, IL; and Vancouver, WA. For select positions, we are open to hiring fully remote candidates. We post our positions in the location(s) where we are open to having the successful candidate be located.

Diversity, inclusion, and accessibility:

At Later, we are committed to fostering a culture rooted in an inclusion-first mindset at every level of the company, embracing the importance of hiring and building teams for culture add rather than culture fit. We openly build and maintain unbiased hiring, pay, and promotion practices to create a foundation for an equitable workplace, paving the way for systemic change.

We are committed to creating a diverse environment and are proud to be an equal opportunity employer. All applications will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, national origin, disability, or age. Please let us know if you require any accommodations or support during the recruitment process.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Python
AWS
Google Cloud Platform
Docker
Kubernetes
Terraform
Amazon SageMaker
MLflow
CI/CD
Flask
Prometheus
Grafana
BigQuery
Infrastructure as Code

Возможные вопросы на собеседовании

Проверка опыта работы с основным инструментом, указанным в вакансии.

Опишите ваш опыт настройки и оптимизации пайплайнов в Amazon SageMaker для обучения и деплоя моделей.

Важно понять, как кандидат обеспечивает надежность систем.

Как вы организуете мониторинг дрейфа данных (data drift) и производительности моделей в продакшене?

Вакансия требует навыков автоматизации инфраструктуры.

Расскажите о наиболее сложном кейсе использования Terraform или CloudFormation для развертывания ML-ресурсов.

Проверка навыков работы с современными тяжелыми моделями.

Какие подходы вы используете для масштабирования GPU-инфраструктуры при обучении больших моделей?

Оценка способности работать на стыке команд.

Как вы выстраиваете процесс передачи модели от Data Scientist к инженеру для вывода в продакшн, чтобы минимизировать ошибки?

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

Канадаот 10 100 000 ₽

Откликайтесь
на вакансии с ИИ

ML Infrastructure Engineer

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в later уже сейчас

Описание вакансии

About this position:

What you'll be doing:

Strategy

Technical/ Execution

Team / Collaboration

Research/Best Practices

What success looks like:

What you bring:

How you work:

Our approach to compensation:

Where we work:

Diversity, inclusion, and accessibility:

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Опишите ваш опыт настройки и оптимизации пайплайнов в Amazon SageMaker для обучения и деплоя моделей.

Как вы организуете мониторинг дрейфа данных (data drift) и производительности моделей в продакшене?

Расскажите о наиболее сложном кейсе использования Terraform или CloudFormation для развертывания ML-ресурсов.

Какие подходы вы используете для масштабирования GPU-инфраструктуры при обучении больших моделей?

Как вы выстраиваете процесс передачи модели от Data Scientist к инженеру для вывода в продакшн, чтобы минимизировать ошибки?

Похожие вакансии

T-shape Аналитик AI (Middle / Senior)

Архитектор мультиагентных систем на базе LLM

Fullstack разработчик-подмастерье (AI Engineer)

Специалист по AI-инструментам

Fullstack / AI разработчик (подмастерье)

AI engineer (ML/DS)

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

ML Infrastructure Engineer

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в later уже сейчас

Описание вакансии

About this position:

What you'll be doing:

Strategy

Technical/ Execution

Team / Collaboration

Research/Best Practices

What success looks like:

What you bring:

How you work:

Our approach to compensation:

Where we work:

Diversity, inclusion, and accessibility:

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Опишите ваш опыт настройки и оптимизации пайплайнов в Amazon SageMaker для обучения и деплоя моделей.

Как вы организуете мониторинг дрейфа данных (data drift) и производительности моделей в продакшене?

Расскажите о наиболее сложном кейсе использования Terraform или CloudFormation для развертывания ML-ресурсов.

Какие подходы вы используете для масштабирования GPU-инфраструктуры при обучении больших моделей?

Как вы выстраиваете процесс передачи модели от Data Scientist к инженеру для вывода в продакшн, чтобы минимизировать ошибки?

Похожие вакансии

T-shape Аналитик AI (Middle / Senior)

Архитектор мультиагентных систем на базе LLM

Fullstack разработчик-подмастерье (AI Engineer)

Специалист по AI-инструментам

Fullstack / AI разработчик (подмастерье)

AI engineer (ML/DS)

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ