Страна: Канада
Зарплата: 225 000 $ – 315 000 $

+500% приглашений

Откликайтесь
на вакансии с ИИ

УдалённоПолная занятость

AI/ML Specialist Solutions Architect

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Отличная вакансия с очень высокой зарплатой (до $315k OTE), работой с передовым оборудованием (H200/B200) и возможностью удаленной работы из США или Канады. Компания быстро растет и предлагает сильный соцпакет.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Высокая сложность обусловлена требованиями к глубокому техническому опыту (7-10 лет) в области MLOps и распределенного обучения на сотнях GPU, а также необходимостью совмещать инженерные навыки с ролью архитектора-консультанта.

Анализ зарплаты

Медиана250 000 $

Рынок200 000 $ – 350 000 $

Предлагаемый диапазон $225k - $315k OTE находится на верхнем уровне рыночных ожиданий для опытных архитекторов решений в области ИИ в Северной Америке, особенно для удаленного формата.

I am writing to express my strong interest in the AI/ML Specialist Solutions Architect position at Nebius. With over 8 years of experience in MLOps and Machine Learning engineering, I have developed a deep expertise in designing and scaling distributed training pipelines across multi-node GPU environments. My background aligns perfectly with Nebius's mission to provide cutting-edge cloud infrastructure for the global AI economy, especially given my hands-on experience with PyTorch, Kubernetes, and Terraform.

In my previous roles, I have successfully transitioned complex ML models from POC to production, managing large-scale deployments that required meticulous optimization of GPU resources. I am particularly excited about the opportunity to work with advanced hardware like the H200 and B200 clusters. I am confident that my technical proficiency, combined with my ability to act as a trusted advisor for enterprise clients, will allow me to contribute significantly to the success of Nebius's customers and the evolution of your AI Cloud platform.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в nebius уже сейчас

Присоединяйтесь к Nebius и создавайте будущее ИИ-инфраструктуры на базе мощнейших GPU-кластеров!

Описание вакансии

Why work at NebiusNebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.

Where we workHeadquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 1400 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.

Customer experience:

Customer experience at Nebius AI Cloud involves tackling customers’ challenges and directly impacting their success by solving real-world AI and ML problems at massive GPU cloud scale. You’ll not only resolve issues, but play a key role in shaping clients’ business success by optimizing their AI solutions. Working with advanced GPUs such as H200, B200 and GB200, as well as modern ML frameworks, you’ll influence the development of the Nebius AI Cloud and gain experience at the intersection of infrastructure and AI. With minimal bureaucracy, you’ll have the freedom to innovate, take ownership and drive change. Opportunities for growth are abundant in this vibrant and supportive professional community.

The role

We seek an experienced AI/ML Specialist Solutions Architect to support AI-focused customers leveraging Nebius services. In this role, you will be a trusted advisor, collaborating with clients to design scalable AI solutions, resolve technical challenges and manage large-scale AI deployments involving hundreds to thousands of GPUs.

You’re welcome to work remotely from the United States or Canada.

Your responsibilities will include:

Designing customer-centric solutions that maximize business value and align with strategic goals.
Building and maintaining long-term relationships to foster trust and ensure customer satisfaction.
Delivering technical presentations, producing whitepapers, creating manuals and hosting webinars for audiences with varying technical expertise.
Collaborating with engineering and product teams to effectively prioritize and relay customer feedback.

We expect you to have:

7-10 + years of experience with cloud technologies in MLOps engineering, Machine Learning engineering or similar roles.
Strong understanding of ML ecosystems, including models, use cases and tooling.
Proven experience in setting up and optimizing distributed training pipelines across multi-node and multi-GPU environments.
Hands-on knowledge of frameworks like PyTorch or JAX.
Excellent verbal and written communication skills.

It will be an added bonus if you have:

Expertise in deploying inference infrastructure for production workloads.
Ability to transition ML pipelines from POC to scalable production systems.

Preferred tooling:

Programming Languages – Python, Go, Java, C++
Orchestration – Kubernetes (K8s), Slurm
DevOps Tools – Git, Docker, Helm
Infrastructure as Code (IaC) – Terraform
ML Frameworks and Libraries – PyTorch, TensorFlow, JAX, HuggingFace, Scikit-learn

Key Employee Benefits:

Health Insurance:100% company-paid medical, dental, and vision coverage for employees and families.
401(k) Plan:Up to 4% company match with immediate vesting.
Parental Leave:20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
Remote Work Reimbursement:Up to $85/month for mobile and internet.
Disability & Life Insurance:Company-paid short-term, long-term, and life insurance coverage.

Compensation

We offer competitive salaries, ranging from 225k - 315k OTE (On-Target Earnings) and equity based on your experience, skills, and location.

Join Nebius Today!

What we offer

Competitive salary and comprehensive benefits package.
Opportunities for professional growth within Nebius.
Flexible working arrangements.
A dynamic and collaborative work environment that values initiative and innovation.

We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Python
Go
Java
C++
Kubernetes
Slurm
Git
Docker
Helm
Terraform
PyTorch
TensorFlow
JAX
Huggingface
Scikit-learn
MLOps
Machine Learning
Distributed Training
Inference

Возможные вопросы на собеседовании

Вакансия подразумевает работу с огромными кластерами GPU. Важно понимать, как кандидат решает проблемы пропускной способности и задержек.

Опишите ваш опыт оптимизации распределенного обучения (Distributed Training) на нескольких узлах. С какими узкими местами в сетевой инфраструктуре вы сталкивались?

Nebius использует K8s и Slurm. Кандидат должен понимать разницу в подходах к оркестрации для ИИ-задач.

В каких сценариях вы бы предпочли использовать Slurm вместо Kubernetes для управления ML-нагрузками, и наоборот?

Роль Solutions Architect требует умения объяснять сложные вещи клиентам.

Как бы вы объяснили клиенту преимущества перехода с обучения на одном узле на распределенное обучение, учитывая затраты и сложность инфраструктуры?

Упоминается работа с новейшими GPU (H200, B200). Важно знание специфики железа.

Какие архитектурные особенности последних поколений GPU NVIDIA (например, H100/H200) наиболее критичны для оптимизации производительности LLM?

В бонусах указано развертывание инференса. Это критично для бизнес-задач клиентов.

Расскажите о вашем опыте построения масштабируемой инфраструктуры для инференса. Как вы обеспечиваете низкую задержку при высокой нагрузке?

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

Канадаот 225 000 $

Откликайтесь
на вакансии с ИИ

AI/ML Specialist Solutions Architect

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в nebius уже сейчас

Описание вакансии

The role

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Опишите ваш опыт оптимизации распределенного обучения (Distributed Training) на нескольких узлах. С какими узкими местами в сетевой инфраструктуре вы сталкивались?

В каких сценариях вы бы предпочли использовать Slurm вместо Kubernetes для управления ML-нагрузками, и наоборот?

Как бы вы объяснили клиенту преимущества перехода с обучения на одном узле на распределенное обучение, учитывая затраты и сложность инфраструктуры?

Какие архитектурные особенности последних поколений GPU NVIDIA (например, H100/H200) наиболее критичны для оптимизации производительности LLM?

Расскажите о вашем опыте построения масштабируемой инфраструктуры для инференса. Как вы обеспечиваете низкую задержку при высокой нагрузке?

Похожие вакансии

Архитектор мультиагентных систем на базе LLM

Fullstack разработчик-подмастерье (AI Engineer)

Fullstack / AI разработчик (подмастерье)

Applied AI / LLM Engineer (Python)

AI-разработчик (Senior)

Аналитик AI-агентов Senior

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

AI/ML Specialist Solutions Architect

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в nebius уже сейчас

Описание вакансии

The role

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Опишите ваш опыт оптимизации распределенного обучения (Distributed Training) на нескольких узлах. С какими узкими местами в сетевой инфраструктуре вы сталкивались?

В каких сценариях вы бы предпочли использовать Slurm вместо Kubernetes для управления ML-нагрузками, и наоборот?

Как бы вы объяснили клиенту преимущества перехода с обучения на одном узле на распределенное обучение, учитывая затраты и сложность инфраструктуры?

Какие архитектурные особенности последних поколений GPU NVIDIA (например, H100/H200) наиболее критичны для оптимизации производительности LLM?

Расскажите о вашем опыте построения масштабируемой инфраструктуры для инференса. Как вы обеспечиваете низкую задержку при высокой нагрузке?

Похожие вакансии

Архитектор мультиагентных систем на базе LLM

Fullstack разработчик-подмастерье (AI Engineer)

Fullstack / AI разработчик (подмастерье)

Applied AI / LLM Engineer (Python)

AI-разработчик (Senior)

Аналитик AI-агентов Senior

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ