- Страна
- США
- Зарплата
- 225 000 $ – 315 000 $
Откликайтесь
на вакансии с ИИ

Senior ML Solutions Architect - Token Factory
Исключительно привлекательная вакансия в быстрорастущей AI-компании с листингом на Nasdaq. Высокий уровень компенсации (до $315k OTE), отличный соцпакет и возможность работать с передовыми технологиями инференса LLM.
Сложность вакансии
Высокая сложность обусловлена требованием глубокой экспертизы в LLM (RAG, агенты, оптимизация вывода) и необходимостью совмещать навыки архитектора с клиентским консалтингом. Важен опыт работы с современным стеком (vLLM, LangChain) и понимание инфраструктурных аспектов.
Анализ зарплаты
Предлагаемый диапазон $225k - $315k OTE полностью соответствует и даже несколько превышает рыночные стандарты для Senior ML Solutions Architect в США, особенно учитывая наличие опционов (equity). Это топовый уровень компенсации для Tier-1 технологических компаний.
Сопроводительное письмо
I am writing to express my strong interest in the Senior ML Solutions Architect position at Nebius. With over 5 years of experience in ML systems and a deep focus on Generative AI, I have closely followed Nebius's emergence as a leader in AI cloud infrastructure. My background in building production-ready RAG architectures and agentic pipelines using LangChain and vLLM aligns perfectly with the goals of the Token Factory team.
In my previous roles, I have successfully guided enterprise clients from initial POC to full-scale production, optimizing for both performance and cost-efficiency. I am particularly excited about the opportunity to work with your serverless inference platform and contribute to the roadmap of open-source LLM solutions. My technical proficiency in Python and experience with multimodal models will allow me to provide immediate value to your clients and internal engineering teams alike.
Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в nebius уже сейчас
Присоединяйтесь к Nebius, чтобы проектировать будущее LLM-инфраструктуры на острие технологий!
Описание вакансии
Why work at NebiusNebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we workHeadquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 1400 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.
The role
We seek an experienced Senior ML Solutions Architect to support customers leveraging Nebius Token Factory's serverless inference platform for open-source LLMs across multiple modalities. In this role, you will be collaborating with clients to design and implement customized LLM-based solution and architect scalable AI applications using our served models, and working together with our backend team to improve our platform to match the clients' needs.
Your responsibilities will include:
- Design and implement LLM-based solutions using Nebius Token Factory’s inference services to drive business value and support customer goals.
- Build production-ready applications leveraging our serverless LLM APIs, including multimodal models (text, vision, audio) and domain-specific models.
- Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
- Collaborate with product and engineering teams to surface customer feedback and shape the platform roadmap.
- Guide customers in scaling from POC to production with a focus on performance, reliability, and cost efficiency.
We expect you to have:
- 5+ years of experience in ML/AI systems, with at least 2 years focused on LLMs and generative AI.
- Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
- Hands-on experience with:
+ Prompt engineering and LLM pipeline development, including evaluation.
+ Agentic frameworks such as Langchain, Langsmith, smolagents, or equivalent.
+ Vector databases and RAG implementation patterns.
+ Deploying LLM-powered applications using APIs from OpenAI, Anthropic, or open-source models.
- Strong Python programming skills.
- Excellent communication skills, with the ability to clearly explain technical concepts to diverse audiences.
It will be an added bonus if you have:
- Experience with inference frameworks and libraries (e.g., vLLM, SGLang, TensorRT-LLM, Transformers).
- Familiarity with inference optimization techniques such as quantization, batching, caching, and routing.
- Work with multimodal AI models (e.g., vision-language, speech).
- Proficiency with DevOps tools (Docker, Kubernetes).
- Contributions to open-source ML/AI projects.
Preferred tooling:
- Programming Languages– Python
- ML Frameworks and Libraries– vLLM, SGLang, TensorRT-LLM, Transformers, OpenAI/Anthropic SDKs
- Frameworks for Agentic Pipelines : Langchain / Langsmith / smolagents / equivalent
- API and Web Frameworks– FastAPI, Flask
- MLOps and DevOps tools– Kubernetes (K8s), Docker, Git
- Cloud Platforms– AWS (SageMaker, Bedrock), GCP (Vertex AI), Azure (Azure ML)
Key Employee Benefits:
- Health Insurance:100% company-paid medical, dental, and vision coverage for employees and families.
- 401(k) Plan:Up to 4% company match with immediate vesting.
- Parental Leave:20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
- Remote Work Reimbursement:Up to $85/month for mobile and internet.
- Disability & Life Insurance:Company-paid short-term, long-term, and life insurance coverage.
Compensation
We offer competitive salaries, ranging from $225k - $315k OTE (On-Target Earnings) and equity based on your experience, skills, and location.
Join Nebius Today!
What we offer
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!
Создайте идеальное резюме с помощью ИИ-агента

Навыки
- Python
- LLM
- Generative AI
- RAG
- LangChain
- vLLM
- Docker
- Kubernetes
- FastAPI
- TensorRT-LLM
- SGLang
- Transformers
- Vector Databases
- Prompt Engineering
Возможные вопросы на собеседовании
Проверка практического опыта оптимизации стоимости и производительности в реальных проектах.
Как бы вы подошли к оптимизации задержки (latency) и стоимости для RAG-системы, работающей с миллионами документов?
Оценка понимания современных фреймворков для создания автономных систем.
В каких случаях вы бы предпочли использование LangGraph или smolagents вместо стандартных последовательных цепочек (chains)?
Проверка знаний в области инференса, что критично для Token Factory.
Расскажите о вашем опыте работы с техниками квантования (например, AWQ или FP8). Как они влияют на точность модели в специфических доменах?
Оценка навыков архитектурного проектирования и выбора инструментов.
Опишите процесс выбора векторной базы данных для проекта: на какие метрики и особенности архитектуры вы обращаете внимание в первую очередь?
Проверка soft skills и умения работать с клиентами.
Как вы объясните нетехническому заказчику риски галлюцинаций в LLM и какие стратегии минимизации предложите для production-решения?
Похожие вакансии
Middle, Middle+, Senior GenAI/LLM Разработчик
Middle / Senior GenAI Engineer (CV)
Senior / Lead LLM Engineer
Senior Computer Vision Engineer
AI Platform Engineer (RAG/Agents/Skills)
GenAI Engineer (LLMs · RAG · ML Systems) — Senior
1000+ офферов получено
Устали искать работу? Мы найдём её за вас
Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!
- Страна
- США
- Зарплата
- 225 000 $ – 315 000 $