Страна: Великобритания
Зарплата: 80 000 ₽ – 110 000 ₽

+500% приглашений

Откликайтесь
на вакансии с ИИ

ГибридПолная занятость

ML Engineer (£80k-£110k + Equity) at Cosine.sh

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Исключительная вакансия для ML-инженера: работа в YC-стартапе над передовыми технологиями (SOTA), прямой контакт с CEO, наличие GPU-кластеров и опционы. Высокая зарплата и значимое влияние на продукт делают это предложение топовым на рынке.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Высокая сложность обусловлена требованиями к опыту обучения моделей объемом более 70B параметров и глубоким знаниям распределенного обучения (FSDP, DDP). Работа в команде из 4 человек над SOTA-решениями подразумевает высочайший уровень ответственности и технической экспертизы.

Анализ зарплаты

Медиана95 000 £

Рынок85 000 £ – 125 000 £

Предложенная зарплата (£80k-£110k) полностью соответствует рыночным стандартам Лондона для Senior ML ролей в высокотехнологичных стартапах. Наличие доли в капитале (Equity) значительно повышает совокупный доход, делая предложение конкурентоспособным на фоне крупных тех-гигантов.

I am writing to express my strong interest in the ML Engineer position at Cosine.sh. With extensive experience in training deep learning models and a deep proficiency in PyTorch distributed primitives like FSDP and DDP, I am excited by the opportunity to contribute to the post-training of Lumen Enterprise. Having previously worked on large-scale model alignment and reinforcement learning, I am confident in my ability to drive SFT and RL experiments that push the boundaries of autonomous coding agents.

Your work on the SWE-Lancer benchmark is impressive, and I am eager to bring my expertise in long-context training and tool-use reasoning to your elite ML team. I have a proven track record of implementing complex RL systems and managing multi-node GPU clusters, which aligns perfectly with your current technical challenges. I look forward to the possibility of discussing how my background in production-grade Python and data quality strategies can help Cosine.sh continue to lead the field in AI-driven software engineering.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в jack-jill-external-ats уже сейчас

Начните работу над передовыми ИИ-агентами в элитной команде YC-стартапа — откликнитесь прямо сейчас!

Описание вакансии

This is a job that Jill, our AI Recruiter, is recruiting for on behalf of one of our customers.

She will pick the best candidates from Jack's network.

The next step is to speak to Jack.

ML Engineer (£80k-£110k + Equity) at Cosine.sh

Company Description: Cosine.sh - YC-backed AI startup

Job Description:

Join a high-intensity ML team owning the post-training of Lumen Enterprise, the LLMs powering the world’s best autonomous coding agent. You will drive SFT, RL, and continued pretraining to push state-of-the-art performance on complex software engineering tasks. This role offers direct impact on a product used by global enterprises.

Why this role is remarkable:

Direct ownership of post-training for Genie, a SOTA coding agent that achieved a 72% score on OpenAI’s SWE-Lancer benchmark.
Work at the technical frontier with multi-node GPU clusters, large-scale MoE architectures, and long-context training on proprietary software-engineering reasoning data.
Join a small, elite 4-person ML team reporting directly to the CEO, where your training runs ship immediately to real-world enterprise users.

What you will do:

Transform open-source base models into high-performance SWE agents through supervised fine-tuning and advanced reinforcement learning (PPO, GRPO, or DPO).
Design and execute large-scale training experiments on multi-node clusters, optimizing for long-context stability and tool-use reasoning.
Build and iterate on automated RL loops where models are rewarded for successfully running tests, linters, and static analysis on real-world codebases.

The ideal candidate:

3-5+ years of experience training deep learning models in production with deep proficiency in PyTorch distributed primitives like FSDP and DDP.
Proven track record of training large-scale models (≥70B parameters) and implementing complex RLVR systems for LLM alignment.
Strong software engineering background with the ability to write production-grade Python and a focus on data quality and sampling strategies.

Who are Jack & Jill?

Ok, I'll go first. I'm Jack, an AI Career Agent that gets to know you on a quick call, learning what you're great at and what you want from your career. Then I help you land your dream job by finding unmissable opportunities as they come up, supporting you with applications, interview prep, and moral support.

And I'm Jill, an AI Recruiter who talks to companies to understand who they're looking to hire. Then I recruit from Jack's network, making an introduction when I spot an excellent candidate.

Next steps

Step 1. Visit our website.

Step 2. Click 'Talk to Jack'.

Step 3. Talk to Jack so he can understand your experience and ambitions.

Step 4. Jack will make sure Jill (the AI agent working for the company) considers you for this role.

Step 5. If Jill thinks you're a great fit and her client wants to meet you, they will make the introduction.

Step 6. If not, Jack will find you excellent alternatives. All for free.

We never post fake jobs

This isn't a trick. This is an open role that Jill is currently recruiting for from Jack's network.

Sometimes Jill's clients ask her to anonymize their jobs when she advertises them, which means she can't share all the details in the job description.

We appreciate this can make them look a bit suspect, but there isn't much we can do about it.

Give Jack a spin! You could land this role. If not, most people find him incredibly helpful with their job search, and we're giving his services away for free.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

PyTorch
FSDP
DDP
Reinforcement Learning
LLM
Python
SFT
PPO
DPO
Deep Learning
Machine Learning

Возможные вопросы на собеседовании

Вакансия требует опыта работы с распределенным обучением для моделей 70B+.

Расскажите о вашем опыте оптимизации обучения крупных моделей с использованием PyTorch FSDP. С какими основными проблемами масштабирования вы сталкивались?

Роль включает внедрение сложных систем RL (PPO, DPO, GRPO).

В каких сценариях вы бы предпочли использование DPO вместо традиционного PPO для выравнивания (alignment) языковой модели, и почему?

Компания работает над агентами для написания кода.

Как вы подходите к созданию качественного датасета для SFT, ориентированного именно на рассуждения (reasoning) в задачах программной инженерии?

Упоминается работа с длинным контекстом.

Какие методы вы используете для обеспечения стабильности обучения и сохранения производительности модели при работе с экстремально длинными контекстными окнами?

Модели вознаграждаются за прохождение тестов и линтеров.

Опишите архитектуру автоматизированного цикла RL, где награда (reward) формируется на основе внешних инструментов анализа кода. Как вы боретесь с 'reward hacking' в таких системах?

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

Великобританияот 80 000 ₽

Откликайтесь
на вакансии с ИИ

ML Engineer (£80k-£110k + Equity) at Cosine.sh

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в jack-jill-external-ats уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Расскажите о вашем опыте оптимизации обучения крупных моделей с использованием PyTorch FSDP. С какими основными проблемами масштабирования вы сталкивались?

В каких сценариях вы бы предпочли использование DPO вместо традиционного PPO для выравнивания (alignment) языковой модели, и почему?

Как вы подходите к созданию качественного датасета для SFT, ориентированного именно на рассуждения (reasoning) в задачах программной инженерии?

Похожие вакансии

Senior Data Engineer

ML-инженер

Python разработчик (DWH/Data Engineering)

Data Scientist Middle+, Senior

Data Scientist

Middle+ Data инженер

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

ML Engineer (£80k-£110k + Equity) at Cosine.sh

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в jack-jill-external-ats уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Расскажите о вашем опыте оптимизации обучения крупных моделей с использованием PyTorch FSDP. С какими основными проблемами масштабирования вы сталкивались?

В каких сценариях вы бы предпочли использование DPO вместо традиционного PPO для выравнивания (alignment) языковой модели, и почему?

Как вы подходите к созданию качественного датасета для SFT, ориентированного именно на рассуждения (reasoning) в задачах программной инженерии?

Похожие вакансии

Senior Data Engineer

ML-инженер

Python разработчик (DWH/Data Engineering)

Data Scientist Middle+, Senior

Data Scientist

Middle+ Data инженер

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ