yandex
graphcore
Страна
Великобритания
+500% приглашений

Откликайтесь
на вакансии с ИИ

Ускорим процесс поиска работы
SeniorГибридПолная занятость

Staff Machine Learning Engineer (Large Systems)

Оценка ИИ

Высокий балл обусловлен работой в лидирующей компании по производству AI-чипов, поддержкой SoftBank и участием в разработке фундаментальных технологий будущего. Отличный пакет льгот и работа с уникальным оборудованием.


Вакансия из Quick Offer Global, списка международных компаний
Пожаловаться

Сложность вакансии

ЛегкоСложно
Оценка ИИ

Роль требует исключительных знаний в области распределенного обучения на тысячах ускорителей и глубокого понимания архитектуры железа. Кандидат должен обладать навыками как в высокоуровневом Python/PyTorch, так и в низкоуровневой оптимизации (C++/CUDA).

Анализ зарплаты

Медиана110 000 £
Рынок95 000 £ – 140 000 £
Оценка ИИ

Предлагаемая позиция Staff/Senior уровня в Кембридже предполагает зарплату выше среднего по рынку Великобритании для ML-инженеров, учитывая дефицитность навыков работы с железом. Рыночные оценки для таких ролей в UK варьируются от £90,000 до £130,000+ в зависимости от опыта.

Сопроводительное письмо

I am writing to express my strong interest in the Staff Machine Learning Engineer position at Graphcore. With extensive experience in optimizing deep learning models and a proven track record of scaling distributed training across large-scale accelerator clusters, I am eager to contribute to the Applied AI team. My background in PyTorch and C++ development, combined with a deep understanding of performance bottlenecks in high-performance computing environments, aligns perfectly with Graphcore's mission to deliver cutting-edge AI compute solutions.

In my previous roles, I have successfully implemented and optimized large-scale systems, focusing on both performance and accuracy. I am particularly drawn to Graphcore's unique position within the SoftBank AI ecosystem and the opportunity to work on the complete AI compute stack. I am confident that my technical expertise in distributed inference and my passion for staying at the forefront of AI research will allow me to make a significant impact on your next generation of AI hardware and software libraries.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в graphcore уже сейчас

Присоединяйтесь к команде Graphcore и создавайте будущее вычислений для ИИ на передовом оборудовании!

Описание вакансии

About Graphcore

At Graphcore, we’re building the future of AI compute.We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacenter scale.As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem.To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world.We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence.

Job Summary

As a Senior Machine Learning Engineer in the Applied AI team at Graphcore, you will contribute to advancing AI technology by developing and optimising AI models tailored to our specialised hardware. You will work on large scale systems where performance is critical to the success of our projects. Working closely with the Software development and Research teams, you will play a critical role in identifying opportunities to innovate and differentiate Graphcore’s technology. We seek engineers with strong technical skills and an understanding of AI model implementation at scale, eager to make a tangible impact in this rapidly evolving field.

The Team

The Applied AI team’s role is to be proxies for our customers, we need to understand the latest AI models, applications, and software to ensure that Graphcore’s technology works seamlessly with the AI ecosystem and at scale. We build reference applications, contribute to key software libraries e.g. optimising kernels for efficiency on our hardware, and collaborate with the Research team to develop and publish novel ideas in domains such as efficient compute, model scaling and distributed training and inference of AI models for multiple modalities and applications.

If you're excited about advancing the next generation of AI models on cutting-edge hardware, we’d love to hear from you!

Responsibilities and Duties

  • Implement latest machine learning models and optimise them for performance and accuracy, scaling to 1000s of accelerators.
  • Test and evaluate new internal software releases, provide feedback to software engineering teams, make necessary code fixes, and conduct code reviews.
  • Benchmark models and key ML techniques to identify performance bottlenecks and improve model efficiency.
  • Design and conduct experiments on novel AI methods, implement them and evaluate results.
  • Collaborate with Research, Software, and Product teams to define, build, and test Graphcore’s next generation of AI hardware.
  • Engage with AI community and keep in touch with the latest developments in AI.

Candidate Profile

Essential:

  • Bachelor/Master's/PhD or equivalent experience in Machine Learning, Computer Science, Maths, Data Science, or related field.
  • Proficiency in deep learning frameworks like PyTorch/JAX.
  • Strong Python or C++ software development skills
  • Expertise in deep learning from model training to optimisation and evaluation.
  • Experience in distributed training or inference of ML models across 64+ accelerators.
  • Capable of designing, executing and reporting from ML experiments.
  • Developed deep understanding of performance bottlenecks and how to overcome them.
  • Ability to move quickly in a dynamic environment
  • Enjoy cross-functional work collaborating with other teams.
  • Strong communicator - able to explain complex technical concepts to different audiences.

Desirable:

  • Experience in one or more of:

+ MLOps for Kubernetes-based clusters

+ Building production systems with large language models

+ Efficient computing based on low-precision arithmetic.

  • Experience writing C++/Triton/CUDA kernels for performance optimisation of ML models.
  • Familiarity with HPC systems and networking including Infiniband, NVLink, RoCE technologies.
  • Have contributed to open-source projects or published research papers in relevant fields.
  • Knowledge of cloud computing platforms.
  • Keen to present, publish and deliver talks in the AI community.

Benefits

In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.

Applicants for this position must hold the right to work in the UK. Unfortunately at this time, we are unable to provide visa sponsorship or support for visa applications

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Создайте идеальное резюме с помощью ИИ-агента

Навыки

  • Python
  • C++
  • PyTorch
  • JAX
  • Machine Learning
  • Deep Learning
  • Distributed Training
  • Kubernetes
  • MLOps
  • Triton
  • CUDA
  • Infiniband
  • HPC

Возможные вопросы на собеседовании

Проверка опыта работы с масштабируемыми системами, что критично для данной роли.

Опишите ваш опыт настройки распределенного обучения на кластерах из 64+ ускорителей: с какими основными проблемами синхронизации вы сталкивались?

Вакансия подразумевает работу на стыке софта и железа Graphcore.

Как бы вы подошли к профилированию модели, чтобы выявить узкие места, специфичные для кастомных AI-ускорителей, а не стандартных GPU?

В описании упоминается оптимизация ядер.

Есть ли у вас опыт написания кастомных ядер (Triton/CUDA/C++) для оптимизации производительности конкретных слоев нейросетей?

Работа в Applied AI требует внедрения последних достижений науки.

Расскажите о последней прочитанной вами статье по теме эффективных вычислений или LLM, которую вы смогли бы адаптировать для нашего стека.

Оценка навыков работы в междисциплинарной команде.

Приведите пример, когда вам приходилось объяснять сложные архитектурные ограничения ML-модели команде разработчиков аппаратного обеспечения.

Похожие вакансии

roku
Не указана

Senior Machine Learning Engineer

SeniorГибридВеликобритания
PyTorch · TensorFlow · Computer Vision · Natural Language Processing · AWS · GCP · Azure · GraphQL · REST · Airflow · Argo · Metaflow · Docker · Kubernetes · Terraform · CloudFormation · Python · CI/CD
+18 навыков
jetbrains
Не указана

Senior Software Developer (IntelliJ AI)

SeniorУдалённоСербия
Java · Kotlin · LLM · Artificial Intelligence · Multithreading · Software Architecture · Algorithms
+7 навыков
flowtraders
Не указана

Senior Research Engineer

SeniorВ офисеВеликобритания
Python · PyTorch · NumPy · Polars · Pandas · Ray · Machine Learning · MLOps · Statistics · Docker · Kubernetes · AWS · Azure · GCP · CI/CD · Agile
+16 навыков
parloa
Не указана

Senior AI Agent Architect

SeniorУдалённоВеликобритания
LLM · Natural Language Understanding · Prompt Engineering · API Integration · SSML · Regex · Data Structures · Project Management · Stakeholder Management · Testing
+10 навыков
ionq
Не указана

Senior Quantum Field Engineer - Germany

SeniorГибридГермания
Quantum Computing · Python · Qiskit · Cirq · Jupyter Notebook · Technical Sales · Solution Architecture · Machine Learning · Data Science
+9 навыков
manychat
Не указана

Senior Python Engineer (AI Product Team)

SeniorГибридНидерланды
Python · FastAPI · Django · Flask · asyncio · PostgreSQL · Docker · PyTest · CI/CD · AWS · Google Cloud · Azure · LLM · OpenAI · Grafana · Prometheus · Distributed Systems
+17 навыков
более 1000 офферов получено
4.9

1000+ офферов получено

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

graphcore
Страна
Великобритания