- Страна
- США
Откликайтесь
на вакансии с ИИ

Staff Systems Engineer
Это престижная позиция в инновационной компании Graphcore, принадлежащей SoftBank. Работа над передовыми ИИ-системами и участие в разработке инфраструктуры для Artificial Super Intelligence обеспечивают отличные карьерные перспективы и профессиональный рост.
Сложность вакансии
Роль требует глубоких знаний в архитектуре серверов, отладке на уровне плат и работе с HPC-системами. Высокий уровень ответственности за запуск и валидацию прототипов оборудования в составе SoftBank Group предполагает наличие серьезного инженерного опыта.
Анализ зарплаты
Зарплата для позиции Staff Engineer в Остине обычно находится в диапазоне $160,000 - $210,000 в год без учета бонусов и акций. Учитывая статус компании и сложность задач, предложение должно соответствовать верхним границам рынка.
Сопроводительное письмо
I am writing to express my strong interest in the Staff Systems Engineer position at Graphcore. With extensive experience in server hardware architectures and board-level debugging, I have a proven track record of supporting complex hardware bring-up and validation for high-performance computing environments. My background in diagnosing system-level failures involving thermal behavior, power anomalies, and BIOS/BMC issues aligns perfectly with the requirements of your next-generation AI compute platforms.
Throughout my career, I have excelled in fast-paced engineering environments, collaborating closely with firmware and platform teams to perform root cause analysis and implement corrective actions. I am particularly drawn to Graphcore's mission within the SoftBank Group to enable Artificial Super Intelligence. I am confident that my technical expertise in rack-scale infrastructure and my commitment to operational reliability will make me a valuable asset to your Systems Engineering team in Austin.
Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в graphcore уже сейчас
Присоединяйтесь к команде Graphcore и станьте частью будущего искусственного интеллекта в составе SoftBank Group!
Описание вакансии
About us
Graphcore is one of the world’s leading innovators in Artificial Intelligence compute. It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.
As part of the SoftBank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. Together, they share a bold vision: to enable Artificial Super Intelligence and ensure its benefits are accessible to everyone.
Graphcore’s teams are drawn from diverse backgrounds and bring a broad range of skills and perspectives. A melting pot of AI research specialists, silicon designers, software engineers and systems architects, Graphcore enjoys a culture of continuous learning and constant innovation.
Job Summary
We are seeking a Staff Hardware Engineer to provide advanced operational, diagnostic, and engineering support for Graphcore’s Arm-based hardware platforms across lab and data center environments.
This role focuses on supporting hardware bring-up, validation, and troubleshooting of complex AI compute platforms, including server blades, racks, and rack-scale infrastructure. The successful candidate will collaborate closely with engineering, platform, and data center teams to ensure the reliability and performance of next-generation AI systems.
The Team
The Systems Engineering and Hardware Engineering teams are responsible for enabling the bring-up, validation, and operational reliability of Graphcore’s AI infrastructure platforms.
The team works closely with server engineering, firmware teams, platform architects, and data center operations to support the development, testing, and deployment of next-generation AI compute systems.
This collaborative environment enables rapid problem-solving and continuous improvement of Graphcore’s hardware platforms from early development through production deployment.
Responsibilities and Duties
- Lead advanced break-fix troubleshooting for server blades, motherboards, power systems, and rack-scale infrastructure.
- Support engineering bring-up activities, including component validation and firmware interaction testing.
- Diagnose system-level failures involving thermal behavior, power anomalies, network configuration, and BIOS/BMC issues.
- Collaborate with server engineering teams to perform root cause analysis and propose corrective actions or design improvements.
- Support deployment and rollout of next-generation hardware platforms through structured validation and qualification cycles.
- Interface with facilities and infrastructure teams to understand environmental factors impacting system reliability.
- Develop and maintain standard operating procedures (SOPs), troubleshooting guides, and validation documentation.
- Provide guidance and mentorship to junior technicians and engineers on troubleshooting methodologies and hardware diagnostics.
- Participate in on-call rotations or off-hours support during critical engineering milestones or hardware bring-up phases.
Candidate Profile
Essential
- Bachelor’s degree in Electrical Engineering, Computer Engineering, Computer Science, or related discipline.
- Strong experience with server hardware architectures and board-level debugging.
- Experience analyzing system logs, hardware telemetry, and power/thermal metrics to isolate hardware failures.
- Hands-on experience with HPC systems, AI compute platforms, or rack-scale infrastructure.
- Strong collaboration skills and ability to work effectively in fast-paced engineering environments.
- Excellent written and verbal communication skills.
Desirable
- Experience supporting prototype or pre-production hardware bring-up.
- Familiarity with data center facilities, including liquid cooling and power distribution systems.
- Experience using Python, Bash, or automation tools for hardware validation or troubleshooting.
- Exposure to structured failure analysis and reliability engineering methodologies.
Создайте идеальное резюме с помощью ИИ-агента

Навыки
- Electrical Engineering
- Computer Engineering
- Hardware Architecture
- Debugging
- HPC
- Python
- Bash
- Root Cause Analysis
- BIOS
- BMC
- Server Hardware
- Hardware Diagnostics
Возможные вопросы на собеседовании
Проверка навыков диагностики сложных системных сбоев.
Опишите ваш процесс поиска первопричины (root cause analysis) при возникновении аномалий в питании или тепловыделении на уровне серверной стойки.
Оценка опыта работы с предсерийными образцами.
С какими основными трудностями вы сталкивались при этапе bring-up нового оборудования и как вы их решали?
Проверка навыков автоматизации процессов.
Как вы использовали Python или Bash для автоматизации процессов валидации оборудования или сбора телеметрии?
Оценка понимания специфики ИИ-инфраструктуры.
Каков ваш опыт работы с системами жидкостного охлаждения и распределения питания в контексте высокоплотных вычислений для ИИ?
Проверка лидерских качеств и менторства.
Как вы подходите к обучению младших инженеров и техников методологиям поиска неисправностей в сложных системах?
Похожие вакансии
Sr. Fire Protection Engineer
Senior Regulatory Engineer - Electrical Distribution
Continuous Improvement Engineer (E3) – Senior Continuous Improvement Engineer
Field Service Engineer - Rotary UPS
Field Service Engineer - Rotary UPS
Field Service Engineer - Rotary UPS
1000+ офферов получено
Устали искать работу? Мы найдём её за вас
Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!
- Страна
- США