- Страна
- США
Откликайтесь
на вакансии с ИИ

Reliability Engineer
Исключительно привлекательная вакансия для инженеров: работа над передовыми ИИ-чипами, отличный соцпакет (100% страховка, субсидия на жилье $2000) и возможность быть в центре инноваций в Купертино. Единственный минус для некоторых — строго офисный формат.
Сложность вакансии
Высокая сложность обусловлена необходимостью глубоких знаний в области надежности полупроводников и опыта работы с контрактными производителями (ODM/JDM). Работа в стартапе над уникальной архитектурой ASIC требует не только технической экспертизы, но и способности быстро адаптироваться к изменениям.
Анализ зарплаты
Зарплата в вакансии не указана, но для позиции Reliability Engineer с опытом 5+ лет в районе Купертино (Кремниевая долина) рыночные ставки значительно выше средних по США. Учитывая щедрые бонусы (субсидия на жилье, питание), совокупный доход может быть очень конкурентным.
Сопроводительное письмо
I am writing to express my strong interest in the Reliability Engineer position at Etched. With over five years of experience in reliability engineering and a deep understanding of datacenter applications, I am excited by Etched's mission to build model-specific ASICs that outperform traditional GPUs. My background in managing supplier reliability standards and working closely with ODMs/JDMs aligns perfectly with your requirements for ensuring the long-term stability of the Sohu chip and future products.
In my previous roles, I have successfully led DFMEA/PFMEA processes and established rigorous testing methodologies for high-performance hardware. I am particularly drawn to Etched's "Bitter Lesson" philosophy and the collaborative, in-person culture in Cupertino. I am confident that my technical expertise in silicon reliability and my proactive approach to quality assurance will contribute significantly to maintaining the maximum operational uptime required for your groundbreaking AI infrastructure.
Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в etchedai уже сейчас
Присоединяйтесь к команде Etched в Купертино и создавайте самое надежное ИИ-железо в мире!
Описание вакансии
About Etched
Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning.
Reliability Engineer
We are seeking a skilled and detail-oriented Reliability Engineer to join our team. As a Reliability Engineer at Etched, you will play a critical role in ensuring that all components and systems meet our rigorous reliability standards, essential for our datacenter applications. This position requires a deep understanding of reliability engineering principles, as well as experience working with suppliers, ODMs, and JDMs.
Representative Projects:
- Lead the development, implementation, and management of reliability standards for all suppliers working with Etched. Ensure that all components and systems meet or exceed the required reliability benchmarks.
- Review and verify reliability reports from suppliers, ensuring accuracy and adherence to Etched’s standards. Provide guidance and feedback to suppliers to ensure continuous improvement in reliability performance.
- Collaborate with cross-functional teams to review and recommend component selection criteria based on reliability performance. Ensure that all selected components are capable of meeting the long-term reliability requirements of our datacenter applications.
- Evaluate and approve reliability test plans proposed by external vendors. Ensure that the test methodologies and conditions are sufficient to validate long-term reliability under expected operating conditions.
- Conduct in-depth analysis of reliability data provided by suppliers and vendors. Identify trends, potential issues, and areas for improvement to enhance overall reliability.
- Work closely with ODMs (Original Design Manufacturers) and JDMs (Joint Design Manufacturers) to ensure that all products meet Etched quality and reliability standards. Provide technical guidance and support to maintain maximum operational uptime and long-term reliability.
- Review and establish reliability metrics and standards for silicon components, ensuring they meet the stringent requirements for long-term reliability in data center environments.
You maybe a good fit if you have
- Bachelor’s or Master’s degree in Reliability Engineering, Electrical Engineering, or a related field.
- 5+ years of experience in reliability engineering, with a focus on datacenter applications preferred.
- Strong understanding of reliability standards, testing methodologies, and data analysis techniques. DFMEA / PFMEA / SPC Engineering analysis experience desired.
- Experience working with suppliers, ODMs, and JDMs in a high-tech environment.
- Excellent communication skills, with the ability to convey complex technical concepts to diverse stakeholders.
- Proven ability to manage multiple projects and deliver results in a fast-paced environment.
We encourage you to apply even if you do not believe you meet every single qualification.
How we’re different:
Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.
We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.
Benefits:
- Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents
- Housing subsidy of $2,000/month for those living within walking distance of the office
- Daily lunch and dinner in our office
- Relocation support for those moving to Cupertino
Создайте идеальное резюме с помощью ИИ-агента

Навыки
- Reliability Engineering
- Electrical Engineering
- DFMEA
- PFMEA
- SPC
- Data Analysis
- ASIC
- Silicon Reliability
- Supplier Management
- Testing Methodologies
Возможные вопросы на собеседовании
Проверка опыта работы с ключевыми методологиями анализа рисков, указанными в вакансии.
Расскажите о вашем опыте проведения DFMEA для сложных систем. Какие критические риски вы выявили и как их минимизировали?
Важно понять, как кандидат взаимодействует с внешними партнерами для обеспечения качества.
Как вы выстраиваете процесс верификации отчетов о надежности от поставщиков и ODM-партнеров?
Позиция сфокусирована на дата-центрах, где условия эксплуатации специфичны.
Какие специфические факторы надежности наиболее критичны для ASIC, работающих в режиме 24/7 в условиях современного дата-центра?
Проверка навыков работы с данными и статистическими методами.
Какие статистические модели вы используете для прогнозирования интенсивности отказов (failure rate) на основе ускоренных испытаний?
Оценка способности работать в междисциплинарной среде стартапа.
Опишите случай, когда вам пришлось убеждать команду разработчиков изменить выбор компонента или дизайн на основе ваших данных о надежности.
Похожие вакансии
C++ Developer (System Programming / COM & RPC)
C++ разработчик (ethernet-коммутатор)
C++ Developer (Desktop VPN Client)
Разработчик C/C++ macOS
Руководитель эксплуатации серверного оборудования
Стажёр RPA Developer
1000+ офферов получено
Устали искать работу? Мы найдём её за вас
Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!
- Страна
- США