Страна: США
Зарплата: 230 000 $ – 322 000 $

+500% приглашений

Откликайтесь
на вакансии с ИИ

УдалённоПолная занятость

Staff Research Engineer, Pre-training Science

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Это уникальная возможность работать над созданием собственных LLM в одной из крупнейших социальных платформ мира. Высокая зарплата, удаленный формат работы в США и работа с передовыми технологиями (AWS Trainium, мультимодальность) делают эту вакансию крайне привлекательной.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Роль требует исключительной экспертизы в области обучения LLM, понимания архитектуры трансформеров и опыта работы с распределенными системами на огромных масштабах. Кандидат должен обладать навыками как в прикладных исследованиях, так и в глубокой инженерной оптимизации.

Анализ зарплаты

Медиана280 000 $

Рынок220 000 $ – 350 000 $

Предлагаемый диапазон $230k–$322k полностью соответствует рыночным стандартам для позиций уровня Staff в ведущих технологических компаниях США (Tier-1). С учетом бонусов и опционов (RSU), совокупный доход может значительно превышать медиану рынка.

I am writing to express my strong interest in the Staff Research Engineer, Pre-training Science position at Reddit. With over 7 years of experience in machine learning and a deep focus on large-scale model pre-training, I am excited by the opportunity to lead the technical strategy for Reddit-native foundational models. My background in domain adaptation and multimodal learning aligns perfectly with your mission to bridge the gap between general intelligence and community-specific context.

In my previous roles, I have successfully managed complex training runs and addressed challenges like catastrophic forgetting and training instabilities. I am particularly drawn to Reddit's unique data structure, including conversational trees and multimodal content, and I am eager to apply my expertise in scaling laws and distributed training on AWS Trainium clusters to help build the 'engine room' of Reddit's AI future. I look forward to the possibility of contributing to your distinguished engineering team.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в reddit уже сейчас

Присоединяйтесь к Reddit, чтобы создавать LLM нового поколения, которые понимают язык интернета!

Описание вакансии

Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 121 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit www.redditinc.com.

Reddit is continuing to grow our teams with the best talent. This role iscompletely remote friendly within the United States. If you happen to live close to one of our physical office locations (San Francisco, Los Angeles, New York City & Chicago) our doors are open for you to come into the office as often as you'd like.

The AI Engineering team at Reddit is embarking on a strategic initiative to build our own Reddit-native foundational Large Language Models (LLMs). This team sits at the intersection of applied research and massive-scale infrastructure, tasked with training models that truly understand the unique culture, language, and structure of Reddit communities. You will be joining a team of distinguished engineers and safety experts to build the "engine room" of Reddit's AI future—creating the foundational models that will power Safety & Moderation, Search, Ads, and the next generation of user products.

As a Staff Research Engineer for Pre-training Science, you will serve as the technical lead for defining the Continual Pre-Training (CPT) strategies that transform generic foundation models into Reddit-native experts. You will bridge the gap between "General Intelligence" and "Community Context," designing scientific frameworks that inject Reddit’s unique knowledge (conversational trees, slang, multimodal memes) into base models without causing catastrophic forgetting. You will define the "learning recipe"—the precise mix of data, hyperparameters, and architectural adaptations needed to build a model that speaks the language of the internet.

Responsibilities:

Architect and validate rigorous Continual Pre-Training (CPT) frameworks, focusing on domain adaptation techniques that effectively transfer Reddit’s knowledge into licensed frontier models.
Design the "Science of Multimodality": Lead research into fusing vision and language encoders to process Reddit’s rich media (images, video) alongside conversational text threads.
Formulate data curriculum strategies: scientifically determining the optimal ratio of "Reddit data" vs. "General data" to maximize community understanding while maintaining safety and reasoning capabilities.
Conduct deep-dive research into Scaling Laws for Graph-based data: investigating how Reddit’s tree-structured conversations impact model convergence compared to flat text.
Design and scale continuous evaluation pipelines (the "Reddit Gym") that monitor model reasoning and safety capabilities in real-time, enabling dynamic adjustments to training recipes.
Drive high-stakes architectural decisions regarding compute allocation, distributed training strategies (3D parallelism), and checkpointing mechanisms on AWS Trainium/Nova clusters.
Serve as a force multiplier for the engineering team by setting coding standards, conducting high-level design reviews, and mentoring senior engineers on distributed systems and ML fundamentals.

Required Qualifications:

7+ years of experience in Machine Learning engineering or research, with a specific focus on LLM Pre-training, Domain Adaptation, or Transfer Learning.
Expert-level proficiency in Python and deep learning frameworks (PyTorch or JAX), with a track record of debugging complex training instabilities at scale.
Deep theoretical understanding of Transformer architectures and Pre-training dynamics—specifically regarding Catastrophic Forgetting and Knowledge Injection.
Experience with Multimodal models (VLM): understanding how to align image/video encoders (e.g., CLIP, SigLIP) with language decoders.
Experience implementing continuous integration/evaluation systems for ML models, measuring generalization and reasoning performance.
Demonstrated ability to communicate complex technical concepts (like loss spikes or convergence issues) to leadership and coordinate efforts across Infrastructure and Data teams.

Nice to Have:

Published research or open-source contributions in Continual Learning, Curriculum Learning, or Efficient Fine-Tuning (LoRA/Peft).
Experience with Graph Neural Networks (GNNs) or processing tree-structured data.
Proficiency in low-level optimization (CUDA, Triton) or distributed training frameworks (Megatron-LM, DeepSpeed, FSDP).
Familiarity with Safety alignment techniques (RLHF/DPO) to understand how pre-training objectives impact downstream safety.

Benefits:

Comprehensive Healthcare Benefits and Income Replacement Programs
401k with Employer Match
Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
Family Planning Support
Gender-Affirming Care
Mental Health & Coaching Benefits
Flexible Vacation & Paid Volunteer Time Off
Generous Paid Parental Leave

#LI-SP1

Pay Transparency:

This job posting may span more than one career level.

In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit https://www.redditinc.com/careers/.

To provide greater transparency to candidates, we share base salary ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.

The base salary range for this position is:

$230,000—$322,000 USD

In select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.

During the interview, we will collect the following categories of personal information: Identifiers, Professional and Employment-Related Information, Sensory Information (audio/video recording), and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role, as applicable. We will not sell your personal information or disclose it to any third party for their marketing purposes. We will delete any recording of your interview promptly after making a hiring decision. For more information about how we will handle your personal information, including our retention of it, please refer to our Candidate Privacy Policy for Potential Employees and Contractors.

Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve. Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If, due to a disability, you need an accommodation during the interview process, please let your recruiter know.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Python
PyTorch
JAX
LLM
Transfer Learning
Multimodal Learning
AWS Trainium
Distributed Training
DeepSpeed
FSDP
CUDA
Triton
NLP

Возможные вопросы на собеседовании

Проверка понимания ключевой проблемы при дообучении моделей на специфических данных.

Как вы планируете бороться с проблемой катастрофического забывания (catastrophic forgetting) при проведении Continual Pre-training на данных Reddit?

Оценка опыта работы с мультимодальностью, что является важной частью вакансии.

Опишите ваш подход к выравниванию (alignment) визуальных энкодеров с языковыми декодерами для обработки мемов и видеоконтента.

Проверка навыков работы с инфраструктурой и масштабированием.

С какими наиболее сложными проблемами нестабильности обучения (loss spikes) вы сталкивались при использовании 3D-параллелизма и как вы их решали?

Оценка способности работать с уникальной структурой данных Reddit.

Как, по вашему мнению, древовидная структура комментариев Reddit должна влиять на выбор функции потерь или архитектуру модели по сравнению с обычным текстом?

Проверка лидерских качеств и умения принимать архитектурные решения.

Как вы определяете оптимальное соотношение (data curriculum) между общими данными и специфическими данными Reddit для достижения наилучшего качества модели?

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

СШАот 230 000 $

Откликайтесь
на вакансии с ИИ

Staff Research Engineer, Pre-training Science

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в reddit уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Как вы планируете бороться с проблемой катастрофического забывания (catastrophic forgetting) при проведении Continual Pre-training на данных Reddit?

Опишите ваш подход к выравниванию (alignment) визуальных энкодеров с языковыми декодерами для обработки мемов и видеоконтента.

С какими наиболее сложными проблемами нестабильности обучения (loss spikes) вы сталкивались при использовании 3D-параллелизма и как вы их решали?

Как, по вашему мнению, древовидная структура комментариев Reddit должна влиять на выбор функции потерь или архитектуру модели по сравнению с обычным текстом?

Как вы определяете оптимальное соотношение (data curriculum) между общими данными и специфическими данными Reddit для достижения наилучшего качества модели?

Похожие вакансии

Архитектор мультиагентных систем на базе LLM

Fullstack разработчик-подмастерье (AI Engineer)

Fullstack / AI разработчик (подмастерье)

Applied AI / LLM Engineer (Python)

AI-разработчик (Senior)

Аналитик AI-агентов Senior

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

Staff Research Engineer, Pre-training Science

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в reddit уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Как вы планируете бороться с проблемой катастрофического забывания (catastrophic forgetting) при проведении Continual Pre-training на данных Reddit?

Опишите ваш подход к выравниванию (alignment) визуальных энкодеров с языковыми декодерами для обработки мемов и видеоконтента.

С какими наиболее сложными проблемами нестабильности обучения (loss spikes) вы сталкивались при использовании 3D-параллелизма и как вы их решали?

Как, по вашему мнению, древовидная структура комментариев Reddit должна влиять на выбор функции потерь или архитектуру модели по сравнению с обычным текстом?

Как вы определяете оптимальное соотношение (data curriculum) между общими данными и специфическими данными Reddit для достижения наилучшего качества модели?

Похожие вакансии

Архитектор мультиагентных систем на базе LLM

Fullstack разработчик-подмастерье (AI Engineer)

Fullstack / AI разработчик (подмастерье)

Applied AI / LLM Engineer (Python)

AI-разработчик (Senior)

Аналитик AI-агентов Senior

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ