- Страна
- США
- Зарплата
- 345 040 $ – 399 420 $
Откликайтесь
на вакансии с ИИ

Principal Software Engineer, Stateless Jobs Platform (Core Services)
Исключительная вакансия с очень высокой зарплатой, возможностью работать над уникальными технологическими вызовами мирового масштаба и сильным социальным пакетом. Единственный минус — требование присутствия в офисе 3 дня в неделю.
Сложность вакансии
Это позиция высочайшего уровня сложности, требующая более 10 лет опыта, глубочайших знаний внутреннего устройства Kubernetes и опыта создания кастомных систем оркестрации. Кандидат должен уметь проектировать системы для миллионов одновременных пользователей в гибридных облачных средах.
Анализ зарплаты
Предложенная зарплата ($345k - $399k) находится на верхнем пределе рынка для Principal-инженеров даже в Кремниевой долине. Это значительно выше медианы, что отражает критическую важность роли и высокие требования к экспертизе.
Сопроводительное письмо
I am writing to express my strong interest in the Principal Software Engineer position for the Stateless Jobs Platform at Roblox. With over a decade of experience in building high-throughput distributed systems and a deep expertise in Golang and Kubernetes internals, I am excited by the challenge of architecting a custom control plane that pushes the boundaries of standard orchestration. My background in developing Kubernetes Operators and managing hybrid-cloud mobility aligns perfectly with your mission to provide a 'deploy and forget' experience for critical workloads.
In my previous roles, I have successfully led greenfield projects that involved extending the Kubernetes control plane and implementing complex reconciliation loops for massive-scale fleets. I am particularly drawn to Roblox's vision of connecting a billion people and the technical complexity of managing global inference and microservices footprints. I am confident that my experience in distributed state management and multi-region failover strategies will allow me to make an immediate impact on the Core Services team and help shape the next generation of your global real-time platform.
Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в roblox уже сейчас
Присоединяйтесь к Roblox, чтобы создавать инфраструктуру будущего для миллиарда пользователей — откликайтесь сейчас!
Описание вакансии
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
The Core Services team manages the core infrastructure and API stack and builds high throughput microservices that powers https://www.roblox.com. These services need to be fast, reliable and highly scalable as they have a huge impact on the day-to-day experience of every Roblox user. In addition, the team owns shared libraries, infrastructure microservices, and the web infrastructure used by all other Roblox full-stack feature teams. We ship with testable and configurable features that allow for rapid experimentation, data collection, and optimize for performance and user engagement. From serving basic user information to populating content into in-experience. Core Services is integral to the Roblox experience.
We are building a massive-scale, multi-region platform designed to power the next generation of global real-time experiences. At the intersection of Cloud Engineering and AI Infrastructure, you will build the foundation for a platform that supports millions of concurrent users, defining how stateless jobs are executed at a scale that pushes the boundaries of standard open-source tooling.
As the orchestrator for our global inference and microservices footprint, our platform provides a "deploy and forget" experience for critical workloads. You won't just be managing clusters; you will be building the custom control plane that automates scheduling, scaling, and recovery across a hybrid-cloud environment, ensuring our infrastructure remains resilient regardless of where it runs.
You will:
- Build the Orchestration Engine: Design and develop custom Kubernetes Operators and Controllers in Go to automate the entire lifecycle of high-throughput, mission-critical stateless workloads.
- Architect Hybrid-Cloud Mobility: Create systems that enable workloads to move seamlessly between on-premise and public cloud environments, ensuring high availability and automated failover during regional outages.
- Extend the Kubernetes Control Plane: Write performant reconciliation loops and Custom Resource Definitions (CRDs) to handle complex scheduling logic and resource optimization for massive CPU and GPU-intensive fleets.
- Empower Developer Velocity: Build high-level platform abstractions and automation that allow service owners to deploy global-scale code without needing to manage the underlying container orchestration.
You have:
- 10+ years of experience building web services using Golang or similar language.
- Experience building and operating K8’s clusters.
- Deep understanding of Kubernetes internals (control plane, reconciliation loops, scheduling, networking).
- Experience building large scale distributed systems with focus on scalability, reliability, and availability. Experience building or operating control-plane or orchestration systems (e.g., schedulers, workflow engines, or compute platforms).
- Strong knowledge of distributed systems fundamentals such as leader election, event-driven architectures, messaging/queuing, or distributed state management.
- Experience designing systems that handle multi-region orchestration, failover, disaster recovery, or large-scale reliability challenges.
- Experience with Oncall and in troubleshooting live site issues. Experience leading cross team greenfield projects.
- Bachelor’s degree in Computer Science or a related field, or equivalent experience.
- Experience writing Kubernetes Operators or custom controllers using Operator-SDK or control runtime.
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page.
Annual Salary Range
$345,040—$399,420 USD
Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations to candidates with qualifying disabilities or religious beliefs during the recruiting process.
Создайте идеальное резюме с помощью ИИ-агента

Навыки
- Go
- Kubernetes
- Distributed Systems
- Microservices
- Cloud Engineering
- Kubernetes Operators
- CRD
- Hybrid Cloud
- System Architecture
- Scalability
Возможные вопросы на собеседовании
Проверка глубоких знаний K8s, необходимых для расширения его функционала.
Можете ли вы подробно описать процесс работы reconciliation loop в кастомном контроллере Kubernetes и как вы обеспечиваете его производительность при управлении тысячами ресурсов?
Важно для обеспечения надежности платформы Roblox в разных регионах.
Как бы вы спроектировали систему автоматического переключения (failover) для stateless-задач между on-premise дата-центром и публичным облаком, минимизируя задержки?
Роль требует опыта в создании сложных распределенных систем.
С какими проблемами согласованности данных вы сталкивались при реализации распределенного планировщика (scheduler) и как их решали?
Позиция Principal подразумевает лидерство в новых проектах.
Расскажите о вашем опыте руководства greenfield-проектом: как вы принимали архитектурные решения и координировали работу нескольких команд?
Платформа должна поддерживать GPU-интенсивные задачи для ИИ.
Какие специфические вызовы возникают при оркестрации рабочих нагрузок, требующих GPU, по сравнению с обычными CPU-задачами в масштабах Roblox?
Похожие вакансии
.NET разработчик Middle+ , Senior
Senior C++ Developer (ATM / Payment Systems)
Senior Java developer
Старший разработчик C++
Senior Java Developer
Middle+ Java-разработчик
1000+ офферов получено
Устали искать работу? Мы найдём её за вас
Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!
- Страна
- США
- Зарплата
- 345 040 $ – 399 420 $