- Страна
- США
- Зарплата
- 210 000 $ – 280 000 $
Откликайтесь
на вакансии с ИИ

Principal Software Engineer, Video Engineering
Исключительная вакансия в топовом стартапе с огромным финансированием ($107M) и поддержкой NVIDIA. Работа над передовыми технологиями на стыке Video Engineering и AI, удаленный формат в США и возможность влиять на фундаментальные модели делают это предложение крайне привлекательным для экспертов.
Сложность вакансии
Роль требует исключительной экспертизы: более 12 лет опыта, глубочайшие знания внутренностей FFmpeg и кодеков, а также умение работать на стыке системного программирования (C++/Rust/Go) и машинного обучения. Высокая планка ответственности за архитектуру петабайтных хранилищ и пайплайнов делает эту позицию одной из самых сложных на рынке.
Анализ зарплаты
Для позиции уровня Principal в высокотехнологичном стартапе в США (особенно с поддержкой Tier-1 фондов) рыночная зарплата значительно выше средней. Указанный диапазон отражает текущие реалии рынка Сан-Франциско и удаленной работы в США для инженеров редкой специализации.
Сопроводительное письмо
I am writing to express my strong interest in the Principal Software Engineer, Video Engineering position at Twelve Labs. With over 12 years of experience in software engineering and a deep specialization in video processing pipelines, I have spent a significant portion of my career mastering the intricacies of FFmpeg, libav, and various codecs like H.265 and AV1. My background in building scalable media systems aligns perfectly with your mission to redefine video understanding for AI, moving beyond traditional playback optimization toward machine-centric processing.
In my previous roles, I have successfully architected high-throughput ingestion and decoding pipelines, often leveraging GPU acceleration (NVDEC) to optimize for both performance and cost. I am particularly excited by Twelve Labs' focus on semantic and heuristic chunking; I have extensive experience in scene boundary detection and content-aware segmentation, which I believe are critical for maximizing the performance of downstream multimodal foundation models. I am eager to bring my expertise in systems engineering and media toolchains to help Twelve Labs push the boundaries of what is possible in video-language modeling.
Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в twelve-labs уже сейчас
Присоединяйтесь к лидерам в области мультимодального ИИ и определите будущее видеотехнологий — подайте заявку в Twelve Labs сегодня!
Описание вакансии
Who we are
At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.
With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.
We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.
About The Role
Most video engineering roles at Netflix, YouTube, or Twitch optimize for human playback — better compression, lower bitrate, smoother streaming. At Twelve Labs, video is processed for machine understanding. The tradeoffs are fundamentally different: we optimize for AI model performance, not just perceptual quality. This is a rare opportunity to define how video is engineered for intelligence — not just delivery.
As the Principal Software Engineer, Video Engineering, you will own the architecture and implementation of Twelve Labs' video processing pipelines — from byte ingestion through decode, chunking, storage, and playback — ensuring it is fast, cost-efficient, and purpose-built for AI-native video intelligence at scale. You will be the internal subject matter expert on all things related to video engineering.
In This Role You Will:
- Own the video pipeline end-to-end: Architect and implement ingestion → decode → chunking → storage → retrieval → playback, across batch and streaming modes based on AI/ML workflows or media application workflows.
- Deep codec & decode mastery: Drive decisions on decode strategies (hardware vs. software, GPU-accelerated pipelines), container format handling (fMP4, CMAF, MKV, TS), and codec support (H.264, H.265, VP9, AV1) with pragmatic cost/quality tradeoffs.
- Semantic & heuristic chunking: Work with our ML Research Scientists to design and implement intelligent video segmentation that goes beyond fixed-interval splitting — scene boundary detection, shot change analysis, content-aware chunking that optimizes downstream AI model performance.
- Streaming ingestion: Architect low-latency streaming pipelines (HLS, DASH, LL-HLS, WebRTC ingest) that process video in near-real-time, including streaming decode and incremental chunking.
- Video storage architecture: Design storage tiers and retrieval patterns optimized for AI workloads — balancing hot/warm/cold access, frame-level random access, and cost at petabyte scale.
- Playback & delivery: Ensure video can be served back to users with accurate temporal navigation, supporting time-coded references from AI analysis results.
- FFmpeg & media toolchain expertise: Be the internal authority on FFmpeg, libav, and related tooling. Build and maintain custom processing pipelines, filters, and integrations.
- Cost engineering: Quantify and optimize cost-per-hour-of-video-processed. Drive decode efficiency through hardware acceleration (NVDEC, VA-API), pipeline parallelism, and intelligent resource allocation.
- Cross-team technical leadership: Partner with ML teams on how video is preprocessed for model consumption, with platform teams on infrastructure, and with product on customer-facing media capabilities.
- Standards & best practices: Establish video engineering standards, author reference implementations, and mentor engineers across teams on media fundamentals.
You May Be A Good Fit If You Have:
- 12+ years in software engineering with 7+ years focused on video/media engineering in production systems processing video at scale.
- Deep FFmpeg expertise: Not just CLI usage — understanding of libavcodec, libavformat, filter graphs, custom demuxers/decoders, and performance tuning.
- Codec internals knowledge: H.264/H.265 bitstream structure, AV1 adoption tradeoffs, hardware decode paths, quality metrics (VMAF, SSIM, PSNR).
- Streaming protocol fluency: HLS, DASH, LL-HLS, WebRTC. Experience with live/real-time ingest pipelines.
- Systems engineering depth: Comfortable in C/C++, Rust, or Go for performance-critical media code; Python for pipeline orchestration. Can reason about memory layout, SIMD, GPU pipelines.
- Storage & retrieval at scale: Experience designing video storage systems — object stores, frame-indexed access patterns, tiered storage strategies.
- Content-aware processing: Experience with scene detection, shot boundary analysis, temporal segmentation, or perceptual quality optimization.
- Production instincts: Incident response, observability for media pipelines, debugging decode failures at scale, handling format edge cases gracefully.
- AI/ML integration experience (strongly preferred): Worked with teams consuming video frames for model training/inference. Understands how preprocessing decisions (resolution, frame rate, chunking strategy) impact model quality.
Qualified Candidates May Also Have:
- Made major contributions to FFmpeg, GStreamer, or open-source media projects.
- Deep familiarity with GPU-accelerated video processing (ex. NVDEC/NVENC).
- Experience running media pipelines in constrained environments such as on-prem or edge settings.
Candidates must be able to travel up to 10% of the time annually to attend conferences, off-site meetings, and other business-related events as required by the role. This role may require participation in on-site interviews and/or completion of in-person onboarding processes.
Benefits and Perks
🤝 An open and inclusive culture and work environment.
🚀 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.
🏥 Full health, dental, and vision benefits
✈️ Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.
🛂 VISA support where applicable
Создайте идеальное резюме с помощью ИИ-агента

Навыки
- C++
- Python
- Rust
- Machine Learning
- Go
- HLS
- WebRTC
- GPU
- DASH
- FFmpeg
- H.264
- AV1
- H.265
- NVDEC
- VA-API
Возможные вопросы на собеседовании
Проверка глубоких знаний FFmpeg и умения оптимизировать производительность на низком уровне.
Расскажите о вашем опыте работы с libavcodec и libavformat. Можете ли вы описать случай, когда вам приходилось писать кастомный демуксер или фильтр для решения специфической проблемы производительности?
Критически важный навык для данной роли — понимание специфики подготовки данных для ИИ.
Как бы вы спроектировали стратегию 'умного' чанкинга видео для обучения мультимодальной модели, чтобы сбалансировать вычислительные затраты и семантическую целостность фрагментов?
Оценка навыков системного проектирования и управления затратами в облаке.
Как вы организуете эффективный доступ к кадрам на уровне frame-level random access в хранилище петабайтного масштаба, минимизируя задержки и стоимость S3/облачных запросов?
Проверка опыта работы с аппаратным ускорением.
В каких случаях вы предпочтете программное декодирование аппаратному (NVDEC/VA-API) в контексте масштабируемого ИИ-пайплайна, и какие подводные камни вы встречали при работе с GPU-декодерами в Docker-контейнерах?
Оценка лидерских качеств и умения работать в кросс-функциональной команде.
Опишите ваш подход к установлению стандартов видеоинженерии в команде, где большинство разработчиков специализируются на ML, а не на медиа-технологиях.
Похожие вакансии
Go - разработчик (Senior)
Senior Java Developer
.NET разработчик Middle+ , Senior
Senior C++ Developer (ATM / Payment Systems)
Разработчик C++ ( Senior )
Rust Developer
1000+ офферов получено
Устали искать работу? Мы найдём её за вас
Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!
- Страна
- США
- Зарплата
- 210 000 $ – 280 000 $