yandex
ruby-labs
+500% приглашений

Откликайтесь
на вакансии с ИИ

Ускорим процесс поиска работы
SeniorУдалённоПолная занятость

AI Engineer

Оценка ИИ

Отличная вакансия для опытного инженера: современный стек, фокус на данных и метриках, а не на интуиции, плюс привлекательные бенефиты вроде Unlimited PTO и удаленки. Компания демонстрирует зрелый подход к AI-разработке.


Вакансия из Quick Offer Global, списка международных компаний
Пожаловаться

Сложность вакансии

ЛегкоСложно
Оценка ИИ

Роль требует глубоких знаний как в классической веб-разработке (Node.js/Next.js), так и в специфических AI-инструментах (Langfuse, OpenRouter, LangChain). Высокая сложность обусловлена необходимостью строить сложные системы оценки (evals) и работать с неопределенностью выходов LLM.

Анализ зарплаты

Медиана105 000 $
Рынок75 000 $ – 140 000 $
Оценка ИИ

Зарплата не указана, но для позиции Senior AI Engineer в европейских компаниях с удаленным форматом работы рыночный диапазон обычно составляет $80,000 – $130,000 в год. Ruby Labs предлагает контрактную форму сотрудничества, что часто подразумевает ставку выше средней по рынку для компенсации налогов.

Сопроводительное письмо

I am writing to express my strong interest in the Senior AI Engineer position at Ruby Labs. With a deep background in Node.js and Next.js, combined with extensive experience in building production-ready LLM workflows, I am confident in my ability to drive your AI infrastructure forward. My expertise in advanced prompt engineering and structured outputs aligns perfectly with your mission to deliver high-quality, data-driven AI experiences.

In my previous roles, I have successfully implemented observability pipelines using tools like Langfuse and managed complex model deployments via OpenRouter. I am particularly drawn to Ruby Labs' commitment to quantitative benchmarking and systematic A/B testing, as I believe that rigorous evaluation is the key to overcoming hallucinations and optimizing model performance. I am excited about the opportunity to bring my analytical mindset and iterative approach to your fast-growing team.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в ruby-labs уже сейчас

Присоединяйтесь к Ruby Labs и создавайте будущее потребительских AI-продуктов в динамичной среде с неограниченным отпуском!

Описание вакансии

About us

Ruby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries. Our innovative teams are driving the future of consumer-led products, and we're always looking for passionate individuals to join us. Learn more about our story at: https://rubylabs.com/about-us/

About the role

At RubyLabs, we’re seeking a Senior AI Engineer (Node.js / Next.js / TypeScript) to shape our AI infrastructure and drive production-ready LLM experiences. You’ll work in a modern stack, making data-driven decisions around model performance, reliability, and cost.

You’ll own advanced prompt systems, structured outputs, and complex LLM workflows using LangChain or LlamaIndex. Observability, debugging, and evaluation are core to the role, leveraging Langfuse and AI gateways like OpenRouter to continuously improve model quality and operational efficiency. You’ll take full ownership of key AI features from experimentation to live production.

Key Responsibilities

  • Advanced Prompt Engineering: Designing complex, dynamic prompt templates with conditional logic and efficiently reusing information and context within prompts to maximize generation quality and reasoning.
  • Structured Outputs & Schemas: Implementing various response schemes (JSON mode, function calling, Zod/JSON schemas) to ensure AI outputs are predictable and ready for seamless integration into application logic.
  • Prompt Engineering & Evaluations: Building robust evaluation pipelines and using Langfuse to collect feedback and score the quality of responses in real time.
  • Tracing & Debugging: Performing deep debugging of complex LLM chains using Langfuse traces to identify bottlenecks and optimize for cost, latency, and context window usage.
  • AI A/B Testing: Running systematic experiments across different models via OpenRouter (e.g., comparing Claude 3.5 Sonnet vs. GPT-4o) and analyzing results based on quantitative metrics.
  • Data-Driven Decisions: Making deployment decisions for new prompts or models strictly based on quantitative benchmarks and trace data, rather than intuition.
  • Output Scoring & Analysis: Developing scoring systems to analyze the “Problem → Solution” chain and identify root causes of hallucinations or logic errors using Langfuse analytics.
  • Model Performance & Fine-Tuning: Regularly re-evaluating model performance as new architectures emerge and performing fine-tuning when necessary to meet specific domain requirements.

Qualifications

  • Node.js & Next.js: Deep knowledge of the stack to build reliable services and handle complex LLM-generated data.
  • Dynamic Prompting Skills: Proven experience in building prompts where content is highly dependent on input variables and context injection.
  • OpenRouter Experience: Experience working with unified APIs, managing rate limits, and selecting the most cost-effective models for specific tasks.
  • Langfuse (or similar): Understanding of LLM observability principles — setting up tracing, creating test datasets, and integrating scoring systems.
  • Evaluation Methodology: Experience with frameworks like RAGAS or building custom “LLM-as-a-judge” systems.
  • Analytical Mindset: Ability to transform raw generation logs into actionable business metrics and technical insights.
  • Iterative Mindset: Focus on continuous product improvement through constant feedback loops.

Nice to have

  • Fine-Tuning: Practical experience in fine-tuning models for specific domain tasks or JSON compliance.
  • RAG Architecture: Understanding how to build and optimize Retrieval-Augmented Generation systems, including indexing, retrieval, and re-ranking.
  • Python: Basic knowledge for working with data science scripts or AI evaluation libraries.

Location

Ruby Labs operates within the CET (Central European Time) zone. Applicants from any country are welcome to apply for the position as long as they are located within approximately ± 4 hours of CET. This ensures optimal collaboration and communication during working hours.

Benefits

Discover the perks of being part of our vibrant team! We offer:

  • Remote Work Environment: Embrace the freedom to work from anywhere, anytime, promoting a healthy work-life balance.
  • Unlimited PTO: Enjoy unlimited paid time off to recharge and prioritize your well-being, without counting days.
  • Paid National Holidays: Celebrate and relax on national holidays with paid time off to unwind and recharge.
  • Company-provided MacBook: Experience seamless productivity with top-notch Apple MacBooks provided to all employees who need them.
  • Flexible Independent Contractor Agreement: Unlock the benefits of flexibility, autonomy, and entrepreneurial opportunities. Benefit from tax advantages, networking opportunities, reduced employment obligations, and the freedom to work from anywhere. Read more about it here: https://docs.google.com/document/d/1tzxGX4Uu7Ts_HCLFXESKLnKaaBfVCPf1f9AYZPrkjJM/preview?tab=t.0

Be part of our fast-growing team and seize this excellent opportunity for personal and professional growth!

Interview Process

After submitting your application, we conduct a thorough review which typically takes 3 to 5 days, but may occasionally take longer due to the volume of applications received. If we see a potential fit, we proceed with the following steps:

  • Recruiter Screening (40 minutes)
  • Technical Interview (60 minutes)
  • Final Interview (30 minutes)

Life at Ruby Labs

At Ruby Labs, we move fast, aim high, and expect the same from our team. We’re not here to play small—we’re here to build, grow, and win. That means we look for people who are ambitious, driven, and ready to give their best every single day.

This is a place for individuals who thrive under pressure, embrace challenges, and see opportunity in every obstacle. If you’re hungry to achieve, motivated by impact, and want to grow at the speed of your own ambition, Ruby Labs offers the platform to make it happen.

Here, effort is matched with reward. We recognize those who go all in and deliver results, and we create space for people who want more—more responsibility, more growth, and more success.

#LI-Remote

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Создайте идеальное резюме с помощью ИИ-агента

Навыки

  • TypeScript
  • Python
  • RAG
  • Prompt Engineering
  • Node.js
  • Next.js
  • LangChain
  • JSON Schema
  • LlamaIndex
  • Langfuse
  • OpenRouter
  • Zod

Возможные вопросы на собеседовании

Вакансия делает упор на использование Langfuse для отладки и мониторинга.

Расскажите о вашем опыте настройки пайплайнов оценки (evaluations) в Langfuse. Какие метрики вы считаете наиболее критичными для оценки качества ответов LLM?

Компания использует OpenRouter для переключения между моделями (Claude, GPT-4).

Как вы подходите к выбору конкретной модели для задачи, учитывая баланс между стоимостью, задержкой (latency) и качеством генерации? Приведите пример из практики.

Упоминается использование Zod и JSON schemas для структурированных выходов.

Как вы обеспечиваете 100% соблюдение схемы JSON при работе с моделями, которые не поддерживают нативный 'JSON mode' или 'Function Calling'?

В описании указано проектирование сложных промптов с условной логикой.

Опишите архитектуру самого сложного промпта, который вы создавали. Как вы управляли контекстным окном и повторным использованием информации?

Упоминается RAG как желательный навык.

С какими проблемами вы сталкивались при масштабировании RAG-систем и как вы оптимизировали процессы извлечения (retrieval) и переранжирования (re-ranking)?

Похожие вакансии

JETLYN
310 000 ₽ – 430 000 ₽

AI Engineer (CV & Navigation)

SeniorУдалённоРоссия
Computer Vision · Python · PyTorch · TensorFlow · SLAM · Deep Learning · Augmented Reality
+7 навыков
Золотое Яблоко
Не указана

Senior / Lead LLM Engineer

SeniorУдалённоРоссия
Python · LLM · Generative AI · RAG · Vector Databases · Machine Learning · Information Retrieval · NLP
+8 навыков
NDA
Не указана

Middle, Middle+, Senior GenAI/LLM Разработчик

SeniorУдалённоРоссия
n8n · JSON · PostgreSQL · REST · GraphQL · OAuth2 · FastAPI · JavaScript · TypeScript · React · Python · LangChain · RAG · pgvector · Qdrant · Milvus · Prompt Engineering
+17 навыков
NDA
Не указана

Senior Python AI Developer

SeniorУдалённоРоссия
Python · LLM · RAG · OpenAI · Claude · JSON · API · CRM · Behavioral Analysis · Regression Testing
+10 навыков
Eclipse Digital
Не указана

GenAI/LLM Разработчик

SeniorУдалённоРоссия
n8n · JSON · PostgreSQL · REST · GraphQL · OAuth2 · FastAPI · JavaScript · TypeScript · React · Python · LangChain · RAG · pgvector · Qdrant · Milvus · Prompt Engineering
+17 навыков
QLAN
Не указана

Middle / Senior GenAI Engineer (CV)

SeniorУдалённоРоссия
Computer Vision · Diffusion Models · Stable Diffusion · SDXL · LoRA · UNet · Python · PyTorch · Machine Learning · Image Generation · Video Generation
+11 навыков
более 1000 офферов получено
4.9

1000+ офферов получено

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

ruby-labs