yandex
xsolla
Страна
Малайзия
+500% приглашений

Откликайтесь
на вакансии с ИИ

Ускорим процесс поиска работы
ГибридПолная занятость

AI Infrastructure Engineer

Оценка ИИ

Xsolla — лидер в игровой индустрии, предлагающий работу на острие технологий (AIOps). Позиция предполагает высокую степень автономии, работу с современным стеком (GCP, K8s, LLMs) и отличный социальный пакет.


Вакансия из Quick Offer Global, списка международных компаний
Пожаловаться

Сложность вакансии

ЛегкоСложно
Оценка ИИ

Роль требует редкого сочетания глубоких знаний в DevOps (GCP, Kubernetes, IaC) и практического опыта работы с ИИ (LLM, ML-пайплайны). Высокий порог входа обусловлен необходимостью не просто использовать готовые инструменты, а создавать собственные ИИ-решения для автоматизации инфраструктуры.

Анализ зарплаты

Медиана55 000 $
Рынок45 000 $ – 75 000 $
Оценка ИИ

Зарплата для данной роли не указана, однако для специалистов уровня Senior/Lead в области AI Infrastructure в Куала-Лумпуре рыночные показатели значительно выше средних по IT-сектору из-за уникальности компетенций. Мы оцениваем медиану в районе 180,000 - 240,000 MYR в год.

Сопроводительное письмо

I am writing to express my strong interest in the AI Infrastructure Engineer position at Xsolla. With over 6 years of experience in DevOps and SRE roles, combined with a deep passion for integrating LLMs and predictive analytics into operational workflows, I am confident in my ability to help Xsolla transition from reactive to predictive infrastructure management. My background includes extensive work with GCP and Terraform, as well as developing custom Python-based automation that leverages OpenAI APIs to streamline incident response.

In my previous roles, I have successfully implemented anomaly detection systems that reduced MTTR by 30% and automated cloud cost optimization strategies that saved significant operational budget. I am particularly excited about Xsolla's vision of building internal AI agents for developer self-service and auto-generating IaC configurations. I thrive in environments where I can experiment with frameworks like LangChain to solve real-world infrastructure challenges, and I am eager to bring this expertise to your global team.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в xsolla уже сейчас

Присоединяйтесь к Xsolla и станьте архитектором будущего, где ИИ управляет глобальной игровой инфраструктурой!

Описание вакансии

ABOUT YOU

We are seeking a hands-on and forward-thinking AI Infrastructure Engineer to help build and operate the intelligent systems that power Xsolla's infrastructure. As part of our Infrastructure Team, you will implement AI-driven solutions across cloud optimization, security, automation, and developer support — helping us shift from manual and reactive operations to predictive, self-optimizing infrastructure management.

The ideal candidate brings solid infrastructure engineering experience combined with practical knowledge of AI/ML integration. You are comfortable working with LLMs, ML pipelines, and AI automation frameworks, and you know how to apply them to real operational problems at scale. You thrive in environments that require both technical depth and the ability to experiment, iterate, and deliver.

If you're passionate about using AI to transform how infrastructure is built and operated — and want to be part of a team that is driving that transformation at a global gaming company — we'd love to hear from you.

ABOUT US

Xsolla is a global commerce company with robust tools and services designed to help developers solve the inherent challenges of the video game industry. From indie to AAA, companies partner with Xsolla to help them fund, distribute, market, and monetize their games. Grounded in the belief in the future of video games, Xsolla is resolute in the mission to bring opportunities together, and continually make new resources available to creators. Headquartered and incorporated in Los Angeles, California, Xsolla operates as the merchant of record and has helped over 1,500+ game developers to reach more players and grow their businesses around the world.

For more information, visit xsolla.com.

Responsibilities:

  • Design and implement AI/ML-powered solutions for infrastructure use cases, including predictive autoscaling, anomaly detection, intelligent cost optimization, and automated remediation across GCP and multi-cloud environments
  • Build and maintain AI-driven monitoring and observability systems that correlate logs, metrics, and traces to surface root causes, predict bottlenecks, and reduce mean time to resolution (MTTR)
  • Develop and operate automated incident response workflows using AI-powered playbooks that diagnose, contain, and resolve infrastructure issues with minimal manual intervention
  • Integrate AI tooling into CI/CD pipelines to improve deployment reliability, automate test prediction, score release health, and support rollback automation
  • Contribute to the development of internal AI agents and virtual assistants integrated into developer workflows (Slack, IDEs, Confluence) — enabling self-service for provisioning, troubleshooting, and infrastructure guidance
  • Implement AI/ML-based anomaly detection and automated vulnerability management workflows to enhance the security posture of Xsolla's infrastructure
  • Prototype and productionize Generative AI solutions for infrastructure automation, including auto-generation of Terraform/Puppet modules, IaC configurations, runbooks, and change documentation
  • Collaborate with senior engineers and leadership to evolve and execute the infrastructure AI strategy across its implementation phases
  • Maintain clear documentation of AI tools, integrations, and automated workflows; share knowledge and best practices across the team

Qualifications:

  • 5–7 years of experience in infrastructure engineering, DevOps, SRE, or a related field
  • Hands-on experience with GCP (priority) and/or AWS; solid understanding of cloud resource management, scaling, and cost structures
  • Practical experience building or integrating AI/ML-powered tools in an operational context (anomaly detection, predictive models, LLM-based automation, or similar)
  • Experience with infrastructure-as-code tools — Terraform, Puppet, Ansible, or equivalent
  • Proficiency in Python for scripting, automation, and AI/ML integration; Bash or Go a plus
  • Working knowledge of Kubernetes and container orchestration in production environments
  • Familiarity with observability and monitoring stacks (Prometheus, Grafana, ELK, Datadog, or similar)
  • Familiarity with LLM APIs (OpenAI, Anthropic, or similar) and prompt engineering for operational use cases
  • Strong problem-solving mindset with a bias toward automation and eliminating toil
  • Fluent in English (written and verbal)

Nice To Have:

  • Experience with AI workflow orchestration frameworks (LangChain, LlamaIndex, n8n, or similar)
  • Exposure to AIOps platforms (Dynatrace, Datadog AI, Moogsoft, BigPanda, or similar)
  • Background in FinOps or AI-driven cloud cost optimization
  • Familiarity with vector databases (Weaviate, Pinecone, Qdrant) for knowledge retrieval systems
  • Experience with VMware or hybrid cloud environments
  • GCP and/or AWS cloud certifications
  • Prior experience in gaming, high-growth tech, or SaaS platform environments
  • The duties and responsibilities of this position may evolve over time to support the organization's goals and individual growth

Note

This job description is intended to outline the general nature and level of work being performed and is not intended to be an exhaustive list of all duties, responsibilities, and qualifications required.

Benefits:We are passionate about fostering a supportive environment for our team, so we prioritize the physical, mental, and emotional well-being of our employees and their families through a comprehensive Benefits Program. This includes medical, dental, and vision, PTO, and a personalized career roadmap for each employee. By investing in professional development through training and educational opportunities, we ensure that our team thrives both personally and professionally. Together, we’re not just building a business; we’re cultivating a community that values creativity, collaboration, and the transformative power of play.

Equal Employment Opportunity Statement:

Xsolla is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We do not discriminate based on race, color, religion, sex, national origin, age, disability, sexual orientation, gender identity, or any other characteristic protected by law.

We consider qualified applicants with criminal histories in accordance with the Fair Chance Act.

Criminal History Consideration:

For the AI Infrastructure Engineer, we will conduct a background check that may include the following:

Criminal history check

Employment verification

Education verification

Relevance to Job Responsibilities:

The background check is relevant to this position because of the following role responsibilities:

Accessing confidential company data

Ensuring compliance with regulatory requirements

Rights Under the Fair Chance Act:

Applicants are encouraged to inquire about their rights under the Fair Chance Act. If you have questions regarding our hiring practices, please contact careers@xsolla.com.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Создайте идеальное резюме с помощью ИИ-агента

Навыки

  • AWS
  • Python
  • Terraform
  • GCP
  • Kubernetes
  • Bash
  • Prometheus
  • Grafana
  • Go
  • Ansible
  • LangChain
  • Puppet
  • Datadog
  • ELK
  • OpenAI
  • Anthropic
  • LlamaIndex

Возможные вопросы на собеседовании

Проверка опыта интеграции ИИ в реальные процессы.

Расскажите о наиболее сложном случае, когда вы использовали ИИ или ML для решения конкретной инфраструктурной проблемы. Каких метрик удалось достичь?

Оценка навыков работы с облачной инфраструктурой и инструментами автоматизации.

Как бы вы спроектировали систему предиктивного автоскейлинга в GCP, используя исторические данные о нагрузке и ML-модели?

Проверка понимания современных LLM-фреймворков.

Какие подходы вы бы использовали для минимизации галлюцинаций при генерации Terraform-кода с помощью LLM?

Оценка опыта в SRE и мониторинге.

Как интегрировать ИИ в существующий стек Prometheus/Grafana для автоматического выявления первопричин (Root Cause Analysis) инцидентов?

Проверка навыков программирования и автоматизации.

Опишите ваш опыт создания кастомных инструментов на Python для взаимодействия с API облачных провайдеров или ИИ-сервисов.

Похожие вакансии

более 1000 офферов получено
4.9

1000+ офферов получено

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

xsolla
Страна
Малайзия