- Страна
- Сербия
Откликайтесь
на вакансии с ИИ

Inference Server – Product Software Intern
Исключительная возможность для старта карьеры в одной из самых инновационных AI-компаний мира под руководством легендарных инженеров. Работа с RISC-V и кастомным железом в Белграде — это редкий и ценный опыт для регионального рынка.
Сложность вакансии
Позиция ориентирована на студентов старших курсов, что предполагает наличие крепкой базы в Computer Science. Основная сложность заключается в необходимости понимания специфики AI-инференса и работы с кастомным оборудованием, а также в строгих требованиях экспортного контроля США.
Анализ зарплаты
Для интернатуры в Белграде в международной технологической компании предлагаемая компенсация обычно выше среднего по рынку Сербии. Указанный диапазон соответствует уровню топовых R&D центров в регионе для студентов технических специальностей.
Сопроводительное письмо
I am writing to express my strong interest in the Inference Server Product Software Intern position at Tenstorrent in Belgrade. As a final-year student with a deep passion for machine learning infrastructure and backend systems, I have been following Tenstorrent’s work in democratizing high-performance AI hardware with great admiration. My background in Computer Science and proficiency in Python, combined with a growing interest in C++ and performance optimization, aligns perfectly with the goals of your Inference Server Technologies team.
During my studies and personal projects, I have focused on understanding how ML models transition from research to production. I am particularly excited about the opportunity to work on custom AI hardware and learn how to optimize end-to-end inference through batching and model parallelism. I am eager to contribute to your backend features and help benchmark workloads on Tenstorrent’s stack, while learning from your world-class team of engineers. I am confident that my commitment to solving hard problems and my collaborative mindset would make me a valuable addition to your internship program.
Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в tenstorrentuniversity уже сейчас
Присоединяйтесь к команде Tenstorrent в Белграде и начните свою карьеру в авангарде разработки AI-технологий и RISC-V!
Описание вакансии
Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.
At Tenstorrent, we believe the future of computing must be open, which is why our interns don’t just watch from the sidelines - they help build the core of it. We provide a "code-to-career" pipeline where students collaborate with industry experts to solve high-stakes problems in RISC-V and AI hardware-software co-design. By joining us, you are taking an internship to democratize high-performance computers that are accessible to everyone.
Join our Inference Server Technologies team, where we build the software layer that powers state-of-the-art AI inference on Tenstorrent hardware. This team develops APIs, deploys workloads, and benchmarks end-to-end model performance so developers can efficiently scale inference on our stack. You will work on a project under the guidance of experienced engineers and a dedicated mentor. We are looking for a minimum of 3 months for this role with the potential for extension to 6 months.
This role is hybrid based in Belgrade, Serbia.
Who You Are
- Final-year BSc or MSc student in Computer Science, Software Engineering, Electrical Engineering, or a related technical field
- Strong programming fundamentals in Python, with familiarity in C++ considered a plus
- Interested in backend systems, API design, and how ML models are deployed in production environments
- Curious about performance optimization techniques such as batching, caching, and model parallelism
- Motivated to learn and contribute in a collaborative engineering environment
What We Need
- Contribute to backend features and APIs that support AI inference workloads
- Assist in deploying, testing, and benchmarking models running on Tenstorrent hardware
- Analyze inference performance and help identify optimization opportunities
- Write clean, maintainable code with guidance from senior engineers
- Collaborate with the team to improve reliability, usability, and performance of the inference server stack
What You Will Learn
- How end-to-end ML inference is optimized on custom AI hardware
- How scalable backend systems are designed to serve real-world AI applications
- How APIs and infrastructure shape the developer experience for AI workloads
- Practical performance analysis techniques in production-like environments
- How modern AI software stacks integrate models, runtimes, and hardware
Hiring Timelines
This internship opportunity is available throughout our 3 terms with the following corresponding recruitment cycles:
- Winter Term: Mar–May work term, Nov–Jan recruit.
- Summer Term: Jul–Sep work term, Jan–Apr recruit.
- Fall Term: Oct–Dec work term, Apr–May recruit.
Please note these timelines are for reference only. Actual timelines may vary.
Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.
This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.
Создайте идеальное резюме с помощью ИИ-агента

Навыки
- C++
- Python
- Linux
- Machine Learning
- Backend Development
- API Design
- Performance Optimization
- RISC-V
Возможные вопросы на собеседовании
Проверка базовых знаний Python, необходимых для разработки бэкенда инференс-сервера.
Объясните разницу между многопоточностью (threading) и многопроцессорностью (multiprocessing) в Python. В каких случаях для AI-инференса лучше использовать второй вариант?
Оценка понимания специфики развертывания ML-моделей, упомянутой в описании.
Что такое батчинг (batching) в контексте инференса нейронных сетей и как он влияет на пропускную способность и задержку (latency)?
Проверка навыков работы с API и бэкенд-системами.
Как бы вы спроектировали REST или gRPC API для сервиса, который принимает изображения для классификации на удаленном ускорителе?
Оценка интереса к оптимизации производительности.
Знакомы ли вы с понятием квантования (quantization) моделей? Как это помогает при запуске моделей на специализированном AI-железе?
Проверка навыков отладки и работы с кодом.
Расскажите о случае, когда вам пришлось оптимизировать производительность вашего кода. Какие инструменты вы использовали для профилирования?
Похожие вакансии
Стажер Менеджер по ИИ-инструментам
Стажёр Prompt Engineer
Product Builder Trainee, AI-Native
AI-инженер (Middle+)
A.I. Engineering Intern (Colombia)
AI-инженер (Middle+) & Node.js
1000+ офферов получено
Устали искать работу? Мы найдём её за вас
Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!
- Страна
- Сербия