Страна: Сербия

+500% приглашений

Откликайтесь
на вакансии с ИИ

InternВ офисеПолная занятость

ML Models Implementation & Performance Optimization, Intern (Serbia)

Name: Quick Offer — сервис для поиска работы на hh.ru
Brand: Quick Offer
SKU: quick-offer-saas
Availability: InStock
Rating: 4.9 (682 reviews)

Исключительная возможность для старта карьеры в одной из самых инновационных компаний индустрии ИИ-железа. Работа с открытым исходным кодом, менторство экспертов и реальное влияние на продукт делают эту позицию очень привлекательной.

Вакансия из Quick Offer Global, списка международных компаний

Пожаловаться

Сложность вакансии

ЛегкоСложно

Стажировка требует серьезных знаний как в области ML (PyTorch), так и в системном программировании (C++, оптимизация производительности). Высокая планка обусловлена работой с низкоуровневым стеком и специфическим оборудованием Tenstorrent.

Анализ зарплаты

Медиана1 200 €

Рынок800 € – 1 500 €

Зарплата для стажеров в Tenstorrent обычно соответствует или слегка превышает рыночные показатели для международных технологических компаний в Сербии. Учитывая сложность задач и престиж компании, компенсация является конкурентоспособной для локального рынка Белграда.

I am writing to express my strong interest in the ML Models Implementation & Performance Optimization Internship at Tenstorrent in Belgrade. As a final-year student with a deep passion for high-performance computing and machine learning, I have been following Tenstorrent’s work in revolutionizing AI hardware-software co-design with great admiration. My background in Python and C++, combined with a solid foundation in ML frameworks like PyTorch, aligns perfectly with the technical requirements of this role.

During my academic projects, I have focused on optimizing code for efficiency and exploring how software interacts with underlying hardware. The opportunity to work with Tenstorrent’s open-source stack, including tt-metalium and tt-nn, is incredibly exciting to me. I am particularly drawn to your "code-to-career" philosophy and the chance to own a well-defined engineering project under the mentorship of industry experts. I am eager to contribute to pushing the boundaries of inference speed and accuracy on your cutting-edge RISC-V platforms.

+250% к просмотрам

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в tenstorrentuniversity уже сейчас

Присоединяйтесь к команде Tenstorrent в Белграде и внесите свой вклад в будущее открытых вычислений и ИИ-технологий!

Описание вакансии

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

At Tenstorrent, we believe the future of computing must be open, which is why our interns don’t just watch from the sidelines - they help build the core of it. We provide a "code-to-career" pipeline where students collaborate with industry experts to solve high-stakes problems in RISC-V and AI hardware-software co-design. By joining us, you are taking an internship to democratize high-performance computers that are accessible to everyone.

In this role, you will implement state of art ML models on Tenstorrent hardware using Python and C++, focusing on pushing both accuracy and inference speed. You will work hands-on with Tenstorrent’s open-source software stack (tt-metalium, tt-nn, tt-llk), taking models from framework to silicon and iterating on performance. You will own a well-defined engineering project under the guidance of a dedicated mentor, with direct impact on how real workloads run on our chips. We are looking for a minimum of 3 months for this role with the potential for extension to 6 months.

This role is onsite, based in our Belgrade office.

Who You Are

Enrolled in the final year of BSc or MSc studies in Computer Science, Computer Engineering, Software Engineering, Electronics, Math, or a related field.
Solid coding skills in Python and C++, with a basic understanding of machine learning concepts and frameworks.
You have a passion for programming, are eager to learn, and enjoy solving complex performance and optimization problems.
You are collaborative, open to feedback, and excited to work closely with experienced engineers and a dedicated mentor.

What We Need

Implement functional ML models on Tenstorrent hardware using Python and popular ML frameworks like PyTorch.
Benchmark, analyze, and optimize the performance of the implemented model's inference using existing tools and coding in C++ and Python.
Collaborate with experienced engineers to validate the accuracy of implemented models and iterate on improvements.
Contribute to performance optimization efforts where success is measured by achieving both high accuracy and fast execution (inference) of ML models on Tenstorrent hardware.

What You Will Learn

How to implement state-of-the-art ML models on Tenstorrent hardware using Python, C++, and popular ML frameworks like PyTorch.
Techniques for benchmarking, analyzing, and optimizing the performance of ML model inference using existing tools and code in C++ and Python.
How to use (and potentially debug and fix) Tenstorrent’s open-source software libraries, such as tt-metalium, tt-nn, and tt-llk.
How to collaborate with experienced engineers, apply various problem-solving techniques, and drive a well-defined engineering project under the guidance of a dedicated mentor.

Hiring Timelines

This internship opportunity is available throughout our 3 terms with the following corresponding recruitment cycles:

Winter Term: Mar–May work term, Nov–Jan recruit.
Summer Term: Jul/Aug–Sep work term, Jan–Apr/May recruit.
Fall Term: Oct–Dec work term, Apr–May recruit.

Please note these timelines are for reference only. Actual timelines may vary.

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.

+400% к собеседованиям

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Python
C++
Machine Learning
PyTorch
Performance Optimization
Benchmarking
RISC-V
Computer Architecture

Возможные вопросы на собеседовании

Проверка базовых знаний C++, критически важных для оптимизации производительности на уровне железа.

Объясните разницу между выделением памяти в стеке и в куче, и как это может повлиять на производительность ML-инференса?

Поскольку роль связана с оптимизацией моделей, важно понимать внутреннее устройство популярных фреймворков.

Как работает механизм автоматического дифференцирования в PyTorch и какие накладные расходы он создает при инференсе?

Работа в Tenstorrent предполагает понимание того, как код исполняется на чипе.

Что такое SIMD-инструкции и как они используются для ускорения матричных вычислений в нейронных сетях?

Проверка навыков профилирования и поиска узких мест.

Опишите ваш подход к поиску причин медленной работы модели: на какие метрики вы будете смотреть в первую очередь?

Оценка интереса к специфике компании.

Что вы знаете об архитектуре RISC-V и почему она становится популярной в сфере ИИ-ускорителей?

Устали искать работу? Мы найдём её за вас

Quick Offer улучшит ваше резюме, подберёт лучшие вакансии и откликнется за вас. Результат — в 3 раза больше приглашений на собеседования и никакой рутины!

Сербия

Откликайтесь
на вакансии с ИИ

ML Models Implementation & Performance Optimization, Intern (Serbia)

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в tenstorrentuniversity уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Объясните разницу между выделением памяти в стеке и в куче, и как это может повлиять на производительность ML-инференса?

Как работает механизм автоматического дифференцирования в PyTorch и какие накладные расходы он создает при инференсе?

Что такое SIMD-инструкции и как они используются для ускорения матричных вычислений в нейронных сетях?

Опишите ваш подход к поиску причин медленной работы модели: на какие метрики вы будете смотреть в первую очередь?

Что вы знаете об архитектуре RISC-V и почему она становится популярной в сфере ИИ-ускорителей?

Похожие вакансии

Дата инженер (ученик)

Стажер data science (ИИ в агростраховании)

Стажер Data Engineer в Аналитика Datagovernance [Big Data, МТС Веб Сервисы]

Стажер Data Science

Стажер/антифрод-аналитик

Стажёр в центр развития MLOps

Устали искать работу? Мы найдём её за вас

Откликайтесьна вакансии с ИИ

ML Models Implementation & Performance Optimization, Intern (Serbia)

Анализ зарплаты

Сопроводительное письмо

Составьте идеальное письмо к вакансии с ИИ-агентом

Откликнитесь в tenstorrentuniversity уже сейчас

Описание вакансии

Создайте идеальное резюме с помощью ИИ-агента

Навыки

Возможные вопросы на собеседовании

Объясните разницу между выделением памяти в стеке и в куче, и как это может повлиять на производительность ML-инференса?

Как работает механизм автоматического дифференцирования в PyTorch и какие накладные расходы он создает при инференсе?

Что такое SIMD-инструкции и как они используются для ускорения матричных вычислений в нейронных сетях?

Опишите ваш подход к поиску причин медленной работы модели: на какие метрики вы будете смотреть в первую очередь?

Что вы знаете об архитектуре RISC-V и почему она становится популярной в сфере ИИ-ускорителей?

Похожие вакансии

Дата инженер (ученик)

Стажер data science (ИИ в агростраховании)

Стажер Data Engineer в Аналитика Datagovernance [Big Data, МТС Веб Сервисы]

Стажер Data Science

Стажер/антифрод-аналитик

Стажёр в центр развития MLOps

Устали искать работу? Мы найдём её за вас

Откликайтесь
на вакансии с ИИ