Tag: TinyLlama

All the articles with the tag "TinyLlama".

Intel NPU 가속을 활용한 TinyLlama 챗봇 구현

3 Feb, 2025

Intel NPU Acceleration Library를 사용하여 TinyLlama 모델을 NPU에서 추론하는 챗봇을 구현한 과정. transformers 버전 호환성 문제와 NPU 가속 설정을 정리한다.