intel-extension-for-transformers

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Archived

GitHub

2k stars
28 watching
211 forks
Language: Python
last commit: about 1 month ago
Linked from 2 awesome lists

4-bitsautoroundchatbotchatpdfgaudi3habanaintel-optimized-llamacpplarge-language-modelllm-cpullm-inferenceneural-chatneural-chat-7bragretrievalspeculative-decodingstreamingllm

Backlinks from these awesome lists: