intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Archived
2k stars
28 watching
211 forks
Language: Python
last commit: about 1 month ago
Linked from 2 awesome lists
4-bitsautoroundchatbotchatpdfgaudi3habanaintel-optimized-llamacpplarge-language-modelllm-cpullm-inferenceneural-chatneural-chat-7bragretrievalspeculative-decodingstreamingllm