DeepSeek-V2

Mixture-of-Experts Language Model

A high-performance mixture-of-experts language model with strong performance and efficient inference capabilities.

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

GitHub

4k stars

32 watching

171 forks

last commit: 10 months ago

Related projects:

Repository	Description	Stars
deepseek-ai/deepseek-coder	A code completion model trained on large amounts of programming language data to help developers write code more efficiently.	6,987
deepseek-ai/deepseek-moe	A large language model with improved efficiency and performance compared to similar models	1,024
confident-ai/deepeval	A framework for evaluating large language models	4,003
microsoft/deepspeed	A deep learning optimization library that simplifies distributed training and inference on modern computing hardware.	35,863
huggingface/text-generation-inference	A toolkit for deploying and serving Large Language Models (LLMs) for high-performance text generation	9,456
coqui-ai/tts	A deep learning toolkit for generating human-like speech from text	36,118
brightmart/text_classification	An NLP project offering various text classification models and techniques for deep learning exploration	7,881
google/big-bench	A benchmark designed to probe large language models and extrapolate their future capabilities through a diverse set of tasks.	2,899
deepseek-ai/deepseek-coder-v2	A code intelligence model designed to generate and complete code in various programming languages	2,322
deepseek-ai/deepseek-vl	A multimodal AI model that enables real-world vision-language understanding applications	2,145
dair-ai/ml-papers-explained	An explanation of key concepts and advancements in the field of Machine Learning	7,352
databrickslabs/dolly	A large language model trained on a commercial machine learning platform with limited capabilities	10,820
huggingface/alignment-handbook	Provides recipes and guidelines for training language models to align with human preferences and AI goals	4,800
qwenlm/qwen2.5	A large language model series with various sizes and variants for text generation and understanding.	10,959
tju-drl-lab/ai-optimizer	A next-generation deep reinforcement learning toolkit with libraries for multiagent, self-supervised, offline, and transfer/reinforcement learning	4,848