GLM-130B
Bilingual Language Model
An open-source implementation of a large bilingual language model pre-trained on vast amounts of text data.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
8k stars
99 watching
608 forks
Language: Python
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
thudm/glm | A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. | 3,199 |
openbmb/toolbench | A platform for training, serving, and evaluating large language models to enable tool use capability | 4,843 |
fminference/flexllmgen | Generates large language model outputs in high-throughput mode on single GPUs | 9,192 |
thunlp/ultrachat | Large-scale dialogue data and models for training chatbots and conversational AI systems | 2,259 |
lyogavin/airllm | A Python library that optimizes inference memory usage for large language models on limited GPU resources. | 5,259 |
young-geng/easylm | A framework for training and serving large language models using JAX/Flax | 2,409 |
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 7,964 |
qwenlm/qwen | This repository provides large language models and chat capabilities based on pre-trained Chinese models. | 14,164 |
huawei-noah/pretrained-language-model | A collection of pre-trained language models and optimization techniques for efficient natural language processing | 3,028 |
openbmb/bmtools | Tools and platform for building and extending large language models | 2,898 |
x-d-lab/langchain-chatglm-webui | Provides an online UI for deploying large language models based on LangChain and ChatGLM | 3,173 |
thunlp/plmpapers | Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. | 3,328 |
sgl-project/sglang | A framework for serving large language models and vision models with efficient runtime and flexible interface. | 6,082 |
modeltc/lightllm | An LLM inference and serving framework providing a lightweight design, scalability, and high-speed performance for large language models. | 2,609 |
brexhq/prompt-engineering | Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. | 8,440 |