GLM-130B

Bilingual Language Model

An open-source implementation of a large bilingual language model pre-trained on vast amounts of text data.

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

GitHub

8k stars
99 watching
608 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
thudm/glm A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. 3,199
openbmb/toolbench A platform for training, serving, and evaluating large language models to enable tool use capability 4,843
fminference/flexllmgen Generates large language model outputs in high-throughput mode on single GPUs 9,192
thunlp/ultrachat Large-scale dialogue data and models for training chatbots and conversational AI systems 2,259
lyogavin/airllm A Python library that optimizes inference memory usage for large language models on limited GPU resources. 5,259
young-geng/easylm A framework for training and serving large language models using JAX/Flax 2,409
sjtu-ipads/powerinfer An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs 7,964
qwenlm/qwen This repository provides large language models and chat capabilities based on pre-trained Chinese models. 14,164
huawei-noah/pretrained-language-model A collection of pre-trained language models and optimization techniques for efficient natural language processing 3,028
openbmb/bmtools Tools and platform for building and extending large language models 2,898
x-d-lab/langchain-chatglm-webui Provides an online UI for deploying large language models based on LangChain and ChatGLM 3,173
thunlp/plmpapers Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. 3,328
sgl-project/sglang A framework for serving large language models and vision models with efficient runtime and flexible interface. 6,082
modeltc/lightllm An LLM inference and serving framework providing a lightweight design, scalability, and high-speed performance for large language models. 2,609
brexhq/prompt-engineering Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. 8,440