GLM-130B
Bilingual Language Model
An open-source implementation of a large bilingual language model pre-trained on vast amounts of text data.
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
8k stars
99 watching
608 forks
Language: Python
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
thudm/glm | A general-purpose language model pre-trained with an autoregressive blank-filling objective and designed for various natural language understanding and generation tasks. | 3,207 |
openbmb/toolbench | A platform for training, serving, and evaluating large language models to enable tool use capability | 4,888 |
fminference/flexllmgen | Generates large language model outputs in high-throughput mode on single GPUs | 9,236 |
thunlp/ultrachat | Large-scale dialogue data and models for training chatbots and conversational AI systems | 2,272 |
lyogavin/airllm | Optimizes large language model inference on limited GPU resources | 5,446 |
young-geng/easylm | A framework for training and serving large language models using JAX/Flax | 2,428 |
sjtu-ipads/powerinfer | An efficient Large Language Model inference engine leveraging consumer-grade GPUs on PCs | 8,011 |
qwenlm/qwen | This repository provides large language models and chat capabilities based on pre-trained Chinese models. | 14,797 |
huawei-noah/pretrained-language-model | A collection of pre-trained language models and optimization techniques for efficient natural language processing | 3,039 |
openbmb/bmtools | Tools and platform for building and extending large language models | 2,907 |
x-d-lab/langchain-chatglm-webui | Provides an online UI for deploying large language models based on LangChain and ChatGLM | 3,194 |
thunlp/plmpapers | Compiles and organizes key papers on pre-trained language models, providing a resource for developers and researchers. | 3,331 |
sgl-project/sglang | A fast serving framework for large language models and vision language models. | 6,551 |
modeltc/lightllm | A Python-based framework for serving large language models with low latency and high scalability. | 2,691 |
brexhq/prompt-engineering | Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. | 8,487 |