Aquila2

Language model toolkit

Provides pre-trained language models and tools for fine-tuning and evaluation

The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.

GitHub

437 stars
5 watching
30 forks
Language: Python
last commit: about 1 month ago
llmllm-inferencellm-training

Related projects:

Repository Description Stars
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,782
openai/lm-human-preferences Training methods and tools for fine-tuning language models using human preferences 1,229
aiplanethub/beyondllm An open-source toolkit for building and evaluating large language models 261
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 987
flageval-baai/flageval An evaluation toolkit and platform for assessing large models in various domains 300
langboat/mengzi3 An 8B and 13B language model based on the Llama architecture with multilingual capabilities. 2,032
melih-unsal/demogpt A comprehensive toolset for building Large Language Model (LLM) based applications 1,710
baai-wudao/model A repository of pre-trained language models for various tasks and domains. 121
chendelong1999/polite-flamingo Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models 63
openlmlab/openchinesellama An incremental pre-trained Chinese large language model based on the LLaMA-7B model 234
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 904
llava-vl/llava-plus-codebase A platform for training and deploying large language and vision models that can use tools to perform tasks 704
kendryte/toucan-llm A large language model with 70 billion parameters designed for chatbot and conversational AI tasks 29
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 508