BiLLa
Bilingual Reasoning Model
A bilingual LLaMA model with enhanced reasoning ability trained on a mix of task-oriented and conversational data.
BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability
421 stars
6 watching
47 forks
Language: Python
last commit: over 1 year ago Related projects:
Repository | Description | Stars |
---|---|---|
kendryte/toucan-llm | A large language model with 70 billion parameters designed for chatbot and conversational AI tasks | 29 |
elanmart/psmm | An implementation of a neural network model for character-level language modeling. | 50 |
bilibili/index-1.9b | A lightweight, multilingual language model with a long context length | 904 |
airaria/visual-chinese-llama-alpaca | Develops a multimodal Chinese language model with visual capabilities | 424 |
datacanvasio/alaya | A pre-trained conversational AI model with high-quality training data and fine-tuned for various tasks such as question answering, code generation, and text summarization. | 43 |
chendelong1999/polite-flamingo | Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models | 63 |
turkunlp/wikibert | Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks | 34 |
flagai-open/aquila2 | Provides pre-trained language models and tools for fine-tuning and evaluation | 437 |
brightmart/xlnet_zh | Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks | 230 |
langboat/mengzi3 | An 8B and 13B language model based on the Llama architecture with multilingual capabilities. | 2,032 |
linksoul-ai/chinese-llama-2-7b | A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data | 2,228 |
clue-ai/chatyuan-7b | An updated version of a large language model designed to improve performance on multiple tasks and datasets | 13 |
yxuansu/tacl | Improves pre-trained language models by encouraging an isotropic and discriminative distribution of token representations. | 92 |
ncbi-nlp/bluebert | Pre-trained language models for biomedical natural language processing tasks | 558 |
cluebenchmark/electra | Trains and evaluates a Chinese language model using adversarial training on a large corpus. | 140 |