BiLLa

Bilingual Reasoning Model

A bilingual LLaMA model with enhanced reasoning ability trained on a mix of task-oriented and conversational data.

BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability

GitHub

421 stars
6 watching
47 forks
Language: Python
last commit: over 1 year ago

Related projects:

Repository Description Stars
kendryte/toucan-llm A large language model with 70 billion parameters designed for chatbot and conversational AI tasks 29
elanmart/psmm An implementation of a neural network model for character-level language modeling. 50
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 904
airaria/visual-chinese-llama-alpaca Develops a multimodal Chinese language model with visual capabilities 424
datacanvasio/alaya A pre-trained conversational AI model with high-quality training data and fine-tuned for various tasks such as question answering, code generation, and text summarization. 43
chendelong1999/polite-flamingo Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models 63
turkunlp/wikibert Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks 34
flagai-open/aquila2 Provides pre-trained language models and tools for fine-tuning and evaluation 437
brightmart/xlnet_zh Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks 230
langboat/mengzi3 An 8B and 13B language model based on the Llama architecture with multilingual capabilities. 2,032
linksoul-ai/chinese-llama-2-7b A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data 2,228
clue-ai/chatyuan-7b An updated version of a large language model designed to improve performance on multiple tasks and datasets 13
yxuansu/tacl Improves pre-trained language models by encouraging an isotropic and discriminative distribution of token representations. 92
ncbi-nlp/bluebert Pre-trained language models for biomedical natural language processing tasks 558
cluebenchmark/electra Trains and evaluates a Chinese language model using adversarial training on a large corpus. 140