BiLLa

Bilingual Reasoning Model

A bilingual LLaMA model with enhanced reasoning ability trained on a mix of task-oriented and conversational data.

BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability

GitHub

421 stars

6 watching

47 forks

Language: Python

last commit: about 3 years ago

Related projects:

Repository	Description	Stars
kendryte/toucan-llm	A large language model with 70 billion parameters designed for chatbot and conversational AI tasks	29
elanmart/psmm	An implementation of a neural network model for character-level language modeling.	50
bilibili/index-1.9b	A lightweight, multilingual language model with a long context length	920
airaria/visual-chinese-llama-alpaca	Develops a multimodal Chinese language model with visual capabilities	429
datacanvasio/alaya	A pre-trained AI model that can engage in natural language conversations with high accuracy and understanding.	43
chendelong1999/polite-flamingo	Develops training methods to improve the politeness and natural flow of multi-modal Large Language Models	63
turkunlp/wikibert	Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks	34
flagai-open/aquila2	Provides pre-trained language models and tools for fine-tuning and evaluation	439
brightmart/xlnet_zh	Trains a large Chinese language model on massive data and provides a pre-trained model for downstream tasks	230
langboat/mengzi3	An 8B and 13B language model based on the Llama architecture with multilingual capabilities.	2,031
linksoul-ai/chinese-llama-2-7b	A deep learning project providing an open-source implementation of the LLaMA2 model with Chinese and English text data	2,235
clue-ai/chatyuan-7b	An updated version of a large language model designed to improve performance on multiple tasks and datasets	13
yxuansu/tacl	Improves pre-trained language models by encouraging an isotropic and discriminative distribution of token representations.	92
ncbi-nlp/bluebert	Pre-trained language models for biomedical natural language processing tasks	560
cluebenchmark/electra	Trains and evaluates a Chinese language model using adversarial training on a large corpus.	140