roformer-sim

SimBERT variant

An upgraded version of SimBERT with integrated retrieval and generation capabilities

SimBERT升级版（SimBERTv2）！

441 stars

5 watching

73 forks

Language: Python

last commit: over 3 years ago

Related projects:

Repository	Description	Stars
zhuiyitechnology/roformer-v2	An improved version of a transformer-based language model with enhanced speed and accuracy through structural simplification and pre-training	148
zhuiyitechnology/roformer	An enhanced transformer model with improved relative position embeddings for natural language processing tasks	837
zhuiyitechnology/wobert	A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation	460
ieit-yuan/yuan2.0-m32	A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation	182
yunwentechnology/unilm	This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese.	439
sww9370/rocbert	A pre-trained Chinese language model designed to be robust against maliciously crafted texts	15
zhuiyitechnology/t5-pegasus	Pretrained Chinese text generation model trained on large-scale data	558
xverse-ai/xverse-13b	A large language model developed to support multiple languages and applications	648
langboat/mengzi3	An 8B and 13B language model based on the Llama architecture with multilingual capabilities.	2,031
xverse-ai/xverse-65b	A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications.	132
zhuiyitechnology/pretrained-models	A collection of pre-trained language models for natural language processing tasks	989
xverse-ai/xverse-moe-a36b	Develops and publishes large multilingual language models with advanced mixing-of-experts architecture.	37
zhuiyitechnology/gau-alpha	An implementation of a transformer-based NLP model utilizing gated attention units	98
tencent/tencent-hunyuan-large	This project makes a large language model accessible for research and development	1,245
shawn-ieitsystems/yuan-1.0	Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing	591