roformer-sim
SimBERT variant
An upgraded version of SimBERT with integrated retrieval and generation capabilities
SimBERT升级版(SimBERTv2)!
441 stars
5 watching
73 forks
Language: Python
last commit: almost 3 years ago Related projects:
Repository | Description | Stars |
---|---|---|
zhuiyitechnology/roformer-v2 | An improved version of a transformer-based language model with enhanced speed and accuracy through structural simplification and pre-training | 148 |
zhuiyitechnology/roformer | An enhanced transformer model with improved relative position embeddings for natural language processing tasks | 837 |
zhuiyitechnology/wobert | A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation | 460 |
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 182 |
yunwentechnology/unilm | This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. | 439 |
sww9370/rocbert | A pre-trained Chinese language model designed to be robust against maliciously crafted texts | 15 |
zhuiyitechnology/t5-pegasus | Pretrained Chinese text generation model trained on large-scale data | 558 |
xverse-ai/xverse-13b | A large language model developed to support multiple languages and applications | 648 |
langboat/mengzi3 | An 8B and 13B language model based on the Llama architecture with multilingual capabilities. | 2,031 |
xverse-ai/xverse-65b | A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. | 132 |
zhuiyitechnology/pretrained-models | A collection of pre-trained language models for natural language processing tasks | 989 |
xverse-ai/xverse-moe-a36b | Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. | 37 |
zhuiyitechnology/gau-alpha | An implementation of a transformer-based NLP model utilizing gated attention units | 98 |
tencent/tencent-hunyuan-large | This project makes a large language model accessible for research and development | 1,245 |
shawn-ieitsystems/yuan-1.0 | Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing | 591 |