roformer-sim

Model upgrader

An upgraded version of SimBERT model with integrated retrieval and generation capabilities

SimBERT升级版(SimBERTv2)!

GitHub

438 stars
5 watching
73 forks
Language: Python
last commit: over 2 years ago

Related projects:

Repository Description Stars
zhuiyitechnology/roformer-v2 A faster and more effective text processing model based on the RoFormer architecture 149
zhuiyitechnology/roformer An enhanced transformer model with improved relative position embeddings for natural language processing tasks 819
zhuiyitechnology/wobert A pre-trained Chinese language model that uses word embeddings and is designed to process Chinese text 458
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 180
yunwentechnology/unilm This project provides pre-trained models for natural language understanding and generation tasks using the UniLM architecture. 438
sww9370/rocbert A pre-trained Chinese language model designed to be robust against maliciously crafted texts 15
zhuiyitechnology/t5-pegasus Chinese generation model based on T5 architecture, trained using PEGASUS method 555
xverse-ai/xverse-13b A large language model developed to support multiple languages and applications 649
langboat/mengzi3 An 8B and 13B language model based on the Llama architecture with multilingual capabilities. 2,032
xverse-ai/xverse-65b A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. 132
zhuiyitechnology/pretrained-models A collection of pre-trained language models for natural language processing tasks 987
xverse-ai/xverse-moe-a36b Develops and publishes large multilingual language models with advanced mixing-of-experts architecture. 36
zhuiyitechnology/gau-alpha An implementation of a Gated Attention Unit-based Transformer model for natural language processing tasks 96
tencent/tencent-hunyuan-large This project makes a large language model accessible for research and development 1,114
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591