roformer-v2
Transformative Language Model
An improved version of a transformer-based language model with enhanced speed and accuracy through structural simplification and pre-training
RoFormer升级版
148 stars
6 watching
15 forks
Language: Python
last commit: over 2 years ago Related projects:
Repository | Description | Stars |
---|---|---|
zhuiyitechnology/roformer | An enhanced transformer model with improved relative position embeddings for natural language processing tasks | 837 |
zhuiyitechnology/roformer-sim | An upgraded version of SimBERT with integrated retrieval and generation capabilities | 441 |
zhuiyitechnology/gau-alpha | An implementation of a transformer-based NLP model utilizing gated attention units | 98 |
thudm/chinese-transformer-xl | A pre-trained Chinese language model based on the Transformer-XL architecture. | 218 |
tongjilibo/bert4torch | An implementation of transformer models in PyTorch for natural language processing tasks | 1,257 |
zhuiyitechnology/wobert | A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation | 460 |
fastnlp/cpt | A pre-trained transformer model for natural language understanding and generation tasks in Chinese | 482 |
lucidrains/reformer-pytorch | An implementation of Reformer, an efficient Transformer model for natural language processing tasks. | 2,132 |
zhuiyitechnology/t5-pegasus | Pretrained Chinese text generation model trained on large-scale data | 558 |
ros2/geometry2 | A set of libraries providing data structures and algorithms for managing coordinate transforms in robotics applications. | 122 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
yangjianxin1/ofa-chinese | Transforms the OFA-Chinese model to work with the Hugging Face Transformers framework | 123 |
ieit-yuan/yuan2.0-m32 | A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation | 182 |
xverse-ai/xverse-65b | A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. | 132 |
zmk5/jupyter-ros2 | Enables interactive development and visualization of ROS2-based projects in Jupyter notebooks | 29 |