roformer-v2

Transformative Language Model

An improved version of a transformer-based language model with enhanced speed and accuracy through structural simplification and pre-training

RoFormer升级版

GitHub

148 stars
6 watching
15 forks
Language: Python
last commit: over 2 years ago

Related projects:

Repository Description Stars
zhuiyitechnology/roformer An enhanced transformer model with improved relative position embeddings for natural language processing tasks 837
zhuiyitechnology/roformer-sim An upgraded version of SimBERT with integrated retrieval and generation capabilities 441
zhuiyitechnology/gau-alpha An implementation of a transformer-based NLP model utilizing gated attention units 98
thudm/chinese-transformer-xl A pre-trained Chinese language model based on the Transformer-XL architecture. 218
tongjilibo/bert4torch An implementation of transformer models in PyTorch for natural language processing tasks 1,257
zhuiyitechnology/wobert A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation 460
fastnlp/cpt A pre-trained transformer model for natural language understanding and generation tasks in Chinese 482
lucidrains/reformer-pytorch An implementation of Reformer, an efficient Transformer model for natural language processing tasks. 2,132
zhuiyitechnology/t5-pegasus Pretrained Chinese text generation model trained on large-scale data 558
ros2/geometry2 A set of libraries providing data structures and algorithms for managing coordinate transforms in robotics applications. 122
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,167
yangjianxin1/ofa-chinese Transforms the OFA-Chinese model to work with the Hugging Face Transformers framework 123
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182
xverse-ai/xverse-65b A large language model developed by XVERSE Technology Inc. using transformer architecture and fine-tuned on diverse data sets for various applications. 132
zmk5/jupyter-ros2 Enables interactive development and visualization of ROS2-based projects in Jupyter notebooks 29