thbert
Thai BERT
A pre-trained BERT model designed to facilitate NLP research and development with limited Thai language resources
Yet another pre-trained BERT particularly in Thai
6 stars
2 watching
0 forks
Language: Python
last commit: over 4 years ago Related projects:
Repository | Description | Stars |
---|---|---|
pythainlp/pythainlp | A Python package for text processing and linguistic analysis focused on Thai language | 993 |
zhuiyitechnology/wobert | A Word-based Chinese BERT model trained on large-scale text data using pre-trained models as a foundation | 460 |
pythainlp/lexicon-thai | A Thai language corpus and lexicon repository for natural language processing | 142 |
thunlp-aipoet/bert-ccpoem | A BERT-based pre-trained model for Chinese classical poetry | 146 |
allenai/scibert | A BERT model trained on scientific text for natural language processing tasks | 1,532 |
turkunlp/wikibert | Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks | 34 |
wittawatj/jtcc | A Java library to tokenize Thai text into groups of characters | 18 |
tongjilibo/bert4torch | An implementation of transformer models in PyTorch for natural language processing tasks | 1,257 |
tal-tech/edu-bert | A pre-trained language model designed to improve natural language processing tasks in education | 186 |
kobkrit/tf-nlp-thai-word-embedding | An implementation of a word embedding technique using TensorFlow for Thai language processing | 11 |
wannaphong/thai-ner | Named Entity Recognition for Thai Text using PyThaiNLP and custom implementation. | 53 |
ymcui/chinese-mobilebert | An implementation of MobileBERT, a pre-trained language model, in Python for NLP tasks. | 81 |
vchahun/teny | Tools and techniques for improving machine translation in resource-constrained environments. | 3 |
ethan-yt/guwenbert | Pre-trained language model for classical Chinese texts using RoBERTa architecture | 511 |
ymcui/macbert | Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks | 646 |