bert-distillation
BERT distiller
A high-level API for distilling BERT models to create smaller, more efficient variants with reduced training time and improved inference speed.
Distillation of BERT model with catalyst framework
75 stars
4 watching
7 forks
Language: Python
last commit: over 1 year ago bertcatalystdistilbertdistillationnlprubert
Related projects:
Repository | Description | Stars |
---|---|---|
hobbitlong/repdistiller | A Python-based project implementing contrastive representation distillation and benchmarking recent knowledge distillation methods | 2,191 |
autodistill/autodistill | Automatically trains models from large foundation models to perform specific tasks with minimal human intervention. | 1,983 |
dbmdz/berts | Provides pre-trained language models for natural language processing tasks | 155 |
thunlp-aipoet/bert-ccpoem | A BERT-based pre-trained model for Chinese classical poetry | 145 |
zhuiyitechnology/wobert | A pre-trained Chinese language model that uses word embeddings and is designed to process Chinese text | 458 |
turkunlp/wikibert | Provides pre-trained language models derived from Wikipedia texts for natural language processing tasks | 34 |
yoshitomo-matsubara/torchdistill | A framework for designing and running deep learning experiments without writing code | 1,392 |
diogenes0319/fedmd_clean | An implementation of a heterogenous federated learning framework using model distillation. | 149 |
marscod/bert-sort | Automated sorting of data using a neural network model | 9 |
ncbi-nlp/bluebert | Pre-trained language models for biomedical natural language processing tasks | 558 |
kamalkraj/bert-ner | An implementation of named entity recognition using Google's BERT model for the CoNLL-2003 dataset and Python. | 1,211 |
mop/bier | This project implements a deep metric learning framework using an adversarial auxiliary loss to improve robustness. | 39 |
ag14774/diffdist | Enables backpropagation in distributed settings and facilitates model parallelism using differentiable communication between processes | 61 |
laurentmazare/ocaml-bert | Implementing BERT-like NLP models in OCaml using PyTorch bindings and pre-trained weights from popular sources. | 23 |
birch-san/diffusers | A toolkit for creating and manipulating state-of-the-art diffusion models in PyTorch | 8 |