multifit
Fine-tuning
Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761
284 stars
17 watching
56 forks
Language: Jupyter Notebook
last commit: over 4 years ago fastaimultiple-languagesnlpulmfit
Related projects:
Repository | Description | Stars |
---|---|---|
| A tool to streamline fine-tuning of multimodal models for vision-language tasks | 1,415 |
| A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency | 647 |
| Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 476 |
| This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,167 |
| Training methods and tools for fine-tuning language models using human preferences | 1,240 |
| Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks | 646 |
| A Chinese finance-focused large language model fine-tuning framework | 596 |
| Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks | 47 |
| Improves federated learning performance by incorporating domain knowledge and regularization to adapt models across diverse domains | 93 |
| Extending pretraining models to handle multiple modalities by aligning language and video representations | 751 |
| A repository providing aligned multilingual word vectors for 78 languages using the SVD method. | 1,197 |
| Fine-tuning the LLaMA 2 chat model using DeepSpeed and Lora for improved performance on a large dataset. | 171 |
| Training and deploying large language models on computer vision tasks using region-of-interest inputs | 517 |
| Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. | 15 |
| Automates the search for optimal neural network configurations in deep learning applications | 468 |