multifit
Fine-tuning
Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761
284 stars
17 watching
56 forks
Language: Jupyter Notebook
last commit: over 4 years ago fastaimultiple-languagesnlpulmfit
Related projects:
Repository | Description | Stars |
---|---|---|
roboflow/maestro | A tool to streamline fine-tuning of multimodal models for vision-language tasks | 1,386 |
codefuse-ai/mftcoder | A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency | 637 |
eleutherai/polyglot | Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. | 475 |
openai/finetune-transformer-lm | This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. | 2,160 |
openai/lm-human-preferences | Training methods and tools for fine-tuning language models using human preferences | 1,229 |
ymcui/macbert | Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks | 645 |
jerry1993-tech/cornucopia-llama-fin-chinese | A Chinese finance-focused large language model fine-tuning framework | 589 |
ys-zong/vlguard | Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks | 45 |
wenkehuang/rethinkfl | Improves federated learning performance by incorporating domain knowledge and regularization to adapt models across diverse domains | 91 |
pku-yuangroup/languagebind | Extending pretraining models to handle multiple modalities by aligning language and video representations | 723 |
babylonhealth/fasttext_multilingual | A repository providing aligned multilingual word vectors for 78 languages using the SVD method. | 1,197 |
git-cloner/llama2-lora-fine-tuning | Fine-tuning the LLaMA 2 chat model using DeepSpeed and Lora for improved performance on a large dataset. | 167 |
jshilong/gpt4roi | Training and deploying large language models on computer vision tasks using region-of-interest inputs | 506 |
multimodal-art-projection/omnibench | Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. | 14 |
microsoft/archai | Automates the search for optimal neural network configurations in deep learning applications | 467 |