multifit

Fine-tuning

Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761

GitHub

284 stars
17 watching
56 forks
Language: Jupyter Notebook
last commit: over 4 years ago
fastaimultiple-languagesnlpulmfit

Related projects:

Repository Description Stars
roboflow/maestro A tool to streamline fine-tuning of multimodal models for vision-language tasks 1,386
codefuse-ai/mftcoder A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency 637
eleutherai/polyglot Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. 475
openai/finetune-transformer-lm This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture. 2,160
openai/lm-human-preferences Training methods and tools for fine-tuning language models using human preferences 1,229
ymcui/macbert Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks 645
jerry1993-tech/cornucopia-llama-fin-chinese A Chinese finance-focused large language model fine-tuning framework 589
ys-zong/vlguard Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks 45
wenkehuang/rethinkfl Improves federated learning performance by incorporating domain knowledge and regularization to adapt models across diverse domains 91
pku-yuangroup/languagebind Extending pretraining models to handle multiple modalities by aligning language and video representations 723
babylonhealth/fasttext_multilingual A repository providing aligned multilingual word vectors for 78 languages using the SVD method. 1,197
git-cloner/llama2-lora-fine-tuning Fine-tuning the LLaMA 2 chat model using DeepSpeed and Lora for improved performance on a large dataset. 167
jshilong/gpt4roi Training and deploying large language models on computer vision tasks using region-of-interest inputs 506
multimodal-art-projection/omnibench Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously. 14
microsoft/archai Automates the search for optimal neural network configurations in deep learning applications 467