multifit

Fine-tuning

Reproduces results from a paper on efficient multi-lingual language model fine-tuning using a rewritten framework on top of the fastai library

The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761

GitHub

284 stars

17 watching

56 forks

Language: Jupyter Notebook

last commit: over 5 years ago

fastaimultiple-languagesnlpulmfit

Related projects:

Repository	Description	Stars
roboflow/maestro	A tool to streamline fine-tuning of multimodal models for vision-language tasks	1,415
codefuse-ai/mftcoder	A framework for fine-tuning large language models with multiple tasks to improve their accuracy and efficiency	647
eleutherai/polyglot	Large language models designed to perform well in multiple languages and address performance issues with current multilingual models.	476
openai/finetune-transformer-lm	This project provides code and model for improving language understanding through generative pre-training using a transformer-based architecture.	2,167
openai/lm-human-preferences	Training methods and tools for fine-tuning language models using human preferences	1,240
ymcui/macbert	Improves pre-trained Chinese language models by incorporating a correction task to alleviate inconsistency issues with downstream tasks	646
jerry1993-tech/cornucopia-llama-fin-chinese	A Chinese finance-focused large language model fine-tuning framework	596
ys-zong/vlguard	Improves safety and helpfulness of large language models by fine-tuning them using safety-critical tasks	47
wenkehuang/rethinkfl	Improves federated learning performance by incorporating domain knowledge and regularization to adapt models across diverse domains	93
pku-yuangroup/languagebind	Extending pretraining models to handle multiple modalities by aligning language and video representations	751
babylonhealth/fasttext_multilingual	A repository providing aligned multilingual word vectors for 78 languages using the SVD method.	1,197
git-cloner/llama2-lora-fine-tuning	Fine-tuning the LLaMA 2 chat model using DeepSpeed and Lora for improved performance on a large dataset.	171
jshilong/gpt4roi	Training and deploying large language models on computer vision tasks using region-of-interest inputs	517
multimodal-art-projection/omnibench	Evaluates and benchmarks multimodal language models' ability to process visual, acoustic, and textual inputs simultaneously.	15
microsoft/archai	Automates the search for optimal neural network configurations in deep learning applications	468