unilm

Multimodal model trainer

Large-scale pre-training of general-purpose models across multiple tasks and modalities

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

GitHub

20k stars

306 watching

3k forks

Language: Python

last commit: 7 months ago

beitbeit-3bitnetdeepnetdocument-aifoundation-modelskosmoskosmos-1layoutlmlayoutxlmllmminilmmllmmultimodalnlppre-trained-modeltextdiffusertrocrunilmxlm-e

aka.ms/GeneralAI

Related projects:

Repository	Description	Stars
microsoft/lmops	A research initiative focused on developing fundamental technology to improve the performance and efficiency of large language models.	3,747
flagai-open/flagai	An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality	3,840
fminference/flexllmgen	Generates large language model outputs in high-throughput mode on single GPUs	9,236
eleutherai/gpt-neox	Provides a framework for training large-scale language models on GPUs with advanced features and optimizations.	6,997
dair-ai/ml-papers-explained	An explanation of key concepts and advancements in the field of Machine Learning	7,352
optimalscale/lmflow	A toolkit for fine-tuning and inferring large machine learning models	8,312
microsoft/flaml	Automates machine learning workflows and optimizes model performance using large language models and efficient algorithms	3,968
sakanaai/ai-scientist	A system that enables large language models to conduct fully automated scientific discovery and generate research papers independently.	8,359
microsoft/deepspeed	A deep learning optimization library that simplifies distributed training and inference on modern computing hardware.	35,863
deepseek-ai/deepseek-v2	A high-performance mixture-of-experts language model with strong performance and efficient inference capabilities.	3,758
brexhq/prompt-engineering	Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4.	8,487
ai4finance-foundation/fingpt	Developing a lightweight adaptation of large language models for financial applications	14,384
microsoft/ai-for-beginners	A 12-week curriculum teaching the basics of Artificial Intelligence through practical lessons, quizzes, and labs using popular frameworks like TensorFlow and PyTorch.	35,160
salesforce/transmogrifai	An AutoML library that automates machine learning model development on Apache Spark with minimal hand-tuning	2,248
eleutherai/lm-evaluation-harness	Provides a unified framework to test generative language models on various evaluation tasks.	7,200