unilm

Multimodal model trainer

Large-scale pre-training of general-purpose models across multiple tasks and modalities

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

GitHub

20k stars
306 watching
3k forks
Language: Python
last commit: about 1 month ago
beitbeit-3bitnetdeepnetdocument-aifoundation-modelskosmoskosmos-1layoutlmlayoutxlmllmminilmmllmmultimodalnlppre-trained-modeltextdiffusertrocrunilmxlm-e

Related projects:

Repository Description Stars
microsoft/lmops A research initiative focused on developing fundamental technology to improve the performance and efficiency of large language models. 3,747
flagai-open/flagai An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality 3,840
fminference/flexllmgen Generates large language model outputs in high-throughput mode on single GPUs 9,236
eleutherai/gpt-neox Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. 6,997
dair-ai/ml-papers-explained An explanation of key concepts and advancements in the field of Machine Learning 7,352
optimalscale/lmflow A toolkit for fine-tuning and inferring large machine learning models 8,312
microsoft/flaml Automates machine learning workflows and optimizes model performance using large language models and efficient algorithms 3,968
sakanaai/ai-scientist A system that enables large language models to conduct fully automated scientific discovery and generate research papers independently. 8,359
microsoft/deepspeed A deep learning optimization library that simplifies distributed training and inference on modern computing hardware. 35,863
deepseek-ai/deepseek-v2 A high-performance mixture-of-experts language model with strong performance and efficient inference capabilities. 3,758
brexhq/prompt-engineering Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. 8,487
ai4finance-foundation/fingpt Developing a lightweight adaptation of large language models for financial applications 14,384
microsoft/ai-for-beginners A 12-week curriculum teaching the basics of Artificial Intelligence through practical lessons, quizzes, and labs using popular frameworks like TensorFlow and PyTorch. 35,160
salesforce/transmogrifai An AutoML library that automates machine learning model development on Apache Spark with minimal hand-tuning 2,248
eleutherai/lm-evaluation-harness Provides a unified framework to test generative language models on various evaluation tasks. 7,200