unilm
Multimodal model trainer
Large-scale pre-training of general-purpose models across multiple tasks and modalities
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
20k stars
307 watching
3k forks
Language: Python
last commit: 12 days ago beitbeit-3bitnetdeepnetdocument-aifoundation-modelskosmoskosmos-1layoutlmlayoutxlmllmminilmmllmmultimodalnlppre-trained-modeltextdiffusertrocrunilmxlm-e
Related projects:
Repository | Description | Stars |
---|---|---|
microsoft/lmops | A research initiative focused on developing fundamental technology to improve the performance and efficiency of large language models. | 3,695 |
flagai-open/flagai | An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality | 3,830 |
fminference/flexllmgen | Generates large language model outputs in high-throughput mode on single GPUs | 9,192 |
eleutherai/gpt-neox | Provides a framework for training large-scale language models on GPUs with advanced features and optimizations. | 6,941 |
dair-ai/ml-papers-explained | An explanation of key concepts and advancements in the field of Machine Learning | 7,315 |
optimalscale/lmflow | A toolkit for finetuning large language models and providing efficient inference capabilities | 8,273 |
microsoft/flaml | Automates machine learning workflows and optimizes model performance using large language models and efficient algorithms | 3,919 |
sakanaai/ai-scientist | A system that enables large language models to conduct fully automated scientific discovery and generate research papers independently. | 8,184 |
microsoft/deepspeed | A deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 35,463 |
deepseek-ai/deepseek-v2 | A high-performance mixture-of-experts language model with strong performance and efficient inference capabilities. | 3,590 |
brexhq/prompt-engineering | Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. | 8,440 |
ai4finance-foundation/fingpt | Developing a lightweight adaptation of large language models for financial applications | 14,022 |
microsoft/ai-for-beginners | An educational resource for learning the basics of Artificial Intelligence using practical lessons and code examples in Python | 34,875 |
salesforce/transmogrifai | An AutoML library that automates machine learning model development on Apache Spark with minimal hand-tuning | 2,244 |
eleutherai/lm-evaluation-harness | Provides a unified framework to test generative language models on various evaluation tasks. | 6,970 |