modelz-ChatGLM

Language model deployer

Provides code for deploying a large-scale pre-trained language model on Modelz.

Deploy ChatGLM on Modelz

GitHub

15 stars
3 watching
3 forks
Language: Dockerfile
last commit: over 1 year ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
talkdai/dialog An application framework to simplify the deployment and testing of large language models (LLMs) for natural language processing tasks. 377
batzner/tensorlm A library for text generation with recurrent neural networks using TensorFlow 61
balavenkatesh3322/model_deployment Provides tools and frameworks for deploying machine learning models in production environments 73
mmuller88/alf-cdk-cognito A CDK-based Cognito User Pool deployment project 0
iglaweb/tfprofiler An app for profiling and optimizing the performance of TensorFlow Lite models on mobile devices 27
langchain-ai/langserve Provides a REST API for deploying and managing LangChain runnables and chains 1,944
andrewnguonly/chatabstractions Provides a framework for creating custom chat models with dynamic failover and load balancing features 79
german-nlp-group/german-transformer-training Trains German transformer models to improve language understanding 23
nndeploy/nndeploy An end-to-end model deployment framework providing cross-platform simplicity and high performance 632
mbzuai-oryx/groundinglmm An end-to-end trained model capable of generating natural language responses integrated with object segmentation masks. 781
tensorflow/tflite-support A toolkit to deploy machine learning models on mobile devices using the TensorFlow Lite framework 378
kendryte/toucan-llm A large language model with 70 billion parameters designed for chatbot and conversational AI tasks 29
mainframecomputer/fullmoon-ios An iOS app that provides a chat interface to local large language models, optimized for Apple silicon. 410
tensorflow/tfjs An open-source JavaScript library for training and deploying machine learning models using WebGL acceleration. 18,495
combust/mleap Enables deployment of machine learning data pipelines and algorithms to production 1,504