psmm

Language Model

An implementation of a neural network model for character-level language modeling.

GitHub

50 stars
7 watching
5 forks
Language: Python
last commit: over 6 years ago

Related projects:

Repository Description Stars
ieit-yuan/yuan2.0-m32 A high-performance language model designed to excel in tasks like natural language understanding, mathematical computation, and code generation 182
ibm-granite/granite-3.0-language-models A collection of lightweight state-of-the-art language models designed to support multilinguality, coding, and reasoning tasks on constrained resources. 232
csuhan/onellm A framework for training and fine-tuning multimodal language models on various data types 601
bilibili/index-1.9b A lightweight, multilingual language model with a long context length 920
luogen1996/lavin An open-source implementation of a vision-language instructed large language model 513
google-deepmind/recurrentgemma An implementation of a fast and efficient language model architecture 613
vhellendoorn/code-lms A guide to using pre-trained large language models in source code analysis and generation 1,786
multimodal-art-projection/map-neo A large language model designed for research and application in natural language processing tasks. 887
microsoft/mpnet Develops a method for pre-training language understanding models by combining masked and permuted techniques, and provides code for implementation and fine-tuning. 288
langboat/mengzi3 An 8B and 13B language model based on the Llama architecture with multilingual capabilities. 2,031
eleutherai/polyglot Large language models designed to perform well in multiple languages and address performance issues with current multilingual models. 476
yunwentechnology/unilm This project provides pre-trained models and tools for natural language understanding (NLU) and generation (NLG) tasks in Chinese. 439
xverse-ai/xverse-7b A multilingual large language model developed by XVERSE Technology Inc. 50
umass-foundation-model/3d-llm Developing a Large Language Model capable of processing 3D representations as inputs 979
shawn-ieitsystems/yuan-1.0 Large-scale language model with improved performance on NLP tasks through distributed training and efficient data processing 591