grok-1

MoE model

An implementation of a Mixture of Experts (MoE) model with large parameters and specialized features for natural language processing tasks.

Grok open release

GitHub

50k stars
571 watching
8k forks
Language: Python
last commit: 3 months ago

Related projects:

Repository Description Stars
sanchit-gandhi/whisper-jax An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. 4,444
karpathy/mingpt A minimal PyTorch implementation of a transformer-based language model 20,175
openai/baselines High-quality implementations of reinforcement learning algorithms for research and development purposes 15,810
openai/tiktoken A fast and efficient tokeniser for natural language models based on Byte Pair Encoding (BPE) 12,420
gradio-app/gradio Enables rapid creation and deployment of web applications for machine learning models and functions using Python 33,962
ricklamers/gpt-code-ui An interactive code generation and execution tool using AI models 3,561
openai/gpt-2 A repository providing code and models for research into language modeling and multitask learning 22,516
brexhq/prompt-engineering Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. 8,440
minimaxir/gpt-2-simple A tool for retraining and fine-tuning the OpenAI GPT-2 text generation model on new datasets. 3,397
openai/gym A toolkit for developing and comparing reinforcement learning algorithms using a standardized API to interact with environments. 34,798
openai/whisper A general-purpose speech recognition system trained on large-scale weak supervision 71,257
ktr0731/evans A gRPC client library with two modes: REPL and CLI, providing automatic service inspection and task automation 4,282
openai/consistency_models A PyTorch-based framework for training and sampling consistency models in image generation 6,166
eriklindernoren/pytorch-yolov3 An implementation of YOLOv3 in PyTorch for object detection and tracking 7,324
flagai-open/flagai An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality 3,830