grok-1

MoE model

An implementation of a Mixture of Experts (MoE) model with large parameters and specialized features for natural language processing tasks.

Grok open release

GitHub

50k stars
589 watching
8k forks
Language: Python
last commit: 5 months ago

Related projects:

Repository Description Stars
sanchit-gandhi/whisper-jax An optimized implementation of OpenAI's Whisper Model for speech recognition and speech-to-text tasks using JAX. 4,467
karpathy/mingpt A minimal PyTorch implementation of a transformer-based language model 20,474
openai/baselines High-quality implementations of reinforcement learning algorithms for research and development purposes 15,885
openai/tiktoken A fast and efficient tokeniser for natural language models based on Byte Pair Encoding (BPE) 12,703
gradio-app/gradio Enables rapid creation and deployment of web applications for machine learning models and functions using Python 34,557
ricklamers/gpt-code-ui An interactive code generation and execution tool using AI models 3,567
openai/gpt-2 A repository providing code and models for research into language modeling and multitask learning 22,644
brexhq/prompt-engineering Guides software developers on how to effectively use and build systems around Large Language Models like GPT-4. 8,487
minimaxir/gpt-2-simple A tool for retraining and fine-tuning the OpenAI GPT-2 text generation model on new datasets. 3,398
openai/gym A toolkit for developing and comparing reinforcement learning algorithms using a standardized API to interact with environments. 34,966
openai/whisper A general-purpose speech recognition system trained on large-scale weak supervision 72,752
ktr0731/evans A gRPC client library with two modes: REPL and CLI, providing automatic service inspection and task automation 4,304
openai/consistency_models A PyTorch-based framework for training and sampling consistency models in image generation 6,199
eriklindernoren/pytorch-yolov3 An implementation of YOLOv3 in PyTorch for object detection and tracking 7,343
flagai-open/flagai An open-source toolkit for training and deploying large-scale AI models on various downstream tasks with multi-modality 3,840