snorkel

Training Data Manager

A system designed to streamline the process of creating and managing training data for machine learning models with weak supervision.

A system for quickly generating training data with weak supervision

GitHub

6k stars
166 watching
857 forks
Language: Python
last commit: 9 months ago
Linked from 3 awesome lists

aidata-augmentationdata-sciencedata-slicinglabelingmachine-learningpythonsnorkeltraining-dataweak-supervision

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
lightly-ai/lightly A Python library for self-supervised learning on images using contrastive learning and deep learning techniques. 3,204
pycaret/pycaret An automation tool for machine learning workflows in Python 9,026
trekhleb/homemade-machine-learning Practices implementing popular machine learning algorithms from scratch to gain a deeper understanding of their mathematics 23,191
neptune-ai/neptune-client An experiment tracker for machine learning model training that allows users to log and visualize their experiments in detail. 590
sdv-dev/sdv A library for generating synthetic tabular data based on real-world patterns 2,416
aimhubio/aim An experiment tracking tool designed to handle large numbers of training runs and provide a UI for exploring and comparing results. 5,261
graal-research/poutyne A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities 572
coqui-ai/stt A toolkit for building and deploying speech-to-text models using deep learning techniques 2,302
sakanaai/ai-scientist A system that enables large language models to conduct fully automated scientific discovery and generate research papers independently. 8,359
pythagora-io/gpt-pilot Researches AI-assisted development of fully working apps while human oversight is required 32,067
explosion/spacy Industrial-strength NLP library for Python and Cython 30,459
packtpublishing/hands-on-intelligent-agents-with-openai-gym Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym 374
stvir/pysot A software system designed to support research in visual tracking using deep learning algorithms 4,452
clips/pattern A comprehensive Python module for web mining and analysis of text data. 8,758
instructor-ai/instructor A Python library that simplifies working with structured outputs from large language models 8,551