snorkel

Training Data Manager

A system designed to streamline the process of creating and managing training data for machine learning models with weak supervision.

A system for quickly generating training data with weak supervision

GitHub

6k stars
166 watching
857 forks
Language: Python
last commit: 7 months ago
Linked from 3 awesome lists

aidata-augmentationdata-sciencedata-slicinglabelingmachine-learningpythonsnorkeltraining-dataweak-supervision

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
lightly-ai/lightly An open-source framework for self-supervised learning on images using deep learning techniques. 3,165
pycaret/pycaret An automation tool for machine learning workflows in Python 8,955
trekhleb/homemade-machine-learning Practices implementing popular machine learning algorithms from scratch to gain a deeper understanding of their mathematics 23,121
neptune-ai/neptune-client An experiment tracker for machine learning model training that allows users to log and visualize their experiments in detail. 584
sdv-dev/sdv A library for generating synthetic tabular data based on real-world patterns 2,380
aimhubio/aim An experiment tracking tool designed to handle large numbers of training runs and provide a UI for exploring and comparing results. 5,220
graal-research/poutyne A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities 569
coqui-ai/stt A toolkit for building and deploying speech-to-text models using deep learning techniques 2,283
sakanaai/ai-scientist A system that enables large language models to conduct fully automated scientific discovery and generate research papers independently. 8,184
pythagora-io/gpt-pilot Researches AI-assisted development of fully working apps while human oversight is required 31,900
explosion/spacy Industrial-strength NLP library for Python and Cython 30,230
packtpublishing/hands-on-intelligent-agents-with-openai-gym Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym 373
stvir/pysot A software system designed to support research in visual tracking using deep learning algorithms 4,438
clips/pattern A comprehensive Python module for web mining and analysis of text data. 8,750
instructor-ai/instructor A Python library that provides structured outputs from large language models (LLMs) and facilitates seamless integration with various LLM providers. 8,163