snorkel
Training Data Manager
A system designed to streamline the process of creating and managing training data for machine learning models with weak supervision.
A system for quickly generating training data with weak supervision
6k stars
166 watching
857 forks
Language: Python
last commit: 10 months ago
Linked from 3 awesome lists
aidata-augmentationdata-sciencedata-slicinglabelingmachine-learningpythonsnorkeltraining-dataweak-supervision
Related projects:
Repository | Description | Stars |
---|---|---|
| A Python library for self-supervised learning on images using contrastive learning and deep learning techniques. | 3,204 |
| An automation tool for machine learning workflows in Python | 9,026 |
| Practices implementing popular machine learning algorithms from scratch to gain a deeper understanding of their mathematics | 23,191 |
| An experiment tracker for machine learning model training that allows users to log and visualize their experiments in detail. | 590 |
| A library for generating synthetic tabular data based on real-world patterns | 2,416 |
| An experiment tracking tool designed to handle large numbers of training runs and provide a UI for exploring and comparing results. | 5,261 |
| A PyTorch framework simplifying neural network training with automated boilerplate code and callback utilities | 572 |
| A toolkit for building and deploying speech-to-text models using deep learning techniques | 2,302 |
| A system that enables large language models to conduct fully automated scientific discovery and generate research papers independently. | 8,359 |
| Researches AI-assisted development of fully working apps while human oversight is required | 32,067 |
| Industrial-strength NLP library for Python and Cython | 30,459 |
| Teaching software developers to build intelligent agents using deep reinforcement learning and OpenAI Gym | 374 |
| A software system designed to support research in visual tracking using deep learning algorithms | 4,452 |
| A comprehensive Python module for web mining and analysis of text data. | 8,758 |
| A Python library that simplifies working with structured outputs from large language models | 8,551 |