datacardsplaybook

Dataset doc framework

A framework and toolkit for creating standardized documentation for machine learning datasets

The Data Cards Playbook helps dataset producers and publishers adopt a people-centered approach to transparency in dataset documentation.

GitHub

169 stars
15 watching
42 forks
Language: TypeScript
last commit: 6 months ago
data-carddatacardsplaybookdatasetsdocumentationtransparency

Related projects:

Repository Description Stars
ivylee/model-cards-and-datasheets A collection of documentation and resources for various machine learning models, including their architectures, applications, and usage examples. 71
packing-box/python-dsff A Python library for working with the DataSet File Format (DSFF), converting it to other formats, and handling packed datasets. 2
polymathorg/dataframe A Smalltalk-based implementation of tabular data structures for data analysis 74
nathanepstein/datakit A lightweight JavaScript framework for data analysis and manipulation 291
fielddb/datatags A system to assess and categorize the sensitivity and privacy risk of datasets 0
gopherdata/resources A collection of Go-based resources and tools for data science tasks 876
wildmeorg/wildbook Supports collaboration and automation in wildlife research and data analysis. 106
src-d/datasets Provides datasets and tools for analyzing source code in various aspects such as programming languages, commits, and more. 323
ymcui/cmrc2018 A collection of data for evaluating Chinese machine reading comprehension systems 415
mengtingwan/goodreads Provides code samples and notebooks to download, read, and analyze Goodreads datasets for research purposes. 251
packing-box/dataset-packed-elf A collection of packed ELF binaries used for training machine learning models to detect and analyze executable packing techniques 17
techascent/tech.ml.dataset A Clojure library for efficient tabular data processing and analysis 681
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
sitecore/data-exchange-framework-docs A documentation project for an ETL tool used in Sitecore to exchange and process data 1
vincjo/datatables A toolkit for creating custom data table components with advanced features like filtering, sorting, and pagination 465