datacardsplaybook
Dataset doc framework
A framework and toolkit for creating standardized documentation for machine learning datasets
The Data Cards Playbook helps dataset producers and publishers adopt a people-centered approach to transparency in dataset documentation.
169 stars
15 watching
42 forks
Language: TypeScript
last commit: 6 months ago data-carddatacardsplaybookdatasetsdocumentationtransparency
Related projects:
Repository | Description | Stars |
---|---|---|
ivylee/model-cards-and-datasheets | A collection of documentation and resources for various machine learning models, including their architectures, applications, and usage examples. | 71 |
packing-box/python-dsff | A Python library for working with the DataSet File Format (DSFF), converting it to other formats, and handling packed datasets. | 2 |
polymathorg/dataframe | A Smalltalk-based implementation of tabular data structures for data analysis | 74 |
nathanepstein/datakit | A lightweight JavaScript framework for data analysis and manipulation | 291 |
fielddb/datatags | A system to assess and categorize the sensitivity and privacy risk of datasets | 0 |
gopherdata/resources | A collection of Go-based resources and tools for data science tasks | 876 |
wildmeorg/wildbook | Supports collaboration and automation in wildlife research and data analysis. | 106 |
src-d/datasets | Provides datasets and tools for analyzing source code in various aspects such as programming languages, commits, and more. | 323 |
ymcui/cmrc2018 | A collection of data for evaluating Chinese machine reading comprehension systems | 415 |
mengtingwan/goodreads | Provides code samples and notebooks to download, read, and analyze Goodreads datasets for research purposes. | 251 |
packing-box/dataset-packed-elf | A collection of packed ELF binaries used for training machine learning models to detect and analyze executable packing techniques | 17 |
techascent/tech.ml.dataset | A Clojure library for efficient tabular data processing and analysis | 681 |
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
sitecore/data-exchange-framework-docs | A documentation project for an ETL tool used in Sitecore to exchange and process data | 1 |
vincjo/datatables | A toolkit for creating custom data table components with advanced features like filtering, sorting, and pagination | 465 |