llm_dataset_inference

Dataset checker

Detects whether a given text sequence is part of the training data used to train a large language model.

Official Repository for Dataset Inference for LLMs

GitHub

23 stars

1 watching

4 forks

Language: Jupyter Notebook

last commit: almost 2 years ago

Related projects:

Repository	Description	Stars
iamgroot42/mimir	A Python package for measuring memorization in Large Language Models.	126
radi-cho/datasetgpt	A command-line interface to generate textual datasets with Large Language Models	293
mlcommons/inference	Measures the performance of deep learning models in various deployment scenarios.	1,256
pythainlp/prachathai-67k	An article classification dataset created from news articles scraped from Prachathai.com with multiple benchmark models for multi-label classification	16
karthikncode/nlp-datasets	A curated list of Natural Language Processing datasets used to train and evaluate NLP models.	919
jagilley/fact-checker	A tool for fact-checking LLM outputs with self-ask using prompt chaining	289
snowflake-labs/snowflake-arctic	A project providing optimized stacks for fine-tuning and inference of large language models, focusing on low-latency and high-throughput performance.	525
at-aaims/forge	Pre-training large language models on scientific data for downstream applications	12
i-gallegos/fair-llm-benchmark	Compiles bias evaluation datasets and provides access to original data sources for large language models	115
mirfan899/urdu	A collection of Urdu language datasets for various NLP tasks and applications	71
mengtingwan/goodreads	Provides code samples and notebooks to download, read, and analyze Goodreads datasets for research purposes.	252
bmander/busbuzzard	Analyzes GPS data to infer probabilistic schedules from transit vehicle movements	10
ymcui/cmrc2018	A collection of data for evaluating Chinese machine reading comprehension systems	419
lter/lterdatasampler	A collection of curated environmental datasets from US LTER sites, designed for teaching and training in data science.	48
truera/trulens	A tool to evaluate and track the performance of large language model (LLM) experiments	2,233