llm_dataset_inference

Dataset checker

Detects whether a given text sequence is part of the training data used to train a large language model.

Official Repository for Dataset Inference for LLMs

GitHub

23 stars
1 watching
4 forks
Language: Jupyter Notebook
last commit: 4 months ago

Related projects:

Repository Description Stars
iamgroot42/mimir Measures memorization in Large Language Models (LLMs) to detect potential privacy issues 121
radi-cho/datasetgpt A command-line interface to generate textual datasets with Large Language Models 293
mlcommons/inference Measures the performance of deep learning models in various deployment scenarios. 1,236
pythainlp/prachathai-67k An article classification dataset created from news articles scraped from Prachathai.com with multiple benchmark models for multi-label classification 16
karthikncode/nlp-datasets A curated list of Natural Language Processing datasets used to train and evaluate NLP models. 919
jagilley/fact-checker A tool for fact-checking LLM outputs with self-ask using prompt chaining 286
snowflake-labs/snowflake-arctic A project providing optimized stacks for fine-tuning and inference of large language models, focusing on low-latency and high-throughput performance. 519
at-aaims/forge Pre-training large language models on scientific data for downstream applications 12
i-gallegos/fair-llm-benchmark Compiles bias evaluation datasets and provides access to original data sources for large language models 110
mirfan899/urdu A collection of Urdu language datasets for various NLP tasks and applications 71
mengtingwan/goodreads Provides code samples and notebooks to download, read, and analyze Goodreads datasets for research purposes. 251
bmander/busbuzzard Analyzes GPS data to infer probabilistic schedules from transit vehicle movements 10
ymcui/cmrc2018 A collection of data for evaluating Chinese machine reading comprehension systems 415
lter/lterdatasampler A collection of curated environmental datasets from US LTER sites, designed for teaching and training in data science. 48
truera/trulens A tool to evaluate and track the performance of large language model (LLM) experiments 2,163