llm_dataset_inference
Dataset checker
Detects whether a given text sequence is part of the training data used to train a large language model.
Official Repository for Dataset Inference for LLMs
23 stars
1 watching
4 forks
Language: Jupyter Notebook
last commit: 4 months ago Related projects:
Repository | Description | Stars |
---|---|---|
iamgroot42/mimir | Measures memorization in Large Language Models (LLMs) to detect potential privacy issues | 121 |
radi-cho/datasetgpt | A command-line interface to generate textual datasets with Large Language Models | 293 |
mlcommons/inference | Measures the performance of deep learning models in various deployment scenarios. | 1,236 |
pythainlp/prachathai-67k | An article classification dataset created from news articles scraped from Prachathai.com with multiple benchmark models for multi-label classification | 16 |
karthikncode/nlp-datasets | A curated list of Natural Language Processing datasets used to train and evaluate NLP models. | 919 |
jagilley/fact-checker | A tool for fact-checking LLM outputs with self-ask using prompt chaining | 286 |
snowflake-labs/snowflake-arctic | A project providing optimized stacks for fine-tuning and inference of large language models, focusing on low-latency and high-throughput performance. | 519 |
at-aaims/forge | Pre-training large language models on scientific data for downstream applications | 12 |
i-gallegos/fair-llm-benchmark | Compiles bias evaluation datasets and provides access to original data sources for large language models | 110 |
mirfan899/urdu | A collection of Urdu language datasets for various NLP tasks and applications | 71 |
mengtingwan/goodreads | Provides code samples and notebooks to download, read, and analyze Goodreads datasets for research purposes. | 251 |
bmander/busbuzzard | Analyzes GPS data to infer probabilistic schedules from transit vehicle movements | 10 |
ymcui/cmrc2018 | A collection of data for evaluating Chinese machine reading comprehension systems | 415 |
lter/lterdatasampler | A collection of curated environmental datasets from US LTER sites, designed for teaching and training in data science. | 48 |
truera/trulens | A tool to evaluate and track the performance of large language model (LLM) experiments | 2,163 |