tabred

Tabular model benchmark

An effort to analyze pitfalls and improve the evaluation of tabular deep learning models by creating a new benchmark with real-world datasets

TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

GitHub

56 stars
3 watching
2 forks
Language: Python
last commit: 7 days ago
benchmarksmachine-learningresearchtabular-data

Related projects:

Repository Description Stars
yandex/rep A toolset for building and running reproducible machine learning experiments in Python 689
manujosephv/pytorch_tabular A deep learning framework specifically designed for tabular data, providing a standardized approach to modeling and deploying complex machine learning models. 1,382
juliadata/dataframes.jl A package providing tools and data structures for efficiently working with tabular data in Julia. 1,738
yandexdataschool/ysda_deeplearning17 Repository containing lecture and seminar materials for a deep learning course taught in 2017 116
datonic/datadex A platform for collaborative open data management and analysis 260
rajasegar/htmx-tabular An application that demonstrates tabular data display with features like search, sorting, and pagination using the htmx library in Node. 8
hepdata/hepdata A web application for managing and sharing high-energy physics data from experiments 41
pyg-team/pytorch-frame A deep learning framework for handling heterogeneous tabular data with diverse column types 543
awesomedata/awesome-public-datasets A curated collection of high-quality public datasets organized by topic. 60,953
mohamedadaly/labr A dataset of Arabic book reviews for natural language processing tasks 44
rucaibox/recsysdatasets A repository of public data sources for Recommender Systems. 856
cidree/forestdata A package providing easy access to forestry and land use datasets. 13
jxshin/mzdata A comprehensive dataset of Mozilla issue tracking history, providing multiple extracts and levels for analysis. 7
gopherdata/resources A collection of Go-based resources and tools for data science tasks 876
datasciencemasters/data A curated list of accessible data sources with clear licensing terms and moderate restrictions. 506