datasharing
Data sharing guide
A guide for sharing data with statisticians and data scientists, focusing on providing efficient and timely analysis by standardizing data preparation and documentation.
The Leek group guide to data sharing
7k stars
566 watching
244k forks
last commit: 4 months ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
yjs/yjs | A CRDT framework for collaborative software development, enabling real-time sharing and merging of data without conflicts. | 17,096 |
rhiever/data-analysis-and-machine-learning-projects | A repository of teaching materials, code, and data for various data analysis and machine learning projects. | 6,128 |
huggingface/datasets | A tool providing efficient data manipulation and loading for machine learning models | 19,282 |
alexeygrigorev/data-science-interviews | A collection of interview questions and answers in data science | 8,963 |
jadianes/data-science-your-way | An introduction to data science concepts and applications in R and Python using hands-on tutorials | 596 |
gdsbook/book | An interactive introduction to geospatial data analysis using Python and Jupyter Notebook | 338 |
ujjwalkarn/datasciencer | A comprehensive resource for learning R programming and data science concepts | 2,012 |
juliaearth/geostatsimages.jl | Provides preprocessed data for geostatistical simulations in Julia. | 15 |
jakevdp/pythondatasciencehandbook | An online guide and set of executable Jupyter notebooks providing an introduction to core libraries for data science in Python. | 43,265 |
wesm/pydata-book | Materials and IPython notebooks for data analysis with Python | 22,248 |
sdv-dev/sdv | A library for generating synthetic tabular data based on real-world patterns | 2,380 |
datasciencespecialization/courses | Provides course materials and resources for learning data science fundamentals | 4,067 |
jtablesaw/tablesaw | A Java library for data manipulation and visualization | 3,551 |
simonw/datasette | An interactive platform for exploring and publishing data in various formats | 9,562 |
zjh-819/llmdatahub | A curated collection of high-quality datasets for training large language models. | 2,635 |