datachain

Data Warehouse

An AI-data warehouse that transforms and analyzes unstructured data from various formats

AI-data warehouse to enrich, transform and analyze unstructured data

GitHub

2k stars
17 watching
89 forks
Language: Python
last commit: 6 days ago
aicvdata-analyticsdata-wranglingembeddingsllmllm-evalmachine-learningmlopsmultimodal

Related projects:

Repository Description Stars
jaimegildesagredo/booby A Python library for defining and validating data structures with built-in support for complex data models and relationships. 177
dotchain/dotjs A distributed, reactive, and functional data structure library for JavaScript 8
databricks/lilac A tool to improve data quality and efficiency for large language models 969
datamol-io/datamol A Python library for manipulating molecules 469
h2oai/datatable A Python package for manipulating 2-dimensional tabular data structures with an emphasis on speed and big data support. 1,817
dataoneorg/d1_python A collection of Python libraries and tools for interacting with DataONE repositories 17
f483/btctxstore A library to store and retrieve data in Bitcoin transactions using OP_RETURN nulldata outputs. 10
hellokaton/anima A minimal Java library for simple database operations with a focus on ease of use and support for multiple databases and relational mappings. 228
accelerationnet/data-table Provides a data structure to represent tabular data in Common Lisp, enabling easy interaction with databases and report generation. 22
ujjwalkarn/datasciencepython A curated list of tutorials and resources for learning Python for data science, machine learning, and other related topics. 5,274
whitaker-io/machine A library for creating data workflows that can be simple or complex, with features like recursion and memoization. 158
sabiwara/aja An Elixir standard library extension focused on efficient data structures and manipulation 198
indy256/codelibrary A comprehensive collection of algorithms and data structures implemented in multiple programming languages 1,939
tiledb-inc/tiledb-py Provides a Python interface to store and manage large datasets in a distributed, columnar storage system. 190
scalamolecule/molecule A library that translates custom domain code to database queries for multiple databases 18