omniparse

Data processor

A platform for ingesting and processing unstructured data from various sources to generate structured, actionable data optimized for AI applications.

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

GitHub

6k stars
36 watching
460 forks
Language: Python
last commit: 18 days ago
ingestion-apiocromniparserparse-serverparser-libraryvision-transformerweb-crawlerwhisper-api

Related projects:

Repository Description Stars
quivrhq/quivr An AI-powered personal assistant framework that integrates various natural language models and databases to provide fast and efficient answers. 36,681
truefoundry/cognita A modular framework for building production-ready AI applications with integrated data management and model deployment capabilities 3,305
ory/keto A permission server built using Google's Zanzibar approach, supporting scalable and customizable access control with a flexible language. 4,838
llmware-ai/llmware A framework for building enterprise LLM-based applications using small, specialized models 6,651
unclecode/crawl4ai A tool for web crawling and data extraction, designed to work with large language models. 16,180
ombi-app/ombi Automatically syncs media request information with Plex/Emby servers. 3,737
orhanerday/open-ai A PHP SDK for accessing the OpenAI API and interacting with its GPT-3 and DALL-E services. 2,268
explodinggradients/ragas A toolkit for evaluating and optimizing Large Language Model applications with data-driven insights 7,233
scisharp/llamasharp A C#/.NET library to efficiently run Large Language Models (LLMs) on local devices 2,673
pathwaycom/llm-app Pre-built templates for integrating large language models into enterprise applications with real-time data APIs and various data sources. 4,642
google-ai-edge/mediapipe A platform providing pre-built machine learning models and APIs for cross-platform deployment on various devices 27,608
fastai/fastai Provides high-level components for practical deep learning tasks and low-level components for building new approaches 26,291
langgenius/dify An open-source LLM app development platform that enables users to build and deploy AI-powered applications quickly and efficiently. 51,873
infiniflow/infinity A high-performance database designed to support fast search and retrieval of dense vector, sparse vector, tensor, and full-text data 2,641
sinaptik-ai/pandas-ai Makes data analysis conversational using LLMs and natural language 13,516