omniparse
Data processor
A platform for ingesting and processing unstructured data from various sources to generate structured, actionable data optimized for AI applications.
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
6k stars
36 watching
460 forks
Language: Python
last commit: 18 days ago ingestion-apiocromniparserparse-serverparser-libraryvision-transformerweb-crawlerwhisper-api
Related projects:
Repository | Description | Stars |
---|---|---|
quivrhq/quivr | An AI-powered personal assistant framework that integrates various natural language models and databases to provide fast and efficient answers. | 36,681 |
truefoundry/cognita | A modular framework for building production-ready AI applications with integrated data management and model deployment capabilities | 3,305 |
ory/keto | A permission server built using Google's Zanzibar approach, supporting scalable and customizable access control with a flexible language. | 4,838 |
llmware-ai/llmware | A framework for building enterprise LLM-based applications using small, specialized models | 6,651 |
unclecode/crawl4ai | A tool for web crawling and data extraction, designed to work with large language models. | 16,180 |
ombi-app/ombi | Automatically syncs media request information with Plex/Emby servers. | 3,737 |
orhanerday/open-ai | A PHP SDK for accessing the OpenAI API and interacting with its GPT-3 and DALL-E services. | 2,268 |
explodinggradients/ragas | A toolkit for evaluating and optimizing Large Language Model applications with data-driven insights | 7,233 |
scisharp/llamasharp | A C#/.NET library to efficiently run Large Language Models (LLMs) on local devices | 2,673 |
pathwaycom/llm-app | Pre-built templates for integrating large language models into enterprise applications with real-time data APIs and various data sources. | 4,642 |
google-ai-edge/mediapipe | A platform providing pre-built machine learning models and APIs for cross-platform deployment on various devices | 27,608 |
fastai/fastai | Provides high-level components for practical deep learning tasks and low-level components for building new approaches | 26,291 |
langgenius/dify | An open-source LLM app development platform that enables users to build and deploy AI-powered applications quickly and efficiently. | 51,873 |
infiniflow/infinity | A high-performance database designed to support fast search and retrieval of dense vector, sparse vector, tensor, and full-text data | 2,641 |
sinaptik-ai/pandas-ai | Makes data analysis conversational using LLMs and natural language | 13,516 |