omniparse

Data processor

A platform for ingesting and processing unstructured data from various sources to generate structured, actionable data optimized for AI applications.

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

GitHub

6k stars
36 watching
471 forks
Language: Python
last commit: 3 months ago
ingestion-apiocromniparserparse-serverparser-libraryvision-transformerweb-crawlerwhisper-api

Related projects:

Repository Description Stars
quivrhq/quivr An AI-powered personal assistant framework that integrates various natural language models and databases to provide fast and efficient answers. 36,913
truefoundry/cognita A modular framework for building production-ready AI applications with integrated data management and model deployment capabilities 3,401
ory/keto A permission server built using Google's Zanzibar approach, supporting scalable and customizable access control with a flexible language. 4,875
llmware-ai/llmware A framework for building enterprise LLM-based applications using small, specialized models 8,303
unclecode/crawl4ai A web crawling tool designed to extract structured data from the web for use in AI applications 18,541
ombi-app/ombi Automatically syncs media request information with Plex/Emby servers. 3,762
orhanerday/open-ai A PHP SDK for accessing the OpenAI API and interacting with its GPT-3 and DALL-E services. 2,277
explodinggradients/ragas A toolkit for evaluating and optimizing Large Language Model applications with objective metrics, test data generation, and seamless integrations. 7,598
scisharp/llamasharp An efficient C#/.NET library for running Large Language Models (LLMs) on local devices 2,750
pathwaycom/llm-app Provides pre-built AI application templates to integrate Large Language Models (LLMs) with various data sources for scalable RAG and enterprise search. 7,426
google-ai-edge/mediapipe A platform providing pre-built machine learning models and APIs for cross-platform deployment on various devices 27,962
fastai/fastai Provides high-level components for practical deep learning tasks and low-level components for building new approaches 26,390
langgenius/dify An open-source LLM app development platform that enables users to build and deploy AI-powered applications quickly and efficiently. 54,931
infiniflow/infinity A high-performance database designed to support the fast and efficient search of various data types in AI applications 2,780
sinaptik-ai/pandas-ai Makes data analysis conversational using LLMs and natural language 13,714