kor

Text extractor

An open-source wrapper around LLMs to extract structured data from text

LLM(😽)

GitHub

2k stars

15 watching

91 forks

Language: Python

last commit: over 1 year ago

information-extractionllmnatural-languagenatural-language-processingnatural-language-understanding

eyurtsev.github.io/kor/

Related projects:

Repository	Description	Stars
geeks-of-data/knowledge-gpt	Extracts and stores information from various sources using AI models to generate answers.	283
cognesy/instructor-php	A PHP library that simplifies the integration of Large Language Models into applications by providing structured data extraction and validation.	230
karlicoss/kobuddy	Extracts data from Kobo eReader databases for analysis and backup	152
recrm/archivetools	A collection of tools for extracting and analyzing data from web archives	71
monarch-initiative/ontogpt	An LLM-based tool for extracting structured information from text with ontology-based grounding.	626
bikash/documentunderstanding	Research and development of tools and techniques for extracting information from images and PDFs using deep learning and graph neural networks.	96
wse-research/loris-llm-generated-representations-of-sparql-queries	Generates natural language representations of SPARQL queries for knowledge graphs	3
quanteda/spacyr	An R wrapper around spaCy for natural language processing tasks	251
nikolamilosevic86/tabinout	A framework for extracting information from tables in scientific literature using a rule-based approach.	42
richardlitt/lrl	Developing tools and scripts to extract data from low-resource languages, focusing on language processing and machine learning applications.	2
koraykv/fex	A Lua-based library for feature extraction in computer vision applications using the SIFT algorithm	10
philipperemy/stanford-openie-python	Provides a Python interface to extract structured relation triples from plain text using CoreNLP's open information extraction system.	639
microsoft/unicoder	This repository provides pre-trained models and code for understanding and generation tasks in multiple languages.	89
stephenbrannon/iocextractor	Extracts and organizes Indicators of Compromise from unstructured text files into structured formats.	135
lang-uk/ner-uk	A Ukrainian NER corpus and annotation dataset for training and evaluating named entity recognition models.	90