kor

Text extractor

An open-source wrapper around LLMs to extract structured data from text

LLM(😽)

GitHub

2k stars
15 watching
90 forks
Language: Python
last commit: 8 days ago
information-extractionllmnatural-languagenatural-language-processingnatural-language-understanding

Related projects:

Repository Description Stars
geeks-of-data/knowledge-gpt Extracts and stores information from various sources using AI models to generate answers. 282
cognesy/instructor-php A PHP library that simplifies the integration of Large Language Models into applications by providing structured data extraction and validation. 222
karlicoss/kobuddy Extracts data from Kobo eReader databases for analysis and backup 151
recrm/archivetools A collection of tools for extracting and analyzing data from web archives 70
monarch-initiative/ontogpt An LLM-based tool for extracting structured information from text with ontology-based grounding. 613
bikash/documentunderstanding Research and development of tools and techniques for extracting information from images and PDFs using deep learning and graph neural networks. 96
wse-research/loris-llm-generated-representations-of-sparql-queries Generates natural language representations of SPARQL queries for knowledge graphs 3
quanteda/spacyr An R wrapper around spaCy for natural language processing tasks 251
nikolamilosevic86/tabinout A framework for extracting information from tables in scientific literature using a rule-based approach. 42
richardlitt/lrl Developing tools and scripts to extract data from low-resource languages, focusing on language processing and machine learning applications. 2
koraykv/fex A Lua-based library for feature extraction in computer vision applications using the SIFT algorithm 10
philipperemy/stanford-openie-python Provides a Python interface to extract structured relation triples from plain text using CoreNLP's open information extraction system. 637
microsoft/unicoder This repository provides pre-trained models and code for understanding and generation tasks in multiple languages. 88
stephenbrannon/iocextractor Extracts and organizes Indicators of Compromise from unstructured text files into structured formats. 135
lang-uk/ner-uk A Ukrainian NER corpus and annotation dataset for training and evaluating named entity recognition models. 90