lucenerevolution-2013

NLP demos

Demos and examples for utilizing linguistics in natural language processing with Lucene and Solr

Demo examples for linguistics in Lucene and Solr

GitHub

0 stars
2 watching
0 forks
Language: Java
last commit: over 11 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
fielddb/berlin-buzzwords-2013 Demos for multilingual text analysis using Lucene, Solr, ElasticSearch, and OpenNLP 0
jxxcarlson/l1 A language demo project showcasing fault-tolerant parsing techniques for a simple language with a Lisp-like syntax. 0
fielddb/lex4all Tool for automating pronunciation lexicon creation for low-resource languages using speech recognition and machine learning algorithms. 1
fielddb/multilingualcorporaextractor Extracts and formats multilingual corpora from international bibles into XML, JSON, and HTML files for analysis. 0
fielddb/lexiconwebservicesample A Node.js web server implementing a lexicon API for the Drag and Drop FieldLinguistics project 1
hit-scir/elmoformanylangs Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. 1,462
pemistahl/lingua An accurate language detection library for Java and the JVM suitable for both short and long text inputs. 716
openmandrivaassociation/texlive-babel-galician Provides language-specific support for the TeXLive typesetting system 1
lowresourcelanguages/hltdi-morphology Provides morphological analysis tools for various languages, including verb and noun generation, based on archived web pages. 5
dativebase/old Software for creating collaborative databases of language data 1
fielddb/androidlanguagelessons An Android app that allows users to create custom language lessons using audio and visual vocabulary 2
tonianelope/multilingual-bert Investigating multilingual language models for Named Entity Recognition in German and English 14
pld-linux/apertium-dict-en-gl An English-Galician language translation dictionary for the Apertium platform. 1
fielddb/gamifypsycholinguisticsexperiments An experiment management platform designed to facilitate and analyze psychoinguistics experiments 0
lantip/baku-tidak-baku A repository of linguistic data for Indonesian words categorized as either standard or non-standard 29