lucenerevolution-2013

Linguistics toolkit

This project provides demo examples and tools for exploring linguistic features in Lucene and Solr, two popular search engine technologies.

Demo examples for linguistics in Lucene and Solr

GitHub

0 stars
2 watching
0 forks
Language: Java
last commit: over 11 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
fielddb/berlin-buzzwords-2013 Demos for multilingual text analysis using Lucene, Solr, ElasticSearch, and OpenNLP 0
jxxcarlson/l1 A language demo project showcasing fault-tolerant parsing techniques for a simple language with a Lisp-like syntax. 0
fielddb/lex4all Tool for automating pronunciation lexicon creation for low-resource languages using speech recognition and machine learning algorithms. 1
fielddb/multilingualcorporaextractor Extracts and formats multilingual corpora from international bibles into XML, JSON, and HTML files for analysis. 0
fielddb/lexiconwebservicesample A Node.js web server implementing a lexicon API for the Drag and Drop FieldLinguistics project 1
hit-scir/elmoformanylangs Provides pre-trained ELMo representations for multiple languages to improve NLP tasks. 1,463
pemistahl/lingua An accurate language detection library for Java and the JVM suitable for both short and long text inputs. 707
openmandrivaassociation/texlive-babel-galician Provides language-specific support for the TeXLive typesetting system 1
lowresourcelanguages/hltdi-morphology Provides morphological analysis tools for various languages, including verb and noun generation, based on archived web pages. 5
dativebase/old Software for creating collaborative databases of language data 1
fielddb/androidlanguagelessons An Android app that allows users to create custom language lessons using audio and visual vocabulary 2
tonianelope/multilingual-bert Investigating multilingual language models for Named Entity Recognition in German and English 14
pld-linux/apertium-dict-en-gl An English-Galician language translation dictionary for the Apertium platform. 1
fielddb/gamifypsycholinguisticsexperiments An experiment management platform designed to facilitate and analyze psychoinguistics experiments 0
lantip/baku-tidak-baku A repository of linguistic data for Indonesian words categorized as either standard or non-standard 29