awesome-community-curated-nlp

NLP toolkit collection

A curated list of NLP tools and libraries

Community Curated NLP List

GitHub

196 stars
20 watching
33 forks
last commit: over 2 years ago
Linked from 1 awesome list

awesome-listcommunity-drivennlp

Speech NLP

Kaldi
CMU Sphinx
Julius
RWTH ASR
Merlin
Festival
HTS
eSpeak
Covarep 351 over 4 years ago
Free TTS
Ekho
Hidden Markov Model Toolkit
Praat
https://en.wikipedia.org/wiki/Comparison_of_speech_synthesizers
https://en.wikipedia.org/wiki/List_of_speech_recognition_software

Text NLP Suites

NLTK 13,646 12 days ago
Gensim
SpaCy
Stanford CoreNLP
Freeling
OpenNLP
DKPro
PyNLPl 479 about 1 year ago
IXA Pipes
NLP4J
CogComp's NLP libraries 473 over 1 year ago
Stanbol NLP
LIMA
Corpus.Tools
NooJ
SALAT

Language Specific Text NLP Suites

SAFAR : Software Architecture For Arabic language pRocessing
PyCantonese : Cantonese Linguistics and NLP in Python
SnowNLP 6,440 almost 5 years ago : Simplified Chinese Text Processing
Hazm 1,212 4 months ago : Python library for digesting Persian text
Frog : An advanced NLP suite for Dutch
Tint : Lend color to your Italian texts!
KoNLPy : Korean NLP in Python

Pre-processing (Tokenization / Stemming / POS Tagging / etc.)

Colibri Core C++ and Python tools for n-grams and skipgrams
Snowball Stemmers
SerbianStemmer 16 about 4 years ago
Whoosh Stemmers
Elephant : Sequence labeling for word and sentence segmentation
Toktok 28 over 7 years ago : A fast, simple, multilingual tokenizer
Ucto : An advanced rule-based unicode-aware tokenizer

Deep Linguistic Processing

DELPH-IN : Deep Linguistic Processing with HPSG
English Resource Grammar
CCG2PST 11 over 1 year ago : A tool for converting CCG derivations into PTB-style phrase structure trees

Word Embeddings

Word2Vec
GloVe
COMPOSE
Polyglot
FastText 25,950 8 months ago

Twitter

PyTweet 2 over 12 years ago
Twitter4J
Affective Tweets

Task Specific

KNEWS 82 almost 2 years ago : Knowledge Extraction With Semantics
Deep Dive : Relation Extraction
Berkley Coreference Error Analyser 29 over 1 year ago A tool for classifying errors in coreference resolution
Berkley Parse Error Analyser 41 over 1 year ago : A tool for classifying mistakes in the output of parsers

Machine Translation

OpenNMT
Amunmt 1,255 about 1 year ago
Google Seq2Seq 5,604 about 4 years ago
Eske Seq2seq 389 over 5 years ago
Moses MT
Joshua
Jane
Phrasal
Kriya
Apertium MT
Kyoto EBMT
Friends of Moses
Let’s MT
Neural Machine Translation Implementations
A list of Neural MT implementations 359 over 2 years ago @jonsafari

Language Modelling

SRILM
IRSTLM
KenLM
NPLM
RNNLM Mikolov's
Fast-RNNLM 560 over 2 years ago Yandex
Brat Rapid Annotation Tool : Online environment for collaborative text annotation
PyBossa : The ultimate crowdsourcing framework
FLAT 110 5 months ago : FoLiA Linguistic Annotation Tool
TableAnnotator 20 over 7 years ago and
Marvin 13 almost 7 years ago : Semantic text annotation tools using Wordnet and DBPedia

Others

Java Graphical Authorship Attribution Program 261 5 months ago
Gecco 22 about 2 years ago : Generic Environment for Context-Aware Correction of Orthography
Charguana 10 over 6 years ago : Character Vomitting for CJK Unicode
CLAM : Turn command-line applications into RESTful webservices with web front-end
LuigiNLP 21 about 5 years ago : Experimental NLP Pipeline system built on top of SciLuigi
TextFlows /
Vinci generative environment
Timbl Memory-based machine learning
KeLP : Kernel-based Learning Platform

List of Lists of NLP Resources/Tools

Awesome NLP 16,768 about 1 year ago (The original one, curated by @keon and @outpark)
Repo tagged with nlp on Github.com
Java or Python for NLP?
OpenSource Deep QA Resources (also )
Sibawayh Repository for Arabic NLP
La Machine @proycon
Ruby NLP Resources/Tools 1,270 over 1 year ago
NLP Datasets 5,782 almost 2 years ago @niderhoff
NLP Datasets 919 almost 5 years ago @karthikncode

See Also

Corpora List : Your source of all thing computational linguistics / NLP / corpora
LT World : Language Technology World
META Net
LDC : Linguistic Data Consortium
OLAC : Open Language Archives Community
NLSR : Natural Language Software Registry

Backlinks from these awesome lists:

More related projects: