awesome-community-curated-nlp
NLP toolkit collection
A curated list of NLP tools and libraries
Community Curated NLP List
197 stars
20 watching
33 forks
last commit: over 3 years ago
Linked from 1 awesome list
awesome-listcommunity-drivennlp
Speech NLP | |||
| Kaldi | |||
| CMU Sphinx | |||
| Julius | |||
| RWTH ASR | |||
| Merlin | |||
| Festival | |||
| HTS | |||
| eSpeak | |||
| Covarep | 352 | over 5 years ago | |
| Free TTS | |||
| Ekho | |||
| Hidden Markov Model Toolkit | |||
| Praat | |||
| https://en.wikipedia.org/wiki/Comparison_of_speech_synthesizers | |||
| https://en.wikipedia.org/wiki/List_of_speech_recognition_software | |||
Text NLP Suites | |||
| NLTK | 13,694 | 12 months ago | |
| Gensim | |||
| SpaCy | |||
| Stanford CoreNLP | |||
| Freeling | |||
| OpenNLP | |||
| DKPro | |||
| PyNLPl | 479 | about 2 years ago | |
| IXA Pipes | |||
| NLP4J | |||
| CogComp's NLP libraries | 473 | over 2 years ago | |
| Stanbol NLP | |||
| LIMA | |||
| Corpus.Tools | |||
| NooJ | |||
| SALAT | |||
Language Specific Text NLP Suites | |||
| SAFAR | : Software Architecture For Arabic language pRocessing | ||
| PyCantonese | : Cantonese Linguistics and NLP in Python | ||
| SnowNLP | 6,454 | almost 6 years ago | : Simplified Chinese Text Processing |
| Hazm | 1,219 | over 1 year ago | : Python library for digesting Persian text |
| Frog | : An advanced NLP suite for Dutch | ||
| Tint | : Lend color to your Italian texts! | ||
| KoNLPy | : Korean NLP in Python | ||
Pre-processing (Tokenization / Stemming / POS Tagging / etc.) | |||
| Colibri Core | C++ and Python tools for n-grams and skipgrams | ||
| Snowball Stemmers | |||
| SerbianStemmer | 16 | about 5 years ago | |
| Whoosh Stemmers | |||
| Elephant | : Sequence labeling for word and sentence segmentation | ||
| Toktok | 28 | over 8 years ago | : A fast, simple, multilingual tokenizer |
| Ucto | : An advanced rule-based unicode-aware tokenizer | ||
Deep Linguistic Processing | |||
| DELPH-IN | : Deep Linguistic Processing with HPSG | ||
| English Resource Grammar | |||
| CCG2PST | 11 | over 2 years ago | : A tool for converting CCG derivations into PTB-style phrase structure trees |
Word Embeddings | |||
| Word2Vec | |||
| GloVe | |||
| COMPOSE | |||
| Polyglot | |||
| FastText | 25,979 | over 1 year ago | |
| | |||
| PyTweet | 2 | over 13 years ago | |
| Twitter4J | |||
| Affective Tweets | |||
Task Specific | |||
| KNEWS | 82 | almost 3 years ago | : Knowledge Extraction With Semantics |
| Deep Dive | : Relation Extraction | ||
| Berkley Coreference Error Analyser | 29 | over 2 years ago | A tool for classifying errors in coreference resolution |
| Berkley Parse Error Analyser | 41 | over 2 years ago | : A tool for classifying mistakes in the output of parsers |
Machine Translation | |||
| OpenNMT | |||
| Amunmt | 1,262 | about 2 years ago | |
| Google Seq2Seq | 5,607 | about 5 years ago | |
| Eske Seq2seq | 389 | over 6 years ago | |
| Moses MT | |||
| Joshua | |||
| Jane | |||
| Phrasal | |||
| Kriya | |||
| Apertium MT | |||
| Kyoto EBMT | |||
| Friends of Moses | |||
| Let’s MT | |||
| Neural Machine Translation Implementations | |||
| A list of Neural MT implementations | 359 | over 3 years ago | @jonsafari |
Language Modelling | |||
| SRILM | |||
| IRSTLM | |||
| KenLM | |||
| NPLM | |||
| RNNLM | Mikolov's | ||
| Fast-RNNLM | 560 | over 3 years ago | Yandex |
Annotation Related | |||
| Brat Rapid Annotation Tool | : Online environment for collaborative text annotation | ||
| PyBossa | : The ultimate crowdsourcing framework | ||
| FLAT | 111 | over 1 year ago | : FoLiA Linguistic Annotation Tool |
| TableAnnotator | 20 | over 8 years ago | and |
| Marvin | 13 | almost 8 years ago | : Semantic text annotation tools using Wordnet and DBPedia |
Others | |||
| Java Graphical Authorship Attribution Program | 263 | over 1 year ago | |
| Gecco | 22 | about 3 years ago | : Generic Environment for Context-Aware Correction of Orthography |
| Charguana | 10 | over 7 years ago | : Character Vomitting for CJK Unicode |
| CLAM | : Turn command-line applications into RESTful webservices with web front-end | ||
| LuigiNLP | 21 | about 6 years ago | : Experimental NLP Pipeline system built on top of SciLuigi |
| TextFlows | / | ||
| Vinci generative environment | |||
NLP Related Machine Learning Tools | |||
| Timbl | Memory-based machine learning | ||
| KeLP | : Kernel-based Learning Platform | ||
List of Lists of NLP Resources/Tools | |||
| Awesome NLP | 16,830 | almost 2 years ago | (The original one, curated by @keon and @outpark) |
| Repo tagged with nlp on Github.com | |||
| Java or Python for NLP? | |||
| OpenSource Deep QA Resources | (also ) | ||
| Sibawayh Repository for Arabic NLP | |||
| La Machine | @proycon | ||
| Ruby NLP Resources/Tools | 1,272 | over 2 years ago | |
| NLP Datasets | 5,802 | over 2 years ago | @niderhoff |
| NLP Datasets | 919 | almost 6 years ago | @karthikncode |
See Also | |||
| Corpora List | : Your source of all thing computational linguistics / NLP / corpora | ||
| LT World | : Language Technology World | ||
| META Net | |||
| LDC | : Linguistic Data Consortium | ||
| OLAC | : Open Language Archives Community | ||
| NLSR | : Natural Language Software Registry | ||
More related projects:
-
pawangeek/deep-nlp-resources
-
kyubyong/wordvectors
-
danshapero/icepack-py
-
ifm/ifm3d
-
farama-foundation/arcade-learning-environment
-
avsystem/anjay
-
datasciencemasters/data
-
yuki-koyama/mathtoolbox
-
gianlucabertani/machinelearning
-
redditsota/state-of-the-art-result-for-machine-learning-problems
-
apache/pulsar-client-python
-
yannickjadoul/parselmouth
-
jupyter-xeus/xeus-robot