spark-nlp
NLP toolkit
Provides a set of pre-trained models and libraries for natural language processing tasks on top of Apache Spark
State of the Art Natural Language Processing
4k stars
101 watching
715 forks
Language: Scala
last commit: 11 months ago
Linked from 4 awesome lists
bertentity-extractionlanguage-detectionlemmatizerllamacppllmmachine-translationnamed-entity-recognitionnatural-language-processingnlponnxpart-of-speech-taggerpysparkquestion-answeringsentiment-analysissparkspell-checkertensorflowtext-classificationtransformers
Related projects:
| Repository | Description | Stars |
|---|---|---|
| | A Spark-based tool for processing Hungarian text data with Magyarlanc language processing features and optional integration with ElasticSearch. | 4 |
| | Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks | 422 |
| | Industrial-strength NLP library for Python and Cython | 30,459 |
| | A comprehensive repository tracking progress in NLP tasks and their corresponding datasets. | 22,742 |
| | A comprehensive NLP library for building conversational AI systems with entity extraction, sentiment analysis, language identification, and more. | 6,301 |
| | A PyTorch implementation on Apache Spark for distributed deep learning model training and inference. | 339 |
| | An interactive web-based editor for exploring and analyzing large datasets using Scala, Apache Spark, and other data science tools | 3,155 |
| | A library for building scalable machine learning pipelines on distributed computing frameworks like Apache Spark | 5,083 |
| | A Java-based suite of tools for natural language processing and analysis | 9,727 |
| | A toolkit for creating and using natural language prompts to enable large language models to generalize to new tasks. | 2,718 |
| | Enables parallelization of machine learning tasks on a distributed Spark cluster using the joblib library. | 243 |
| | Wraps the Stanza NLP library to use Stanford models with spaCy | 726 |
| | An analytics engine designed to handle large-scale data processing and analysis | 40,170 |
| | A C# Natural Language Processing library with pre-trained models and tools for building custom models | 752 |
| | A Go-based machine learning library designed to support neural architectures in natural language processing | 1,759 |