spark-corenlp

CoreNLP wrapper

Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks

Stanford CoreNLP wrapper for Apache Spark

422 stars

51 watching

120 forks

Language: Scala

last commit: over 7 years ago

Linked from 1 awesome list

Backlinks from these awesome lists:

awesome-spark/awesome-spark

Related projects:

Repository	Description	Stars
databricks/tensorframes	Enables manipulation of Apache Spark DataFrames using TensorFlow programs	749
databricks/spark-csv	A library for parsing and querying CSV data with Apache Spark	1,052
gorillalabs/sparkling	A Clojure API for interacting with Apache Spark	448
databricks/spark-xml	A library that parses and queries XML data in Apache Spark	504
microsoft/mobius	Provides a C# API for interacting with Apache Spark	941
apache/spark	An analytics engine designed to handle large-scale data processing and analysis	40,170
ondra-m/ruby-spark	A Ruby wrapper around Apache Spark's functionality for large-scale data processing	227
janeliascicomp/nextflow-spark	Provides a reusable set of Nextflow subworkflows and processes for creating transient Apache Spark clusters on any infrastructure.	14
dotnet/spark	Provides high-performance APIs for using Apache Spark with .NET	2,032
explosion/spacy-stanza	Wraps the Stanza NLP library to use Stanford models with spaCy	726
instaclustr/sample-kafkasparkcassandra	An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra.	23
kotlin/kotlin-spark-api	Provides compatibility and extensions between Kotlin and Apache Spark for big data processing	463
stanfordnlp/corenlp	A Java-based suite of tools for natural language processing and analysis	9,727
tubular/sparkly	A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark.	61
rocher/ob-ada-spark	Supports Ada and SPARK programming languages in Emacs org-babel for compiling, running, and formal verification of code	8