spark-corenlp

CoreNLP wrapper

Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks

Stanford CoreNLP wrapper for Apache Spark

GitHub

422 stars
51 watching
120 forks
Language: Scala
last commit: about 6 years ago
Linked from 1 awesome list


Backlinks from these awesome lists:

Related projects:

Repository Description Stars
databricks/tensorframes Enables manipulation of Apache Spark DataFrames using TensorFlow programs 749
databricks/spark-csv A library for parsing and querying CSV data with Apache Spark 1,053
gorillalabs/sparkling A Clojure API for interacting with Apache Spark 448
databricks/spark-xml A library that parses and queries XML data in Apache Spark 505
microsoft/mobius Provides a C# API for interacting with Apache Spark 942
apache/spark An analytics engine designed to handle large-scale data processing and analysis 39,916
ondra-m/ruby-spark A Ruby wrapper around Apache Spark's functionality for large-scale data processing 227
janeliascicomp/nextflow-spark Provides a reusable set of Nextflow subworkflows and processes for creating transient Apache Spark clusters on any infrastructure. 14
dotnet/spark Provides high-performance APIs for using Apache Spark with .NET 2,023
explosion/spacy-stanza Wraps the Stanza NLP library to use Stanford models with spaCy 725
instaclustr/sample-kafkasparkcassandra An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. 23
kotlin/kotlin-spark-api Provides compatibility and extensions between Kotlin and Apache Spark for big data processing 461
stanfordnlp/corenlp A Java-based suite of tools for natural language processing and analysis 9,704
tubular/sparkly A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. 60
rocher/ob-ada-spark Supports Ada and SPARK programming languages in Emacs org-babel for compiling, running, and formal verification of code 8