spark-corenlp
CoreNLP wrapper
Wraps Stanford CoreNLP annotators as Spark DataFrame functions for natural language processing tasks
Stanford CoreNLP wrapper for Apache Spark
422 stars
51 watching
120 forks
Language: Scala
last commit: about 6 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
databricks/tensorframes | Enables manipulation of Apache Spark DataFrames using TensorFlow programs | 749 |
databricks/spark-csv | A library for parsing and querying CSV data with Apache Spark | 1,053 |
gorillalabs/sparkling | A Clojure API for interacting with Apache Spark | 448 |
databricks/spark-xml | A library that parses and queries XML data in Apache Spark | 505 |
microsoft/mobius | Provides a C# API for interacting with Apache Spark | 942 |
apache/spark | An analytics engine designed to handle large-scale data processing and analysis | 39,916 |
ondra-m/ruby-spark | A Ruby wrapper around Apache Spark's functionality for large-scale data processing | 227 |
janeliascicomp/nextflow-spark | Provides a reusable set of Nextflow subworkflows and processes for creating transient Apache Spark clusters on any infrastructure. | 14 |
dotnet/spark | Provides high-performance APIs for using Apache Spark with .NET | 2,023 |
explosion/spacy-stanza | Wraps the Stanza NLP library to use Stanford models with spaCy | 725 |
instaclustr/sample-kafkasparkcassandra | An introductory Scala app using Apache Spark Streaming to process data from Kafka and write summaries to Cassandra. | 23 |
kotlin/kotlin-spark-api | Provides compatibility and extensions between Kotlin and Apache Spark for big data processing | 461 |
stanfordnlp/corenlp | A Java-based suite of tools for natural language processing and analysis | 9,704 |
tubular/sparkly | A set of Python libraries and tools to simplify interactions with various data sources using Apache Spark. | 60 |
rocher/ob-ada-spark | Supports Ada and SPARK programming languages in Emacs org-babel for compiling, running, and formal verification of code | 8 |