scattertext

term analyzer

Tool for analyzing and visualizing language differences among document types

Beautiful visualizations of how language differs among document types.

GitHub

2k stars
55 watching
293 forks
Language: Python
last commit: 4 months ago
Linked from 1 awesome list

computational-social-scienced3edaexploratory-data-analysisjapanese-languagemachine-learningnatural-language-processingnlpscatter-plotsemiotic-squaressentimentstylometricstylometrytext-as-datatext-miningtext-visualizationtopic-modelingvisualizationword-embeddingsword2vec

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
bradley/blotter A JavaScript API for creating unconventional text effects on the web using GLSL shaders. 3,058
niderhoff/nlp-datasets A collection of text datasets for use in Natural Language Processing 5,802
wilfred/difftastic A tool for comparing files based on their syntax and structure 21,365
flairnlp/flair A framework for building state-of-the-art NLP models and performing text embeddings with support for multiple languages 13,990
juliasilge/tidytext Provides tools and data to convert text into tidy data formats for natural language processing tasks 1,182
stanfordnlp/corenlp A Java-based suite of tools for natural language processing and analysis 9,727
kpdecker/jsdiff A JavaScript implementation of text differencing algorithm 8,252
fielddb/berlin-buzzwords-2013 Demos for multilingual text analysis using Lucene, Solr, ElasticSearch, and OpenNLP 0
davidmerfield/typeset An HTML pre-processing tool with typographic features traditionally used in fine printing. 2,657
stanfordnlp/glove Provides pre-trained word vector representations and an implementation of the GloVe model for learning word embeddings 6,908
zix99/rare A tool that provides fast and efficient text analysis and visualization capabilities 275
cemoody/lda2vec A framework for creating interpretable natural language models by combining word embeddings and topic modeling. 3,152
ibm/max-news-text-generator Generates English-language text similar to news articles using machine learning and natural language processing techniques. 26
scriban/scriban A scripting language and engine for .NET with support for templating languages like Handlebars, Liquid, and Mustache. 3,246
mortenjust/cleartext-mac A text editor that restricts writing to the 1,000 most common English words, using natural language processing techniques. 3,274