gachifinder

News scraper

An agent for scraping and storing news articles from Korean portals.

GitHub

16 stars
2 watching
4 forks
Language: Go
last commit: almost 3 years ago

Related projects:

Repository Description Stars
disjukr/just-news A userscript project that parses Korean news site and makes the content more readable 191
jakopako/goskyr A tool to simplify web scraping of list-like structured data from web pages 35
davemolk/gogetjs Tools for extracting and analyzing JavaScript files from web pages 40
slotix/dataflowkit A framework for extracting structured data from web pages using CSS selectors. 662
jjelosua/doga_scraper A tool that extracts and converts Galician Official journal documents to different formats based on input year. 0
gushonorato/mechanize A web scraping and automation tool for Elixir. 30
needmorecowbell/giggity A tool to scrape and store hierarchical data about GitHub organizations, users, or repositories. 126
tjatse/node-readability Automates web page scraping and text extraction to make any webpage readable 343
miyagawa/web-scraper A Perl toolkit for extracting structured data from HTML documents using a DSL-like interface. 104
afjoseph/rake.go An algorithm for extracting keywords from text based on word frequency and part-of-speech tagging 117
go-shiori/obelisk Archives a web page as a single HTML file with embedded resources. 263
foolin/pagser A tool for automatically extracting structured data from HTML pages 105
railsmachine/nagiosharder A Ruby API for querying and managing Nagios installations 115
meilisearch/docs-scraper Automates scraping and indexing of documentation content into a search engine 290
s0rg/crawley A utility for systematically extracting URLs from web pages and printing them to the console. 263