HadoopConcatGz

GZIP handler

Provides a custom input format for handling concatenated GZIP files in distributed processing systems like Hadoop

A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz

GitHub

9 stars
2 watching
3 forks
Language: Java
last commit: almost 7 years ago
Linked from 1 awesome list

hadoopsparkwarcweb-archivingwebarchive

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
mapbox/gzip-hpp A C++ library for efficient compression and decompression of binary data 325
manyuanrong/wasm_gzip Implementation of Gzip compression and decompression in WebAssembly (WASM) for use with the Deno runtime. 19
gin-contrib/gzip A middleware package to enable GZIP compression in Gin-based web applications. 332
sstadick/gzp A multi-threaded data compression library written in Rust. 154
thejoshwolfe/hexdump-zip Produces an annotated hexdump of the contents of a zip archive 8
wzshiming/gotype A tool for parsing and manipulating Golang source code at compile-time 61
xxjwxc/public A comprehensive utility package for Go programmers 175
sindresorhus/gh-got Convenience wrapper around GitHub's API interaction library 176
goto-bus-stop/ziguid A Zig library for parsing and stringifying GUIDs 7
o0morgan0o/gcode-generative-for-processing Library for generating G-code instructions from Processing code for 3D printing pen plotting 28
g-plane/markup_fmt A formatter for various front-end markup languages and templating engines 102
thegodheehee/hijack-zsh A custom zsh theme with Git integration 1
hejsil/mecha A parser combinator library for the Zig programming language 472
gnarroway/hato An HTTP client library for Clojure that wraps JDK 11's HttpClient for synchronous and asynchronous requests with websockets support. 380
crazy-max/ghaction-import-gpg Tool to easily manage GPG keys in GitHub workflows 321