dumbo
Hadoop tool
Makes writing and running Hadoop programs easier with a Python API
Python module that allows one to easily write and run Hadoop programs.
1k stars
62 watching
146 forks
Language: Python
last commit: about 7 years ago
Linked from 1 awesome list
Related projects:
Repository | Description | Stars |
---|---|---|
| A Python MapReduce library written in Cython for efficient data processing on Hadoop clusters. | 243 |
| A Python project focused on GitHub and DevRel, with the goal of providing resources and support for developers. | 111 |
| A Clojure-based library for writing efficient MapReduce programs on the Hadoop platform | 257 |
| Provides a custom input format for handling concatenated GZIP files in distributed processing systems like Hadoop | 9 |
| An HTTP microframework allowing developers to easily expose scripts as APIs and restrict execution. | 614 |
| A beginner's guide to the Python programming language | 2,322 |
| Tool that supports reproducible workflows with Jupyter Notebooks and SCons. | 160 |
| A collection of Haskell code examples and resources illustrating the language's features and programming techniques. | 114 |
| A Kotlin library that provides useful extensions to eliminate boilerplate code in Android development | 894 |
| Provides a unified interface to various cloud providers | 76 |
| Tools for managing and automating DevOps tasks, data processing, and cloud infrastructure using Python. | 783 |
| An AUR helper and library that automates the process of building and installing Arch Linux packages from source. | 71 |
| A Python wrapper for the Mastodon API allowing developers to interact with the social media platform's public and private APIs. | 889 |
| An integration layer for the kippo SSH honeypot with Django's administrative interface | 12 |
| Tool for managing infrastructure clusters by tracking inventory, connections, and abstracting interactions with infrastructure elements. | 291 |