faust

Data pipeline builder

Builds high-performance distributed systems and real-time data pipelines using asynchronous event processing and in-memory durable key-value stores.

Python Stream Processing

GitHub

7k stars
139 watching
534 forks
Language: Python
last commit: 4 months ago
Linked from 9 awesome lists

asynciodistributed-systemskafkakafka-streamspythonstream-processing

Backlinks from these awesome lists:

Related projects:

Repository Description Stars
tschellenbach/stream-framework A Python library for building activity streams and newsfeeds using distributed data stores 4,733
pathwaycom/pathway An ETL framework that enables real-time data processing and analytics using Python, with support for streaming data, batch processing, machine learning, and integration with various external data sources. 4,324
microsoft/playwright-python A Python library to automate multiple web browsers with a single API. 11,872
pytransitions/transitions An object-oriented finite state machine implementation in Python with many extensions. 5,771
arroyosystems/arroyo A distributed stream processing engine designed to efficiently perform stateful computations on high-volume real-time data streams. 3,787
mongodb/motor A Python driver for MongoDB with non-blocking access and support for asyncio and Tornado applications 2,431
mechanicalsoup/mechanicalsoup Automates interaction with websites by simulating browser behavior and handling HTTP sessions and document navigation. 4,672
astral-sh/ruff A fast and powerful Python code linter and formatter written in Rust. 32,812
infinyon/fluvio A lightweight distributed data streaming system written in Rust and Web Assembly for real-time data processing 3,880
deezer/spleeter A Python library for separating audio sources in real-time with high accuracy and speed. 25,926
redpanda-data/connect Stream processor for connecting various data sources and sinks using Apache V2 or Enterprise builds. 8,137
skelsec/pypykatz An implementation of Mimikatz in pure Python for parsing Windows secrets and registry data. 2,879
cloudtools/troposphere A Python library to generate AWS CloudFormation descriptions in JSON or YAML format 4,931
finos/perspective A component for creating interactive analytics and data visualization applications with support for large datasets and streaming queries. 8,530
windmill-labs/windmill An open-source developer platform that integrates scripts with UIs and workflows, allowing for automation of infrastructure and business processes. 10,864