Numaflow¶

Welcome to Numaflow! A Kubernetes-native, serverless platform for running scalable and reliable event-driven applications. Numaflow decouples event sources and sinks from the processing logic, allowing each component to independently auto-scale based on demand. With out-of-the-box sources and sinks, and built-in observability, developers can focus on their processing logic without worrying about event consumption, writing boilerplate code, or operational complexities. Each step of the pipeline can be written in any programming language, offering unparalleled flexibility in using the best programming language for each step and ease of using the languages you are most familiar with.

Use Cases¶

Event driven applications: Process events as they happen, e.g., updating inventory and sending customer notifications in e-commerce.
Real time analytics: Analyze data instantly, e.g., social media analytics, observability data processing.
Inference on streaming data: Perform real-time predictions, e.g., anomaly detection.
Workflows running in a streaming manner.
Learn more in our User Guide.

Key Features¶

Kubernetes-native: If you know Kubernetes, you already know how to use Numaflow.
Serverless: Focus on your code and let the system scale up and down based on demand.
Language agnostic: Use your favorite programming language.
Exactly-Once semantics: No input element is duplicated or lost even as pods are rescheduled or restarted.
Auto-scaling with back-pressure: Each vertex automatically scales from zero to whatever is needed.

Data Integrity Guarantees¶

Minimally provide at-least-once semantics
Provide exactly-once semantics for unbounded and near real-time data sources
Preserving order is not required

Roadmap¶

Make Rust code the primary for data-plane (1.6)

Demo¶

Getting Started¶

For set-up information and running your first Numaflow pipeline, please see our getting started guide.