3. Introducing Spring Cloud Data Flow

3. Introducing Spring Cloud Data Flow
Prev	Part II. Spring Cloud Data Flow Overview	Next

Spring Cloud Data Flow is a cloud native programming and operating model for composable data microservices on modern runtimes. With Spring Cloud Data Flow, developers can create and orchestrate data pipelines for common use cases such as data ingest, real-time analytics, and data import/export.

Spring Cloud Data Flow is the cloud native redesign of Spring XD – a project that aimed to simplify development of Big Data applications. The streaming and batch modules from Spring XD are refactored into Spring Boot data microservice applications that are now autonomous deployment units and they can "natively" run in modern runtimes such as Cloud Foundry, Apache YARN, Apache Mesos, and Kubernetes.

Spring Cloud Data Flow offers a collection of patterns and best practices for microservices-based distributed streaming and batch data pipelines.

3.1 Features

Orchestrate applications across a variety of distributed modern runtimes including: Cloud Foundry, Apache YARN, Apache Mesos, and Kubernetes
Separate runtime dependencies backed by spring profiles
Consume stream and batch data-microservices as maven dependencies
Develop using: DSL, Shell, REST-APIs, Data Flow Server UI, and Flo
Take advantage of metrics, health checks and remote management of each data microservice
Scale stream and batch pipelines without interrupting data flows