Icon for Apache StreamPipesIcon for Apache Pulsar

StreamPipes + Apache Pulsar: High-Performance IIoT Messaging

Integrates withCurated

Overview

Apache StreamPipes integrates with Apache Pulsar to leverage Pulsar's high-performance messaging capabilities for Industrial IoT data streams. This integration enables reliable, scalable data ingestion and distribution in manufacturing environments.

Integration Architecture

StreamPipes connects to Pulsar via its messaging adapter layer:

  1. Data Ingestion — StreamPipes consumes IIoT data from Pulsar topics using the Pulsar adapter
  2. Data Distribution — Processed data can be published back to Pulsar topics for downstream consumers
  3. Multi-Tenancy — Pulsar's tenant/namespace model enables logical separation of different factory lines or sites

Use Cases

  • High-throughput sensor ingestion — Ingest data from thousands of sensors via Pulsar's scalable architecture
  • Geo-replicated factories — Use Pulsar's geo-replication to synchronize data across multiple factory locations
  • Event-driven microservices — Trigger manufacturing workflows by publishing events to Pulsar topics
  • Backpressure handling — Pulsar's storage layer handles temporary spikes in data volume without data loss

Configuration

  1. Deploy Pulsar cluster (standalone for testing, distributed for production)
  2. In StreamPipes Connect, configure the Pulsar adapter with broker URL and topic name
  3. Set consumer group for scalable processing
  4. Configure schema registry if using structured data (Avro, JSON Schema)

Tradeoffs & Considerations

  • Operational complexity — Pulsar requires more setup than Mosquitto or RabbitMQ
  • Resource requirements — Pulsar's storage layer (Apache BookKeeper) needs SSD storage
  • Maturity in IIoT — Fewer manufacturing-specific examples than Kafka
  • Alternative — For simpler deployments, StreamPipes also supports Kafka, MQTT (Mosquitto), and RabbitMQ