Getting Started with IBM Streams v4.1 Quick Start Edition

Edit me

If you haven’t downloaded the Streams QSE, you may download it here:

Streams Overview

For a quick overview about Streams and developing in Streams, see the following video:

Getting Started for the Developer

As a developer, you want to:

  • Learn about Streams
  • Write your first Streams application
  • Work with the development tooling in Streams

Below are some resources to get you up and running!

Java and Scala

Example Streams Application:

package simple;

import com.ibm.streamsx.topology.TStream;
import com.ibm.streamsx.topology.Topology;
import com.ibm.streamsx.topology.context.StreamsContextFactory;

public static void main(String[] args) throws Exception {

        /*
         * Create the container for the topology that will
         * hold the streams of tuples.
         */
        Topology topology = new Topology("HelloWorld");

        /*
         * Declare a source stream (hw) with String tuples containing two tuples,
         * "Hello" and "World!".
         */
        TStream<String> hw = topology.strings("Hello", "World!");

        /*
         * Sink hw by printing each of its tuples to System.out.
         */
        hw.print();

        /*
         * At this point the topology is declared with a single
         * stream that is printed to System.out.
         */

        /*
         * Now execute the topology by submitting to a StreamsContext.
         * If no argument is provided then the topology is executed
         * within this JVM (StreamsContext.Type.EMBEDDED).
         * Otherwise the first and only argument is taken as the
         * String representation of the
         */
        if (args.length == 0)
            StreamsContextFactory.getEmbedded().submit(topology).get();
        else
            StreamsContextFactory.getStreamsContext(args[0]).submit(topology)
                    .get();
    }
package simple

import com.ibm.streamsx.topology.Topology
import com.ibm.streamsx.topology.streams.BeaconStreams
import com.ibm.streamsx.topology.context.StreamsContextFactory

import java.util.concurrent.TimeUnit

import com.ibm.streamsx.topology.functions.FunctionConversions._

object HelloWorldScala {
  def main(args: Array[String]) {
    val topology = new Topology("HelloWorldScala")

    var hw = topology.strings("Hello", "World!")    
    hw.print()

   StreamsContextFactory.getStreamsContext("EMBEDDED").submit(topology).get()
  }
}

To get started, follow these development guides:

Streams Processing Language (SPL)

The Streams Processing Language is designed from the ground up for writing streaming application. To quickly get started:

Streams is shipped with comprehensive development tooling.

Streams Studio

To learn about how to develop using Streams Studio (our drag-and-drop IDE):

Writing Java Operators

If you have existing Java code, you may easily reuse your code by writing a Java operator or native Java functions.

SparkMLLib in Streams

To get started, follow this development guide:

Getting Started for the Data Engineer

As a Data Engineer, you are responsible for:

  • Designing, building, and managing data and analytic systems to ensure they are secure, reliable, and scalable
  • Making all data, including data in motion, available for analysis by other team members such as data scientists and developers
  • Capturing data in motion and integrating it with data at rest
  • Leveraging the newest technologies for stream computing

Below are some resources to help you get started.

Integrating with Streams

Streams is shipped with many toolkits out of the box to enable integration with some of the most popular systems like HDFS, HBase, Kafka, Active MQ and more. To learn about the set of toolkits that are shipped as part of the Streams product, refer to the Product Toolkits Overview

IBMStreams on GitHub provides a platform enabling Streams to rapidly deliver our support to emgerging technologies to you. It is also a place for us to share sample applications and helpful utilities. For a list of open-source projects hosted on GitHub, see: IBM Streams GitHub Projects Overview

Integration with IBM InfoSphere Data Governance Catalog

With this support, developers can easily discover the data and schema that are available for use. By building data lineage with your Streams application, you can quickly see and control how data is consumed. To get started, refer to Streams Governance Quickstart Guide

Cybersecurity Toolkit

The Cybersecurity Toolkit provides operators that are capable of analyzing network traffic and detecting suspicious behaviour. For more information on using the Cybersecurity Toolkit, refer to Cybersecurity Getting Started Guide

Streams and SPSS

SPSS is analytic predictive software enabling you to build predictive model from your data. Your application may perform real-time predictive scoring by running these predictive models using the SPSS operators.

To learn about Streams can integrate with SPSS: Streams and SPSS Lab.

Streams Domain Management and Administration

Streams Console is the web-based administration console for monitoring and managing your Streams domain.

Prior to Streams 4.1, the Streams Console dashboard contained a fixed set of widgets. With the latest release, you can now create customized dashboards to monitor your Streams domain, instances and applications.

Streams Console

To familiarize yourself with the Streams Console, see this video:

Getting Started for the Business User

As a business user, you need to:

  • Identify patterns, trends, risks and opportunities in data
  • Build predictive analytic models
  • Use visualization tools to explore and uncover high value data.

Below are some resources to help you get started.

Streams and Microsoft Excel

Streams and Excel

IBM Streams integrates with Microsoft Excel, allowing you to see, analyze and visulize live streaming data in an Excel worksheet. This article helps you get started: Streams for Microsoft Excel

In the following demo, we demonstrate how you may build a marketing dashboard from real-time data using Excel.

Streams Community

The following Streams resources can help you connect with the Streams community and get support when you need it:

  • StreamsDev - This resource is a developer-to-developer website maintained by the Streams Development Team. It contains many useful articles and getting started material. Check back often for new articles, tips and best practises to this website.
  • Streams Forum - This forum enables you to ask, and get answers to your questions, related to IBM Streams. If you have questions, start here.
  • IBMStreams on GitHub - Streams is shipped with many useful toolkits out of the box. IBMStreams on GitHub contains many open-source toolkits. For a list of available toolkits available on GitHub, see this web page: IBMStreams GitHub Toolkits.
  • IBM Streams Support - This website provides information about IBM Streams downloads, technical support tools, documentation, and other resources.
  • IBM Streams Product Site - This website provides a broad range of information and resources about Streams and related topics.