streamsx.topology.context module

Context for submission of applications.

The main function is submit() to submit a Topology to a Streaming Analytics service or IBM® Streams instance for execution.

class streamsx.topology.context.ConfigParams

Bases: object

Configuration options which may be used as keys in submit() config parameter.

FORCE_REMOTE_BUILD = 'topology.forceRemoteBuild'

Force a remote build of the application.

When submitting to STREAMING_ANALYTICS_SERVICE a local build of the Streams application bundle will occur if the environment variable STREAMS_INSTALL is set. Setting this flag to True ignores the local Streams install and forces the build to occur remotely using the service.

JOB_CONFIG = 'topology.jobConfigOverlays'

Key for a JobConfig object representing a job configuration for a submission.

SERVICE_NAME = 'topology.service.name'

Streaming Analytics service name.

Selects the specific Streaming Analytics service from VCAP services information defined by the the environment variable VCAP_SERVICES or the key VCAP_SERVICES in the submit config.

STREAMS_CONNECTION = 'topology.streamsConnection'

Key for a StreamsConnection object for connecting to a running IBM Streams instance.

VCAP_SERVICES = 'topology.service.vcap'

Streaming Analytics service credentials in VCAP_SERVICES format.

Provides the connection credentials when connecting to a Streaming Analytics service using context type STREAMING_ANALYTICS_SERVICE.

The key overrides the environment variable VCAP_SERVICES.

The value can be:
  • Path to a local file containing a JSON representation of the VCAP services information.
  • Dictionary containing the VCAP services information.
class streamsx.topology.context.ContextTypes

Bases: object

Submission context types.

A Topology is submitted using submit() and a context type. Submision of a Topology generally builds the application into a Streams application bundle (sab) file and then submits it for execution in the required context.

The Streams application bundle contains all the artifacts required by an application such that it can be executed remotely (e.g. on a Streaming Analytics service), including distributing the execution of the application across multiple resources (hosts).

The context type defines which context is used for submission.

The main context types result in a running application and are:

  • STREAMING_ANALYTICS_SERVICE - Application is submitted to a Streaming Analytics service running on IBM Bluemix cloud platform.
  • DISTRIBUTED - Application is submitted to an IBM Streams instance.
  • STANDALONE - Application is executed as a local process, IBM Streams standalone application. Typically this is used during development or testing.

The BUNDLE context type compiles the application (Topology) to produce a Streams application bundle (sab file). The bundle is not executed but may subsequently be submitted to a Streaming Analytics service or an IBM Streams instance. A bundle may be submitted multiple times to services or instances, each resulting in a unique job (running application).

ANALYTICS_SERVICE = 'ANALYTICS_SERVICE'

Synonym for STREAMING_ANALYTICS_SERVICE.

BUILD_ARCHIVE = 'BUILD_ARCHIVE'

Creates a build archive.

This context type produces the intermediate code archive used for bundle creation.

Note

BUILD_ARCHIVE is typically only used when diagnosing issues with bundle generation.

BUNDLE = 'BUNDLE'

Create a Streams application bundle.

The Topology is compiled locally to produce Streams application bundle (sab file).

The resultant application can be submitted to:
  • Streaming Analytics service using the Streams console or the Streaming Analytics REST api.
  • IBM Streams instance using the Streams console, JMX api or command line streamtool submitjob.
  • Executed standalone for development or testing (when built with IBM Streams 4.2 or later).

The bundle must be built on the same operating system version and architecture as the intended running environment. For Streaming Analytics service this is currently RedHat/CentOS 6 and x86_64 architecture.

Environment variables:

This environment variables define how the application is built.

  • STREAMS_INSTALL - Location of a IBM Streams installation (4.0.1 or later).
DISTRIBUTED = 'DISTRIBUTED'

Submission to an IBM Streams instance.

The Topology is compiled locally and the resultant Streams application bundle (sab file) is submitted to an IBM Streams instance.

Environment variables:

These environment variables define how the application is built and submitted.

  • STREAMS_INSTALL - Location of a IBM Streams installation (4.0.1 or later).
  • STREAMS_DOMAIN_ID - Domain identifier for the Streams instance.
  • STREAMS_INSTANCE_ID - Instance identifier.
  • STREAMS_ZKCONNECT - (optional) ZooKeeper connection string for domain (when not using an embedded ZooKeeper)
STANDALONE = 'STANDALONE'

Build and execute locally.

Compiles and executes the Topology locally in IBM Streams standalone mode as a separate sub-process. Typically used for devlopment and testing.

The call to submit() return when (if) the application completes. An application completes when it has finite source streams and all tuples from those streams have been processed by the complete topology. If the source streams are infinite (e.g. reading tweets) then the standalone application will not complete.

Environment variables:

This environment variables define how the application is built.

  • STREAMS_INSTALL - Location of a IBM Streams installation (4.0.1 or later).
STANDALONE_BUNDLE = 'STANDALONE_BUNDLE'

Create a Streams application bundle for standalone execution.

The Topology is compiled locally to produce Streams standalone application bundle (sab file).

The resultant application can be submitted to:
  • Executed standalone for development or testing.

The bundle must be built on the same operating system version and architecture as the intended running environment. For Streaming Analytics service this is currently RedHat/CentOS 6 and x86_64 architecture.

Environment variables:

This environment variables define how the application is built.

  • STREAMS_INSTALL - Location of a IBM Streams installation (4.0.1 or 4.1.x).

Deprecated since version IBM: Streams 4.2 Use BUNDLE when compiling with IBM Streams 4.2 or later.

STREAMING_ANALYTICS_SERVICE = 'STREAMING_ANALYTICS_SERVICE'

Submission to Streaming Analytics service running on IBM Bluemix cloud platform.

The Topology is compiled and the resultant Streams application bundle (sab file) is submitted for execution on the Streaming Analytics service.

When STREAMS_INSTALL is not set or the submit() config parameter has FORCE_REMOTE_BUILD set to True the compilation of the application occurs remotely by the service. This allows creation and submission of Streams applications without a local install of IBM Streams.

When STREAMS_INSTALL is set and the submit() config parameter has FORCE_REMOTE_BUILD set to False or not set then the creation of the Streams application bundle occurs locally and the bundle is submitted for execution on the service.

Environment variables:

These environment variables define how the application is built and submitted.

  • STREAMS_INSTALL - (optional) Location of a IBM Streams installation (4.0.1 or later). The install must be running on RedHat/CentOS 6 and x86_64 architecture.
TOOLKIT = 'TOOLKIT'

Creates an SPL toolkit.

Topology applications are translated to SPL applications before compilation into an Streams application bundle. This context type produces the intermediate SPL toolkit that is input to the SPL compiler for bundle creation.

Note

TOOLKIT is typically only used when diagnosing issues with bundle generation.

class streamsx.topology.context.JobConfig(job_name=None, job_group=None, preload=False, data_directory=None, tracing=None)

Bases: object

Job configuration.

JobConfig allows configuration of job that will result from submission of a py:class:Topology (application).

A JobConfig is set in the config dictionary passed to submit() using the key JOB_CONFIG. add() exists as a convenience method to add it to a submission configuration.

Parameters:
  • job_name (str) – The name that is assigned to the job. A job name must be unique within a Streasm instance When set to None a system generated name is used.
  • job_group (str) – The job group to use to control permissions for the submitted job.
  • preload (bool) – Specifies whether to preload the job onto all resources in the instance, even if the job is not currently needed on each. Preloading the job can improve PE restart performance if the PEs are relocated to a new resource.
  • data_directory (str) – Specifies the location of the optional data directory. The data directory is a path within the cluster that is running the Streams instance.
  • tracing – Specify the application trace level. See tracing

Example:

# Submit a job with the name NewsIngester
cfg = {}
job_config = JobConfig(job_name='NewsIngester')
job_config.add(cfg)
context.submit('ANALYTICS_SERVICE', topo, cfg)
add(config)

Add this JobConfig into a submission configuration object.

Parameters:config (dict) – Submission configuration.
Returns:config.
Return type:dict
target_pe_count

Target processing element count.

When submitted against a Streams instance target_pe_count provides a hint to the scheduler as to how to partition the topology across processing elements (processes) for the job execution. When a job contains multiple processing elements (PEs) then the Streams scheduler can distributed the PEs across the resources (hosts) running in the instance.

When set to None (the default) no hint is supplied to the scheduler. The number of PEs in the submitted job will be determined by the scheduler.

The value is only a target and may be ignored when the topology contains isolate() calls.

Note

Only supported in Streaming Analytics service and IBM Streams 4.2 or later.

tracing

Runtime application trace level.

The runtime application trace level can be a string with value error, warn, info, debug or trace.

In addition a level from Python logging module can be used in with CRITICAL and ERROR mapping to error, WARNING to warn, INFO to info and DEBUG to debug.

Setting tracing to None or logging.NOTSET will result in the job submission using the Streams instance application trace level.

The value of tracing is the level as a string (error, warn, info, debug or trace) or None.

class streamsx.topology.context.SubmissionResult(results)

Bases: object

Passed back to the user after a call to submit. Allows the user to use dot notation to access dictionary elements.

job

If able, returns the job associated with the submitted build. If a username/password, StreamsConnection, or vcap file was not supplied, returns None.

NOTE: The @property tag supersedes __getattr__. In other words, this job method is called before __getattr__(self, ‘job’) is called.

streamsx.topology.context.submit(ctxtype, graph, config=None, username=None, password=None)

Submits a Topology (application) using the specified context type.

Used to submit an application for compilation into a Streams application and execution within an Streaming Analytics service or IBM Streams instance.

ctxtype defines how the application will be submitted, see ContextTypes.

The parameters username and password are only required when submitting to an IBM Streams instance and it is required to access the Streams REST API from the code performing the submit. Accessing data from views created by view() requires access to the Streams REST API.

Parameters:
  • ctxtype (str) – Type of context the application will be submitted to. A value from ContextTypes.
  • graph (Topology) – The application topology to be submitted.
  • config (dict) – Configuration for the submission.
  • username (str) – Username for the Streams REST api.
  • password (str) – Password for username.
Returns:

Result of the submission. For details of what is contained see the ContextTypes constant passed as ctxtype.

Return type:

SubmissionResult