Skip to main content

Sources

Sources define where your data comes from — the connection parameters and technical details for inbound data endpoints.

What are Sources?

A Source Asset specifies the technical parameters for connecting to an external system that provides data. This includes:

  • Connection details — endpoints, credentials, and access parameters
  • Location configuration — folders, buckets, queues, or topics to read from
  • Polling behavior — how often to check for new data
  • Failure handling — retry logic and error management

Sources are referenced by Input Processors, which handle the actual message ingestion and processing workflow.

Sources vs Input Processors

Understanding the distinction helps you build maintainable workflows:

SourcesInput Processors
Define where data comes from (connection/location)Define how data is processed (workflow entry)
Reusable across multiple workflowsWorkflow-specific configuration
Configure endpoints, folders, credentialsConfigure format parsing, routing, initial processing
Example: S3 bucket name and regionExample: Which format to parse, where to route

When to define a Source vs using Input Processor directly

Define a Source Asset when:

  • The same connection/location is used across multiple workflows
  • You need to centralize credential management
  • You want to monitor and manage connections independently
  • You're connecting to external systems (S3, Kafka, Email, FTP, etc.)

Input Processor configuration is sufficient when:

  • The data source is internal to the workflow (Message Source, Timer)
  • No external connection parameters are needed
  • The configuration is truly workflow-specific

Sources Overview

SourcePurposeCommon Use Case
EmailConnect to mail servers (IMAP, POP3, MS 365)Process incoming emails and attachments
File SystemRead from local or mounted file systemsPick up files from local directories
FTPConnect to FTP/SFTP serversRetrieve files from FTP endpoints
Google Cloud StoragePoll GCS buckets for objectsCloud storage integration
HTTPDefine HTTP endpoint parametersREST API data ingestion
KafkaConnect to Apache Kafka topicsStream processing from Kafka
MessageDefine inline message sourcesStatic or template messages
NFSConnect to NFS sharesNetwork file system access
OneDriveConnect to Microsoft OneDriveCloud document processing
S3Poll Amazon S3 bucketsObject storage integration
ServiceUse Service Assets as data sourcesJDBC queries, HTTP calls as streams
SharePointConnect to Microsoft SharePointEnterprise document management
SMBConnect to SMB/CIFS sharesWindows file sharing
SQSPoll Amazon SQS queuesMessage queue processing
TCPReceive data over TCP connectionsNetwork socket data ingestion
TimerGenerate scheduled message triggersCron-based workflow initiation
UDPReceive data over UDPLightweight network messaging
Virtual File SystemPoll VFS for filesAbstracted file system access
WebDAVConnect to WebDAV serversWeb-based file management

Common Configuration Patterns

Most Source Assets share these configuration sections:

  • Name & Description — Asset identification
  • Required Roles — Access control for the asset
  • Connection — Reference to a Connection Asset (for external sources)
  • Directories/Folders — Source location configuration
  • Polling & Processing — How often to check for new data
  • Throttling & Failure Handling — Rate limiting and error recovery

See Also

  • Input Processors — Process data from Sources
  • Connections — Reusable connection definitions used by Sources
  • Sinks — The counterpart to Sources — where data goes