Reads events from Google Cloud Storage objects. Supports both list-based polling and Pub/Sub notifications.

Gcp Cloud Storage🔗

Reads events from Google Cloud Storage objects. Supports both list-based polling and Pub/Sub notifications.

Configuration🔗

FieldTypeRequiredDefaultDescription
decodingDecoderConfigEnum (optional)noOptional decoder configuration for input events.

Mode🔗

Source mode: list for polling or event_stream for Pub/Sub notifications.

ModeDescription
listList mode: periodically polls for new objects.
event_streamEvent stream mode: consumes GCS notifications from Pub/Sub.

List🔗

FieldTypeRequiredDefaultDescription
authGcpConfig (optional)no[Authentication] GCP authentication and project settings.
  project_idstring (optional)yesGCP Project ID.
  credentials_filestring (optional)noPath to a Service Account JSON key file. If not provided, Application Default Credentials (ADC) will be used.
  credentials_jsonstring (optional)noRaw Service Account JSON key content. If not provided, credentials_file or ADC will be used. (sensitive)
bucketstringyesGCS bucket to list objects from.
prefixstring (optional)noKey prefix filter.
interval_secsunsigned integerno60Polling interval in seconds.
delete_after_readbooleannofalseWhether to delete objects from GCS after they have been successfully processed.

Example:

mode:
mode: list
  bucket: my-gcs-bucket
  interval_secs: 60

Event Stream🔗

FieldTypeRequiredDefaultDescription
pubsubPubSubConfigyes[Event Stream] Pub/Sub configuration for consuming GCS event notifications.
  authGcpConfig (optional)noGCP authentication and project settings.
    project_idstring (optional)yesGCP Project ID.
    credentials_filestring (optional)noPath to a Service Account JSON key file. If not provided, Application Default Credentials (ADC) will be used.
    credentials_jsonstring (optional)noRaw Service Account JSON key content. If not provided, credentials_file or ADC will be used. (sensitive)
  topicstringyesPub/Sub topic name.
  subscriptionstring (optional)noPub/Sub subscription name (required for sources).
  dead_letter_topicstring (optional)noDead Letter Topic name.
  enable_message_orderingbooleannoWhether to enable message ordering.
  ordering_key_fieldstring (optional)noField name to use as the ordering key.
bucketstringyesGCS bucket to read objects from.
include_metadata_updatesbooleannofalseWhether to listen for OBJECT_METADATA_UPDATE events.

Example:

mode:
mode: event_stream
  bucket: my-gcs-bucket
  pubsub:
    topic: my-topic
    subscription: my-sub

Telemetry🔗

This component emits the following standard telemetry metrics and events.

Metrics🔗

NameTypeDescription
component_received_events_totalCounterTotal number of Kinetic events created by this source.
component_received_event_bytes_totalCounterTotal estimated byte size of events created.
component_received_network_bytes_totalCounterTotal number of raw network bytes received.
component_errors_totalCounterTotal number of errors encountered during processing.

Events🔗

NameDescription
ComponentErrorEmitted when an error occurs during component execution.
ComponentEventsDroppedEmitted when events are intentionally or unintentionally dropped.

Example🔗

sources:
  my_gcp_cloud_storage:
    type: gcp_cloud_storage
    mode: list