Braze Cloud Data Ingestion

Braze Cloud Data Ingestion (CDI) allows you to set up a direct connection from your data storage solution to sync relevant user data and other non-user data to Braze. This data can then be used for personalization or segmentation to power your marketing use cases. Cloud Data Ingestion’s flexible integration supports complex data structures, including nested JSON and arrays of objects.

How it works

With Braze Cloud Data Ingestion (CDI), you set up an integration between your data warehouse instance and Braze workspace to sync data on a recurring basis. This sync runs on a schedule you set, and each integration can have a different schedule. Syncs can run as frequently as every 15 minutes or as infrequently as once per month. If you need syncs to occur more frequently than 15 minutes, contact your customer success manager or consider using REST API calls for real-time data ingestion.

note

Sync frequency in the dashboard controls how often Braze runs a sync (for example, options such as hourly or more frequent runs within an hour). It does not set a custom interval longer than one hour between runs. To run a sync outside the scheduled cadence—such as on demand after your warehouse load completes—use the Trigger a sync endpoint with your integration ID.

When a sync runs, Braze directly connects to your data warehouse instance, retrieves all new data from the specified table, and updates the corresponding data on your Braze dashboard. Each time the sync runs, any updated data is reflected in Braze.

Finding your integration ID

You can find your integration ID in the URL when viewing an integration in the Braze dashboard. Navigate to Data Settings > Cloud Data Ingestion and select an integration. The integration ID appears in the URL in the format https://[instance].braze.com/integrations/cloud_data_ingestion/[integration_id]. For example, if your URL is https://dashboard-01.braze.com/integrations/cloud_data_ingestion/abc123xyz, your integration ID is abc123xyz. You can use this ID when making API calls to trigger syncs or check sync status.

Use cases

With Braze Cloud Data Ingestion capabilities, you can:

Create a simple integration directly from your data warehouse or file storage solution to Braze in just a few minutes.
Securely sync user data, including attributes, events, and purchases from your data warehouse to Braze.
Close the data loop with Braze by combining Cloud Data Ingestion with Currents or Snowflake Data Sharing.

In addition, Connected Sources are a zero-copy alternative. You can have Braze directly query your data warehouse or file storage solution to construct CDI segments —all without copying the underlying data to Braze.

Supported data sources

Cloud Data Ingestion can sync data from:

Amazon Redshift
Databricks
Google BigQuery
Microsoft Fabric
Snowflake
Amazon S3

Supported data types

Cloud Data Ingestion supports the following data types:

User data

User attributes, including:
- Nested custom attributes
- Arrays of objects
- Subscription statuses
Custom events
Purchase events
User deletion requests

Non-user objects

Catalog items

Zero-copy messaging

Connected Sources

User identifiers for data ingestion

When syncing user data through Cloud Data Ingestion, you can identify users using one or more of the following identifier types. Each row in your source table should contain a value for only one identifier type at a time, but your table can include columns for one, two, three, four, or all five identifier types.

Identifier	Description
`EXTERNAL_ID`	The external ID that identifies the user profile to create or update. This should match the `external_id` value used in Braze.
`ALIAS_NAME` and `ALIAS_LABEL`	These two columns create a user alias object. `alias_name` should be a unique identifier, and `alias_label` specifies the type of alias. Users may have multiple aliases with different labels but only one `alias_name` per `alias_label`.
`BRAZE_ID`	The Braze user identifier generated by the Braze SDK. New users cannot be created using a Braze ID through Cloud Data Ingestion. To create new users, specify an external user ID or user alias.
`EMAIL`	The user’s email address. If multiple profiles with the same email address exist, the most recently updated profile is prioritized for updates. If you include both email and phone, email is used as the primary identifier.
`PHONE`	The user’s phone number. If multiple profiles with the same phone number exist, the most recently updated profile is prioritized for updates.

For detailed information about setting up table columns and payload formatting requirements, see Table setup for Cloud Data Ingestion.

For source-specific setup instructions and SQL examples, see Data Warehouse integrations.

Data point usage

For customers on data points-based billing, data point billing for Cloud Data Ingestion is equivalent to billing for updates through the /users/track endpoint. Refer to Data points for more information.

important

Braze Cloud Data Ingestion counts toward the available rate limit, so if you’re sending data using another method, the rate limit is combined between the Braze API and Cloud Data Ingestion.

Product limitations

Limitation	Description
Number of integrations	There is no limit on how many integrations you can set up. However, you can set up only one integration per table or view.
Number of rows	By default, each run can sync up to 500 million rows. Any syncs with more than 500 million new rows are stopped. If you need a higher limit than this, contact your Braze customer success manager or Braze Support.
Attributes per row	Each row should contain a single user ID and a JSON object with up to 250 attributes. Each key in the JSON object counts as one attribute (that is, an array counts as one attribute).
Payload size	Each row can contain a payload of up to 1 MB. Payloads greater than 1 MB are rejected, and the error “Payload was greater than 1MB” is logged to the sync log along with the associated external ID and truncated payload.
Data type	You can sync user attributes, events, and purchases through Cloud Data Ingestion.
Braze region	This product is available in all Braze regions. Any Braze region can connect to any source data region.
Source region	Braze connects to your data warehouse or cloud environment in any region or cloud provider.

New Stuff!