On this page

Forwarded Data

Use forwarded data in Statsig Warehouse Native to send events to Statsig and store them in your warehouse for experiment and analytics use.

If you log events or exposures through Statsig's SDK, Statsig forwards that data back to your warehouse in near-real time, on demand.

Setting up tables for forwarded data

By default, when you set up a data connection, Statsig automatically creates tables named exposures and events to receive forwarded SDK data.

To rename these tables, go to the data connection page, open the Advanced tab, and update the table names.

Forwarded data table configuration interface

If you've already had data exported and change the table name, Statsig writes future data to the new table.

Statsig doesn't forward non-production exposures or log events to external warehouses.

Exposures

Logging exposures with Statsig provides real-time diagnostics on the Statsig console and real-time aggregations like exposures by hour.

When you run Pulse analysis, Statsig fast-forwards raw exposures to catch up with the real-time stream. The fast-forward provides all users in your experiment and Pulse results as fresh as approximately 15 minutes, assuming events and metrics arrive at the same speed.

Statsig performs a just-in-time update of exposures in your warehouse when Pulse is loaded, for the first 1 million exposures logged to the experiment. After that threshold, Statsig batches, deduplicates, and writes exposures to your warehouse once a day.

Statsig doesn't deduplicate fast-forwarded exposures, and some fields are missing (notably user_dimensions). These fields are included in the subsequent daily load.

Each day, Statsig exports a deduplicated digest to your warehouse to ensure consistency. The digest is deduplicated with the fast-forwarded data as part of the standard Pulse pipeline.

Duplicate exposure records

Statsig retains exposure data for 30 days and can't deduplicate beyond that window. After 30 days, Statsig re-sends exposures for units first exposed more than 30 days ago that were re-exposed on a given day. Statsig correctly deduplicates these records during analysis in your warehouse. The table doesn't contain unique user records, even after filtering out fast-forwarded exposures, which can also include duplicate records.

For gates with 0% or 100% rollout, Statsig doesn't forward exposure to your warehouse by default. If you need them, contact the support team, your sales contact, or the Slack community.

Events

If you use Statsig SDKs to capture custom events, Statsig exports those events to your warehouse hourly. Pulse results on metrics derived from those events are as fresh as approximately 1 hour.

Was this helpful?