On this page

Databricks

Configure Statsig data warehouse ingestion from Databricks, including authentication, scheduled queries, and mapping to events and user properties.

How Databricks ingestion works

To set up a connection with Databricks, Statsig needs the following:

  • API Key
  • Server Hostname
  • HTTP Path
You can use any cluster in your project to connect to your data, but Statsig recommends using a Databricks SQL warehouse/endpoint so that the cluster doesn't need to start for every pull.

API Key

To generate a new API key, go to User Settings in your Databricks console and generate a new token as shown below.

databricks info

You can also use a personal access token for a service principal. Generate one by following the steps in Manage personal access tokens for a service principal.

Server Hostname & HTTP Path

Find your Server Hostname and HTTP Path in your Databricks console by navigating to your cluster, selecting the Configuration tab, and expanding Advanced options.

credentials

Was this helpful?