Forward Proxy

Statsig Forward Proxy

The Statsig Forward Proxy is a service that we developed to be hosted and run in your own infrastructure. If SDKs are configured to use the forward proxy, it provides a closer, but also more reliable hop for retrieving Statsig dependent configurations from within your infrastructure instead of reaching out to our networks. The expected benefits of using the proxy are:

Reduced dependency on Statsig Infrastructure Availability
Improved performance through localization of download_config_spec to your cluster
Improved cost savings and reduced network overhead through minimizing requests to Statsig Infrastructure
Improved consistency of configurations, as the forward proxy becomes an articulation point for serving configs
Support for cool features such as a streaming API through GRPC

If you have any questions about how to deploy, or need assistance, feel free to reach out to us on our slack channel or create a new github issue.

Installation

Helm Installation

The recommended way to install Statsig Forward Proxy in a Kubernetes environment is using the official Helm chart. Follow these steps to deploy:

# Add the Statsig Helm repository
helm repo add statsig https://statsig-helm.storage.googleapis.com
helm repo update

# Install the chart
helm install statsig-forward-proxy statsig/statsig-forward-proxy

The Helm chart provides extensive configuration options for customizing your deployment. For detailed configuration options, refer to the Statsig Forward Proxy Helm chart documentation.

Manual Deployment

For environments where Helm is not available or for more customized deployments, Statsig Forward Proxy can be deployed manually. The proxy is available as a pre-built Docker image, and you can also build your own binary from source. For detailed instructions on manual deployment options and available configuration parameters, refer to the Manual Deployment section in the GitHub repository.

How Forward Proxy works

The forward proxy works by setting up a local http or grpc server. When requests are made to the forward proxy, only the initial request is blocking, after that, a background loop is spawned which keeps configurations up to date without impacting the hot path for serving configurations. While providing this capability, the forward proxy also provides other benefits such the ability to enable backup caches and monitoring through technologies such as Redis and Statsd.

Integration with SDK

Legacy Java/Kotlin, Legacy Python, Python Server Core and Node Server Core SDKs provide the option for Forward Proxy integration with GRPC streaming. For other server SDKs, you can integrate with the Forward Proxy via HTTP by overriding the initialize endpoints in StatsigOptions. Feel free to consult us in Slack on this approach.

You can now configure SDK network using different network protocol to integrate with Statsig Forward Proxy for different endpoints. There are 3 network endpoints SDK make requests to download_config_specs,get_id_lists, and log_events, and you can currently configure the download_config_specs to use the proxy. We are actively developing the latter two.

Network protocol for different endpoints

For download_config_specs endpoint, where we get specs on evaluating gates/layers/experiments/configs

http: the default protocol. If SDK is initialized with http, it will poll config updates in background thread.
grpc_websocket: Establish a grpc streaming from SDK to statsig forward proxy, and listen to updates being pushed from proxy servers. You have to use statsig forward proxy or have a similar proxy server in order to use grpc streaming. Note: Not all SDKs support GRPC yet, if you have an SDK you want support for please reach out to our support channel on slack.
grpc: Unary RPC, behave similarly to HTTP protocol, after initialization, SDK poll changes from forward proxy server in background thread.

Listen to Config Change Example

const statsigOptions = {
  proxyConfigs: {
    download_config_specs:  {
      proxyAddress: proxyAddress // Your proxy address
      protocol: "grpc_websocket"
    }
  }
}
Statsig.initialize(server_key, statsigOptions)
// Statsig will use listen for config updates from statsig forward proxy using grpc_websocket protocol. And use http to get idlists and post log events from statsig servers.

For information on your specific SDK language, see the language specific docs in the left hand column.

Failover behavior

Initialization SDK remains the same behavior when forward proxy is being used. But there are several configurations you can use to improve the data availability during initialization. Default behavior is SDK will get from forward proxy. If DataAdapter is presented, get from DataAdapter first, and fallback to Forward Proxy if there is no value served from data adapter. There are several ways you can configure the behavior here.

set fallbackToStatsigAPI = true. If you want to fallback to statsig api when initialization or config sync from original sources is false
If you want to customize your own order of initialization set initializeSources to have multiple sources,for example initializeSources=[DataSource.Network, DataSource.DataAdapter, DataSource.StatsigNetwork] will fetch from 3 sources sequentially until it successfully find one

Config Sync (Post initialization) If sdk is using pulling model, grpc or http

Set fallbackToStatsigAPI=true
Set configSyncSources to have multiple sources,for example initializeSources=[DataSource.Network, DataSource.StatsigNetwork] will fetch from 3 sources sequentially until it successfully find one (Note if you are using streaming, config sync sources does not matter here)

If sdk is using grpc_websocket protocol, meaning sdk is in a listening mode for any config changes, by default it will

Retry connecting to forward proxy, with exponential backoff, 10s, 50s, 250s, 1250s … until it reconnects, limit fo r retry is 10 times.
If 4th retry still fails, sdk will start polling from from Statsig endpoint. Use StatsigOptions.pollingInterval to control how often it
While sdk polling from Statsig endpoint, sdk will keep retrying until it connected, Everything mentioned above is configurable with StatsigOptions

Streaming failover in depth

Within SDK, when exceptions being thrown, for example, connection is dropped or forward proxy server is down, grpc will return error to SDK and we start falling back behavior. Python will check if the channel is idle every 2 hours to ensure channel is not idle, this is to prevent some unknown corner case behavior within python grpc library.

TLS — Advanced network authentication

Forward proxy support TLS and mTLS when use GRPC server, so all grpc network (streaming and unary call) are encrypted To enable it:

Setup forward proxy server with valid certifications, more details to be found here
Setup SDK with valid certifications. more details to be found within SDK page python example. a. If certifications are misconfigured, SDK will treat as exception and started failing over behavior you configured. For example, fallback to statsig api, retry to connect and start falling back behavior.

Coordinate SDK DataAdapter / DataStore with Forward Proxy Cache service

In order to increase reliability of SDK during initialization, setup DataStore will help in case Forward Proxy is not available when initialized. If DataStore is set, the SDK will try to initialize with values from dataStore, and sync (listen or polling depends on setup) from forward proxy in background after initialization. It’s recommended for SDK and Forward Proxy to share the same cache service for consistency and reduce cache I/O. Example on DataStore setup in python:

class DataAdapter(IDataStore):
def __init__(self):
  self.redis_cache = ExampleCache()

def get(self, key: str):
  # IDlist isn't currently supported by proxy, so do normal lookup
  if "statsig.id_lists" in key:
    return self.cache.get(key)
  # This logic must stay in sync with statsig-forward-proxy
  else:
    hashed_key = "statsig::" + hashlib.sha256(key.encode()).hexdigest()
    return self.cache.hget(hashed_key, 'config')

def set(self, key: str):
  # Don't implement set method if you are share the same cache between forward proxy and sdk, forward proxy will write to cache 
  pass

def shutdown(self):
  self.cache.shutdown()

Deployment Notes

Given that every use case for the proxy is different, we don’t have strict recommendations. However, we do have general recommendations:

Understand what your general QPS to Statsig is. If you need help understanding this, feel free to reach out.
If possibly using your payload, get a sense of what throughput can be supported by a single pod and then calculate how much you need to scale out by to support your traffic.
Pre-scale the proxy for your max QPS and then begin to gradually rollout the proxy to services.
Configure your auto-scaling, for most folks, scaling up at 70% utilization for CPU and memory is typically sufficient, but depending on your usage this might need to be tweaked.

If you are seeing scaling issues that can’t be resolved by scaling out horizontally, we recommend fronting it with nginx to help with flow control. If you need any help, we are happy to collaborate and work closely, just send a message to our slack channel to get further assistance.

Workspace Management

Integrations

Collaboration

Privacy

Infrastructure Operations

Statsig Forward Proxy

Installation

Helm Installation

Manual Deployment

How Forward Proxy works

Integration with SDK

Network protocol for different endpoints

Listen to Config Change Example

Failover behavior

Streaming failover in depth

TLS — Advanced network authentication

Coordinate SDK DataAdapter / DataStore with Forward Proxy Cache service

Deployment Notes

Workspace Management

Integrations

Collaboration

Privacy

Infrastructure Operations

​Statsig Forward Proxy

​Installation

​Helm Installation

​Manual Deployment

​How Forward Proxy works

​Integration with SDK

​Network protocol for different endpoints

​Listen to Config Change Example

​Failover behavior

​Streaming failover in depth

​TLS — Advanced network authentication

​Coordinate SDK DataAdapter / DataStore with Forward Proxy Cache service

​Deployment Notes

Statsig Forward Proxy

Installation

Helm Installation

Manual Deployment

How Forward Proxy works

Integration with SDK

Network protocol for different endpoints

Listen to Config Change Example

Failover behavior

Streaming failover in depth

TLS — Advanced network authentication

Coordinate SDK DataAdapter / DataStore with Forward Proxy Cache service

Deployment Notes