Question 1

How does OSO Kafka Backup authenticate to Google Cloud Storage?

Accepted Answer

Three ways: a service account JSON key set via service_account_json in the storage config, the GOOGLE_APPLICATION_CREDENTIALS environment variable, or ambient credentials — Workload Identity on GKE and the metadata service on GCE — with no key file at all.

Question 2

Can I use GCS lifecycle rules with Kafka backups?

Accepted Answer

Yes. Backups are written under a configurable object prefix, so you can attach lifecycle rules that transition older backups to Nearline, Coldline, or Archive storage classes, or delete them in line with your retention policy.

Question 3

Does this work with dual-region or multi-region GCS buckets?

Accepted Answer

Yes. The gcs backend addresses the bucket by name, so regional, dual-region, and multi-region buckets all work unchanged. Dual-region buckets give the backup data itself geographic redundancy without a second backup job.

Question 4

Can I back up Kafka on GKE without managing key files?

Accepted Answer

Yes. Bind the backup pod’s Kubernetes service account to a Google service account with Workload Identity, grant it access to the bucket, and omit credentials from the config entirely — the setup guide walks through the binding commands.

Question 5

How is backup data compressed?

Accepted Answer

Topic data is compressed with Zstd or LZ4 before upload, independent of the compression producers used, which typically reduces storage cost substantially compared with raw log segments.

Back up Apache Kafka to Google Cloud Storage

Why Google Cloud Storage for Kafka backups

Configuration

What gets backed up

Frequently asked questions

Ready to protect your Kafka data?

Why Google Cloud Storage for Kafka backups​

Configuration​

What gets backed up​

Frequently asked questions

Related reading

Ready to protect your Kafka data?

Why Google Cloud Storage for Kafka backups

Configuration

What gets backed up