snowplow/bigquery-loader-pubsub该Docker镜像提供Snowplow加载器功能,专门用于从Google Cloud Platform (GCP) 数据管道读取事件数据并加载到BigQuery数据仓库。作为Snowplow数据处理生态的重要组件,它简化了事件数据从GCP管道(如Pub/Sub、Cloud Storage)到BigQuery的集成流程,支持高效的数据传输与存储。
bashdocker run -d \ --name snowplow-bigquery-loader \ -e GCP_PROJECT_ID="your-gcp-project-id" \ -e BIGQUERY_DATASET="your-dataset" \ -e BIGQUERY_TABLE="your-table" \ -e INPUT_SOURCE="pubsub" \ -e PUBSUB_TOPIC="projects/your-project/topics/your-topic" \ -v /path/to/gcp-credentials.json:/app/credentials.json \ -e GOOGLE_APPLICATION_CREDENTIALS="/app/credentials.json" \ snowplow/bigquery-loader:latest
| 环境变量 | 描述 | 示例 |
|---|---|---|
| GCP_PROJECT_ID | GCP项目ID | "my-gcp-project" |
| BIGQUERY_DATASET | BigQuery目标数据集名称 | "snowplow_events" |
| BIGQUERY_TABLE | BigQuery目标表名称 | "page_views" |
| INPUT_SOURCE | 输入数据源类型(支持"pubsub"或"gcs") | "pubsub" |
| PUBSUB_TOPIC | Pub/Sub主题完整路径(INPUT_SOURCE为"pubsub"时必填) | "projects/my-project/topics/snowplow-events" |
| GCS_BUCKET | Cloud Storage桶名称(INPUT_SOURCE为"gcs"时必填) | "my-snowplow-bucket" |
| GOOGLE_APPLICATION_CREDENTIALS | GCP认证凭据文件路径 | "/app/credentials.json" |
yamlversion: '3' services: snowplow-loader: image: snowplow/bigquery-loader:latest container_name: snowplow-bigquery-loader environment: - GCP_PROJECT_ID=my-gcp-project - BIGQUERY_DATASET=snowplow_events - BIGQUERY_TABLE=page_views - INPUT_SOURCE=pubsub - PUBSUB_TOPIC=projects/my-project/topics/snowplow-events - GOOGLE_APPLICATION_CREDENTIALS=/app/credentials.json volumes: - ./gcp-credentials.json:/app/credentials.json restart: unless-stopped

manifest unknown 错误
TLS 证书验证失败
DNS 解析超时
410 错误:版本过低
402 错误:流量耗尽
身份认证失败错误
429 限流错误
凭证保存错误
来自真实用户的反馈,见证轩辕镜像的优质服务