appdynamics/machine-agent-netviz该镜像是一个专为Kubernetes集群设计的机器代理,核心功能是提供网络可见性能力,通过监控集群网络流量、采集性能指标及可视化网络行为,帮助用户实时掌握集群网络状态,快速定位和解决网络相关问题。
该代理需以DaemonSet形式部署在Kubernetes集群中,确保每个节点运行一个代理实例,以全面覆盖集群网络监控。
| 环境变量名 | 描述 | 默认值 | 可选值 |
|---|---|---|---|
KUBE_API_SERVER | Kubernetes API服务器地址,用于获取集群资源信息 | [***] | 集群内API地址 |
METRICS_INTERVAL | 网络指标采集间隔 | 30s | 10s/30s/1m等 |
LOG_LEVEL | 日志输出级别 | info | debug/info/warn/error |
NETWORK_INTERFACE | 需监控的主机网络接口 | eth0 | 节点实际网络接口(如ens3) |
VISIBILITY_MODE | 网络可见性模式(基础/高级) | basic | basic/advanced |
PROMETHEUS_EXPORTER_PORT | Prometheus指标暴露端口 | 9090 | 1024-65535间未占用端口 |
ALERT_THRESHOLD_LATENCY | 网络延迟告警阈值(毫秒) | 500ms | 自定义数值(如1000ms) |
yamlapiVersion: apps/v1 kind: DaemonSet metadata: name: machine-agent-network-visibility namespace: monitoring labels: app: machine-agent spec: selector: matchLabels: app: machine-agent template: metadata: labels: app: machine-agent spec: hostNetwork: true # 启用主机网络以捕获节点网络流量 containers: - name: machine-agent image: [镜像名称] # 替换为实际镜像地址 ports: - containerPort: 9090 # Prometheus指标暴露端口 name: metrics env: - name: KUBE_API_SERVER value: "[***]" - name: METRICS_INTERVAL value: "30s" - name: LOG_LEVEL value: "info" - name: NETWORK_INTERFACE value: "eth0" - name: VISIBILITY_MODE value: "advanced" - name: PROMETHEUS_EXPORTER_PORT value: "9090" resources: limits: cpu: "500m" memory: "512Mi" requests: cpu: "200m" memory: "256Mi" volumeMounts: - name: var-log mountPath: /var/log - name: sys mountPath: /sys - name: run-xtables-lock mountPath: /run/xtables.lock readOnly: false volumes: - name: var-log hostPath: path: /var/log - name: sys hostPath: path: /sys - name: run-xtables-lock hostPath: path: /run/xtables.lock type: FileOrCreate
machine-agent-daemonset.yaml[镜像名称]为实际镜像地址kubectl apply -f machine-agent-daemonset.yaml -n monitoringkubectl get pods -n monitoring -l app=machine-agentRunning状态kubectl port-forward <pod-name> 9090:9090 -n monitoring,访问http://localhost:9090/metrics验证指标输出kubectl logs <pod-name> -n monitoring,确认无错误日志输出VISIBILITY_MODE=advanced)会增加网络流量采集粒度,可能提升资源占用,建议根据集群规模评估cluster-reader角色)kubectl exec -it <node-name> -- ip link查看节点网络接口名称manifest unknown 错误
TLS 证书验证失败
DNS 解析超时
410 错误:版本过低
402 错误:流量耗尽
身份认证失败错误
429 限流错误
凭证保存错误
来自真实用户的反馈,见证轩辕镜像的优质服务