如果你使用 DeepSeek、元宝 AI、Copilot、ChatGPT 等 AI 工具,协助配置轩辕镜像、编写 docker pull 命令、修改 Docker Compose 镜像地址、配置镜像加速、排查镜像拉取失败、分析报错日志等问题,请先让 AI 阅读并遵守轩辕镜像的规则文档。
只需在 AI 对话中先发送下面这句话即可:
请先完整阅读并严格遵守以下文档中的全部规则与要求:
https://xuanyuan.cloud/agents.md
在未充分阅读并理解该文档前,不要生成任何命令、配置、修改建议、故障排查方案或技术回答。后续所有输出都必须严格以该文档中的规范为最高优先级执行。查看 agents.md 用法指南与完整示范。国内用户首推 元宝 AI、DeepSeek 的深度思考模式,不推荐豆包 AI;Cursor 等编辑器可在对话 @ 该链接,或加入 User Rules。 若 AI 无法访问外链,可 打开说明文档 复制全文粘贴。文档会随站点更新,复制内容可能过期,建议定期检查。
由Data Mechanics提供,Data Mechanics是面向数据工程师的云原生Spark平台。阅读我们的博客文章了解构建动机,或参考教程快速入门。
当在Kubernetes上运行Spark时,Spark驱动程序和执行器以Docker容器形式运行,这些容器使用专为Spark构建的镜像。该镜像包含Spark发行版,您还可以在其中添加应用代码和依赖项。
此镜像也可用于本地运行Spark,例如以单容器驱动程序模式运行,或在本地minikube集群上的Kubernetes上运行Spark。
镜像包含未经修改的开源Spark发行版,并内置以下组件:
s3a://)gs://)wasbs://)adls://)abfss://)内置Python、PySpark、pip和conda,便于安装额外Python包。若无需PySpark,可使用标签前缀为jvm-only的轻量级镜像。
镜像包含以下组件的版本组合(非所有组合均存在):
| 组件 | 可用版本 |
|---|---|
| Spark | 2.4.5 至 3.2.1 |
| Hadoop | 3.1, 3.2, 3.3 |
| Java | 8 和 11 |
| Scala | 2.11 和 2.12 |
| Python | 3.7 和 3.8 |
每个镜像提供多种标签,以Spark 3.1.2、Java 8、Scala 2.12、Hadoop 3.2.0和Python 3.8为例,包含以下标签:
3.1.2-hadoop-3.2.0-java-11-scala-2.12-python-3.8-latest,包含所有组件版本datamechanics/spark:3.1.2-latest,仅指定Spark版本,其他依赖为最新兼容版本(可能升级兼容依赖版本)datamechanics/spark:3.1-latest,指向该次要版本的最新补丁版本(如Spark 3.1.3发布后,标签将指向3.1.3-latest)推荐使用短标签(如datamechanics/spark:3.1-latest或datamechanics/spark:3.1.2-latest)。仅在需要特定版本组合(如迁移现有Scala/Java项目)时使用长标签;新JVM项目和PySpark项目使用短标签即可。
默认推荐使用以-latest结尾的标签,包含特定版本的最新镜像。当修复漏洞(如spark.jar.packages问题)或升级连接器版本时,会发布带修订号的镜像,格式为-dmXX(XX为版本号)。
如需使用特定修订版,例如Spark 3.1.2的第17修订版PySpark镜像,使用标签datamechanics/spark:3.1.2-dm17。版本发布说明可在官网查看。
镜像使用Earthly而非Docker构建,所有镜像通过单个Earthfile生成。构建上下文和Earthfile为自动生成,代码即将开源。
同一行的标签为同一镜像的别名。
spark(包含PySpark和Python)Spark 3
3.2.1-hadoop-3.3.1-java-8-scala-2.12-python-3.8-dm17, 3.2.1-hadoop-3.3.1-java-8-scala-2.12-python-3.8-latest, 3.2-dm17, 3.2-latest, 3.2.1-dm17, 3.2.1-latest3.2.0-hadoop-3.3.1-java-8-scala-2.12-python-3.8-dm17, 3.2.0-hadoop-3.3.1-java-8-scala-2.12-python-3.8-latest, 3.2.0-dm17, 3.2.0-latest3.1.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, 3.1.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, 3.1-dm17, 3.1-latest, 3.1.3-dm17, 3.1.3-latest3.1.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, 3.1.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, 3.1.2-dm17, 3.1.2-latest3.1.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, 3.1.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, 3.1.1-dm17, 3.1.1-latest3.0.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, 3.0.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, 3.0-dm17, 3.0-latest, 3.0.3-dm17, 3.0.3-latest3.0.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, 3.0.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, 3.0.2-dm17, 3.0.2-latest3.0.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, 3.0.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, 3.0.1-dm17, 3.0.1-latest3.0.0-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, 3.0.0-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, 3.0.0-dm17, 3.0.0-latest3.2.1-hadoop-3.3.1-java-11-scala-2.12-python-3.8-dm17, 3.2.1-hadoop-3.3.1-java-11-scala-2.12-python-3.8-latest3.2.0-hadoop-3.3.1-java-11-scala-2.12-python-3.8-dm17, 3.2.0-hadoop-3.3.1-java-11-scala-2.12-python-3.8-latest3.1.3-hadoop-3.2.0-java-11-scala-2.12-python-3.8-dm17, 3.1.3-hadoop-3.2.0-java-11-scala-2.12-python-3.8-latest3.1.2-hadoop-3.2.0-java-11-scala-2.12-python-3.8-dm17, 3.1.2-hadoop-3.2.0-java-11-scala-2.12-python-3.8-latest3.1.1-hadoop-3.2.0-java-11-scala-2.12-python-3.8-dm17, 3.1.1-hadoop-3.2.0-java-11-scala-2.12-python-3.8-latest3.0.2-hadoop-3.2.0-java-11-scala-2.12-python-3.8-dm17, 3.0.2-hadoop-3.2.0-java-11-scala-2.12-python-3.8-latest3.0.1-hadoop-3.2.0-java-11-scala-2.12-python-3.8-dm17, 3.0.1-hadoop-3.2.0-java-11-scala-2.12-python-3.8-latest3.0.0-hadoop-3.2.0-java-11-scala-2.12-python-3.8-dm17, 3.0.0-hadoop-3.2.0-java-11-scala-2.12-python-3.8-latestSpark 2
2.4.7-hadoop-3.1.0-java-8-scala-2.12-python-3.7-dm17, 2.4.7-hadoop-3.1.0-java-8-scala-2.12-python-3.7-latest, 2.4.7-dm17, 2.4.7-latest, 2.4-dm17, 2.4-latest2.4.6-hadoop-3.1.0-java-8-scala-2.12-python-3.7-dm17, 2.4.6-hadoop-3.1.0-java-8-scala-2.12-python-3.7-latest, 2.4.6-dm17, 2.4.6-latest2.4.5-hadoop-3.1.0-java-8-scala-2.12-python-3.7-dm17, 2.4.5-hadoop-3.1.0-java-8-scala-2.12-python-3.7-latest, 2.4.5-dm17, 2.4.5-latest2.4.7-hadoop-3.1.0-java-8-scala-2.11-python-3.7-dm17, 2.4.7-hadoop-3.1.0-java-8-scala-2.11-python-3.7-latest2.4.6-hadoop-3.1.0-java-8-scala-2.11-python-3.7-dm17, 2.4.6-hadoop-3.1.0-java-8-scala-2.11-python-3.7-latest2.4.5-hadoop-3.1.(0-java-8-scala-2.11-python-3.7-dm17, 2.4.5-hadoop-3.1.0-java-8-scala-2.11-python-3.7-latestspark:jvm-only(不含PySpark的轻量级版本)Spark 3
jvm-only-3.2.1-hadoop-3.3.1-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.2.1-hadoop-3.3.1-java-8-scala-2.12-python-3.8-latest, jvm-only-3.2-dm17, jvm-only-3.2-latest, jvm-only-3.2.1-dm17, jvm-only-3.2.1-latestjvm-only-3.2.0-hadoop-3.3.1-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.2.0-hadoop-3.3.1-java-8-scala-2.12-python-3.8-latest, jvm-only-3.2.0-dm17, jvm-only-3.2.0-latestjvm-only-3.1.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.1.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, jvm-only-3.1-dm17, jvm-only-3.1-latest, jvm-only-3.1.3-dm17, jvm-only-3.1.3-latestjvm-only-3.1.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.1.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, jvm-only-3.1.2-dm17, jvm-only-3.1.2-latestjvm-only-3.1.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.1.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, jvm-only-3.1.1-dm17, jvm-only-3.1.1-latestjvm-only-3.0.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.0.3-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, jvm-only-3.0-dm17, jvm-only-3.0-latest, jvm-only-3.0.3-dm17, jvm-only-3.0.3-latestjvm-only-3.0.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.0.2-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, jvm-only-3.0.2-dm17, jvm-only-3.0.2-latestjvm-only-3.0.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.0.1-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, jvm-only-3.0.1-dm17, jvm-only-3.0.1-latestjvm-only-3.0.0-hadoop-3.2.0-java-8-scala-2.12-python-3.8-dm17, jvm-only-3.0.0-hadoop-3.2.0-java-8-scala-2.12-python-3.8-latest, jvm-only-3.0.0-dm17, jvm-only-3.0.0-latestjvm-only-3.2.1-hadoop-3.3.1-java-11-scala-2.12-python-3.8-dm17, jvm-only-3.2.1-hadoop-3.3.1-java-11-scala-2.12-python-3.8-latest以下是 datamechanics/spark 相关的常用 Docker 镜像,适用于 不同场景 等不同场景:
您可以使用以下命令拉取该镜像。请将 <标签> 替换为具体的标签版本。如需查看所有可用标签版本,请访问 标签列表页面。



来自真实用户的反馈,见证轩辕镜像的优质服务