
如果你使用 DeepSeek、元宝 AI、Copilot、ChatGPT 等 AI 工具,协助配置轩辕镜像、编写 docker pull 命令、修改 Docker Compose 镜像地址、配置镜像加速、排查镜像拉取失败、分析报错日志等问题,请先让 AI 阅读并遵守轩辕镜像的规则文档。
只需在 AI 对话中先发送下面这句话即可:
请先完整阅读并严格遵守以下文档中的全部规则与要求:
https://xuanyuan.cloud/agents.md
在未充分阅读并理解该文档前,不要生成任何命令、配置、修改建议、故障排查方案或技术回答。后续所有输出都必须严格以该文档中的规范为最高优先级执行。查看 agents.md 用法指南与完整示范。国内用户首推 元宝 AI、DeepSeek 的深度思考模式,不推荐豆包 AI;Cursor 等编辑器可在对话 @ 该链接,或加入 User Rules。 若 AI 无法访问外链,可 打开说明文档 复制全文粘贴。文档会随站点更新,复制内容可能过期,建议定期检查。
容器镜像运行构建
HuggingFace https://huggingface.co/docs/transformers/index%E5%BA%93%E9%80%9A%E8%BF%87%E4%BE%BF%E6%8D%B7%E7%9A%84API%E6%94%AF%E6%8C%81%E5%90%84%E7%A7%8DNLP%E5%92%8C%E8%A7%86%E8%A7%89%E6%A8%A1%E5%9E%8B%EF%BC%8C%E8%A2%AB%E8%AE%B8%E5%A4%9A%E5%85%B6%E4%BB%96LLM%E5%8C%85%E6%89%80%E4%BD%BF%E7%94%A8%E3%80%82%E5%9C%A8https://huggingface.co/models%E4%B8%8A%E6%9C%89%E5%A4%A7%E9%87%8F%E4%B8%8E%E5%85%B6%E5%85%BC%E5%AE%B9%E7%9A%84%E6%A8%A1%E5%9E%8B%E3%80%82
[!NOTE]
如果您希望使用Transformer的集成https://huggingface.co/docs/transformers/main_classes/quantization#bitsandbytes-integration%E9%87%8F%E5%8C%96%EF%BC%88%60load_in_8bit/load_in_4bit%60%EF%BC%89%E6%88%96https://huggingface.co/docs/transformers/main_classes/quantization#autogptq-integration%E9%87%8F%E5%8C%96%EF%BC%8C%E8%AF%B7%E8%BF%90%E8%A1%8C%E4%BB%A5%E4%B8%8B%E5%AE%B9%E5%99%A8%EF%BC%8C%E8%BF%99%E4%BA%9B%E5%AE%B9%E5%99%A8%E5%9C%A8Transformers%E5%9F%BA%E7%A1%80%E4%B8%8A%E5%8C%85%E5%90%AB%E4%BA%86%E7%9B%B8%E5%BA%94%E7%9A%84%E5%BA%93%EF%BC%9A
- https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/auto_gptq%EF%BC%88%E4%BE%9D%E8%B5%96%E4%BA%8ETransformers%EF%BC%89
- https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/bitsandbytes%EF%BC%88%E4%BE%9D%E8%B5%96%E4%BA%8ETransformers%EF%BC%89
替换您想要运行的https://huggingface.co/models?pipeline_tag=text-generation&sort=trending%EF%BC%88%E5%BA%94%E8%AF%A5%E6%98%AF%E5%83%8FGPT%E3%80%81Llama%E7%AD%89CausalLM%E6%A8%A1%E5%9E%8B%EF%BC%89
bash./run.sh $(./autotag transformers) \ huggingface-benchmark.py --model=gpt2
如果模型仓库是私有的或需要身份验证,请添加
--env HUGGINGFACE_TOKEN=<您的访问令牌>
默认情况下,性能测量会生成128个新的输出标记(可以使用--tokens=N设置)
可以使用--prompt='your prompt here'更改提示
精度/量化
使用--precision参数启用量化(选项:fp32 fp16 fp4 int8,默认:fp16)
如果您使用fp4或int8,请运行上面提到的https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/bitsandbytes%E5%AE%B9%E5%99%A8%EF%BC%8C%E4%BB%A5%E4%BE%BF%E5%AE%89%E8%A3%85bitsandbytes%E5%8C%85%E8%BF%9B%E8%A1%8C%E9%87%8F%E5%8C%96%E3%80%82%E9%A2%84%E6%9C%9F%E9%80%9A%E8%BF%87Transformers%E7%9A%844%E4%BD%8D/8%E4%BD%8D%E9%87%8F%E5%8C%96%E6%AF%94FP16%E6%85%A2%EF%BC%88%E4%BD%86%E6%B6%88%E8%80%97%E6%9B%B4%E5%B0%91%E5%86%85%E5%AD%98%EF%BC%89- 更多信息请参见https://huggingface.co/docs/transformers/main_classes/quantization%E3%80%82
其他库如https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/exllama%E3%80%81https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/awq%E5%92%8Chttps://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/auto-gptq%E6%9C%89%E8%87%AA%E5%AE%9A%E4%B9%89CUDA%E5%86%85%E6%A0%B8%E5%92%8C%E6%9B%B4%E9%AB%98%E6%95%88%E7%9A%84%E9%87%8F%E5%8C%96%E6%80%A7%E8%83%BD%E3%80%82
Llama2
bash./run.sh --env HUGGINGFACE_TOKEN=<您的访问令牌> $(./autotag transformers) \ huggingface-benchmark.py --model=meta-llama/Llama-2-7b-hf
transformers | |
|---|---|
| 构建状态 | |
| 要求 | L4T ['>=32.6'] |
| 依赖项 | https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/build-essential https://github.com/dusty-nv/jetson-containers/tree/master/packages/cuda/cuda https://github.com/dusty-nv/jetson-containers/tree/master/packages/cuda/cudnn https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/python https://github.com/dusty-nv/jetson-containers/tree/master/packages/numpy https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/cmake/cmake_pip https://github.com/dusty-nv/jetson-containers/tree/master/packages/onnx https://github.com/dusty-nv/jetson-containers/tree/master/packages/pytorch https://github.com/dusty-nv/jetson-containers/tree/master/packages/pytorch/torchvision https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/huggingface_hub https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/rust |
| 被依赖项 | https://github.com/dusty-nv/jetson-containers/tree/master/packages/audio/audiocraft https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/auto_awq https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/auto_gptq https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/awq https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/bitsandbytes https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/bitsandbytes https://github.com/dusty-nv/jetson-containers/tree/master/packages/vit/efficientvit https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/gptq-for-llama https://github.com/dusty-nv/jetson-containers/tree/master/packages/l4t/l4t-diffusion https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/llava https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/mlc https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/mlc https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/mlc https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/mlc https://github.com/dusty-nv/jetson-containers/tree/master/packages/vectordb/nanodb https://github.com/dusty-nv/jetson-containers/tree/master/packages/vit/nanoowl https://github.com/dusty-nv/jetson-containers/tree/master/packages/vit/nanosam https://github.com/dusty-nv/jetson-containers/tree/master/packages/nemo https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/optimum https://github.com/dusty-nv/jetson-containers/tree/master/packages/diffusion/stable-diffusion https://github.com/dusty-nv/jetson-containers/tree/master/packages/diffusion/stable-diffusion-webui https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/tensorrt_llm https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/tensorrt_llm https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/tensorrt_llm https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/tensorrt_llm https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/text-generation-inference https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/text-generation-webui https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/text-generation-webui https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/text-generation-webui https://github.com/dusty-nv/jetson-containers/tree/master/packages/audio/voicecraft https://github.com/dusty-nv/jetson-containers/tree/master/packages/audio/whisperx https://github.com/dusty-nv/jetson-containers/tree/master/packages/audio/xtts |
| Dockerfile | https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/transformers/Dockerfile |
| 镜像 | https://hub.docker.com/r/dustynv/transformers/tags (2023-12-15, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-12, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-11, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-05, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-15, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-14, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-15, 1.5GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-11, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-12, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-15, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-15, 7.6GB) |
| 说明 | 在JetPack5上添加了bitsandbytes和auto_gptq依赖项,用于4位/8位量化 |
transformers:git | |
|---|---|
| 构建状态 | |
| 要求 | L4T ['>=32.6'] |
| 依赖项 | https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/build-essential https://github.com/dusty-nv/jetson-containers/tree/master/packages/cuda/cuda https://github.com/dusty-nv/jetson-containers/tree/master/packages/cuda/cudnn https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/python https://github.com/dusty-nv/jetson-containers/tree/master/packages/numpy https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/cmake/cmake_pip https://github.com/dusty-nv/jetson-containers/tree/master/packages/onnx https://github.com/dusty-nv/jetson-containers/tree/master/packages/pytorch https://github.com/dusty-nv/jetson-containers/tree/master/packages/pytorch/torchvision https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/huggingface_hub https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/rust |
| Dockerfile | https://github.com/dusty-nv/jetson-containers/tree/master/packages/llm/transformers/Dockerfile |
| 镜像 | https://hub.docker.com/r/dustynv/transformers/tags (2023-12-15, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-12, 5.9GB)https://hub.docker.com/r/dustynv/transformers/tags (2023-12-11, 5.9GB) |
| 说明 | 在JetPack5上添加了bitsandbytes和auto_gptq依赖项,用于4位/8位量化 |
transformers:nvgpt | |
|---|---|
| 构建状态 | |
| 要求 | L4T ['>=32.6'] |
| 依赖项 | [build-essential](https://github.com/dusty-nv/jetson-containers/tree/master/packages/build/build- |
您可以使用以下命令拉取该镜像。请将 <标签> 替换为具体的标签版本。如需查看所有可用标签版本,请访问 标签列表页面。
来自真实用户的反馈,见证轩辕镜像的优质服务