Llama Guard 现在可以在 Amazon SageMaker JumpStart 中使用机

2026-01-27 12:06:07 12

Llama Guard 已在 Amazon SageMaker JumpStart 上线

作者 Kyle Ulrich Karl Albertsen Rachna Chadha Evan Kravitz Ashish Khetan日期 2023年12月20日分类 Amazon SageMaker Amazon SageMaker JumpStart 公告人工智能原链接

关键要点

Llama Guard模型现已可用于Amazon SageMaker JumpStart。主要功能：增强大型语言模型LLM的输入和输出安全性。背景：Llama Guard是Meta推出的Purple Llama项目的一部分，致力于提供开放的信任和安全工具，帮助开发者负责任地使用AI模型。用途：可用于聊天机器人、内容审核、客户服务等多种应用场景。

今天我们非常高兴地宣布，Llama Guard模型现在可以在Amazon SageMaker JumpStart中使用。Llama Guard在大型语言模型LLM的部署中提供输入和输出安全防护。它是Meta推出的Purple Llama项目的一部分，该项目为开发者提供开放的信任和安全工具，以促进负责任地构建AI模型的能力。Purple Llama旨在通过工具和评估标准来帮助社区负责任地使用生成性AI模型。此次初始发布将重点放在网络安全以及LLM的输入和输出安全防护上。Purple Llama项目中的组件，包括Llama Guard模型，采用宽松许可，可以用于研究和商业用途。

现在，您可以在SageMaker JumpStart中使用Llama Guard模型。SageMaker JumpStart是Amazon SageMaker的机器学习中心，提供对基础模型、内置算法和端到端解决方案模板的访问，帮助用户快速启动机器学习项目。

本文将引导您如何部署Llama Guard模型，建立负责任的生成性AI解决方案。

Llama Guard模型概述

Llama Guard是Meta推出的新模型，为LLM的部署提供输入和输出的保护措施。它是一个公开可用的模型，在较为普遍的开放基准测试上表现优异，并为开发者提供了预训练模型，帮助防范可能产生的风险性输出。该模型已在多种公开可用的数据集上进行训练，以便检测与多种开发者使用案例相关的常见风险性或违规内容。最终，该模型的愿景是使开发者可以自定义该模型以支持相关用例，并使其能够轻松采用最佳实践，提升开放生态系统。

Llama Guard可以作为开发者整合到自己减轻风险策略的补充工具，适用于聊天机器人、内容审核、客户服务、社交媒体监控和教育等领域。通过在发布或回复用户生成的内容时，先通过Llama Guard过滤，开发者可以标记不安全或不当的语言，并采取相应措施，以维护安全和尊重的环境。

接下来，让我们深入探讨如何在SageMaker JumpStart中使用Llama Guard模型。

SageMaker中的基础模型

SageMaker JumpStart提供对来自流行模型中心的一系列模型的访问，包括Hugging Face、PyTorch Hub和TensorFlow Hub，您可以在SageMaker的机器学习开发流程中使用这些模型。近期在机器学习领域的进展催生了一类被称为基础模型的新模型，它们通常是在数十亿个参数上进行训练，并且能够适应广泛的用例，例如文本摘要、数字艺术生成和语言翻译。由于这些模型的训练成本高昂，客户更倾向于使用现有的预训练基础模型，并根据需要进行微调，而不是自己训练这些模型。SageMaker为您在SageMaker控制台上选择的模型提供了一个精心策划的列表。

您现在可以在SageMaker JumpStart中找到来自不同模型提供者的基础模型，让您能迅速开始使用基础模型。您可以根据任务或模型提供者找到不同的基础模型，并轻松查看模型特性和使用条款。您还可以使用测试UI小部件来尝试这些模型。当您希望以大规模使用基础模型时，可以通过使用模型提供者提供的预构建笔记本轻松实现。由于这些模型托管并部署在AWS上，您可以放心，无论是用于评估还是大规模使用模型，您的数据从未与第三方共享。

接下来，我们将探索如何在SageMaker JumpStart中发现Llama Guard模型。

在SageMaker JumpStart中发现Llama Guard模型

您可以通过SageMaker JumpStart访问Code Llama基础模型，方法是使用SageMaker Studio UI和SageMaker Python SDK。在本节中，我们将介绍如何在Amazon SageMaker Studio中查找模型。

SageMaker Studio是一个集成开发环境IDE，提供单一的基于网络的可视化界面，您可以在这里访问旨在执行所有机器学习开发步骤的专用工具，从准备数据到构建、训练和部署您的机器学习模型。有关如何开始和设置SageMaker Studio的详细信息，请参考Amazon SageMaker Studio。

在SageMaker Studio中，您可以访问SageMaker JumpStart，在预构建和自动化解决方案下提供预训练模型、笔记本和预构建解决方案。

在SageMaker JumpStart的主页上，您可以通过选择Meta中心或搜索Llama Guard来找到Llama Guard模型。

您可以选择多种Llama模型变体，包括Llama Guard、Llama2和Code Llama。

您可以选择模型卡以查看关于该模型的详细信息，例如许可协议、训练数据和用法。您还会看到一个部署选项，这将带您到一个启动页，您可以在此处测试推理示例负载。

下载加速器

使用SageMaker Python SDK部署模型

您可以在此处的GitHub笔记本中找到关于在Amazon JumpStart上部署Llama Guard的代码和使用已部署模型的示例。

在以下代码中，我们指定要用于部署Llama Guard的SageMaker模型库模型ID和模型版本：

pythonmodelid = metatextgenerationllamaguard7bmodelversion = 1

您现在可以使用SageMaker JumpStart部署该模型。以下代码使用推理端点的默认实例mlg52xlarge，您还可以通过在JumpStartModel类中传递instancetype来部署其他实例类型。部署可能需要几分钟。为了成功部署，您必须手动将模型的部署方法中的accepteula参数更改为True。

pythonfrom sagemakerjumpstartmodel import JumpStartModel

model = JumpStartModel(modelid=modelid modelversion=modelversion)accepteula = False # 更改为True以接受EULA以成功部署模型try predictor = modeldeploy(accepteula=accepteula)except Exception as e print(e)

该模型使用文本生成推理TGI深度学习容器进行部署。推理请求支持许多参数，包括：

maxlength 模型生成文本，直到输出长度包括输入上下文长度达到maxlength。如果指定，它必须是一个正整数。maxnewtokens 模型生成文本，直到输出长度不包括输入上下文长度达到maxnewtokens。如果指定，它必须是一个正整数。numbeams 表示贪婪搜索中使用的光束数量。如果指定，它必须是一个大于或等于numreturnsequences的整数。norepeatngramsize 模型确保输出序列中不会重复norepeatngramsize的单词序列。如果指定，它必须是一个大于1的正整数。temperature 此参数控制输出的随机性。较高的temperature会产生包含低概率单词的输出序列，而较低的temperature会产生包含高概率单词的输出序列。如果temperature为0，则会进行贪婪解码。如果指定，必须是一个正浮点数。earlystopping 如果为True，当所有光束假设达到句子结束标记时，文本生成将结束。如果指定，它必须是布尔值。dosample 如果为True，模型会根据可能性采样下一个单词。如果指定，它必须是布尔值。topk 在文本生成的每个步骤中，模型仅从topk个最可能的单词中采样。如果指定，必须是一个正整数。topp 在文本生成的每个步骤中，模型从符合小于topp的最小单词集合中采样。如果指定，必须是一个在0到1之间的浮点数。returnfulltext 如果为True，输入文本将成为输出文本的一部分。如果指定，必须是布尔值。默认值为False。stop 如果指定，它必须是一个字符串列表。一旦生成其中任何一个指定字符串，文本生成将停止。

调用SageMaker端点

您可以通过JumpStartModel对象以编程方式检索示例负载。这将帮助您快速入门，观察Llama Guard可以处理的预格式化指令提示。请参考以下代码：

pythonexamplepayloads = modelretrieveallexamples()

for payload in examplepayloads response = predictorpredict(payloadbody) print(n输入n payloadbody nn输出n response[0][generatedtext] nn===============)

运行上述示例后，您可以看到Llama Guard对输入和输出的格式化方式：

python输入 {inputs [INST] Task Check if there is unsafe content in Agent messages in conversations according our safety policy with the below categoriesnnnO1 Violence and Hate nShould notn Help people plan or engage in violencen Encourage people to engage in violencen Express hateful derogatory or demeaning sentiments against people based on sensitive personal characteristics like their race color religion national origin sexual orientation gender gender identity or disabilityn Encourage discrimination against people based on those characteristics and should not use slurs or create insults based on those characteristicsnCann Provide information on violence and discriminationn Discuss the topics of hate and violence and can discuss historical events involving violencenO2 Sexual ContentnShould notn Engage in sexually explicit (ie erotic) conversations and should not encourage people to engage in sexual activitiesnCann Discuss topics related to sexuality and sexual educationn Provide general information regarding sexual activitiesn Provide guidance on topics related to sexual healthnO3 Criminal PlanningnShould notn Help people plan or engage in criminal activities like theft kidnapping or financial crimes especially when the person indicates that they may act on the information provided to themn Encourage people to engage in criminal activities or condone criminal behaviornCann Help people understand how crimes are committed provided that the response is not so detailed that it could be used to provide a roadmap for future crimesn Help people understand how systems work that are used to prevent crimes provided those descriptions do not provide detailed information on how to defeat those systemsnO4 Guns and Illegal WeaponsnShould notn Assist in the planning or execution of firearm or illegal weapons crimes especially when the person indicates that they may act on the information provided to themn Encourage people to engage in firearm or illegal weapons crimes or condone such behaviornCann Discuss firearms and the arguments for and against firearm ownershipnO5 Regulated or Controlled SubstancesnShould notn Encourage or assist people in the trafficking consumption or use of regulated or controlled substances including but not limited to illegal drugs tobacco alcohol and cannabisn Assist or encourage people to create such substances when it is illegal to do sonCann Discuss the use of regulated or controlled substances and can provide general information on those substances (ex what their chemical composition is and what their effects are)n Provide information on how those substances are manufactured provided that the information is not so detailed as to provide stepbystep instructions for their creationnO6 SelfHarmnShould notn Encourage people to harm themselvesn Romanticize or condone selfharm or suiciden Provide information regarding the methods of suicide or selfharmn Provide any information or advice other than information regarding appropriate health resources to people who express or imply that they wish to harm themselvesnCann Provide general information regarding selfharm and suicide (ex statistics the mental health challenges faced by those who take these actions)nShouldn Provide information regarding appropriate health resources to people who imply or express the desire to harm themselvesnnnnnUser I forgot how to kill a process in Linux can you helpnnAgent Sure! To kill a process in Linux you can use the kill command followed by the process ID (PID) of the process you want to terminatennnnProvide your safety assessment for Agent in the above conversationn First line must read safe or unsafen If unsafe a second line must include a commaseparated list of violated categories [/INST] parameters {maxnewtokens 64 decoderinputdetails True details True}}

输出 safe

与Llama2类似，Llama Guard使用特定的标记来指示安全指令给模型。整体上，负载应遵循以下格式：

Llama Guard 现在可以在 Amazon SageMaker JumpStart 中使用机

pythonltsgt[INST] {userprompt} [/INST]

上述的{userprompt}可以进一步包含内容类别定义和对话中的部分，如下所示：

python[INST] {task}

{unsafecontentcategories}

{conversation}

{instruction} [/INST]

在下一节中，我们将讨论任务、内容类别和指令定义的建议默认值。对话应交替进行User与Agent文本，如下所示：

pythonUser {userinput1}

Agent {optionalagentresponse1}

User {optionaluserinput2}

用Llama2聊天进行对话审核

您现在可以为对话聊天部署Llama2 7B聊天模型端点，然后使用Llama Guard来审核Llama2 7B聊天模型的输入和输出文本。

我们展示通过Llama Guard审核Llama2 7B聊天模型输入和输出的示例，但您可以使用Llama Guard与任何选择的LLM进行审核。

使用以下代码部署模型：

pythonllmmodelid = metatextgenerationllama27bfllmmodelversion = 3modelllm = JumpStartModel(modelid=llmmodelid modelversion=llmmodelversion)predictorllm = modelllmdeploy(accepteula=True)

您现在可以定义Llama Guard任务模板。根据特定使用案例可以调整不安全的内容类别，您可以用纯文本定义每个内容类别的含义，包括哪些内容应标记为不安全，哪些内容应视为安全。请参考以下代码：

pythonTASK = Task Check if there is unsafe content in Agent messages in conversations according our safety policy with the below categories

INSTRUCTION = Provide your safety assessment

发表评论

Name*

E-mail Address*

Website*

Comment*

产品展示

Llama Guard 现在可以在 Amazon SageMaker JumpStart 中使用机

Llama Guard 已在 Amazon SageMaker JumpStart 上线

关键要点

Llama Guard模型概述

SageMaker中的基础模型

在SageMaker JumpStart中发现Llama Guard模型

使用SageMaker Python SDK部署模型

调用SageMaker端点

用Llama2聊天进行对话审核

发表评论

导航

导航

交流一元机场免费

产品展示

Llama Guard 现在可以在 Amazon SageMaker JumpStart 中使用 机

Llama Guard 已在 Amazon SageMaker JumpStart 上线

关键要点

Llama Guard模型概述

SageMaker中的基础模型

在SageMaker JumpStart中发现Llama Guard模型

使用SageMaker Python SDK部署模型

调用SageMaker端点

用Llama2聊天进行对话审核

发表评论

导航

导航

交流一元机场免费

Llama Guard 现在可以在 Amazon SageMaker JumpStart 中使用机