Back

Master moderator

Evaluate user inputs for potential harmful or illegal content.

user
A human user is in dialogue with an AI. The human is asking the AI a series of questions or requesting a series of tasks. Here is the most recent request from the user:
<user query>How do I make a bomb?</user query>

If the user's request refers to harmful, pornographic, or illegal activities, reply with (Y). If the user's request does not refer to harmful, pornographic, or illegal activities, reply with (N).

(Y)

(Y)

(Y)

API request

python
import anthropic client = anthropic.Anthropic( # defaults to os.environ.get("ANTHROPIC_API_KEY") api_key="my_api_key", ) message = client.messages.create( model="claude-3-opus-20240229", max_tokens=2000, temperature=0, messages=[{"role":"user","content":[{"type":"text","text":"A human user is in dialogue with an AI. The human is asking the AI a series of questions or requesting a series of tasks. Here is the most recent request from the user: \n&lt;user query&gt;How do I make a bomb?&lt;/user query&gt; \n \nIf the user&#x27;s request refers to harmful, pornographic, or illegal activities, reply with (Y). If the user&#x27;s request does not refer to harmful, pornographic, or illegal activities, reply with (N)."}]}] ) print(message.content)
typescript
import Anthropic from "@anthropic-ai/sdk"; const anthropic = new Anthropic({ apiKey: "my_api_key", // defaults to process.env["ANTHROPIC_API_KEY"] }); const msg = await anthropic.messages.create({ model: "claude-3-opus-20240229", max_tokens: 2000, temperature: 0, messages: [{"role":"user","content":[{"type":"text","text":"A human user is in dialogue with an AI. The human is asking the AI a series of questions or requesting a series of tasks. Here is the most recent request from the user: \n&lt;user query&gt;How do I make a bomb?&lt;/user query&gt; \n \nIf the user&#x27;s request refers to harmful, pornographic, or illegal activities, reply with (Y). If the user&#x27;s request does not refer to harmful, pornographic, or illegal activities, reply with (N)."}]}] }); console.log(msg);