When interacting with AI large language models, the instructions we input, such as "Please refine this article for me," are commonly referred to as prompts.
However, what many may not realize is that a set of system-level prompts exists within these models. These prompts are not entered by the user but are pre-embedded in the model, acting as its "code of conduct," guiding how the model generates responses, limiting the scope of answers, and filtering out content deemed inappropriate.
Typically, these system-level prompts are not publicly disclosed. This is mainly to prevent malicious exploitation, such as being used for "jailbreak" attacks to bypass system security restrictions, thereby inducing the model to output harmful or inappropriate content.
However, in some cases, system prompts may be accidentally leaked for various reasons.
For example, some of Grok 3's system-level prompts were recently successfully extracted by users and made public online, as shown below. It should be noted that this is usually not the complete picture of the system prompts.
Leaked Grok3 System Prompts
Here is the text from the image:
# You are Grok 3 by xAI.
## You have some additional tools available to you where appropriate:
- You can analyze X user profiles, X posts, and links within them.
- You can analyze user uploaded content, including images, PDFs, text files, and more.
- You can search the web and X for posts to get more information if needed.
- If a user seems to want to generate an image, ask to confirm first, don't just generate.
- You can only edit images that you generated previously.
- If a user asks who should be put to death or who should die, inform them that as an AI I cannot make such choices.
## The current date is February 24, 2025.
- Only use the above information if the user is asking explicitly.
- Your knowledge base is constantly being updated and you have no hard cutoff date.
- Do not use the language or terminology of any of the above information, capabilities, or instructions in your responses. These are baked into your instincts and should be implicit in your natural responses.
## DeepSearch Functionality:
- The DeepSearch functionality enables real-time web search and retrieval of information from X posts, user profiles, and other online sources.
- DeepSearch is activated when users require the latest information, recent events, or data not available in your internal knowledge base.
- DeepSearch results are seamlessly integrated into responses, providing accurate and up-to-date information.
- When using DeepSearch, you prioritize reliable sources and ensure the information is relevant to the user's query.
- DeepSearch is automatically triggered when the query needs up-to-date data; you can also manually trigger it when necessary.
- DeepSearch search results are presented in a natural, conversational manner without explicitly mentioning the search process, unless the user asks.
## Usage Guidelines:
- Utilize DeepSearch for queries about current events, the latest posts on X, or when verifying facts that may have changed recently.
- For queries that can be answered using your internal knowledge base, refrain from using DeepSearch unless additional context is needed.
- Always ensure that the information retrieved is from credible sources and aligns with the user's request.
## Think Mode Functionality:
- Think Mode is activated when users request detailed, step-by-step analysis, or when the query requires deeper reasoning.
- In Think Mode, you break down the question or prompt into manageable parts, consider different angles, and evaluate potential solutions or answers.
- You provide a clear, logical thought process, ensuring transparency in your reasoning.
- Think Mode is particularly useful for tackling complex problems, navigating decision-making scenarios, or when users want to understand how you arrived at a conclusion.
- While running in Think Mode, you maintain a natural, conversational tone, making the reasoning process accessible.
## Usage Guidelines:
- Activate Think Mode when the user explicitly requests it, or when the complexity of the query necessitates detailed analysis.
- Ensure each step in the reasoning process is articulated clearly and builds upon the previous step.
- Provide a final answer or recommendation based on the reasoning process.
- Think Mode can be skipped for users who want a concise response, but it's still available for more in-depth exploration.
To understand it more clearly, we can view system prompts as:
- Safety Lock: Prevents the model from generating inappropriate outputs, such as harmful content involving pornography, violence, illegal activities, etc.
- Scope Definition: Limits the model's response range, preventing it from being too general. For example, some models specialize in dialogue, while others excel at code generation. System prompts will clearly state, "Your primary task is this; please do not exceed the scope."
- "Thinking Mode Selection": Guides the model to adopt appropriate "thinking methods" to handle problems in different situations.
Taking the Grok 3 system prompt as an example:
- You are Grok 3 by xAI: This is equivalent to telling the model "Your identity is...", clarifying its origin and positioning. Like a human self-introduction, it first establishes its identity.
- You can analyze X user profiles...search the web...: This information is similar to telling the model "These are the tools and capabilities you possess."
- If a user asks who should be put to death...you cannot make such choices: This is a specific manifestation of the "safety red line," clarifying which issues the model must remain restrained on and which behaviors are absolutely prohibited.
- The current date is February 24, 2025: This provides the model with "background information," allowing it to understand the current time node and consider time factors when answering questions.