Mastering Prompt Engineering

Jan 04, 2024

Prompt engineering has emerged as a key skill for leveraging the full potential of LLMs. It's not just about asking questions; it's the art of guiding AI to produce insightful, accurate, and relevant answers. Let’s talk about the strategies and tactics of prompt engineering that will help you be successful at bending AI to your will.

What is Prompt Engineering? Prompt engineering is the process of crafting inputs (prompts) for AI language models to elicit desired outputs. It's a crucial skill for anyone working with LLMs, like GPT-4, Claude, Bard, or Mistral, as the quality of the prompt directly influences the model's response.

Core Strategies for Prompt Engineering

The following sections are taken from OpenAI’s Prompt Engineering docs, but are applicable to all LLMs. Each section includes a good and bad example followed by an explanation.

1. Clear Instructions: The Foundation

Good Example: "Write a summary of the latest climate change research in approximately 200 words, focusing on key findings and implications."

Bad Example: "Talk about climate change."

The good example is specific, giving the AI a clear task (summarizing research), a topic (climate change research), a length guideline (200 words), and areas to focus on (key findings and implications). The bad example is vague, lacking direction and specificity, which can lead to irrelevant or overly broad responses and, more importantly, can be frustrating to the end user.

2. Context Matters

Good Example: "Considering the recent changes in tax laws in 2023, provide a detailed guide for small businesses on managing their taxes."

Bad Example: "How do I do taxes?"

In the good example, the prompt is contextualized with recent changes in tax laws and tailored for a specific audience (small businesses), resulting in a more relevant and targeted response. The bad example lacks context and specificity, making it difficult for the AI to provide a useful answer.

3. Simplifying Complex Tasks

Good Example: "Step 1: List the main causes of urban pollution. Step 2: Discuss the impact of each cause on public health."

Bad Example: "Tell me everything about urban pollution."

By breaking down the complex topic of urban pollution into manageable steps, the good example prompts the AI to organize its response methodically. The bad example is overly broad, likely leading to an unfocused and overwhelming response.

4. Giving AI Time to 'Think'

Good Example: "First, list the most popular programming languages in 2023. Next, provide a brief comparison of their main features."

Bad Example: "Programming languages comparison."

The good example guides the AI through a logical sequence of thought, first identifying popular programming languages, then comparing them. The bad example is too abrupt, lacking a structured approach for the AI to follow.

5. Leveraging External Tools

Good Example: "Using data from the World Health Organization's latest report, summarize the current global health trends."

Bad Example: "Health trends."

The good example directs the AI to a specific, reputable source for its response, leading to more accurate and credible information. The bad example is unspecific and lacks a reference point, which can lead to generic or outdated responses.

6. Testing and Evaluating Changes

Consistently testing various prompts and evaluating their outputs is key. Adjusting the prompt based on the model's responses can lead to more refined and effective results over time.

Practical Tactics for Enhanced Performance

Now let’s discuss specific tactics that can further enhance the performance of your prompts, making them more effective and tailored to your needs.

1. Detailed Queries for Relevance

Good Example: "Provide a step-by-step guide for beginners on how to create a basic website using HTML and CSS, including examples of code."

Bad Example: "How do I make a website?"

By specifying the audience (beginners), the task (creating a basic website), and the tools (HTML and CSS), the good example ensures a relevant and detailed response. The bad example lacks detail, which can lead to a generic or incomplete answer.

2. Adopting Personas in Prompts

Good Example: "As a nutrition expert, suggest a balanced diet plan for an athlete preparing for a marathon."

Bad Example: "Diet plan for athlete."

Personifying the AI as a 'nutrition expert' in the good example provides a frame for the response, leading to more specialized and credible advice. The bad example lacks this framing, which might result in a less tailored response.

3. Using Delimiters for Clarity

Good Example: "Question: What are the latest trends in renewable energy? Answer: [AI's response]"

Bad Example: "Tell me about renewable energy trends."

The use of 'Question' and 'Answer' as delimiters in the good example provides a clear structure, guiding the AI to format its response accordingly. The bad example is less structured, potentially leading to a less organized response.

4. Specifying Steps for Task Completion

Good Example: "1. Outline the history of the internet. 2. Discuss its impact on global communication. 3. Predict future developments."

Bad Example: "Internet history and future."

The good example breaks down the broad topic into specific, sequential steps, leading to a comprehensive and well-structured response. The bad example's lack of structure can result in a disjointed or incomplete answer.

5. Providing Examples for Style or Format

Good Example: "Write a poem about nature in the style of Robert Frost, focusing on imagery and emotion."

Bad Example: "Poem about nature."

The good example specifies a style (Robert Frost) and elements to focus on (imagery and emotion), guiding the AI to mimic a specific poetic approach. The bad example lacks this guidance, which could lead to a generic response.

6. Defining Output Length

Good Example: "In 100 words, explain the concept of blockchain technology."

Bad Example: "Explain blockchain."

The good example's word limit ensures a concise and focused explanation, suitable for quick understanding. The bad example could result in either too brief or overly complex responses.

7. Embeddings-Based Search for Information Retrieval

Good Example: "Using embeddings-based search, find and summarize articles about AI ethics."

Bad Example: "Find articles on AI ethics."

The good example explicitly instructs the AI to use a specific search method (embeddings-based), likely leading to more relevant and comprehensive results. The bad example is vague about the search method, potentially affecting the quality of the retrieved information.

Advanced Techniques and Considerations

As we dive deeper into prompt engineering, it's important to explore advanced techniques that can elevate the effectiveness of your prompts, as well as key considerations to keep in mind.

1. Combining Strategies for Complex Scenarios

Good Example: "As a travel expert, list five hidden gem destinations in Europe for solo travelers, providing a brief history and main attractions for each, in a blog post format."

This example combines multiple strategies: adopting a persona (travel expert), targeting a specific audience (solo travelers), and structuring the response (list format with history and attractions). It illustrates how blending various techniques can result in a rich, detailed, and audience-specific response.

2. Balancing Detail and Conciseness in Prompts

Striking the right balance between providing enough detail and keeping prompts concise is crucial. Overly lengthy prompts can confuse the AI, while too brief prompts may lack necessary guidance. Regular testing and refinement of your prompts will help find this balance.

Final thoughts

Prompt engineering is an iterative process that requires persistence, creativity, and a willingness to experiment. It often takes many tries adjusting the wording and approach before landing on the prompt that unlocks the response you seek. Even advanced LLMs have limitations in their knowledge and reasoning capacities. By regularly assessing the model’s strengths and weaknesses, you can better target your prompts and set reasonable expectations.

But prompt engineering also opens up tremendous possibilities for utilizing language models to their full potential. With thoughtful human guidance, these systems can produce remarkable insights, creative ideas, and helpful information. Start by applying some of the key strategies covered in this guide, testing prompts related to your needs. Track what works well or falls short. As the craft develops further, so will the breadth of problems AI can help solve.

The prompt engineering journey is just beginning.

Sterling’s Substack

Discussion about this post