Project-Zero-Days
diff --git a/‎articles/how_to_work_with_large_language_models.md
Lines changed: 23 additions & 24 deletions b/‎articles/how_to_work_with_large_language_models.md
Lines changed: 23 additions & 24 deletions
diff --git a/‎articles/techniques_to_improve_reliability.md
Lines changed: 17 additions & 17 deletions b/‎articles/techniques_to_improve_reliability.md
Lines changed: 17 additions & 17 deletions
diff --git a/‎articles/text_comparison_examples.md
Lines changed: 10 additions & 10 deletions b/‎articles/text_comparison_examples.md
Lines changed: 10 additions & 10 deletions
diff --git a/‎examples/Classification_using_embeddings.ipynb
Lines changed: 11 additions & 13 deletions b/‎examples/Classification_using_embeddings.ipynb
Lines changed: 11 additions & 13 deletions
@@ -6,14 +6,14 @@
 
 The magic of large language models is that by being trained to minimize this prediction error over vast quantities of text, the models end up learning concepts useful for these predictions. For example, they learn:
 
-* how to spell
-* how grammar works
-* how to paraphrase
-* how to answer questions
-* how to hold a conversation
-* how to write in many languages
-* how to code
-* etc.
+- how to spell
+- how grammar works
+- how to paraphrase
+- how to answer questions
+- how to hold a conversation
+- how to write in many languages
+- how to code
+- etc.
 
 They do this by “reading” a large amount of existing text and learning how words tend to appear in context with other words, and uses what it has learned to predict the next most likely word that might appear in response to a user request, and each subsequent word after that.
 
@@ -25,12 +25,12 @@ Of all the inputs to a large language model, by far the most influential is the
 
 Large language models can be prompted to produce output in a few ways:
 
-* **Instruction**: Tell the model what you want
-* **Completion**: Induce the model to complete the beginning of what you want
-* **Scenario**: Give the model a situation to play out
-* **Demonstration**: Show the model what you want, with either:
-  * A few examples in the prompt
-  * Many hundreds or thousands of examples in a fine-tuning training dataset
+- **Instruction**: Tell the model what you want
+- **Completion**: Induce the model to complete the beginning of what you want
+- **Scenario**: Give the model a situation to play out
+- **Demonstration**: Show the model what you want, with either:
+  - A few examples in the prompt
+  - Many hundreds or thousands of examples in a fine-tuning training dataset
 
 An example of each is shown below.
 
@@ -77,6 +77,7 @@ Output:
 Giving the model a scenario to follow or role to play out can be helpful for complex queries or when seeking imaginative responses. When using a hypothetical prompt, you set up a situation, problem, or story, and then ask the model to respond as if it were a character in that scenario or an expert on the topic.
 
 Example scenario prompt:
+
 ```text
 Your role is to extract the name of the author from any given text
 
@@ -141,24 +142,22 @@ Large language models aren't only great at text - they can be great at code too.
 
 GPT-4 powers [numerous innovative products][OpenAI Customer Stories], including:
 
-* [GitHub Copilot] (autocompletes code in Visual Studio and other IDEs)
-* [Replit](https://replit.com/) (can complete, explain, edit and generate code)
-* [Cursor](https://cursor.sh/) (build software faster in an editor designed for pair-programming with AI)
+- [GitHub Copilot] (autocompletes code in Visual Studio and other IDEs)
+- [Replit](https://replit.com/) (can complete, explain, edit and generate code)
+- [Cursor](https://cursor.sh/) (build software faster in an editor designed for pair-programming with AI)
 
-GPT-4 is more advanced than previous models like `text-davinci-002`. But, to get the best out of GPT-4 for coding tasks, it's still important to give clear and specific instructions. As a result, designing good prompts can take more care.
+GPT-4 is more advanced than previous models like `gpt-3.5-turbo-instruct`. But, to get the best out of GPT-4 for coding tasks, it's still important to give clear and specific instructions. As a result, designing good prompts can take more care.
 
 ### More prompt advice
 
 For more prompt examples, visit [OpenAI Examples][OpenAI Examples].
 
 In general, the input prompt is the best lever for improving model outputs. You can try tricks like:
 
-* **Be more specific** E.g., if you want the output to be a comma separated list, ask it to return a comma separated list. If you want it to say "I don't know" when it doesn't know the answer, tell it 'Say "I don't know" if you do not know the answer.' The more specific your instructions, the better the model can respond.
-* **Provide Context**: Help the model understand the bigger picture of your request. This could be background information, examples/demonstrations of what you want or explaining the purpose of your task.
-* **Ask the model to answer as if it was an expert.** Explicitly asking the model to produce high quality output or output as if it was written by an expert can induce the model to give higher quality answers that it thinks an expert would write. Phrases like "Explain in detail" or "Describe step-by-step" can be effective.
-* **Prompt the model to write down the series of steps explaining its reasoning.** If understanding the 'why' behind an answer is important, prompt the model to include its reasoning. This can be done by simply adding a line like "[Let's think step by step](https://arxiv.org/abs/2205.11916)" before each answer.
-
-
+- **Be more specific** E.g., if you want the output to be a comma separated list, ask it to return a comma separated list. If you want it to say "I don't know" when it doesn't know the answer, tell it 'Say "I don't know" if you do not know the answer.' The more specific your instructions, the better the model can respond.
+- **Provide Context**: Help the model understand the bigger picture of your request. This could be background information, examples/demonstrations of what you want or explaining the purpose of your task.
+- **Ask the model to answer as if it was an expert.** Explicitly asking the model to produce high quality output or output as if it was written by an expert can induce the model to give higher quality answers that it thinks an expert would write. Phrases like "Explain in detail" or "Describe step-by-step" can be effective.
+- **Prompt the model to write down the series of steps explaining its reasoning.** If understanding the 'why' behind an answer is important, prompt the model to include its reasoning. This can be done by simply adding a line like "[Let's think step by step](https://arxiv.org/abs/2205.11916)" before each answer.
 
 [Fine Tuning Docs]: https://platform.openai.com/docs/guides/fine-tuning
 [OpenAI Customer Stories]: https://openai.com/customer-stories
 
@@ -14,25 +14,25 @@ If you were asked to multiply 13 by 17, would the answer pop immediately into yo
 
 Similarly, if you give GPT-3 a task that's too complex to do in the time it takes to calculate its next token, it may confabulate an incorrect guess. Yet, akin to humans, that doesn't necessarily mean the model is incapable of the task. With some time and space to reason things out, the model still may be able to answer reliably.
 
-As an example, if you ask `text-davinci-002` the following math problem about juggling balls, it answers incorrectly:
+As an example, if you ask `gpt-3.5-turbo-instruct` the following math problem about juggling balls, it answers incorrectly:
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Q: A juggler has 16 balls. Half of the balls are golf balls and half of the golf balls are blue. How many blue golf balls are there?
 A:
 ```
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 There are 8 blue golf balls.
 ```
 
 Does this mean that GPT-3 cannot do simple math problems? No; in fact, it turns out that by prompting the model with `Let's think step by step`, the model solves the problem reliably:
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Q: A juggler has 16 balls. Half of the balls are golf balls and half of the golf balls are blue. How many blue golf balls are there?
 A: Let's think step by step.
 ```
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 There are 16 balls in total.
 Half of the balls are golf balls.
 That means that there are 8 golf balls.
@@ -64,9 +64,9 @@ The rest of this article shares techniques for improving reliability of large la
 
 One way to give a model more time and space to think is to break tasks into simpler pieces.
 
-As an example, consider a task where we ask the model a multiple-choice question about some text - in this case, a game of Clue. When asked directly, `text-davinci-002` isn't able to put clues 3 & 5 together, and answers incorrectly:
+As an example, consider a task where we ask the model a multiple-choice question about some text - in this case, a game of Clue. When asked directly, `gpt-3.5-turbo-instruct` isn't able to put clues 3 & 5 together, and answers incorrectly:
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Use the following clues to answer the following multiple-choice question.
 
 Clues:
@@ -84,7 +84,7 @@ Question: Was Colonel Mustard in the observatory with the candlestick?
 Solution:
 ```
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 (c) Unknown; there is not enough information to determine whether Colonel Mustard was in the observatory with the candlestick
 ```
 
@@ -96,7 +96,7 @@ However, instead of asking for the answer directly, we can split the task into t
 - Second, combine the relevant clues to reason out the answer to the question
 - Third, write the final answer: either (a), (b), or (c)
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Use the following clues to answer the following multiple-choice question, using the following procedure:
 (1) First, go through the clues one by one and consider whether the clue is potentially relevant
 (2) Second, combine the relevant clues to reason out the answer to the question
@@ -118,7 +118,7 @@ Solution:
 (1) First, go through the clues one by one and consider whether the clue is potentially relevant:
 ```
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Clue 1: Miss Scarlett was the only person in the lounge. This clue is not relevant.
 Clue 2: The person with the pipe was in the kitchen. This clue is not relevant.
 Clue 3: Colonel Mustard was the only person in the observatory. This clue is relevant.
@@ -136,9 +136,9 @@ By giving the model more time and space to think, and guiding it along a reasoni
 
 Another benefit of splitting complex instructions into smaller subtasks is that it can help keep the model focused on each subtask.
 
-For example, if we ask `text-davinci-002` to summarize a text in its original language, the model can lapse back into English:
+For example, if we ask `gpt-3.5-turbo-instruct` to summarize a text in its original language, the model can lapse back into English:
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Summarize the text using the original language of the text. The summary should be one sentence long.
 
 Text:
@@ -149,13 +149,13 @@ La estadística (la forma femenina del término alemán Statistik, derivado a su
 Summary:
 ```
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 The text explains that statistics is a science that studies the variability, collection, organization, analysis, interpretation, and presentation of data, as well as the random process that generates them following the laws of probability.
 ```
 
 However, if we first ask the model to identify the language of the text, and then summarize the text, it becomes more reliable:
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 First, identify the language of the text. Second, summarize the text using the original language of the text. The summary should be one sentence long.
 
 Text:
@@ -166,7 +166,7 @@ La estadística (la forma femenina del término alemán Statistik, derivado a su
 Language:
 ```
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Spanish
 
 La estadística es una ciencia que estudia la variabilidad, colección, organización, análisis, interpretación, y presentación de los datos, así como el proceso aleatorio que los genera siguiendo las leyes de la probabilidad.
@@ -203,7 +203,7 @@ To learn more, read the [full paper](https://arxiv.org/abs/2205.11916).
 
 If you apply this technique to your own tasks, don't be afraid to experiment with customizing the instruction. `Let's think step by step` is rather generic, so you may find better performance with instructions that hew to a stricter format customized to your use case. For example, you can try more structured variants like `First, think step by step about why X might be true. Second, think step by step about why Y might be true. Third, think step by step about whether X or Y makes more sense.`. And you can even give the model an example format to help keep it on track, e.g.:
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
 Using the IRS guidance below, answer the following questions using this format:
 (1) For each criterion, determine whether it is met by the vehicle purchase
 - {Criterion} Let's think step by step. {explanation} {yes or no, or if the question does not apply then N/A}.
@@ -229,7 +229,7 @@ Solution:
 - Does the vehicle have at least four wheels? Let's think step by step.
 ```
 
-```text-davinci-002
+```gpt-3.5-turbo-instruct
  The Toyota Prius Prime has four wheels, so the answer is yes.
 - Does the vehicle weigh less than 14,000 pounds? Let's think step by step. The Toyota Prius Prime weighs less than 14,000 pounds, so the answer is yes.
 - Does the vehicle draw energy from a battery with at least 4 kilowatt hours that may be recharged from an external source? Let's think step by step. The Toyota Prius Prime has a battery with at least 4 kilowatt hours that may be recharged from an external source, so the answer is yes.
 
@@ -8,8 +8,8 @@ Embeddings can be used for semantic search, recommendations, cluster analysis, n
 
 For more information, read OpenAI's blog post announcements:
 
-* [Introducing Text and Code Embeddings (Jan 2022)](https://openai.com/blog/introducing-text-and-code-embeddings/)
-* [New and Improved Embedding Model (Dec 2022)](https://openai.com/blog/new-and-improved-embedding-model/)
+- [Introducing Text and Code Embeddings (Jan 2022)](https://openai.com/blog/introducing-text-and-code-embeddings/)
+- [New and Improved Embedding Model (Dec 2022)](https://openai.com/blog/new-and-improved-embedding-model/)
 
 For comparison with other embedding models, see [Massive Text Embedding Benchmark (MTEB) Leaderboard](https://huggingface.co/spaces/mteb/leaderboard)
 
@@ -19,14 +19,14 @@ Embeddings can be used for search either by themselves or as a feature in a larg
 
 The simplest way to use embeddings for search is as follows:
 
-* Before the search (precompute):
-  * Split your text corpus into chunks smaller than the token limit (8,191 tokens for `text-embedding-ada-002`)
-  * Embed each chunk of text
-  * Store those embeddings in your own database or in a vector search provider like [Pinecone](https://www.pinecone.io), [Weaviate](https://weaviate.io) or [Qdrant](https://qdrant.tech)
-* At the time of the search (live compute):
-  * Embed the search query
-  * Find the closest embeddings in your database
-  * Return the top results
+- Before the search (precompute):
+  - Split your text corpus into chunks smaller than the token limit (8,191 tokens for `text-embedding-3-small`)
+  - Embed each chunk of text
+  - Store those embeddings in your own database or in a vector search provider like [Pinecone](https://www.pinecone.io), [Weaviate](https://weaviate.io) or [Qdrant](https://qdrant.tech)
+- At the time of the search (live compute):
+  - Embed the search query
+  - Find the closest embeddings in your database
+  - Return the top results
 
 An example of how to use embeddings for search is shown in [Semantic_text_search_using_embeddings.ipynb](../examples/Semantic_text_search_using_embeddings.ipynb).