RAG for Small LLM

Advanced RAG for Small LLM

llama.cpp Phi 2 Pico Jarvis

github.com/ariya/pico-jarvis

Zero-Shot Prompt

You are a helpful assistant. Answer verbosely. user assistant system

You are a helpful assistant. Answer concisely.

Chain of Thought

You are a research assistant with access to Google search.
Given the question from the user, your task is to use Google to search for the answer. Think step by step.

JSON Mode

You are a research assistant with access to Google search.
Given the question from the user, your task is to use Google to search for the answer. Think step by step. Always output a valid JSON as follows: { "thought": describe your thoughts about the question, "tool": the search engine to use (must be Google), "input": important key phrases to give to search for, "observation": concise result from Google } MAKE BELIEVE

thought: This is about Star Trek, I will use Google
search tool: Google input: Captain Kirk first name observation: James

Retrieval-Augmented Generation

Generation

You are given a question from human and you have
to answer it concisely using only the following reference document. Avoid using any external information or recalling from memory. Now use the above reference to answer this question.

This is the weather observation for Palo Alto. The current
weather condition is broken clouds. * barometric pressure: 1013 mbars. * temperature (in Celcius): 17 °C. * temperature (in Fahrenheit): 63 °F. * humidity: 60%.

Retrieval Improvement

Query Transformation

You are a language assistant. The user specifies a conversation
history followed by a statement. Your task is to rephrase the statement as a new inquiry, with only the relevant references in the conversation history. Explain the reason behind the rephrasing. Always respond in JSON format, such as: { "rephrased": a reformulated statement of the original, "explanation": reason to rephrase it that way }

User: What is a dwarf planet? Assistant: A dwarf planet
is a celestial body that orbits the sun, is large enough to be rounded by its own gravity, but has not cleared the neighborhood around its orbit of other debris. User: Give an example! User: What is a dwarf planet? Assistant: A dwarf planet is a celestial body that orbits the sun, is large enough to be rounded by its own gravity, but has not cleared the neighborhood around its orbit of other debris. User: Can you provide an example of a dwarf planet?

rephrased: Can you provide an example of a dwarf planet?
explanation: The user wants to know an example of a dwarf planet, as they have just learned about what a dwarf planet is.

HYDE Hypothetical Document Embeddings

1 _ .. __ .. __ .. __ __ ..
__ .. __ 2 _ .. __ .. __ .. __ __ .. __ .. __ 3 _ .. __ .. __ .. __ __ .. __ .. __ … __ .. __ .. __ .. __ __ .. __ .. __ .. __ 434 Dwarf planets are considered planets by some planetologists but not by the IAU. .. __ .. __ .. What is a dwarf planet? 77%

1 _ .. __ .. __ .. __ __ ..
__ .. __ 2 _ .. __ .. __ .. __ __ .. __ .. __ 3 _ .. __ .. __ .. __ __ .. __ .. __ … __ .. __ .. __ .. __ __ .. __ .. __ .. __ 433 A dwarf planet is a body orbiting the Sun that is massive enough to be made near-spherical by its own gravity but that has not cleared planetesimals from its neighborhood and is also not a satellite. .. __ .. __ .. A dwarf planet is a celestial body that orbits the sun, is large enough to be rounded by its own gravity, but has not cleared the neighborhood around its orbit of other debris. 87%

Keyphrases Search

Keyphrases Citations

thought: This is about Star Trek, I will use Google
search tool: Google input: Captain Kirk first name observation: James

Knowledge Base First Retriever Relevant Passages Rerank

Intent Classifier

Reason-Act Thought: This is about weather and I will use
weather action. Action: weather: Palo Alto Observation: No, it's not currently snowing in Palo Alto. Answer: It's not currently snowing in Palo Alto. 1

Reason-Act Thought: This is about weather and I will use
weather action. Action: weather: Palo Alto Observation: The current weather in Palo Alto is clear sky at 17.5 C and humidity 48% Answer: No, it is not. 2 Weather API

Thank You speakerdeck.com/ariya

RAG for Small LLM

RAG for Small LLM

Ariya Hidayat

More Decks by Ariya Hidayat

Featured

Transcript

Advanced RAG for Small LLM

llama.cpp Phi 2 Pico Jarvis

github.com/ariya/pico-jarvis

Zero-Shot Prompt

Zero-Shot Prompt

You are a helpful assistant. Answer verbosely. user assistant system

You are a helpful assistant. Answer concisely.

Chain of Thought

You are a research assistant with access to Google search.

JSON Mode

JSON Mode

You are a research assistant with access to Google search.

thought: This is about Star Trek, I will use Google

WRONG

Retrieval-Augmented Generation

Generation

You are given a question from human and you have

This is the weather observation for Palo Alto. The current

Retrieval Improvement

Query Transformation

You are a language assistant. The user specifies a conversation

User: What is a dwarf planet? Assistant: A dwarf planet

rephrased: Can you provide an example of a dwarf planet?

HYDE Hypothetical Document Embeddings

1 _ .. .. .. ..

1 _ .. .. .. ..

Keyphrases Search

Keyphrases Citations

thought: This is about Star Trek, I will use Google

Knowledge Base First Retriever Relevant Passages Rerank

Intent Classifier

Reason-Act Thought: This is about weather and I will use

Reason-Act Thought: This is about weather and I will use

Thank You speakerdeck.com/ariya