2024-05-25LangChain Agentの仕組み@機械学習社会実装勉強会第35回

LangChain Agentの仕組み 2024/05/25 那珂将人

内容 1. 「Agentをよく使うが実際にどうなっているのかよくわからない」を全体の流れをわかるようになる 2. AgentのTypeが多いので一つを例にでも理解を深めておくと今後役立つ 3. 今後AgentなどLangChainの中身を見るきっかけになる (今回はLangChain 0.1までのAgentsに関してです。
LangChain 0.2では、AgentsもLangGraphに移行するように奨励されています。Ref: How to migrate from legacy LangChain agents to LangGraph)

Agentとは The core idea of agents is to use a
language model to choose a sequence of actions to take. 言語モデルを使って一連の実行アクションを選択する Agent Tool Google検索 Tool Graph可視化日本の人口の推移をグラフにして

Concepts 1. Schema: AgentAction, AgentFinish, Intermediate Steps a. Agentから返されるのは次の AgentのアクションかAgentのアクションの終了サインか
b. 中間Stepsの結果はIntermediate stepsに格納 2. Agent: Agent inputs & Agent outputs a. AgentのInputに合わせてAgentを実行してOutputを取得 3. AgentExecutor: Agentを実際に呼び出し、Actionを実行し、Outputをベースに次の Actionを実行していく 4. Tools: Agentが実行できる選択肢 a. 例. TavilySearchResults https://tavily.com/ (月1000コールまで無料)、自分で書いた関数を Toolとして使うこともできる

全体像 1. AgentExecutor 2. Agent

Example 1. tool: a. TavilySearchResults: https://docs.tavily.com/docs/welcome (search engine for AI
agents) b. multiply: custom tool 2. prompt: hub 3. llm: ChatOpenAI 4. agent: create_react_agent 5. agent_executor

Step1: Toolの準備 1. TavilySearchResults a. API keyが必要 b. 1000 requests
per monthまで無料 2. multiply a. 2 つの数字の掛け算をする関数 + @tool

Step2: Chat model & Prompt 1. ChatOpenAI を設定 2. Promptを準備
a. https://smith.langchain.com/hub/hwchase17/openai-functions-agent b. langchain hubで公開されているpromptを仕様できる

Step3: Agentを設定 1. create_tool_calling_agentにllm, tools, promptを入れる 2. agent.invokeすると、ToolAgentActionが返される create_tool_calling_agentは後ほど

Step3: Agentをinvoke例: “hi” 1. input: hi 2. output: AgentFinish(return_values{‘output’: ‘Hello!
How can I assist you today?’})

Step3: Agentをinvoke例: “What’s the weather in SF?” 1. input: What’s
the weather in SF? 2. output: ToolAgentAction(tool=’tavily_search_results_json’, tool_input={‘query’: ‘current weather in San Francisco’}})

Step3: Agentをinvoke例: “What’s 3 times 2?” 1. input: What’s 3
times 2? 2. output: ToolAgentAction(tool=’multiply’, tool_input={‘a’: 3, ‘b’: 2}})

Step4: AgentExecutor 1. AgentExecutorにagent, toolsを渡して初期化

Step4: AgentExecutor 実行してみる 1. What’s the weather in SF? 2.
What’s 2 times 3?

Step4: AgentExecutor 「what’s 2 times 3の答えとSFのセ氏での気温とどちらの数字が大きいですか」

Agentの中身

Agent: 作成create_tool_calling_agent 全体の流れ 1. バリデーション 2. llm.bind_tools(tools) 3. agentの作成

Agentのステップ Agentは4つのステップからなっている

Agent.steps[0] format_to_tool_messages: AgentActionとツールのアウトプットからFunctionMessageへ変換する 1. Input: intermediate_steps: Sequence[Tuple[AgentAction, str]] 2.
Output: List[BaseMessage] AgentActionを実行し、ToolのOutputを取得してから、それらを中間ステップ (intermediate_steps)にBaseMessageのリストとしてためていくためのステップ

Agent: agent.steps[1].prompt 一つ目のステップで生成した、過去のAgentActionとToolのOutputの歴史が入ったものも含めて、次のllm呼び出しで使うpromptを決める 1. input_variables: a. agent_scratchpad: １つ目のstepで生成されたagentとToolのOutputの歴史 b.
input: もともとの質問 2. input_types 3. messages: a. SystemMessagePromptTemplate: “You are a helpful assistant” b. MessagesPlaceholder: chat_history c. HumanMessagePromptTemplate: input d. MessagesPlaceholder: agent_scratchpad

Agent: agent.steps[2] 1. 次のActionをllmに聞く a. tool_calls に name: tavily_search_results_jsonと引数argsが返される 2.
返ってくるのはAIMessage

Agent: agent.steps[2]: llm.bind_tools 1. llm.bind_tools -> Runnableを返す 2. llmを呼ぶときにtoolを一緒に渡して、ツールと Argsを取得できる
3. 例.

llm.bind_tools 1. llm.bind_tools(tools, tool_choice, **kwargs) a. Runnableを返す 2. toolを追加 3.
BaseChatOpenAI.bind_tools(tools, tool_choice) a. 4. BaseChatModel.bind(**kwargs) a. Useful when a runnable in a chain requires an argument that is not in the output of the previous runnable or included in the user input. b. kwargsにはtool 5. Runnable.bind() -> Runnable 6.

agent.steps[3]: Output Parser 1. ToolsAgentOutputParser() 前のStepで返されたAIMessageをToolAgentAction/AgentFinishに変換 ※Toolはまだ実行されてない

ここまでのまとめ 1. AgentのStepは４つ a. steps[0]: AgentActionとToolの結果からMessageに変換してagent_scratchpadへ格納 b. steps[1]: prompt (input,
agent_scratchpad)をinput variablesにもつ c. steps[2]: RunnableBinding llmにtoolsをbindしたもので、inputからどのツールを使えばよいかを llm に聞くステップ d. steps[3]: Output parser 上のステップでllmから返ってきた結果を ToolAgentAction/AgentFinishに変換する 2. 注意 a. 実際のToolの実行はしない b. Agentで定義されているのは、次の Actionを決定すること

全体像 1. AgentExecutor 2. Agent

AgentExecutor

AgentExecutor: AgentとToolを使って最後の答えを出す AgentExecutorの仕事は、 1. Agentを呼び出して、次のアクションを聞く 2. ツールを使う場合は、実際のツールの呼び出しをする 3. その結果を含めて、Agentを呼び出して次のアクションを聞く 4.
… 5. Agentに終了と言われたら、最後の答えを返す

AgentExecutorの作成 AgentExecutorをagent, toolsで初期化

AgeneExecutorはChainの子クラス

Chainのinvokeが呼ばれ、その中のself._callが呼ばれる ChainのInvoke _callは子クラスで実装される

AgentExecutorに戻って _call 1. whileループ: continueの条件を満たす限り 2. self._take_next_step: agentを呼び Next
Actionを取得し、 AgentActionがあればtoolを呼び実行した結果を返す 3. next_step_outputに値よって終了か、intermediate_stepsに追加 4. (_get_tool_return: agentからの結果を直接返すかどうか) 5. _return(): 最終的な結果を返す

まとめ 1. Agent a. 質問の内容と今までの Agentの実行履歴から、次の Actionを返す b. Input: 最初のユーザからの質問、
intermediate_steps c. Output: 次のAction (AgentFinish or AgentAction) 2. AgentExecutor a. AgentがFinishを返すか、実行最大数 or最大時間に到達するまで、Agentを繰り返し呼ぶ b. Agentに選ばれたツールを実行し、その結果を intermediate_stepsを更新しながらAgentを呼び続ける

2024-05-25LangChain Agentの仕組み@機械学習社会実装勉強会第35回

2024-05-25LangChain Agentの仕組み@機械学習社会実装勉強会第35回

Naka Masato

More Decks by Naka Masato

Other Decks in Technology

Featured

Transcript

LangChain Agentの仕組み 2024/05/25 那珂将人

Agentとは The core idea of agents is to use a

Concepts 1. Schema: AgentAction, AgentFinish, Intermediate Steps a. Agentから返されるのは次の AgentのアクションかAgentのアクションの終了サインか

全体像 1. AgentExecutor 2. Agent

Example 1. tool: a. TavilySearchResults: https://docs.tavily.com/docs/welcome (search engine for AI

Step1: Toolの準備 1. TavilySearchResults a. API keyが必要 b. 1000 requests

Step2: Chat model & Prompt 1. ChatOpenAI を設定 2. Promptを準備

Step3: Agentを設定 1. create_tool_calling_agentにllm, tools, promptを入れる 2. agent.invokeすると、ToolAgentActionが返される create_tool_calling_agentは後ほど

Step3: Agentをinvoke例: “hi” 1. input: hi 2. output: AgentFinish(return_values{‘output’: ‘Hello!

Step3: Agentをinvoke例: “What’s the weather in SF?” 1. input: What’s

Step3: Agentをinvoke例: “What’s 3 times 2?” 1. input: What’s 3

Step4: AgentExecutor 1. AgentExecutorにagent, toolsを渡して初期化

Step4: AgentExecutor 実行してみる 1. What’s the weather in SF? 2.

Step4: AgentExecutor 「what’s 2 times 3の答えとSFのセ氏での気温とどちらの数字が大きいですか」

Agentの中身

Agent: 作成create_tool_calling_agent 全体の流れ 1. バリデーション 2. llm.bind_tools(tools) 3. agentの作成

Agentのステップ Agentは4つのステップからなっている

Agent.steps[0] format_to_tool_messages: AgentActionとツールのアウトプットからFunctionMessageへ変換する 1. Input: intermediate_steps: Sequence[Tuple[AgentAction, str]] 2.

Agent: agent.steps[2] 1. 次のActionをllmに聞く a. tool_calls に name: tavily_search_results_jsonと引数argsが返される 2.

Agent: agent.steps[2]: llm.bind_tools 1. llm.bind_tools -> Runnableを返す 2. llmを呼ぶときにtoolを一緒に渡して、ツールと Argsを取得できる

llm.bind_tools 1. llm.bind_tools(tools, tool_choice, **kwargs) a. Runnableを返す 2. toolを追加 3.

agent.steps[3]: Output Parser 1. ToolsAgentOutputParser() 前のStepで返されたAIMessageをToolAgentAction/AgentFinishに変換 ※Toolはまだ実行されてない

ここまでのまとめ 1. AgentのStepは４つ a. steps[0]: AgentActionとToolの結果からMessageに変換してagent_scratchpadへ格納 b. steps[1]: prompt (input,

全体像 1. AgentExecutor 2. Agent

AgentExecutor

AgentExecutorの作成 AgentExecutorをagent, toolsで初期化

AgeneExecutorはChainの子クラス

Chainのinvokeが呼ばれ、その中のself._callが呼ばれる ChainのInvoke _callは子クラスで実装される

AgentExecutorに戻って _call 1. whileループ: continueの条件を満たす限り 2. self._take_next_step: agentを呼び Next

まとめ 1. Agent a. 質問の内容と今までの Agentの実行履歴から、次の Actionを返す b. Input: 最初のユーザからの質問、