⌘K

- Home
- Library

Home

Library

⌘K

Building AI Agents in Pure Python - Beginner Course | COFYT

2025 backers only: get a full year at 60% off. Last chance! Get Plus →

nogueirarui

3 months ago

Building AI Agents in Pure Python - Beginner Course

New

Building AI Agents in Pure Python - Beginner Course

Sources

youtube.com

Answer

Ask me anything about this video:

What Python library is used in the video to define and manage data models for structured output?

Answer

The video mentions using the Pydantic library to define data models for structured output.

besides generating text, is there any other functionality of the AI agent displayed in the video?

Answer

Does the LLM directly call external APIs (like a weather API) in the "tool use" example, or does the application handle this?

Answer

In the "tool use" example, the application handles the direct call to the external API. The LLM's role is to identify that a tool needs to be called and to provide the specific parameters (like latitude and longitude for a weather API) required for that tool. The Python script then takes these parameters and makes the actual API call.

How to use the LLM response to call a function? Because an LLM generate lot of tokens. What triggers a function though?

Answer

How to handle thinking models? There may not be only the structured output, but all the CoT before.

Answer

How to parse the LLM's output to call a function?

Answer

What if I want to use more functions? How to chain more functions? How to concatenate the results in the prompt?

Answer

nogueirarui

Building AI Agents in Pure Python - Beginner Course

Sources

youtube.com

Building AI Agents in Pure Python - Beginner Course

Answer

About this video

Video Title: Building AI Agents in Pure Python - Beginner Course
Channel: Dave Ebbelaar
Speakers: Dave Ebbelaar
Duration: 46:44

Overview

This video is a beginner's course on building AI agents using pure Python and directly interacting with large language model (LLM) APIs, such as OpenAI. It demonstrates how to leverage fundamental building blocks like basic API calls, structured output, and tool usage to create effective AI systems without relying on external frameworks. The course also covers workflow patterns like prompt chaining, routing, and parallelization, offering practical examples and insights for developers.

Key takeaways

Direct API Interaction: The video advocates for building AI agents by directly using LLM APIs with Python, emphasizing understanding the underlying principles over relying solely on frameworks.
Core Building Blocks: Key components for building AI systems include making basic API calls, using structured output for programmatic data handling, and enabling AI to use tools (functions) for external interactions.
Workflow Patterns: Effective AI systems can be constructed using patterns like prompt chaining (decomposing tasks into sequential steps), routing (directing logic based on conditions), and parallelization (executing independent tasks simultaneously for efficiency).
Practical Examples: The course provides hands-on demonstrations of building agents for tasks like scheduling calendar events and retrieving information, illustrating the application of these concepts.
Deployment Considerations: The video touches upon the importance of deploying AI applications and mentions a framework, "Gen Launchpad," for production-ready generative AI apps.

Ask me anything about this video:

What Python library is used in the video to define and manage data models for structured output?

Answer

The video mentions using the Pydantic library to define data models for structured output.

besides generating text, is there any other functionality of the AI agent displayed in the video?

Answer

Yes, the video displays several other functionalities of AI agents beyond just generating text:

Structured Output: The AI can predict and return information in a specific, programmatically usable structure (e.g., a calendar event with name, date, and participants).
Tool Usage: The AI can decide to use external tools (like a weather API or a search function) based on the user's request and provide the necessary parameters for those tools.
Information Retrieval: The AI can access and retrieve information from a knowledge base to answer user questions.
Process Decomposition (Prompt Chaining): The AI can break down complex tasks into a sequence of smaller steps, where the output of one step informs the next.
Conditional Logic (Routing): The AI can determine different courses of action based on the nature of the request, such as scheduling a new event versus modifying an existing one.
Parallel Processing: The AI can execute independent tasks simultaneously to improve efficiency, demonstrated through parallel checks for guardrails.
Applying Guardrails: The AI can be instructed to perform checks for prompt injection or harmful content before responding.

Does the LLM directly call external APIs (like a weather API) in the "tool use" example, or does the application handle this?

Answer

How to use the LLM response to call a function? Because an LLM generate lot of tokens. What triggers a function though?

Answer

In the context of the video, the LLM response itself doesn't directly call a function. Instead, the LLM is configured to output a specific format that indicates which function should be called and with what arguments.

Here's how it works and what triggers the function call:

Tool Definition: You define your available functions (tools) and describe them to the LLM. This description includes the function's name, its purpose, and the parameters it expects. This is often done using a specific JSON schema format that the LLM understands.
LLM's "Decision": When you send a user's prompt to the LLM along with these tool definitions, the LLM analyzes the prompt. Based on its understanding of the prompt and the descriptions of your tools, it decides if one of your defined tools is the most appropriate way to fulfill the request.
Structured Output for Tool Call: Instead of generating free-form text, the LLM is instructed (via the API parameters, often using response_format or similar) to output a structured response that signifies a tool call. This response typically includes:
- The name of the tool (function) it wants to call.
- The arguments it has determined for that function, formatted according to the tool's definition.
Application Logic (The Trigger): Your Python application receives this structured response from the LLM. The application then:
- Parses this structured output.
- Checks if the finish_reason indicates a tool_calls. This is the primary trigger.
- Identifies the tool_name and arguments provided by the LLM.
- Uses conditional logic (like if tool_name == "get_weather":) to match the LLM's request to the actual Python function defined in your code.
- Calls that Python function, passing the arguments that the LLM extracted.

So, the LLM generates a structured request to call a function, and your application's logic acts upon that request to actually execute the function. The "trigger" is the application detecting the specific tool-call format in the LLM's response.

How to handle thinking models? There may not be only the structured output, but all the CoT before.

Answer

You're touching on a crucial aspect of advanced LLM usage: handling more complex reasoning processes like Chain-of-Thought (CoT) alongside structured outputs. The key is to design your prompts and parsing logic to accommodate these intermediate thinking steps.

Here's how you can approach handling "thinking models" or CoT outputs before a final structured output:

Prompt Engineering for CoT + Structured Output:

Explicitly Ask for Thinking: In your system prompt, you can instruct the LLM to "think step-by-step" or "show your reasoning" before providing the final answer or structured output.
Define the Final Output Format: Clearly specify that after the reasoning steps, the LLM must provide the answer in a particular structured format (e.g., JSON).

Example Prompt Snippet:

You are an AI assistant tasked with extracting event details.
First, think step-by-step to determine if the user's request describes a calendar event.
Show your reasoning.
Once you have completed your reasoning and confirmed it's a calendar event, extract the following information and provide it as a JSON object with keys: "name", "date", "participants".
If it's not a calendar event, simply state "Not a calendar event."

User request: [user_input_here]

LLM Configuration for "Function Calling" or Tool Use:
- When using libraries like OpenAI's API, you'd typically define your tools (functions) as before. The LLM's goal is still to identify if a tool call is appropriate.
- If the LLM is configured to output a tool call, it will do so after its internal reasoning process. The CoT is often part of the LLM's internal generation that leads to the tool call decision, but the API's structured output mechanism is designed to surface the tool call itself, not necessarily the entire CoT transcript.
Handling the LLM's Response: This is where your application logic becomes more sophisticated:
- Scenario A: LLM provides the structured tool call directly (most common with function calling)
  - The LLM's response will directly contain the tool_calls object, specifying the function name and arguments. Your application parses this as described previously. The CoT is usually internal to the LLM's generation process and not directly exposed in the structured output intended for function calls.
- Scenario B: LLM provides CoT and then structured output (less common for direct function calls, more for general structured data)
  - If you've designed the prompt such that the LLM explicitly outputs its reasoning before a final JSON block (not necessarily tied to a function call schema), your application needs to parse this differently.
  - Parsing Strategy:
    1. Identify the Reasoning: You might look for keywords like "step-by-step," "reasoning," or specific formatting that signals the start of the thinking process.
    2. Extract the Reasoning (Optional): You could log or display this reasoning to the user or for debugging.
    3. Identify the Final Structured Output: Look for the start of a JSON object ( { ) or the specific structured format you requested.
    4. Parse the Structured Data: Once you have the JSON, you can then use it. If this JSON is meant to be arguments for a function, your application logic would proceed as in Scenario A, using the parsed JSON data.
- Scenario C: Hybrid Approach (using tool use for the final step)
  - You can prompt the LLM to perform CoT.
  - Then, instruct it that after its reasoning, it should decide which of the predefined tools to call and provide the arguments.
  - The API response might contain the tool_calls object. If you want to also capture the CoT, you might need to:
    - Ask the LLM to first generate the reasoning and then generate the tool call within the same response block, perhaps separated by a marker. This is trickier to reliably parse.
    - Or, make a second LLM call. The first call generates the reasoning (and perhaps a summary or decision), and the second call takes that reasoning as input and then decides which tool to call based on it. This is more robust but adds latency.

What Triggers a Function Call?

The "trigger" for a function call, especially when using LLM APIs with built-in function calling capabilities, is the LLM's output being structured to indicate a tool call. Specifically, the API response often contains a field (like finish_reason: "tool_calls") and a tool_calls array detailing the function name and arguments.

If you're not using a dedicated function-calling API feature and are just parsing a general JSON output after CoT, the trigger is your application's code:

Detecting the specific structured output format (e.g., a JSON object).
Checking if the content of that structured output matches the expected format for function arguments.
Your code then explicitly calling the relevant Python function with the parsed arguments.

In essence, the LLM generates a description of what needs to be done (including reasoning and desired final output/action), and your application code interprets that description and executes the corresponding actions or function calls.

How to parse the LLM's output to call a function?

Answer

Parsing the LLM's output to call a function typically involves these steps, especially when using APIs designed for function calling or tool use:

Receive the LLM Response: Your application receives a response object (often JSON) from the LLM API.
Check for Tool Calls: The first crucial step is to determine if the LLM decided to call a tool. Many LLM APIs indicate this in the response. For example, OpenAI's API has a finish_reason field that might be set to "tool_calls". You'll need to check this field.
Access the tool_calls Array: If the finish_reason indicates tool calls, you'll then access the specific part of the response that contains the list of tool calls. In OpenAI's API, this is typically under a key like message['tool_calls'].
Iterate Through Tool Calls (if multiple): A single LLM response might contain instructions to call multiple tools. You'll likely iterate through each item in the tool_calls array.
Extract Function Name: For each tool call, extract the intended function's name. This is usually found under a key like function['name'].
Extract Function Arguments: Also extract the arguments the LLM determined for that function. These are typically found under function['arguments']. This is often a JSON string that needs to be parsed further.

Parse the Arguments String: Since the arguments are usually provided as a JSON string, you'll need to use a JSON parser (like Python's built-in json module) to convert this string into a Python dictionary or object.

Example (Python):

import json

arguments_string = tool_call['function']['arguments']
try:
    arguments_dict = json.loads(arguments_string)
except json.JSONDecodeError as e:
    print(f"Error decoding arguments JSON: {e}")
    # Handle the error, perhaps by logging or skipping this call
    continue

Map Function Name to Actual Python Function: You'll need a way to map the function name provided by the LLM (e.g., "get_weather") to the actual Python function defined in your codebase (e.g., def get_weather(latitude: float, longitude: float): ...). A common way to do this is:
- A Dictionary: Create a dictionary where keys are the tool names (as strings) and values are the actual function objects.
```
available_tools = {
    "get_weather": get_weather_function,
    "schedule_event": schedule_event_function,
    # ... other tools
}
```
- Conditional Logic: Use if/elif/else statements based on the tool_name.

Call the Python Function with Parsed Arguments: Once you have the Python function object and the parsed arguments dictionary, you can call the function. You'll need to ensure the arguments from the dictionary match the expected parameters of your Python function. Python's **kwargs can be very helpful here.

Example (Python):

tool_name = tool_call['function']['name']
if tool_name in available_tools:
    function_to_call = available_tools[tool_name]
    try:
        # Safely call the function with arguments
        tool_response = function_to_call(**arguments_dict)
        # Now you have the result from your tool
    except Exception as e:
        print(f"Error calling function {tool_name}: {e}")
        # Handle execution errors
else:
    print(f"Unknown tool: {tool_name}")
    # Handle cases where the LLM requested a tool your app doesn't recognize

Handle the Tool's Output: The tool_response you get back from calling your Python function is then typically sent back to the LLM in a subsequent API call to allow the LLM to formulate a final user-facing response.

Simplified Example Flow (Python):

import json
# Assume 'llm_response' is the JSON object from the API
# Assume 'available_tools' is a dictionary mapping tool names to functions
# Assume 'get_weather' and 'schedule_event' are your Python functions

if llm_response.get("choices")[0].get("finish_reason") == "tool_calls":
    for tool_call in llm_response["choices"][0]["message"]["tool_calls"]:
        function_name = tool_call["function"]["name"]
        arguments_string = tool_call["function"]["arguments"]

        try:
            arguments = json.loads(arguments_string)
        except json.JSONDecodeError:
            print(f"Failed to parse arguments for tool: {function_name}")
            continue # Skip to the next tool call

        if function_name in available_tools:
            function_to_execute = available_tools[function_name]
            try:
                # Call the actual Python function with the parsed arguments
                tool_result = function_to_execute(**arguments)
                print(f"Successfully called {function_name} with result: {tool_result}")
                # In a real app, you'd append this result back to the conversation history
                # to send back to the LLM for a final response.
            except TypeError as e:
                print(f"Argument mismatch for function {function_name}: {e}")
            except Exception as e:
                print(f"Error executing function {function_name}: {e}")
        else:
            print(f"LLM requested unknown tool: {function_name}")

This process ensures that the LLM's intent to use a tool is correctly interpreted by your application, and the appropriate Python function is invoked with the right parameters.

What if I want to use more functions? How to chain more functions? How to concatenate the results in the prompt?

Answer

To chain more functions, concatenate their results, and pass them to the LLM for further processing or a final response, you'll build upon the principles of handling single tool calls. The core idea is to create a loop or a sequence that manages multiple function calls and their outputs.

Here's a breakdown of how to chain functions and concatenate results for the LLM:

Define All Functions (Tools):
- Ensure all the functions you want to chain are defined in your Python code.
- Create a comprehensive mapping (like the available_tools dictionary) that includes all these functions, mapping their LLM-recognizable names to your Python function objects.
Handle Multiple Tool Calls in the LLM Response:
- The LLM API might return multiple tool_calls in a single response. Your parsing logic needs to handle this by iterating through the tool_calls array as shown previously.
- If the LLM only returns one tool call per response, you'll need to make sequential LLM calls. After your application executes the first tool call and gets a result, you'll send that result back to the LLM (as part of the message history) and ask it to determine the next action.
Sequential Function Calling with LLM Orchestration:
- Initial LLM Call: User provides input. LLM determines the first tool to call and its arguments.
- Application Executes First Tool: Your code receives the tool_calls and executes the first function.
- Store Tool Result: Store the result of the first function call.
- Append Result to Message History: Add a new message to your conversation history. This message should indicate that a tool was called, the tool's name, and importantly, the tool_result from its execution. This is crucial for the LLM to understand what happened.
```
# Example structure to append to message history
message_history.append({
    "role": "tool",
    "tool_call_id": tool_call["id"], # Keep track of which call this answers
    "name": function_name,
    "content": str(tool_result) # Convert result to string
})
```
- Second LLM Call: Send the updated message_history (including the tool result) back to the LLM. Ask it: "Based on the previous steps and the results, what should I do next?"
- Repeat: The LLM will analyze the history, including the tool_result, and may decide to:
  - Call another tool.
  - Generate a final user response.
- Looping: You can put this process within a loop that continues as long as the LLM is requesting tool calls and there are more functions to execute.
Concatenating Results for the Prompt (for the next LLM call):
- When you append the tool_result to the message history, you're effectively concatenating the results. The LLM receives the full context:
  - Original user prompt.
  - Any previous LLM messages.
  - The information about the tool called.
  - The actual output/result from that tool.
- This content of the "tool" role message is how the results are concatenated and fed back into the LLM's "prompt" (which is technically the entire conversation history).
Example: Chaining Two Functions

Let's say you have get_user_location(user_id) and get_weather(latitude, longitude).
- User Input: "What's the weather like for user Bob?"
- LLM Call 1:
  - Prompt: System message + User message ("What's the weather like for user Bob?"), plus definitions for get_user_location and get_weather.
  - LLM Response: Might indicate a tool_calls for get_user_location with user_id="Bob".
- Application Logic:
  - Parse response, find tool_calls for get_user_location.
  - Execute get_user_location(user_id="Bob"). Let's say it returns {"latitude": 34.0522, "longitude": -118.2437}.
  - Append this result to message history:
```
message_history.append({
    "role": "tool",
    "tool_call_id": "call_abc123", # ID from LLM response
    "name": "get_user_location",
    "content": json.dumps({"latitude": 34.0522, "longitude": -118.2437})
})
```
- LLM Call 2:
  - Prompt: The entire message_history is sent.
  - LLM Response: The LLM sees the user asked for weather, the previous step successfully got the location, and now it can determine it needs to call get_weather using the latitude and longitude it just received. It might return a tool_calls for get_weather(latitude=34.0522, longitude=-118.2437).
- Application Logic:
  - Parse response, find tool_calls for get_weather.
  - Execute get_weather(latitude=34.0522, longitude=-118.2437). Let's say it returns {"temperature": 75, "unit": "Fahrenheit", "description": "Sunny"}.
  - Append this result to message history:
```
message_history.append({
    "role": "tool",
    "tool_call_id": "call_def456", # ID from LLM response
    "name": "get_weather",
    "content": json.dumps({"temperature": 75, "unit": "Fahrenheit", "description": "Sunny"})
})
```
- LLM Call 3:
  - Prompt: The entire message_history is sent again.
  - LLM Response: Now, with all the necessary information (original request, user location, weather data), the LLM can generate a natural language response like: "The weather in Los Angeles is currently Sunny with a temperature of 75 degrees Fahrenheit." It would likely not propose another tool call here.

Key Considerations:

State Management: You need to maintain the conversation history (message_history) throughout the chain.
Looping Mechanism: Implement a loop that continues making LLM calls as long as the LLM requests tool calls that your application can execute. Break the loop when the LLM's finish_reason is not tool_calls (e.g., stop).
Error Handling: What happens if a function call fails? You should catch exceptions, append an error message to the history, and send it back to the LLM so it can decide how to proceed (e.g., try a different approach, inform the user).
Argument Parsing Robustness: Always use try-except blocks when parsing JSON arguments, as LLM output can sometimes be malformed.
Tool Descriptions: The quality of your tool descriptions is paramount. The LLM needs to understand when to call which tool and what arguments are required.

By iteratively calling the LLM and feeding back the results of executed functions, you can orchestrate complex workflows that chain multiple functions together.

About this video

Video Title: Building AI Agents in Pure Python - Beginner Course
Channel: Dave Ebbelaar
Speakers: Dave Ebbelaar
Duration: 46:44

Overview

Key takeaways

Direct API Interaction: The video advocates for building AI agents by directly using LLM APIs with Python, emphasizing understanding the underlying principles over relying solely on frameworks.
Core Building Blocks: Key components for building AI systems include making basic API calls, using structured output for programmatic data handling, and enabling AI to use tools (functions) for external interactions.
Workflow Patterns: Effective AI systems can be constructed using patterns like prompt chaining (decomposing tasks into sequential steps), routing (directing logic based on conditions), and parallelization (executing independent tasks simultaneously for efficiency).
Practical Examples: The course provides hands-on demonstrations of building agents for tasks like scheduling calendar events and retrieving information, illustrating the application of these concepts.
Deployment Considerations: The video touches upon the importance of deploying AI applications and mentions a framework, "Gen Launchpad," for production-ready generative AI apps.

Yes, the video displays several other functionalities of AI agents beyond just generating text:

Structured Output: The AI can predict and return information in a specific, programmatically usable structure (e.g., a calendar event with name, date, and participants).
Tool Usage: The AI can decide to use external tools (like a weather API or a search function) based on the user's request and provide the necessary parameters for those tools.
Information Retrieval: The AI can access and retrieve information from a knowledge base to answer user questions.
Process Decomposition (Prompt Chaining): The AI can break down complex tasks into a sequence of smaller steps, where the output of one step informs the next.
Conditional Logic (Routing): The AI can determine different courses of action based on the nature of the request, such as scheduling a new event versus modifying an existing one.
Parallel Processing: The AI can execute independent tasks simultaneously to improve efficiency, demonstrated through parallel checks for guardrails.
Applying Guardrails: The AI can be instructed to perform checks for prompt injection or harmful content before responding.

Here's how it works and what triggers the function call:

Tool Definition: You define your available functions (tools) and describe them to the LLM. This description includes the function's name, its purpose, and the parameters it expects. This is often done using a specific JSON schema format that the LLM understands.
LLM's "Decision": When you send a user's prompt to the LLM along with these tool definitions, the LLM analyzes the prompt. Based on its understanding of the prompt and the descriptions of your tools, it decides if one of your defined tools is the most appropriate way to fulfill the request.
Structured Output for Tool Call: Instead of generating free-form text, the LLM is instructed (via the API parameters, often using response_format or similar) to output a structured response that signifies a tool call. This response typically includes:
- The name of the tool (function) it wants to call.
- The arguments it has determined for that function, formatted according to the tool's definition.
Application Logic (The Trigger): Your Python application receives this structured response from the LLM. The application then:
- Parses this structured output.
- Checks if the finish_reason indicates a tool_calls. This is the primary trigger.
- Identifies the tool_name and arguments provided by the LLM.
- Uses conditional logic (like if tool_name == "get_weather":) to match the LLM's request to the actual Python function defined in your code.
- Calls that Python function, passing the arguments that the LLM extracted.

Here's how you can approach handling "thinking models" or CoT outputs before a final structured output:

Prompt Engineering for CoT + Structured Output:

Explicitly Ask for Thinking: In your system prompt, you can instruct the LLM to "think step-by-step" or "show your reasoning" before providing the final answer or structured output.
Define the Final Output Format: Clearly specify that after the reasoning steps, the LLM must provide the answer in a particular structured format (e.g., JSON).

Example Prompt Snippet:

You are an AI assistant tasked with extracting event details.
First, think step-by-step to determine if the user's request describes a calendar event.
Show your reasoning.
Once you have completed your reasoning and confirmed it's a calendar event, extract the following information and provide it as a JSON object with keys: "name", "date", "participants".
If it's not a calendar event, simply state "Not a calendar event."

User request: [user_input_here]

LLM Configuration for "Function Calling" or Tool Use:
- When using libraries like OpenAI's API, you'd typically define your tools (functions) as before. The LLM's goal is still to identify if a tool call is appropriate.
- If the LLM is configured to output a tool call, it will do so after its internal reasoning process. The CoT is often part of the LLM's internal generation that leads to the tool call decision, but the API's structured output mechanism is designed to surface the tool call itself, not necessarily the entire CoT transcript.
Handling the LLM's Response: This is where your application logic becomes more sophisticated:
- Scenario A: LLM provides the structured tool call directly (most common with function calling)
  - The LLM's response will directly contain the tool_calls object, specifying the function name and arguments. Your application parses this as described previously. The CoT is usually internal to the LLM's generation process and not directly exposed in the structured output intended for function calls.
- Scenario B: LLM provides CoT and then structured output (less common for direct function calls, more for general structured data)
  - If you've designed the prompt such that the LLM explicitly outputs its reasoning before a final JSON block (not necessarily tied to a function call schema), your application needs to parse this differently.
  - Parsing Strategy:
    1. Identify the Reasoning: You might look for keywords like "step-by-step," "reasoning," or specific formatting that signals the start of the thinking process.
    2. Extract the Reasoning (Optional): You could log or display this reasoning to the user or for debugging.
    3. Identify the Final Structured Output: Look for the start of a JSON object ( { ) or the specific structured format you requested.
    4. Parse the Structured Data: Once you have the JSON, you can then use it. If this JSON is meant to be arguments for a function, your application logic would proceed as in Scenario A, using the parsed JSON data.
- Scenario C: Hybrid Approach (using tool use for the final step)
  - You can prompt the LLM to perform CoT.
  - Then, instruct it that after its reasoning, it should decide which of the predefined tools to call and provide the arguments.
  - The API response might contain the tool_calls object. If you want to also capture the CoT, you might need to:
    - Ask the LLM to first generate the reasoning and then generate the tool call within the same response block, perhaps separated by a marker. This is trickier to reliably parse.
    - Or, make a second LLM call. The first call generates the reasoning (and perhaps a summary or decision), and the second call takes that reasoning as input and then decides which tool to call based on it. This is more robust but adds latency.

What Triggers a Function Call?

If you're not using a dedicated function-calling API feature and are just parsing a general JSON output after CoT, the trigger is your application's code:

Detecting the specific structured output format (e.g., a JSON object).
Checking if the content of that structured output matches the expected format for function arguments.
Your code then explicitly calling the relevant Python function with the parsed arguments.

Parsing the LLM's output to call a function typically involves these steps, especially when using APIs designed for function calling or tool use:

Receive the LLM Response: Your application receives a response object (often JSON) from the LLM API.
Check for Tool Calls: The first crucial step is to determine if the LLM decided to call a tool. Many LLM APIs indicate this in the response. For example, OpenAI's API has a finish_reason field that might be set to "tool_calls". You'll need to check this field.
Access the tool_calls Array: If the finish_reason indicates tool calls, you'll then access the specific part of the response that contains the list of tool calls. In OpenAI's API, this is typically under a key like message['tool_calls'].
Iterate Through Tool Calls (if multiple): A single LLM response might contain instructions to call multiple tools. You'll likely iterate through each item in the tool_calls array.
Extract Function Name: For each tool call, extract the intended function's name. This is usually found under a key like function['name'].
Extract Function Arguments: Also extract the arguments the LLM determined for that function. These are typically found under function['arguments']. This is often a JSON string that needs to be parsed further.

Example (Python):

import json

arguments_string = tool_call['function']['arguments']
try:
    arguments_dict = json.loads(arguments_string)
except json.JSONDecodeError as e:
    print(f"Error decoding arguments JSON: {e}")
    # Handle the error, perhaps by logging or skipping this call
    continue

Map Function Name to Actual Python Function: You'll need a way to map the function name provided by the LLM (e.g., "get_weather") to the actual Python function defined in your codebase (e.g., def get_weather(latitude: float, longitude: float): ...). A common way to do this is:
- A Dictionary: Create a dictionary where keys are the tool names (as strings) and values are the actual function objects.
```
available_tools = {
    "get_weather": get_weather_function,
    "schedule_event": schedule_event_function,
    # ... other tools
}
```
- Conditional Logic: Use if/elif/else statements based on the tool_name.

Example (Python):

tool_name = tool_call['function']['name']
if tool_name in available_tools:
    function_to_call = available_tools[tool_name]
    try:
        # Safely call the function with arguments
        tool_response = function_to_call(**arguments_dict)
        # Now you have the result from your tool
    except Exception as e:
        print(f"Error calling function {tool_name}: {e}")
        # Handle execution errors
else:
    print(f"Unknown tool: {tool_name}")
    # Handle cases where the LLM requested a tool your app doesn't recognize

Handle the Tool's Output: The tool_response you get back from calling your Python function is then typically sent back to the LLM in a subsequent API call to allow the LLM to formulate a final user-facing response.

Simplified Example Flow (Python):

import json
# Assume 'llm_response' is the JSON object from the API
# Assume 'available_tools' is a dictionary mapping tool names to functions
# Assume 'get_weather' and 'schedule_event' are your Python functions

if llm_response.get("choices")[0].get("finish_reason") == "tool_calls":
    for tool_call in llm_response["choices"][0]["message"]["tool_calls"]:
        function_name = tool_call["function"]["name"]
        arguments_string = tool_call["function"]["arguments"]

        try:
            arguments = json.loads(arguments_string)
        except json.JSONDecodeError:
            print(f"Failed to parse arguments for tool: {function_name}")
            continue # Skip to the next tool call

        if function_name in available_tools:
            function_to_execute = available_tools[function_name]
            try:
                # Call the actual Python function with the parsed arguments
                tool_result = function_to_execute(**arguments)
                print(f"Successfully called {function_name} with result: {tool_result}")
                # In a real app, you'd append this result back to the conversation history
                # to send back to the LLM for a final response.
            except TypeError as e:
                print(f"Argument mismatch for function {function_name}: {e}")
            except Exception as e:
                print(f"Error executing function {function_name}: {e}")
        else:
            print(f"LLM requested unknown tool: {function_name}")

This process ensures that the LLM's intent to use a tool is correctly interpreted by your application, and the appropriate Python function is invoked with the right parameters.

Here's a breakdown of how to chain functions and concatenate results for the LLM:

Define All Functions (Tools):
- Ensure all the functions you want to chain are defined in your Python code.
- Create a comprehensive mapping (like the available_tools dictionary) that includes all these functions, mapping their LLM-recognizable names to your Python function objects.
Handle Multiple Tool Calls in the LLM Response:
- The LLM API might return multiple tool_calls in a single response. Your parsing logic needs to handle this by iterating through the tool_calls array as shown previously.
- If the LLM only returns one tool call per response, you'll need to make sequential LLM calls. After your application executes the first tool call and gets a result, you'll send that result back to the LLM (as part of the message history) and ask it to determine the next action.
Sequential Function Calling with LLM Orchestration:
- Initial LLM Call: User provides input. LLM determines the first tool to call and its arguments.
- Application Executes First Tool: Your code receives the tool_calls and executes the first function.
- Store Tool Result: Store the result of the first function call.
- Append Result to Message History: Add a new message to your conversation history. This message should indicate that a tool was called, the tool's name, and importantly, the tool_result from its execution. This is crucial for the LLM to understand what happened.
```
# Example structure to append to message history
message_history.append({
    "role": "tool",
    "tool_call_id": tool_call["id"], # Keep track of which call this answers
    "name": function_name,
    "content": str(tool_result) # Convert result to string
})
```
- Second LLM Call: Send the updated message_history (including the tool result) back to the LLM. Ask it: "Based on the previous steps and the results, what should I do next?"
- Repeat: The LLM will analyze the history, including the tool_result, and may decide to:
  - Call another tool.
  - Generate a final user response.
- Looping: You can put this process within a loop that continues as long as the LLM is requesting tool calls and there are more functions to execute.
Concatenating Results for the Prompt (for the next LLM call):
- When you append the tool_result to the message history, you're effectively concatenating the results. The LLM receives the full context:
  - Original user prompt.
  - Any previous LLM messages.
  - The information about the tool called.
  - The actual output/result from that tool.
- This content of the "tool" role message is how the results are concatenated and fed back into the LLM's "prompt" (which is technically the entire conversation history).
Example: Chaining Two Functions

Let's say you have get_user_location(user_id) and get_weather(latitude, longitude).
- User Input: "What's the weather like for user Bob?"
- LLM Call 1:
  - Prompt: System message + User message ("What's the weather like for user Bob?"), plus definitions for get_user_location and get_weather.
  - LLM Response: Might indicate a tool_calls for get_user_location with user_id="Bob".
- Application Logic:
  - Parse response, find tool_calls for get_user_location.
  - Execute get_user_location(user_id="Bob"). Let's say it returns {"latitude": 34.0522, "longitude": -118.2437}.
  - Append this result to message history:
```
message_history.append({
    "role": "tool",
    "tool_call_id": "call_abc123", # ID from LLM response
    "name": "get_user_location",
    "content": json.dumps({"latitude": 34.0522, "longitude": -118.2437})
})
```
- LLM Call 2:
  - Prompt: The entire message_history is sent.
  - LLM Response: The LLM sees the user asked for weather, the previous step successfully got the location, and now it can determine it needs to call get_weather using the latitude and longitude it just received. It might return a tool_calls for get_weather(latitude=34.0522, longitude=-118.2437).
- Application Logic:
  - Parse response, find tool_calls for get_weather.
  - Execute get_weather(latitude=34.0522, longitude=-118.2437). Let's say it returns {"temperature": 75, "unit": "Fahrenheit", "description": "Sunny"}.
  - Append this result to message history:
```
message_history.append({
    "role": "tool",
    "tool_call_id": "call_def456", # ID from LLM response
    "name": "get_weather",
    "content": json.dumps({"temperature": 75, "unit": "Fahrenheit", "description": "Sunny"})
})
```
- LLM Call 3:
  - Prompt: The entire message_history is sent again.
  - LLM Response: Now, with all the necessary information (original request, user location, weather data), the LLM can generate a natural language response like: "The weather in Los Angeles is currently Sunny with a temperature of 75 degrees Fahrenheit." It would likely not propose another tool call here.

Key Considerations:

State Management: You need to maintain the conversation history (message_history) throughout the chain.
Looping Mechanism: Implement a loop that continues making LLM calls as long as the LLM requests tool calls that your application can execute. Break the loop when the LLM's finish_reason is not tool_calls (e.g., stop).
Error Handling: What happens if a function call fails? You should catch exceptions, append an error message to the history, and send it back to the LLM so it can decide how to proceed (e.g., try a different approach, inform the user).
Argument Parsing Robustness: Always use try-except blocks when parsing JSON arguments, as LLM output can sometimes be malformed.
Tool Descriptions: The quality of your tool descriptions is paramount. The LLM needs to understand when to call which tool and what arguments are required.

By iteratively calling the LLM and feeding back the results of executed functions, you can orchestrate complex workflows that chain multiple functions together.