How-To

The Kernel SDK provides all the building blocks needed to create sophisticated AI applications. If you want to include any dependencies in your Skill, have a look here

Completion

The base building block of any AI framework is the ability to do completion requests:

from pharia_skill import Csi, skill
from pydantic import BaseModel

# define Input & Output models
# ...

@skill
def complete(csi: Csi, input: Input) -> Output:
    prompt = f"""<|begin_of_text|><|start_header_id|>system<|end_header_id|>

    You are a poet who strictly speaks in haikus.<|eot_id|><|start_header_id|>user<|end_header_id|>

    {input.topic}<|eot_id|><|start_header_id|>assistant<|end_header_id|>""""
    params = CompletionParams(max_tokens=64)
    completion = csi.complete("llama-3.1-8b-instruct", prompt, params)
    return Output(haiku=completion)

RAG

Skills can access knowledge from external documents. You can query the DocumentIndex:

from pharia_skill import Csi, IndexPath, skill

@skill
def rag(csi: Csi, input: Input) -> Output:
    # specify the index we query against
    index = IndexPath(
        namespace="my-team-namespace"
        collection="confluence",
        index="asym-256",
    )

    # search for the input topic in the confluence collection
    documents = csi.search(index, query=input.topic)

Conversational Search

Conversational search is the idea to have a chat conversation with an LLM which has access to a knowledge database. To implement this, we first need a Skill that exposes a chat interface.

The OpenAI Chat API is emerging as a standard to expose conversational interface of LLMs. This API is also offered in the Csi with the chat method. Leveraging this, you can easily expose your own custom flavoured chat API as a Kernel Skill. Note that you can return expose internal datatypes in the interface of you Skill as long as they are wrapped in a Pydantic model:

from pharia_skill import Csi, Message, skill

class ChatInterface(BaseModel):
    """A chat input that is compatible with the OpenAI chat API."""

    message: list[Message]

@skill
def conversational_search(csi: Csi, input: ChatInterface) -> ChatInterface:
    # Alter the input message in any way to apply your own flavour
    # You could add a search lookup to allow conversational search, or just
    # prepend a custom system prompt
    input = do_search_lookup(input)
    output = csi.chat("llama-3.1-8b-instruct", input.messages)
    return ChatInterface(input.messages + [output.message])

You only need to define the do_search_lookup function and augment the incoming messages with some context.

Function Calling

The llama3 module, provides support for function calling. It supports both user defined and built-in tools.

Tool Definition

You can define a tool by inheriting from the Tool class, which is a wrapper around a pydantic base model.

For example, suppose we want to give our model the ability to get the readme of a github repository. We can define a tool like this:

from pharia_skill.llama3 import Tool

class GetGithubReadme(Tool):
    """Get the readme of a github repository."""

    repository: str

You can provide default values for the arguments and even add a description for each field by using pydantic's Field class:

from pharia_skill.llama3 import Tool
from pydantic import Field

class GetGithubReadme(Tool):
    """Get the readme of a github repository."""

    repository: str = Field(
        description="The github repository to get the readme of.",
        default="https://github.com/aleph-alpha/pharia-kernel",
    )

The name of the tools is the snake_case version of the class name and the doc string is passed to the LLM to describe the tool.

Tool Usage

You can pass all available tools to the LLM by using the tools argument of the ChatRequest class.

from pharia_skill.llama3 import ChatRequest, UserMessage

message = UserMessage(content="How do I install the kernel?")
request = ChatRequest(
    model="llama-3.1-8b-instruct",
    messages=[message],
    tools=[GetGithubReadme],
)

If the model decides to use a tool, it will reply with an AssistantMessage containing the tool call. A ToolCall consists of the name of the tool and the parameters to pass to it. If you have provided the tool definition as a Pydantic model, then the parameters field will be an instance of the model. In this way, you get a type-safe way to pass parameters to your tools.

Now, it is upon you to execute the tool call. Once you have executed the tool, you can pass the result to the LLM by extending the ChatRequest with a ToolMessage.

You can then trigger another round of chat with the LLM to get the final result:

from pharia_skill import Csi, skill
from pharia_skill.llama3 import ChatRequest, UserMessage, ToolMessage

@skill
def github_skill(csi: Csi, input: Input) -> Output:
    # The input has a question field, which we pass to the LLM
    message = UserMessage(content=input.question)
    request = ChatRequest(
        model="llama-3.3-70b-instruct",
        messages=[message],
        tools=[GetGithubReadme],
    )
    response = request.chat(csi)
    if not response.message.tool_calls:
        return Output(answer=str(response.message.content))

    tool_call = response.message.tool_calls[0].parameters
    assert isinstance(tool_call, GetGithubReadme)

    # execute the tool call
    readme = get_github_readme(tool_call.repository)

    # pass the result to the LLM
    request.extend(ToolMessage(readme))

    # chat again, and return the output
    response = request.chat(csi)
    return Output(answer=str(response.message.content))

Note that outbound http requests are currently not supported in the Kernel. This means tools that need to make http requests can only be executed in a local environment with the DevCsi class and not be deployed to the Kernel.

Code Interpreter

The CodeInterpreter tool is a built-in tool that allows the LLM to execute python code. This tool is available in the llama3 module. Here is an example of how to use it:

from pydantic import BaseModel
from pharia_skill import Csi, skill
from pharia_skill.llama3 import CodeInterpreter, ChatRequest, UserMessage, ToolMessage


class Input(BaseModel):
    question: str


class Output(BaseModel):
    answer: str | None
    executed_code: str | None = None
    code_result: Any | None = None


@skill
def code(csi: Csi, input: Input) -> Output:
    """A skill that optionally executes python code to answer a question"""
    message = UserMessage(content=input.question)
    request = ChatRequest(
        model="llama-3.3-70b-instruct", messages=[message], tools=[CodeInterpreter]
    )
    response = request.chat(csi)
    if not response.message.tool_calls:
        return Output(answer=response.message.content)

    # we know that it will be code interpreter
    tool_call = response.message.tool_calls[0].parameters
    assert isinstance(tool_call, CodeInterpreter)

    output = tool_call.run()
    request.extend(ToolMessage(output))

    # chat again, and return the output
    response = request.chat(csi)
    return Output(
        answer=response.message.content,
        executed_code=tool_call.src,
        code_result=output,
    )

Completion​

RAG​

Conversational Search​

Function Calling​

Tool Definition​

Tool Usage​

Code Interpreter​