Rig Agents：高阶LLM编排框架

发布日期：2025-03-11 05:06:27 浏览次数： 2628

作者：馬喽编程笔记

微信搜一搜，关注“馬喽编程笔记”

在LLM应用开发中，如何高效管理模型、上下文和工具，构建强大的智能体（Agent）Rig提供了一种高阶的LLM编排框架，帮助开发者轻松集成RAG（检索增强生成）、工具调用和自定义配置。本文将深入解析Rig Agents的核心概念、使用方式及最佳实践，助你构建从基础聊天机器人到复杂RAG知识问答系统的AI应用。

什么是 Rig Agents？

Rig Agents是Rig框架中针对LLM高阶封装的核心组件，它提供了一种模块化、可扩展的方式来管理AI代理的行为。一个Agent结合了模型、上下文、工具和配置，能够适应各种AI应用场景。

Agent的核心结构

一个Agent主要由以下部分组成：

1.基础组件

Completion Model（如 GPT-4, Claude）
System Prompt（系统指令）
Configuration（温度、最大 token 等参数）

2.上下文管理

静态上下文：始终可用的文档
动态上下文：通过RAG从知识库检索相关信息
向量存储集成：用于语义搜索

3.工具集成

静态工具：始终可用的能力（如计算器、翻译）
动态工具：根据上下文需求动态提供的工具
工具集（ToolSet）统一管理工具

Rig Agents的使用方式

1. 创建基础Agent

以下代码展示了如何使用Rig创建一个简单的AI助手：

use rig::{providers::openai};use rig::completion::Prompt;
#[tokio::main]async fn main() -> Result<(), Box<dyn std::error::Error>> {    let client = openai::Client::from_url("ollama", "http://localhost:11434/v1");
    let agent = client.agent("qwen2:7b")        .preamble("you are a helpful assistant.")        .build();
    let response = agent.prompt("halo").await?;
    println!("{:#?}", response);
    Ok(())}

运行结果如下所示:

2. 创建 RAG Agent

RAG（Retrieval-Augmented Generation，检索增强生成）可以让AI从知识库中动态检索信息，提高回答的准确性。

use rig::{providers::openai, Embed};use serde::{Serialize};use rig::completion::Prompt;use rig::embeddings::EmbeddingsBuilder;use rig::vector_store::in_memory_store::InMemoryVectorStore;
#[derive(Embed, Serialize, Clone, Debug, Eq, PartialEq, Default)]struct WordDefinition {    id: String,    word: String,    #[embed]    definitions: Vec<String>,}
#[tokio::main]async fn main() -> Result<(), Box<dyn std::error::Error>> {    let client = openai::Client::from_url("ollama", "http://localhost:11434/v1");
    let embedding_model = client.embedding_model("qwen2:7b");
    let embeddings = EmbeddingsBuilder::new(embedding_model.clone())        .documents(vec![            WordDefinition {                id: "doc0".to_string(),                word: "flurbo".to_string(),                definitions: vec![                    "1. *flurbo* (name): A flurbo is a green alien that lives on cold planets.".to_string(),                    "2. *flurbo* (name): A fictional digital currency that originated in the animated series Rick and Morty.".to_string()                ]            },            WordDefinition {                id: "doc1".to_string(),                word: "glarb-glarb".to_string(),                definitions: vec![                    "1. *glarb-glarb* (noun): A glarb-glarb is a ancient tool used by the ancestors of the inhabitants of planet Jiro to farm the land.".to_string(),                    "2. *glarb-glarb* (noun): A fictional creature found in the distant, swampy marshlands of the planet Glibbo in the Andromeda galaxy.".to_string()                ]            },            WordDefinition {                id: "doc2".to_string(),                word: "linglingdong".to_string(),                definitions: vec![                    "1. *linglingdong* (noun): A term used by inhabitants of the far side of the moon to describe humans.".to_string(),                    "2. *linglingdong* (noun): A rare, mystical instrument crafted by the ancient monks of the Nebulon Mountain Ranges on the planet Quarm.".to_string()                ]            },        ])?        .build()        .await?;
    let vector_store = InMemoryVectorStore::from_documents(embeddings);
    let index = vector_store.index(embedding_model);
    let rag_agent = client.agent("qwen2:7b")        .preamble("            You are a dictionary assistant here to assist the user in understanding the meaning of words.            You will find additional non-standard word definitions that could be useful below.        ")        .dynamic_context(1, index)        .build();
    let resp = rag_agent.prompt("what does \"glarb-glarb\" mean?").await?;
    println!("{}", resp);

    Ok(())}

在这个Agent中，每次用户提问时，LLM会自动查询向量存储，检索最相关的文档，并将其与问题一起发送给模型，提高回答的专业度和上下文相关性。

⚠️注意: 在安装rig-core时需要开启derive这个特性，不要会报找不到Embed宏的错误。

运行结果如下所示:

3. 创建带工具的Agent

除了RAG，Rig还支持工具增强（Tool-Augmented）模式，使Agent能够调用外部API 或执行计算。

use rig::{Agent, Tool};
// 创建支持工具调用的 Agentlet agent = openai.agent("gpt-4")    .preamble("You are a capable assistant with tools.")    .tool(calculator)   // 静态工具    .tool(web_search)   // 静态工具    .dynamic_tools(2, tool_index, toolset)  // 动态工具    .build();

工具系统的作用：

静态工具：始终可用的核心能力，如数学计算、天气查询等。
动态工具：基于语义搜索，按需提供，比如智能合约查询、数据库查询等。

Rig Agents的核心特性

1. 动态上下文解析

在RAG场景中，Agent需要：

解析用户问题
查询向量存储，找到最相关的文档
结合检索结果，生成最终回答

2. 工具管理

Agent的工具系统支持：

静态 & 动态工具管理
自动解析 LLM 的工具调用
错误处理与异常恢复

最佳实践

1. 上下文管理

保持静态上下文最小化，减少token消耗。
使用动态上下文（RAG） 处理大规模知识库，提高问答质量。

2. 工具集成

静态工具 适用于核心功能，如数学计算、翻译等。
动态工具 适用于上下文相关的功能，如API查询、数据库操作。
确保工具的错误处理完备，防止运行时崩溃。

3. 性能优化

调整动态内容的采样数量，减少不必要的计算。
根据任务需求调整温度参数，如问答任务使用 0.3-0.5，创意写作使用 0.7-0.9。
监控token使用情况，避免超出LLM限制。

常见应用模式

1. 对话式AI助手

let chat_agent = openai.agent("gpt-4")    .preamble("You are a conversational assistant.")    .temperature(0.9)    .build();

2. RAG知识库

let kb_agent = openai.agent("gpt-4")    .preamble("You are a knowledge base assistant.")    .dynamic_context(5, document_store)    .temperature(0.3)    .build();

3. 以工具为核心的Agent

let tool_agent = openai.agent("gpt-4")    .preamble("You are a tool-using assistant.")    .tool(calculator)    .tool(web_search)    .dynamic_tools(2, tool_store, toolset)    .temperature(0.5)    .build();

总结

Rig Agents提供了一种模块化、可扩展的方式来管理LLM应用，支持RAG、工具调用和灵活配置，使其成为构建高级AI代理的理想选择。无论是构建聊天助手、知识库问答系统，还是工具增强型智能体，Rig都能提供强大的支持。