微信扫码
添加专属顾问
我要投稿
快速构建AI应用,Higress是您的不二选择。 核心内容: 1. AI时代对API网关的新要求 2. Higress的AI原生功能与开源优势 3. 实战演示:基于Higress搭建完整的LLM应用
一、前言
二、AI 代理
官方文档:https://help.aliyun.com/zh/mse/user-guide/ai-agent?spm=a2c4g.11186623.0.0.2927178eciPER4
应用架构
provider:type: qwenapiTokens:- sk-xxxxxxxxxxxxxxxxxxxxxxtimeout: 1200000modelMapping:'gpt-3.5-turbo': qwen-turbo'gpt-4': qwen-max'*': qwen-max
三、AI 可观测
enable: true
配置 AI 内容安全插件后,应用架构如下图所示:
serviceSource: dnsserviceName: green-cipservicePort: 443domain: green-cip.cn-hangzhou.aliyuncs.comak: xxxxxxxxxxxxxxxxxsk: xxxxxxxxxxxxxxxxx
创建一个 redis 服务并且在网关进行配置:
rule_name: default_rulerule_items:- limit_by_per_ip: from-remote-addrlimit_keys:- key: 0.0.0.0/0token_per_minute: 100redis:service_name: redis.staticservice_port: 6379username: xxxxxxpassword: xxxxxxrejected_code: 429rejected_msg: 您的请求频率过高,请稍后再试。
redis:serviceName: redis.staticservicePort: 6379timeout: 2000username: xxxxxx password: xxxxxx
dashscope:apiKey: sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxserviceName: qwenservicePort: 443domain: dashscope.aliyuncs.comdashvector:apiKey: sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxserviceName: dashvectorservicePort: 443domain: vrs-cn-xxxxxxxxxxxxxx.dashvector.cn-hangzhou.aliyuncs.comcollection: xxxxxxxxxxxxxx
prompt 模板[3]
templates:- name: "developer-chat"template:model: gpt-3.5-turbomessages:- role: systemcontent: "你是一个 {{program}} 专家, 你平时使用的编程语言为 {{language}}"- role: user content: "帮我写一个 {{program}} 程序, 你的返回结果里面应该只包含python代码"请求 body 示例如下:
{"template": "developer-chat","properties": {"program": "冒泡排序","language": "python"}}Prompt 装饰器允许用户在网关定义对 prompt 的修改操作,包括在原始请求之前和之后插入 message,配置示例如下,请求 body 与 openai 的请求一致。
prepend:- role: systemcontent: "请使用英语回答问题."append:- role: usercontent: "每次回答完问题,尝试进行反问"
response: enable: trueprompt: "帮我修改以下HTTP应答信息,要求:1. content-type修改为application/json;2. body由xml转化为json;3. 移除content-length。"provider: serviceName: qwendomain: dashscope.aliyuncs.com apiKey: sk-xxxxxxxxxxxxxxxxxxxxxxxxxxx
<?xml version='1.0' encoding='us-ascii'?><!--A SAMPLE set of slides--><slideshowtitle="Sample Slide Show"date="Date of publication"author="Yours Truly"><!-- TITLE SLIDE --><slide type="all"><title>Wake up to WonderWidgets!</title></slide><!-- OVERVIEW --><slide type="all"><title>Overview</title><item>Why <em>WonderWidgets</em> are great</item><item/><item>Who <em>buys</em> WonderWidgets</item></slide></slideshow>
使用以上配置,通过网关访问 httpbin 的 /xml 接口,结果为:
{"slideshow": {"title": "Sample Slide Show","date": "Date of publication","author": "Yours Truly","slides": [{"type": "all","title": "Wake up to WonderWidgets!"},{"type": "all","title": "Overview","items": ["Why <em>WonderWidgets</em> are great","","Who <em>buys</em> WonderWidgets"]}]}}53AI,企业落地大模型首选服务商
产品:场景落地咨询+大模型应用平台+行业解决方案
承诺:免费POC验证,效果达标后再合作。零风险落地应用大模型,已交付160+中大型企业
2026-05-17
开源、零依赖、R@5 精度 95%:agentmemory 凭什么比 mem0 更值得用
2026-05-16
Hermes Agent 深度解析:为什么它能“越用越懂你”?
2026-05-15
再见 Hermes、小龙虾! 面向 DeepSeek V4 的终端原生编程智能体来了
2026-05-15
GenericAgent 实测:Token 少用 89.6%,还能打赢 Claude Code?上下文密度才是关键
2026-05-14
腾讯开源Agent Memory,让Token消耗降低61%
2026-05-14
agents-hive 开源了:一个面向生产的Harness Agent 工程
2026-05-12
Hermes Agent 完整安装指南
2026-05-11
对话OpenClacky李亚飞:把Harness做透,Token账单就不是问题了
2026-03-30
2026-04-03
2026-03-23
2026-04-09
2026-03-31
2026-03-03
2026-04-01
2026-02-22
2026-03-04
2026-03-09
2026-05-16
2026-04-22
2026-04-21
2026-04-15
2026-04-09
2026-04-01
2026-03-17
2026-03-13