85 lines
2.5 KiB
Markdown
85 lines
2.5 KiB
Markdown
## Current Problems
|
||
|
||
- 最新的 trace 缺少了做 workload 类型判断的依据(之前依据 plugin call 进行判断,根据是否调用了 pdf_extractor/wanx/tongyi_nlp_web_search 分类 file/image/search)
|
||
|
||
- 对 turn 的定义变得不确定
|
||
manus: agent 模式下,context 太长,LLM 忘了之前的内容怎么办,就会把一些内容,让 LLM 自己复述一遍,“让我想想,我之前看过这个东西 xxx” 相当于把距离很远的 context 又提到最后了
|
||
```
|
||
## Old
|
||
+---------+
|
||
| Human |
|
||
+---------+
|
||
| /|\
|
||
| |
|
||
\|/ |
|
||
+---------+
|
||
| LLM |
|
||
+---------+
|
||
|
||
## New
|
||
+---------+
|
||
| Human |
|
||
+---------+
|
||
| /|\
|
||
| |
|
||
\|/ |
|
||
+---------+
|
||
| LLM |------+
|
||
+---------+ |
|
||
/|\ | ----> "<Web_search> Do you understand?" "Yes, sir!"
|
||
| |
|
||
+------------+
|
||
```
|
||
|
||
- 缺少 output token length
|
||
在之前的 trace 中有 `usage` 这个 field
|
||
|
||
![[projects/kvcachecache/Trace-Qwen3.figs/250812-140723.png]]
|
||
|
||
|
||
## fields
|
||
|
||
```
|
||
__source__,__tag__:__hostname__,__tag__:__pack_id__,__tag__:__path__,__tag__:__receive_time__,__tag__:__service_name__,__tag__:__user_defined_id__,__tag__:_container_ip_,__tag__:_container_name_,__tag__:_image_name_,__tag__:_namespace_,__tag__:_pod_name_,__tag__:_pod_uid_,__tag__:eci_id,__time__,__topic__,code,context,ds_service_id,ds_service_name,interval,message,model,request_id,service_id,service_name,span_id,step,task_id,time,trace_id,user_id
|
||
|
||
['__source__', '__tag__:__hostname__', '__tag__:__pack_id__', '__tag__:__path__', '__tag__:__receive_time__', '__tag__:__service_name__', '__tag__:__user_defined_id__', '__tag__:_container_ip_', '__tag__:_container_name_', '__tag__:_image_name_', '__tag__:_namespace_', '__tag__:_pod_name_', '__tag__:_pod_uid_', '__tag__:eci_id', '__time__', '__topic__', 'code', 'context', 'ds_service_id', 'ds_service_name', 'interval', 'message', 'model', 'request_id', 'service_id', 'service_name', 'span_id', 'step', 'task_id', 'time', 'trace_id', 'user_id']
|
||
|
||
__source__
|
||
__tag__:__hostname__
|
||
__tag__:__pack_id__
|
||
__tag__:__path__
|
||
__tag__:__receive_time__
|
||
__tag__:__service_name__
|
||
__tag__:__user_defined_id__
|
||
__tag__:_container_ip_
|
||
__tag__:_container_name_
|
||
__tag__:_image_name_
|
||
__tag__:_namespace_
|
||
__tag__:_pod_name_
|
||
__tag__:_pod_uid_
|
||
__tag__:eci_id
|
||
__time__
|
||
__topic__
|
||
code
|
||
context
|
||
ds_service_id
|
||
ds_service_name
|
||
interval
|
||
message
|
||
model
|
||
request_id
|
||
service_id
|
||
service_name
|
||
span_id
|
||
step
|
||
task_id
|
||
time
|
||
trace_id
|
||
user_id
|
||
|
||
```
|
||
|
||
|
||
qwen-chat: 345339
|
||
tongyi: 421551
|
||
tob: 740251 + 740393 |