fix: 自动提取 LLM 返回内容中的 JSON，避免 pydantic 校验失败 by chinaguole · Pull Request #37 · protectai/vulnhuntr

chinaguole · 2025-06-18T07:57:06Z

变更内容

在 LLMs.py 的 _validate_response 方法中，增加了自动提取 JSON 的逻辑。
解决了 LLM 返回内容包含非 JSON 部分（如标签等）时，pydantic 校验失败的问题。

变更原因

某些 LLM 返回内容可能包含额外的注释、标签或 markdown，导致 pydantic 的 model_validate_json 解析失败。
通过正则提取第一个合法 JSON，有效提升了兼容性和健壮性。

测试说明

本地测试通过，LLM 返回内容包含非 JSON 部分时，依然可以正常解析和校验。

如有需要可进一步完善单元测试。

## 变更内容 - 在 LLMs.py 的 _validate_response 方法中，增加了自动提取 JSON 的逻辑。 - 解决了 LLM 返回内容包含非 JSON 部分（如 <think> 标签等）时，pydantic 校验失败的问题。 ## 变更原因 - 某些 LLM 返回内容可能包含额外的注释、标签或 markdown，导致 pydantic 的 model_validate_json 解析失败。 - 通过正则提取第一个合法 JSON，有效提升了兼容性和健壮性。 ## 测试说明 - 本地测试通过，LLM 返回内容包含非 JSON 部分时，依然可以正常解析和校验。 --- 如有需要可进一步完善单元测试。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: 自动提取 LLM 返回内容中的 JSON，避免 pydantic 校验失败#37

fix: 自动提取 LLM 返回内容中的 JSON，避免 pydantic 校验失败#37
chinaguole wants to merge 1 commit into
protectai:mainfrom
chinaguole:prog.le

chinaguole commented Jun 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

chinaguole commented Jun 18, 2025

变更内容

变更原因

测试说明

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant