Skip to content

[Feature]: Optimize Code Retrieve #274

@MaojiaSheng

Description

@MaojiaSheng

Problem Statement

我们希望以代码检索效果和效率为标杆,改善 OpenViking 索引和检索机制。

Proposed Solution

  1. 对代码仓库的数据处理策略进行优化:包括代码理解、代码结构解析、摘要生成等;
  2. 对代码仓库的数据向量化策略进行优化,对代码文件转向量,考虑先进行辅助处理,以支持不同大小的代码文件;
  3. 对代码问答和问题排查,建立初步的测评覆盖;
  4. 讨论和实现代码仓库的增量同步机制;

Alternatives Considered

No response

Feature Area

Core (Client/Engine)

Use Case

ov add-resource https://github.com/volcengine/OpenViking
ov find "OpenViking 的向量化链路是怎样的"
ov chat "向量化模型的调用并发度怎么修改"

Example API (Optional)

Additional Context

No response

Contribution

  • I am willing to contribute to implementing this feature

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    Status

    Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions