Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kernel API OCR returns text coordinate information #11738

Merged
merged 4 commits into from
Jun 16, 2024

Conversation

2234839
Copy link
Member

@2234839 2234839 commented Jun 16, 2024

#11584
ocr 接口返回结果如下

{
    "code": 0,
    "msg": "",
    "data": {
        "ocrJSON": [
            {
                "block_num": "0",
                "conf": "-1",
                "height": "116",
                "left": "0",
                "level": "1",
                "line_num": "0",
                "page_num": "1",
                "par_num": "0",
                "text": "",
                "top": "0",
                "width": "352",
                "word_num": "0"
            }
        ],
        "text": "ocrJSON数组内的所有 text 字段的拼接"
    }
}

@2234839
Copy link
Member Author

2234839 commented Jun 16, 2024

@88250 88250 added this to the 3.1.0 milestone Jun 16, 2024
@88250 88250 changed the title Issue 11584 Kernel API OCR returns text coordinate information Jun 16, 2024
@88250 88250 merged commit c0bd645 into siyuan-note:dev Jun 16, 2024
2 checks passed
copialot pushed a commit to copialot/siyuan that referenced this pull request Aug 3, 2024
* 优化setImageOCRText接口调用

* 扩展 ocr 接口,添加 ocrJSON 返回字段

* 过滤不可见字符

* 返回的ocr文本添加空格
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants