feat: Added LaTeX error modification using deepseek-reasoner #15

Gavin-WangSC · 2025-04-10T15:35:39Z

No description provided.

q1zhen · 2025-04-10T15:37:45Z

N.B., INTENTIONALLY LEFT AS A DRAFT PULL REQUEST FOR THE IMMATURE LLM-BASED CORRECTION APPROACH.

at-wr · 2025-04-10T16:55:22Z

Why did you remove the guideline “Keep spacing between Chinese-English characters”

Gavin-WangSC · 2025-04-10T17:53:55Z

Why did you remove the guideline “Keep spacing between Chinese-English characters”

o sorry, that's unintentional.

RadioNoiseE · 2025-04-11T00:59:45Z

Why did you remove the guideline “Keep spacing between Chinese-English characters”

This is actually removed by @q1zhen in 0e531c8. We decided to use a post-pass rather than relying on poor llm "constrained" output.

RadioNoiseE · 2025-04-11T01:02:19Z

Why did you remove the guideline “Keep spacing between Chinese-English characters”

o sorry, that's unintentional.

Hi, @Gavin-WangSC

I saw that your fork is 1 commit ahead of, 7 commits behind. Please make sure it's in sync with the upstream before creating a pr (aka, resolve conflict).

RadioNoiseE · 2025-04-11T01:07:30Z

And just an advice: you are calling pdflatex to get possible errors, while this means TeX environment and various macro extensions are required.

For error checking, something like chktex as I had mentioned before will do, which only require any ANSI-C complaint compiler to build.

Anyway, thanks for contributing.

RadioNoiseE · 2025-04-11T01:11:17Z

N.B., INTENTIONALLY LEFT AS A DRAFT PULL REQUEST FOR THE IMMATURE LLM-BASED CORRECTION APPROACH.

I actually think an LLM based approach is unavoidable, since I don't think there exists any tool that can repair a broken LaTeX equation (from panic).

However if narrowed to balance curly braces, I think we can supply our own: still not easy though.

q1zhen · 2025-04-11T03:16:13Z

N.B., INTENTIONALLY LEFT AS A DRAFT PULL REQUEST FOR THE IMMATURE LLM-BASED CORRECTION APPROACH.

I actually think an LLM based approach is unavoidable, since I don't think there exists any tool that can repair a broken LaTeX equation (from panic).

However if narrowed to balance curly braces, I think we can supply our own: still not easy though.

Agree. However, he is using LLM to fix the article as a whole, rather than the erroneous parts. That's why I marked this PR a draft.

q1zhen · 2025-04-11T03:17:57Z

I would suggest suspending this fix for now. Watch for the frequency of LaTeX errors to see if it is high enough to worth taking time fixing them in an automatic pipeline.

RadioNoiseE · 2025-04-11T03:35:43Z

I would suggest suspending this fix for now. Watch for the frequency of LaTeX errors to see if it is high enough to worth taking time fixing them in an automatic pipeline.

Fair enough. Or maybe just change the model composing the article.

RadioNoiseE · 2025-04-11T03:39:16Z

Deepseek seems to constantly ignore my prompt telling it the correct way to integrate equation into markdown. It keeps output things like $<random figure>$<unit immediately followed> which will be parsed into \$<random figure>\$<unit immediately followed>.

RadioNoiseE · 2025-04-11T03:40:16Z

The underlying issue is that Markdown is such a weak markup language, which lacks a standard way of writing even math.

Gavin-WangSC · 2025-04-11T12:27:58Z

Why did you remove the guideline “Keep spacing between Chinese-English characters”

This is actually removed by @q1zhen in 0e531c8. We decided to use a post-pass rather than relying on poor llm "constrained" output.

Why did you remove the guideline “Keep spacing between Chinese-English characters”

o sorry, that's unintentional.

Hi, @Gavin-WangSC

I saw that your fork is 1 commit ahead of, 7 commits behind. Please make sure it's in sync with the upstream before creating a pr (aka, resolve conflict).

N.B., INTENTIONALLY LEFT AS A DRAFT PULL REQUEST FOR THE IMMATURE LLM-BASED CORRECTION APPROACH.

I actually think an LLM based approach is unavoidable, since I don't think there exists any tool that can repair a broken LaTeX equation (from panic).

However if narrowed to balance curly braces, I think we can supply our own: still not easy though.

I would suggest suspending this fix for now. Watch for the frequency of LaTeX errors to see if it is high enough to worth taking time fixing them in an automatic pipeline.

Understood👌

This comment was marked as off-topic.

Sign in to view

Gavin-WangSC closed this Apr 11, 2025

Gavin-WangSC force-pushed the main branch from c922e11 to 3ff46a0 Compare April 11, 2025 12:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Added LaTeX error modification using deepseek-reasoner #15

feat: Added LaTeX error modification using deepseek-reasoner #15

Gavin-WangSC commented Apr 10, 2025

This comment was marked as off-topic.

q1zhen commented Apr 10, 2025

at-wr commented Apr 10, 2025

Gavin-WangSC commented Apr 10, 2025

RadioNoiseE commented Apr 11, 2025

RadioNoiseE commented Apr 11, 2025 •

edited

Loading

RadioNoiseE commented Apr 11, 2025

RadioNoiseE commented Apr 11, 2025 •

edited

Loading

q1zhen commented Apr 11, 2025

q1zhen commented Apr 11, 2025 •

edited

Loading

RadioNoiseE commented Apr 11, 2025 •

edited

Loading

RadioNoiseE commented Apr 11, 2025

RadioNoiseE commented Apr 11, 2025 •

edited

Loading

Gavin-WangSC commented Apr 11, 2025

feat: Added LaTeX error modification using deepseek-reasoner #15

feat: Added LaTeX error modification using deepseek-reasoner #15

Conversation

Gavin-WangSC commented Apr 10, 2025

This comment was marked as off-topic.

q1zhen commented Apr 10, 2025

at-wr commented Apr 10, 2025

Gavin-WangSC commented Apr 10, 2025

RadioNoiseE commented Apr 11, 2025

RadioNoiseE commented Apr 11, 2025 • edited Loading

RadioNoiseE commented Apr 11, 2025

RadioNoiseE commented Apr 11, 2025 • edited Loading

q1zhen commented Apr 11, 2025

q1zhen commented Apr 11, 2025 • edited Loading

RadioNoiseE commented Apr 11, 2025 • edited Loading

RadioNoiseE commented Apr 11, 2025

RadioNoiseE commented Apr 11, 2025 • edited Loading

Gavin-WangSC commented Apr 11, 2025

RadioNoiseE commented Apr 11, 2025 •

edited

Loading

RadioNoiseE commented Apr 11, 2025 •

edited

Loading

q1zhen commented Apr 11, 2025 •

edited

Loading

RadioNoiseE commented Apr 11, 2025 •

edited

Loading

RadioNoiseE commented Apr 11, 2025 •

edited

Loading