QA Parsing Becomes Unstable When Using Local vLLM with Markdown-Formatted Inputs

### Problem Description

When using the **local vLLM backend** to generate QA pairs (e.g. atomic / aggregated QA), inputs that contain **Markdown-formatted content** (such as `#`, ``` , `*`, `>`, `---`) may cause **QA parsing failures**.

---

### Reproduction Conditions

* Use **local vLLM backend**
* Input text is Markdown-formatted and contains one or more of:

  * Headings (`#`, `##`)
  * Separators (`---`, `***`)
  * Code blocks (```)
  * Comment-style or annotation-heavy text
* Generate and parse QA pairs (e.g. atomic / aggregated QA)

---

### Bad Case Example

Below is two **real input example** that reliably triggers the issue.
Although the overall request is valid JSON, the `content` field contains a large amount of Markdown separators and annotation-style text:

<img width="2535" height="525" alt="Image" src="https://github.com/user-attachments/assets/c400f02d-7a7b-490f-b980-e11311487494" />

<img width="2531" height="729" alt="Image" src="https://github.com/user-attachments/assets/b0445d9f-265f-492f-b8f3-8dcba1e40960" />

#### Observed Behavior

When processed by local vLLM, this input may result in:

* Markdown separators such as `---` being interpreted as semantic or structural boundaries
* Question and Answer delimiters being duplicated, shifted, or merged
* QA extraction logic failing to reliably identify the true Q/A boundaries
* Final QA outputs becoming malformed or unparseable

This behavior is especially prominent in prompts that contain **annotation-heavy Markdown content**.

---

### Root Cause Analysis

#### 1. Markdown Special Characters Are Not Handled During Input Construction

In `VLLMWrapper._build_inputs`, conversation history and prompts are constructed via **plain string concatenation**:

```python
@staticmethod
def _build_inputs(prompt: str, history: Optional[List[str]] = None) -> str:
    msgs = history or []
    lines = []
    for m in msgs:
        if isinstance(m, dict):
            role = m.get("role", "")
            content = m.get("content", "")
            lines.append(f"{role}: {content}")
        else:
            lines.append(str(m))
    lines.append(prompt)
    return "\n".join(lines)
```

This implementation:

* Does not escape or normalize Markdown structural symbols
* Directly injects Markdown syntax into the model context

As a result, the model may misinterpret Markdown markers as semantic or QA boundaries, leading to unstable QA generation and parsing failures.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

QA Parsing Becomes Unstable When Using Local vLLM with Markdown-Formatted Inputs #134

Problem Description

Reproduction Conditions

Bad Case Example

Observed Behavior

Root Cause Analysis

1. Markdown Special Characters Are Not Handled During Input Construction

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

QA Parsing Becomes Unstable When Using Local vLLM with Markdown-Formatted Inputs #134

Description

Problem Description

Reproduction Conditions

Bad Case Example

Observed Behavior

Root Cause Analysis

1. Markdown Special Characters Are Not Handled During Input Construction

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions