fix: Panic on CJK character truncation in MCP prompt descriptions #3227

mzkmnk · 2025-10-22T13:13:54Z

Summary

Fixes panic when truncating MCP prompt descriptions containing CJK (Chinese, Japanese, Korean) characters.

Issues

Closes #3117
Closes #3170
Related to #3136, #3086

Problem

The truncate_description function in prompts.rs was using unsafe byte-index slicing (&text[..n]), which panics when the index falls in the middle of a multibyte UTF-8 character. CJK characters typically use 3 bytes in UTF-8, causing crashes when truncating at byte boundaries.

Error example:

byte index 37 is not a char boundary; it is inside '한' (bytes 36..39)

Solution

Replaced unsafe byte-index slicing with UTF-8 safe char_indices() iteration
Finds the last valid character boundary before the target length
Handles edge cases (empty strings, very short max_length, emojis)
Maintains backward compatibility with ASCII text

Changes

Modified truncate_description() function in crates/chat-cli/src/cli/chat/cli/prompts.rs
Added comprehensive test cases for CJK characters
Updated existing tests to reflect UTF-8 safe behavior

Note

Cargo.lock was updated to allow local testing and verification of the changes.

Testing

Verified with test cases from reported issues:

✅ Korean text: "사용자가 작성한 글의 어색한 표현이나..." (Issue Amazon Q CLI crashes when there is long CJK MCP Prompts description #3117)
✅ Chinese text: "移除 eagleeye-ec-databases 任務狀況確認..." (Issue Panic when byte index is not a char boundary on non-ASCII strings #3170)
✅ Japanese text: "これは日本語のテキストです..."
✅ ASCII text (backward compatibility)
✅ Edge cases (empty strings, emojis)

All tests pass without panics, respecting character boundaries.

Impact

Scope: MCP prompt description display (/prompts list command)
Compatibility: Fully backward compatible
Risk: Low - only affects truncation logic for long descriptions

evanliu048 · 2025-10-22T17:47:48Z

crates/chat-cli/src/cli/chat/cli/prompts.rs

+        // If we found a valid boundary, use it; otherwise use the last character start
+        if truncate_at == 0 && !text.is_empty() {
+            // Edge case: even the first character is too long
+            truncate_at = text.char_indices().next().map(|(i, _)| i).unwrap_or(0);


qq: Should we delete this if check? char_indices().next() always returns the first character at index 0, so this line doesn't modify truncate_at

@evanliu048

You're absolutely right. Removed in c029a35. All tests still pass.

mzkmnk · 2025-10-23T01:35:21Z

@evanliu048

I've made the corrections. Please review 🙇

- Replace unsafe byte-index slicing with UTF-8 safe char_indices() - Fixes aws#3117, aws#3170 where truncate_description panicked on multibyte characters - Ensures truncation respects character boundaries for CJK languages - Maintains backward compatibility with ASCII text

- Consolidate ASCII and CJK test cases into single test function - Reduces diff size while maintaining comprehensive coverage - Ensures backward compatibility verification

Add comprehensive test coverage for edge cases: - Very small max_length values - CJK characters that don't fit in target length - Emoji (4-byte UTF-8 characters) - Mixed ASCII and CJK text - Single CJK character within limit All tests verify UTF-8 safe truncation behavior.

Remove the unnecessary if-check that was doing nothing. char_indices().next() always returns index 0 for the first character, so this code was just reassigning truncate_at = 0 without any effect. All tests pass without this code, confirming it was redundant.

Updated Cargo.lock to enable local testing and verification of the fix.

Replace custom truncate logic in truncate_description with the existing truncate_safe_in_place utility function to ensure consistency across the codebase and leverage tested UTF-8 safe truncation logic.

mzkmnk · 2025-10-25T02:54:36Z

crates/chat-cli/src/cli/chat/cli/prompts.rs

-    }
+    let mut result = text.to_string();
+
+    truncate_safe_in_place(&mut result, max_length, "...");


I found that truncate_safe_in_place already exists in the codebase, so I used that instead

mzkmnk marked this pull request as draft October 22, 2025 13:15

mzkmnk force-pushed the fix/cjk-truncate-panic-3117-3170 branch from 19411fa to 44bd8a3 Compare October 22, 2025 13:18

mzkmnk marked this pull request as ready for review October 22, 2025 13:46

evanliu048 reviewed Oct 22, 2025

View reviewed changes

mzkmnk force-pushed the fix/cjk-truncate-panic-3117-3170 branch from d9403af to 03d4d0d Compare October 22, 2025 22:05

mzkmnk changed the title ~~Fix: Panic on CJK character truncation in MCP prompt descriptions~~ fix: Panic on CJK character truncation in MCP prompt descriptions Oct 22, 2025

mzkmnk force-pushed the fix/cjk-truncate-panic-3117-3170 branch from 03d4d0d to c5d5b5e Compare October 24, 2025 13:30

mzkmnk added 7 commits October 25, 2025 10:57

Regenerate Cargo.lock to fix TOML parse error

de72ad7

Restore original test_truncate_description test

6e1a86c

- Consolidate ASCII and CJK test cases into single test function - Reduces diff size while maintaining comprehensive coverage - Ensures backward compatibility verification

refactor: code format

2295051

chore: regenerate Cargo.lock

a070120

Updated Cargo.lock to enable local testing and verification of the fix.

mzkmnk force-pushed the fix/cjk-truncate-panic-3117-3170 branch from 9300ee6 to a070120 Compare October 25, 2025 01:58

refactor: use truncate_safe_in_place for UTF-8 safe truncation

188bb5b

Replace custom truncate logic in truncate_description with the existing truncate_safe_in_place utility function to ensure consistency across the codebase and leverage tested UTF-8 safe truncation logic.

mzkmnk commented Oct 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: Panic on CJK character truncation in MCP prompt descriptions #3227

fix: Panic on CJK character truncation in MCP prompt descriptions #3227

mzkmnk commented Oct 22, 2025 •

edited

Loading

Uh oh!

evanliu048 Oct 22, 2025

Uh oh!

mzkmnk Oct 22, 2025 •

edited

Loading

Uh oh!

mzkmnk commented Oct 23, 2025 •

edited

Loading

Uh oh!

mzkmnk Oct 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

fix: Panic on CJK character truncation in MCP prompt descriptions #3227

Are you sure you want to change the base?

fix: Panic on CJK character truncation in MCP prompt descriptions #3227

Conversation

mzkmnk commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Issues

Problem

Solution

Changes

Note

Testing

Impact

Uh oh!

evanliu048 Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

mzkmnk Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mzkmnk commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mzkmnk Oct 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mzkmnk commented Oct 22, 2025 •

edited

Loading

mzkmnk Oct 22, 2025 •

edited

Loading

mzkmnk commented Oct 23, 2025 •

edited

Loading