Update prompt for generating/rewriting follow up questions #809

colelandolt · 2025-03-13T20:23:01Z

Description

As described in #767, when the user asks a follow up question, sometimes Vanna AI combines that question with the previous question even when the second question is fully independent and can be answered on its own. This PR improves the prompt for follow up questions in the following ways:

Refers to the questions as "previous/new question" instead of "first/second question"
Emphasizes the decision-making process for determining if a rewrite/combo is necessary in order to mitigate the the current behavior that is too eager to combine the questions
Adds an instruction to focus on the intent of the new question
Adds an instruction to pay attention to specific prams mentioned in both questions

Risks

My linter ran on save, feel free to lint to your standards

llamapreview

Auto Pull Request Review from LlamaPReview

1. Overview

1.1 Core Changes

Primary purpose and scope: Improve question handling logic to prevent over-eager combination of independent follow-up questions
Key components modified: generate_rewritten_question method prompt structure
Cross-component impacts: Affects core question processing pipeline and LLM interaction patterns
Business value alignment: Enhances conversational SQL reliability per issue Allowing Follow-up Questions Without Unnecessary Merging #767

1.2 Technical Architecture

System design modifications: Enhanced prompt engineering for context analysis
Component interaction changes: Modified LLM instruction set while maintaining output channels
Integration points impact: Localized changes to question rewriting component
Dependency changes: No new dependencies introduced

2. Critical Findings

2.1 Must Fix (P0🔴)

Issue: Terminology mismatch in user message construction

Impact: Causes LLM confusion between "previous/new" vs "first/second" terminology, undermining core PR purpose
Resolution: Update user message labels to match system prompt terminology

Issue: Missing response validation

Impact: Risk of returning unprocessed LLM output containing explanations
Resolution: Add response sanitization logic

2.2 Should Fix (P1🟡)

Issue: Lack of test coverage

Impact: Unable to verify behavior for key use cases
Suggested Solution: Add unit tests for independent/related question scenarios

Issue: No error handling for empty responses

Impact: Potential system failures on malformed LLM outputs
Suggested Solution: Implement fallback to original question

2.3 Consider (P2🟢)

Area: Prompt maintenance

Improvement Opportunity: Externalize prompts to template files for better maintainability

Area: Parameter handling

Improvement Opportunity: Explicit instruction for parameter preservation in combined questions

2.4 Summary of Action Items

Immediate (P0): Fix terminology mismatch and add response validation
High Priority (P1): Implement test suite and error handling
Future (P2): Externalize prompts and enhance monitoring

3. Technical Analysis

3.1 Code Logic Analysis

📁 src/vanna/base/base.py - generate_rewritten_question

Submitted PR Code:

    self.user_message(
        "First question: " + last_question + "
Second question: " + new_question
    ),

Analysis:
- Current logic uses inconsistent terminology vs system prompt
- Creates cognitive dissonance for LLM between "first/second" and "previous/new"
- Directly impacts core functionality of question independence detection
LlamaPReview Suggested Improvements:

    self.user_message(
        "Previous question: " + last_question + "
New question: " + new_question
    ),

Improvement rationale:
- Maintains terminology consistency with system prompt
- Critical for proper LLM instruction following
- High-risk impact on feature effectiveness

📁 src/vanna/base/base.py - generate_rewritten_question

Submitted PR Code:

    return self.submit_prompt(prompt=prompt, **kwargs)

Analysis:
- Returns raw LLM response without validation
- Risk of including explanations or multiple responses
- No handling of empty/malformed responses
LlamaPReview Suggested Improvements:

    response = self.submit_prompt(prompt=prompt, **kwargs)
    return response.strip().split("
")[0]  # Take first line only

Improvement rationale:
- Ensures clean single-question output
- Mitigates risk of unexpected response formats
- Maintains API contract stability

3.2 Key Quality Aspects

Testing strategy: Critical need for test cases validating:
- Independent question preservation
- Contextual question merging
- Parameter handling scenarios
Documentation needs: Prompt engineering decisions should be documented for future maintainers

4. Overall Evaluation

Technical assessment: Valuable improvements needing critical terminology fixes
Business impact: High-value fix for user-reported issue when P0 resolved
Risk evaluation: Medium risk without response validation
Notable positive aspects:
- Clear prompt structure improvements
- Enhanced context analysis guidelines
Implementation quality: Generally well-scoped with needed documentation
Final recommendation: Request Changes (due to P0 terminology mismatch)

💡 LlamaPReview Community
Have feedback on this AI Code review tool? Join our GitHub Discussions to share your thoughts and help shape the future of LlamaPReview.

update prompt for follow up questions

b6320a9

llamapreview bot reviewed Mar 13, 2025

View reviewed changes

address terminology mismatch

6ce6002

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update prompt for generating/rewriting follow up questions #809

Update prompt for generating/rewriting follow up questions #809

colelandolt commented Mar 13, 2025 •

edited

Loading

llamapreview bot left a comment

Update prompt for generating/rewriting follow up questions #809

Are you sure you want to change the base?

Update prompt for generating/rewriting follow up questions #809

Conversation

colelandolt commented Mar 13, 2025 • edited Loading

Description

Risks

llamapreview bot left a comment

Choose a reason for hiding this comment

Auto Pull Request Review from LlamaPReview

1. Overview

1.1 Core Changes

1.2 Technical Architecture

2. Critical Findings

2.1 Must Fix (P0🔴)

2.2 Should Fix (P1🟡)

2.3 Consider (P2🟢)

2.4 Summary of Action Items

3. Technical Analysis

3.1 Code Logic Analysis

3.2 Key Quality Aspects

4. Overall Evaluation

colelandolt commented Mar 13, 2025 •

edited

Loading