🧪 Add integration test for rule_based_extract pipeline by wjohns989 · Pull Request #113 · wjohns989/Muninn

wjohns989 · 2026-05-13T23:05:00Z

🎯 What: Added an integration-style test case to cover the previously untested rule_based_extract orchestrator function in muninn/extraction/rules.py.
📊 Coverage: The new test test_rule_based_extract_complete_pipeline asserts the proper integration of extract_entities_rule_based, extract_relations_rule_based, and extract_temporal_context by validating that all entities, generated relations, and temporal indicators are present and correctly structured in the resulting ExtractionResult. Note: it intentionally avoids verifying the summary property to adhere to code review bounds, given that the issue snippet does not explicitly define or declare the logic for the summary attribute.
✨ Result: Improved test coverage for the core pipeline entrypoint, ensuring the components work together correctly.

PR created automatically by Jules for task 9123088751118843617 started by @wjohns989

Adds `test_rule_based_extract_complete_pipeline` in `tests/test_extraction_rules.py` to test the full pipeline behavior of `rule_based_extract`. This includes verifying correct extraction of entities, relations, and temporal context in a single integration test case, ensuring high-level validation of the rule-based subsystem without depending on LLMs. Co-authored-by: wjohns989 <56205870+wjohns989@users.noreply.github.com>

google-labs-jules · 2026-05-13T23:05:01Z

👋 Jules, reporting for duty! I'm here to lend a hand with this pull request.

When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down.

I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job!

For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with @jules. You can find this option in the Pull Request section of your global Jules UI settings. You can always switch back!

New to Jules? Learn more at jules.google/docs.

For security, I will only act on instructions from the user who triggered this task.

gemini-code-assist

Code Review

This pull request introduces a comprehensive integration test, test_rule_based_extract_complete_pipeline, to verify the rule-based extraction pipeline's handling of entities, relations, and temporal context. The reviewer suggests improving the test's robustness by asserting specific entity types and counts, and replacing next() calls with list-based assertions to ensure clearer error messages and better account for existing extraction logic limitations.

gemini-code-assist · 2026-05-13T23:06:52Z

+        entity_names = [e.name for e in result.entities]
+        assert "Python" in entity_names
+        assert "Docker" in entity_names


The current entity assertions only verify the presence of names but do not check the entity_type or ensure that no unexpected entities were extracted. For a robust integration test, it is better to verify the exact count and the attributes of the extracted entities.

Suggested change

entity_names = [e.name for e in result.entities]

assert "Python" in entity_names

assert "Docker" in entity_names

assert len(result.entities) == 2

entities_by_name = {e.name: e for e in result.entities}

assert entities_by_name["Python"].entity_type == "tech"

assert entities_by_name["Docker"].entity_type == "tech"

References

Every contribution must meet SOTA+ standards of precision, quality, and logic. All code must be production-ready and robust. ^(link)

gemini-code-assist · 2026-05-13T23:06:52Z

+        assert len(result.relations) == 2
+
+        pref_relation = next(r for r in result.relations if r.predicate == "prefers")
+        assert pref_relation.subject == "user"
+        assert pref_relation.object == "Python"
+
+        dep_relation = next(r for r in result.relations if r.predicate == "depends_on")
+        assert dep_relation.subject == "Python"
+        assert dep_relation.object == "Docker"


Using next() on a generator in a test can lead to a StopIteration error if no match is found, which provides a less helpful error message than a standard pytest assertion failure. Furthermore, the assertion len(result.relations) == 2 confirms that the "built" relation is missing; this is because CREATION_PATTERNS are defined in muninn/extraction/rules.py but are not currently utilized in the extract_relations_rule_based function. Consider using more robust assertions that verify both the count and the specific attributes of each relation.

Suggested change

assert len(result.relations) == 2

pref_relation = next(r for r in result.relations if r.predicate == "prefers")

assert pref_relation.subject == "user"

assert pref_relation.object == "Python"

dep_relation = next(r for r in result.relations if r.predicate == "depends_on")

assert dep_relation.subject == "Python"

assert dep_relation.object == "Docker"

# Note: Currently only 2 relations are extracted because CREATION_PATTERNS are ignored in rules.py

assert len(result.relations) == 2

prefers = [r for r in result.relations if r.predicate == "prefers"]

assert len(prefers) == 1

assert prefers[0].subject == "user"

assert prefers[0].object == "Python"

depends = [r for r in result.relations if r.predicate == "depends_on"]

assert len(depends) == 1

assert depends[0].subject == "Python"

assert depends[0].object == "Docker"

References

Every contribution must meet SOTA+ standards of precision, quality, and logic. All code must be production-ready and robust. ^(link)

gemini-code-assist Bot reviewed May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🧪 Add integration test for rule_based_extract pipeline#113

🧪 Add integration test for rule_based_extract pipeline#113
wjohns989 wants to merge 1 commit into
mainfrom
jules-test-rule-based-extract-9123088751118843617

wjohns989 commented May 13, 2026

Uh oh!

google-labs-jules Bot commented May 13, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 13, 2026

Uh oh!

gemini-code-assist Bot May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-        entity_names = [e.name for e in result.entities]
-        assert "Python" in entity_names
-        assert "Docker" in entity_names
+        assert len(result.entities) == 2
+        entities_by_name = {e.name: e for e in result.entities}
+        assert entities_by_name["Python"].entity_type == "tech"
+        assert entities_by_name["Docker"].entity_type == "tech"

-        assert len(result.relations) == 2
-        pref_relation = next(r for r in result.relations if r.predicate == "prefers")
-        assert pref_relation.subject == "user"
-        assert pref_relation.object == "Python"
-        dep_relation = next(r for r in result.relations if r.predicate == "depends_on")
-        assert dep_relation.subject == "Python"
-        assert dep_relation.object == "Docker"
+        # Note: Currently only 2 relations are extracted because CREATION_PATTERNS are ignored in rules.py
+        assert len(result.relations) == 2
+        prefers = [r for r in result.relations if r.predicate == "prefers"]
+        assert len(prefers) == 1
+        assert prefers[0].subject == "user"
+        assert prefers[0].object == "Python"
+        depends = [r for r in result.relations if r.predicate == "depends_on"]
+        assert len(depends) == 1
+        assert depends[0].subject == "Python"
+        assert depends[0].object == "Docker"

Conversation

wjohns989 commented May 13, 2026

Uh oh!

google-labs-jules Bot commented May 13, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant