Fix: Validate File Paths and Parsing Logic #3

Prabhat-Thapa45 · 2025-03-25T08:24:21Z

Resolves comments of pr Fix: Handle Trailing Commas and Empty Strings in File Paths python/mypy#18728
fixes parsing logic

x612skm · 2025-03-27T18:27:03Z

Hey @Prabhat-Thapa45, Thanks for the collaboration! I appreciate it! Did you find time to check on the bug mentioned?

It's failing on the line you added — section["files"].strip(). AttributeError: 'list' object has no attribute 'strip' suggests that section["files"] contains a list and is not a string.

Prabhat-Thapa45 · 2025-03-29T12:36:33Z

Hi, @x612skm. Yes, I did fix that. It had to do with how files names were given as a list rather than str in pyproject.toml. So had to make sure the trailing comma removal logic was only applied if it was a str just a normal fix. Also I have reformatted test cases as well taking notes from feedback on your changes I just need to push it now.

webknjaz · 2025-03-29T16:00:36Z

@x612skm looks like you don't have the CI enabled here. Go to https://github.com/x612skm/mypy-dev/actions and correct that. After doing that, you can close the PR and then re-open it right away, which should trigger the CI run for the first time.

webknjaz · 2025-03-29T16:06:21Z

mypy/config_parser.py

@@ -287,6 +284,22 @@ def _find_config_file(

    return None

+def parse_and_validate_filenames(


When a function has an and in the name, this usually indicates that it's doing too many things. Although, validation is usually implied, so I'd drop its mention from the name. Additionally, it's possible to name it more accurately:

Suggested change

def parse_and_validate_filenames(

def convert_raw_files_string_to_list(

It may be reasonable to make it private, too:

Suggested change

def parse_and_validate_filenames(

def _convert_raw_files_string_to_list(

webknjaz · 2025-03-29T16:09:09Z

mypy/config_parser.py

+def parse_and_validate_filenames(
+    raw_files: str
+) -> list[str]:
+    # Split and strip filenames


No need to use code comments to retell what's already written on the following line. You're just repeating the same thing twice, and that has no value. It's distracting at best. Instead, only use code comments to document motivation/justification for things that code is doing. As a very rare exception, code comments can be used to explain very obscure unobvious logic. But then again, it's best to avoid that and find a way to communicate that through structuring code better, naming the variables, defining abstraction layers clearly.

webknjaz · 2025-03-29T16:09:26Z

mypy/config_parser.py

@@ -287,6 +284,22 @@ def _find_config_file(

    return None

+def parse_and_validate_filenames(
+    raw_files: str
+) -> list[str]:


Plz add a PEP 257-compliant docstring to this function.

webknjaz · 2025-03-29T16:14:32Z

mypy/config_parser.py

@@ -109,9 +109,6 @@ def split_and_match_files_list(paths: Sequence[str]) -> list[str]:
    expanded_paths = []

    for path in paths:
-        if not path:


Why is this needed?

webknjaz · 2025-03-29T16:17:59Z

mypy/config_parser.py

-                )
-
-            options.files = files_split
+        if "files" in section and isinstance(raw_files := section["files"], str):


Is this only a string in the case of INI configs? Does TOML not hit this code path? Would the string check be unnecessary when it's known it's going through the INI code path?

It seems to me that a better place for injecting this conversion would be inside _parse_individual_file() where it has the differentiation between is_toml() and not. And you'd only apply it in the else-branch there.

This would likely let you drop the str check for good.

webknjaz · 2025-03-29T16:19:05Z

mypy/test/testconfigparser.py

+        ("[mypy]\nfiles = file1.py,\nfile2.py,\nfile3.py", ["file1.py", "file2.py", "file3.py"], None),
+    ]
+)
+def test_parse_config_file(tmp_path, config_content, expected_files, expected_exception):


@x612skm do you remember if function-based tests are being picked up? I remember we had some issues with these, but I don't recall the detail. Was it just the filename that was problematic, or having the class was necessary as well?

webknjaz · 2025-03-29T16:20:50Z

mypy/test/testconfigparser.py

+    ]
+)
+def test_parse_config_file(tmp_path, config_content, expected_files, expected_exception):
+    """Parameterized test for parse_config_file handling various configurations."""


Function docstrings should start with a verb so that they reference the action being performed. Only constants / objects need to be described as terms because they don't do anything.

Additionally, it should probably be more accurate. Generic statements aren't very useful because they don't include any specifics and so it's normally pointless to have them like that.

webknjaz · 2025-03-29T16:22:16Z

mypy/test/testconfigparser.py


-if __name__ == "__main__":
-    main()
+    if expected_exception:


It's a bad idea to have logic in tests. Introducing branching makes tests more fragile and unreliable. The more complex structures there are, them more uncertainty there is. You'd have to write tests for tests to make sure they function at all.

Negative and positive scenarios are two distinct tests. They shouldn't co-exist in the same test function.

webknjaz · 2025-03-29T16:22:43Z

mypy/test/testconfigparser.py

-if __name__ == "__main__":
-    main()
+    if expected_exception:
+        with pytest.raises(expected_exception) as exc_info:


pytest.raises() must always have a match= regexp set.

webknjaz · 2025-03-29T16:24:36Z

mypy/test/testconfigparser.py

-                    """
-                )
+@pytest.mark.parametrize(
+    "config_content, expected_files, expected_exception",


It's usually nicer to have an iterable of strings for the params list. Sparse structures read better.

Suggested change

"config_content, expected_files, expected_exception",

("config_content", "expected_files", "expected_exception"),

webknjaz · 2025-03-29T16:25:02Z

mypy/test/testconfigparser.py

It wouldn't hurt to have a module docstring here.

webknjaz · 2025-03-29T16:26:59Z

@Prabhat-Thapa45 when writing commit messages, make sure to include descriptions, not just titles. This helps others understand why you did what you did. Here's some more materials on the topic: https://gist.github.com/webknjaz/cb7d7bf62c3dda4b1342d639d0e78d79.

webknjaz · 2025-03-29T16:28:21Z

Yes, I did fix that.

Is there a test that can prove this?

Prabhat-Thapa45 · 2025-03-31T15:06:58Z

python#18621 This issue seems to be resolved. I think we should close our prs @x612skm and also issue python#11171 can be closed. @webknjaz .

webknjaz · 2025-03-31T22:46:17Z

Looks like it might be addressing the ini cfg trailing comma bit, but not empty strings post parsing. Has anyone checked the main branch against the repros?

Fix: Validate file paths and parsing logic

eb9b2ee

Prabhat-Thapa45 marked this pull request as ready for review March 25, 2025 08:27

test: collapse test data with parametrize

540acaa

x612skm mentioned this pull request Mar 29, 2025

Fix: Handle Trailing Commas and Empty Strings in File Paths python/mypy#18728

Open

webknjaz reviewed Mar 29, 2025

View reviewed changes

mypy/test/testconfigparser.py

Copy link

webknjaz Mar 29, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It wouldn't hurt to have a module docstring here.

arvi18 mentioned this pull request Apr 23, 2025

Fix: Handle Trailing Commas and Empty Strings in File Paths coderabbit-test/mypy#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Validate File Paths and Parsing Logic #3

Fix: Validate File Paths and Parsing Logic #3

Prabhat-Thapa45 commented Mar 25, 2025

x612skm commented Mar 27, 2025

Prabhat-Thapa45 commented Mar 29, 2025

webknjaz commented Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025 •

edited

Loading

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025 •

edited

Loading

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz Mar 29, 2025

webknjaz commented Mar 29, 2025

webknjaz commented Mar 29, 2025

Prabhat-Thapa45 commented Mar 31, 2025

webknjaz commented Mar 31, 2025

		@@ -287,6 +284,22 @@ def _find_config_file(

		return None

		def parse_and_validate_filenames(

	def parse_and_validate_filenames(
	def convert_raw_files_string_to_list(

	def parse_and_validate_filenames(
	def _convert_raw_files_string_to_list(

	"config_content, expected_files, expected_exception",
	("config_content", "expected_files", "expected_exception"),

Fix: Validate File Paths and Parsing Logic #3

Are you sure you want to change the base?

Fix: Validate File Paths and Parsing Logic #3

Conversation

Prabhat-Thapa45 commented Mar 25, 2025

x612skm commented Mar 27, 2025

Prabhat-Thapa45 commented Mar 29, 2025

webknjaz commented Mar 29, 2025

Choose a reason for hiding this comment

webknjaz Mar 29, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

webknjaz Mar 29, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

webknjaz commented Mar 29, 2025

webknjaz commented Mar 29, 2025

Prabhat-Thapa45 commented Mar 31, 2025

webknjaz commented Mar 31, 2025

webknjaz Mar 29, 2025 •

edited

Loading

webknjaz Mar 29, 2025 •

edited

Loading