GH1173 Experiment with fuller typing #1193

loicdiridollou · 2025-04-20T18:47:39Z

Closes type kwargs in DataFrame.query according to DataFrame.eval #1173
[] Tests added: Please use assert_type() to assert the type of any return value

@MarcoGorelli I took a try at your idea of typehinting the whole set of arguments in pd.DataFrame.query but stumbling upon the issue when the user passes a dictionary (which is still an allowed behavior).
Wondering if there is something I am missing here, please let me know!

Dr-Irv · 2025-04-21T01:41:11Z

@MarcoGorelli I took a try at your idea of typehinting the whole set of arguments in pd.DataFrame.query but stumbling upon the issue when the user passes a dictionary (which is still an allowed behavior).
Wondering if there is something I am missing here, please let me know!

You'd still need to have the **kwargs as an overload. If you want to allow the other arguments, you can have that (although I don't think we should do that because it's not documented), but you'd still need the **kwargs overload

pandas-stubs/core/frame.pyi

…1173_query

loicdiridollou · 2025-05-05T02:09:43Z

I was able to make some progress here, let me know if this is what you envisioned and I will clean it up.

MarcoGorelli · 2025-05-05T11:13:13Z

Hey - yeah, this is what I was thinking - I don't really understand why **kwargs would be needed at all, given that the accepted arguments by eval are limited https://pandas.pydata.org/docs/reference/api/pandas.eval.html#pandas.eval , although based on

If you want to allow the other arguments, you can have that (although I don't think we should do that because it's not documented)

it looks like Irv disagrees and that what I've suggested may be against the pandas-stubs philosophy - which is fair enough

Dr-Irv · 2025-05-05T13:55:56Z

Hey - yeah, this is what I was thinking - I don't really understand why **kwargs would be needed at all, given that the accepted arguments by eval are limited https://pandas.pydata.org/docs/reference/api/pandas.eval.html#pandas.eval , although based on

If you want to allow the other arguments, you can have that (although I don't think we should do that because it's not documented)

it looks like Irv disagrees and that what I've suggested may be against the pandas-stubs philosophy - which is fair enough

So I looked more carefully at the pandas docs, which do say that the accepted **kwargs are from eval(). Not the greatest on the docs front, and that could be improved. Maybe one of you could create an issue there??

So I'll look more carefully at this PR now with that in mind.

Dr-Irv · 2025-05-05T14:02:33Z

pandas-stubs/core/frame.pyi

+    @overload
+    def query(
+        self,
+        expr: _str,
+        *,
+        inplace: Literal[True],
+        **kwargs: Any,
    ) -> None: ...


Not clear why you need this overload when the previous overload covers it?

This test would raise a mypy/pyright error without the other overload.

pandas-stubs/tests/test_frame.py

Lines 532 to 536 in d19ac89

kwargs = {"parser": "pandas", "engine": "numexpr"}

check(

assert_type(df.query("col1 > col2", inplace=False, **kwargs), pd.DataFrame),

pd.DataFrame,

)

Dr-Irv · 2025-05-05T14:02:59Z

pandas-stubs/core/frame.pyi

+    def query(
+        self,
+        expr: _str,
+        *,
+        inplace: Literal[False] = ...,
+        **kwargs: Any,
    ) -> Self: ...


same comment - not sure why we need this overload when the previous one covers it.

The goal was to still allow for someone passing kwargs as a dictionary instead of the individual arguments since it is the way documented in the docs like df.query("col1 > col2", **kwargs) that you would pass from another function.

The idea is that someone may put a wrapper like:

def my_own_query(df, expr, **kwargs): return df.query(expr, **kwargs)

If you drop the second overload this will raise an error.

Dr-Irv · 2025-05-05T14:32:59Z

tests/test_frame.py

+    check(
+        assert_type(
+            df.query("col1 > col2", parser="pandas", engine="numexpr"), pd.DataFrame
+        ),
+        pd.DataFrame,
+    )
+    check(
+        assert_type(
+            df.query("col1 > col2", parser="pandas", engine="numexpr"), pd.DataFrame
+        ),
+        pd.DataFrame,
+    )


duplicate tests

loicdiridollou · 2025-05-06T00:25:52Z

I will raise an issue on the pandas side if it is not simpler to have all the arguments directly in the function. It is a good point since it should not be too hard to maintain.

GH1173 Experiment with fuller typing

a970e3f

Dr-Irv reviewed Apr 21, 2025

View reviewed changes

pandas-stubs/core/frame.pyi Show resolved Hide resolved

loicdiridollou added 3 commits May 4, 2025 21:49

GH1173 PR feedback

36bc58e

Merge branch 'main' of github.com:loicdiridollou/pandas-stubs into gh…

271cf8d

…1173_query

GH1173 PR feedback

34ea623

Dr-Irv requested changes May 5, 2025

View reviewed changes

GH1173 PR Feedback

d19ac89

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH1173 Experiment with fuller typing #1193

GH1173 Experiment with fuller typing #1193

loicdiridollou commented Apr 20, 2025

Dr-Irv commented Apr 21, 2025

loicdiridollou commented May 5, 2025

MarcoGorelli commented May 5, 2025

Dr-Irv commented May 5, 2025

Dr-Irv May 5, 2025

loicdiridollou May 6, 2025

Dr-Irv May 5, 2025

loicdiridollou May 6, 2025

loicdiridollou May 6, 2025

Dr-Irv May 5, 2025

loicdiridollou May 6, 2025

loicdiridollou commented May 6, 2025

	kwargs = {"parser": "pandas", "engine": "numexpr"}
	check(
	assert_type(df.query("col1 > col2", inplace=False, **kwargs), pd.DataFrame),
	pd.DataFrame,
	)

GH1173 Experiment with fuller typing #1193

Are you sure you want to change the base?

GH1173 Experiment with fuller typing #1193

Conversation

loicdiridollou commented Apr 20, 2025

Dr-Irv commented Apr 21, 2025

loicdiridollou commented May 5, 2025

MarcoGorelli commented May 5, 2025

Dr-Irv commented May 5, 2025

Dr-Irv May 5, 2025

Choose a reason for hiding this comment

loicdiridollou May 6, 2025

Choose a reason for hiding this comment

Dr-Irv May 5, 2025

Choose a reason for hiding this comment

loicdiridollou May 6, 2025

Choose a reason for hiding this comment

loicdiridollou May 6, 2025

Choose a reason for hiding this comment

Dr-Irv May 5, 2025

Choose a reason for hiding this comment

loicdiridollou May 6, 2025

Choose a reason for hiding this comment

loicdiridollou commented May 6, 2025