chore(typing): add `np.ndarray` aliases #1977

dangotbanned · 2025-02-09T13:12:41Z

Closes #1972

What type of PR is this? (check all applicable)

Related issues

Closes add np.ndarray alias(es) #1972

Checklist

Code follows style guide (ruff)
Tests added
Documented the changes

If you have comments or can explain your changes, please do so below

Resolves warnings that look like this:

narwhals/_pandas_like/dataframe.py:858: error: Missing type parameters for generic type "ndarray"  [type-arg]

On main, this accounts for 25 warnings:

Will close #1972

Noticed a few cases that use the original guard and combined with an `ndim == ...` Now we get that in one call with type narrowing

Tried to resolve a variance issue in `PandasLikeDataFrame.to_numpy`, but abandoned for now

- Sub-package has a huge amount of unrelated warnings - Tried to fix *only* those that are related to `np.ndarray`

dangotbanned · 2025-02-09T13:16:18Z

narwhals/_arrow/dataframe.py

    import pandas as pd
    import polars as pl
+    from pyarrow._stubs_typing import Indices  # type: ignore  # noqa: PGH003


Import is refering to https://github.com/zen-xu/pyarrow-stubs/blob/d97063876720e6a5edda7eb15f4efe07c31b8296/pyarrow-stubs/_stubs_typing.pyi#L27

That type is causing a lot of the headaches with pyarrow:

Indices: TypeAlias = list[int] | NDArray[np.integer] | IntegerArray

Where we are using Sequence[int] it causes a conflict, but AFAICT list[int] is overly narrow on their end

dangotbanned · 2025-02-09T13:19:09Z

narwhals/_arrow/dataframe.py

@@ -207,28 +217,30 @@ def __getitem__(
            isinstance(item, tuple)
            and len(item) == 2
            and is_sequence_but_not_str(item[1])
+            and not isinstance(item[0], str)


I struggled to follow the logic here, but this seems like it should've been there?

this should already have been validated at the narwhals/dataframe.py level - but sure, doesn't hurt to be more explicit about what we're dealing with when we get here 👍

@MarcoGorelli I really think all these CompliantDataFrame.__getitem__ methods would benefit from some reusable code (per #1942)

dangotbanned · 2025-02-09T13:21:38Z

narwhals/dependencies.py

+def is_numpy_array_1d(arr: Any) -> TypeIs[_1DArray]:
+    """Check whether `arr` is a 1D NumPy Array without importing NumPy."""
+    return is_numpy_array(arr) and arr.ndim == 1
+
+
+def is_numpy_array_2d(arr: Any) -> TypeIs[_2DArray]:
+    """Check whether `arr` is a 2D NumPy Array without importing NumPy."""
+    return is_numpy_array(arr) and arr.ndim == 2


Suffixing the current guard name seemed reasonable, but it is a bit odd having:
array_1d -> _1DArray

https://results.pre-commit.ci/run/github/760058710/1739106763.KUYdGvWdRC6nL6ED4bqzFQ

https://github.com/narwhals-dev/narwhals/actions/runs/13226030807/job/36916993522

`pre-commit` inconsistent with local `mypy` https://results.pre-commit.ci/run/github/760058710/1739108468.gewbSXmfRu2T2Ml-tFh6BQ

That'll teach me for simply following what `mypy` asked me to do 🫠 https://results.pre-commit.ci/run/github/760058710/1739106763.KUYdGvWdRC6nL6ED4bqzFQ https://results.pre-commit.ci/run/github/760058710/1739108788.r92OEb-RTUejKTFBisjIfg

https://github.com/narwhals-dev/narwhals/actions/runs/13226357012/job/36917718175?pr=1977

MarcoGorelli

thanks @dangotbanned ! this 1DArray / 2DArray is pretty nice, I think it would be useful in pandas-stubs too, i think i'll suggest it there

MarcoGorelli · 2025-02-09T17:58:49Z

narwhals/_arrow/dataframe.py

@@ -207,28 +217,30 @@ def __getitem__(
            isinstance(item, tuple)
            and len(item) == 2
            and is_sequence_but_not_str(item[1])
+            and not isinstance(item[0], str)


this should already have been validated at the narwhals/dataframe.py level - but sure, doesn't hurt to be more explicit about what we're dealing with when we get here 👍

EdAbati · 2025-02-09T18:00:53Z

narwhals/_pandas_like/utils.py

@@ -819,7 +820,7 @@ def calculate_timestamp_date(s: pd.Series, time_unit: str) -> pd.Series:

 def select_columns_by_name(
    df: T,
-    column_names: Sequence[str],


❤️ yes I needed this too

dangotbanned · 2025-02-09T18:38:20Z

thanks @dangotbanned ! this 1DArray / 2DArray is pretty nice, I think it would be useful in pandas-stubs too, i think i'll suggest it there

Thanks @MarcoGorelli!

I can't take all the credit there, I'm only simplifying parts of numpy(s) stubs:

AFAICT, we only care about .ndim == 1 and .ndim == 2, so we can forgo a lot of the complexities pandas-stubs may have to reckon with

dangotbanned added 8 commits February 9, 2025 10:45

chore(typing): add np.ndarray aliases

85e6e18

Will close #1972

feat(typing): add dim guard variants of is_numpy_array

5778201

Noticed a few cases that use the original guard and combined with an `ndim == ...` Now we get that in one call with type narrowing

refactor(typing): utilize in top-level modules

5d91483

refactor(typing): backport to v1

8b1d0d2

refactor(typing): utilize in _polars

c820a80

refactor(typing): utilize in _pandas_like

63906f5

Tried to resolve a variance issue in `PandasLikeDataFrame.to_numpy`, but abandoned for now

refactor(typing): utilize in _arrow

271d0b3

- Sub-package has a huge amount of unrelated warnings - Tried to fix *only* those that are related to `np.ndarray`

Merge remote-tracking branch 'upstream/main' into typing-ndarray

20dc41b

dangotbanned added the typing label Feb 9, 2025

dangotbanned commented Feb 9, 2025

View reviewed changes

dangotbanned added 6 commits February 9, 2025 13:31

fix(typing): resolve pre-commit warnings _arrow

db511c5

https://results.pre-commit.ci/run/github/760058710/1739106763.KUYdGvWdRC6nL6ED4bqzFQ

fix(typing): resolve pre-commit warnings _pandas_like

71f8fd9

https://results.pre-commit.ci/run/github/760058710/1739106763.KUYdGvWdRC6nL6ED4bqzFQ

ci(typing): 3.8 compat?

e39ac0b

https://github.com/narwhals-dev/narwhals/actions/runs/13226030807/job/36916993522

chore(typing): ignore unused ignore

eaa4213

`pre-commit` inconsistent with local `mypy` https://results.pre-commit.ci/run/github/760058710/1739108468.gewbSXmfRu2T2Ml-tFh6BQ

revert(typing): undo overload reordering

80e0605

That'll teach me for simply following what `mypy` asked me to do 🫠 https://results.pre-commit.ci/run/github/760058710/1739106763.KUYdGvWdRC6nL6ED4bqzFQ https://results.pre-commit.ci/run/github/760058710/1739108788.r92OEb-RTUejKTFBisjIfg

refactor: get coverage for is_numpy_array_2d

0023883

https://github.com/narwhals-dev/narwhals/actions/runs/13226357012/job/36917718175?pr=1977

This comment was marked as resolved.

Sign in to view

dangotbanned marked this pull request as ready for review February 9, 2025 14:38

Merge branch 'main' into typing-ndarray

f1c0aa1

dangotbanned mentioned this pull request Feb 9, 2025

fix: use mypy pre-commit in local environment #1966

Merged

10 tasks

MarcoGorelli approved these changes Feb 9, 2025

View reviewed changes

MarcoGorelli added the internal label Feb 9, 2025

MarcoGorelli merged commit 7b2db00 into main Feb 9, 2025
28 checks passed

MarcoGorelli deleted the typing-ndarray branch February 9, 2025 18:00

EdAbati reviewed Feb 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(typing): add `np.ndarray` aliases #1977

chore(typing): add `np.ndarray` aliases #1977

dangotbanned commented Feb 9, 2025 •

edited

Loading

dangotbanned Feb 9, 2025 •

edited

Loading

dangotbanned Feb 9, 2025

MarcoGorelli Feb 9, 2025

dangotbanned Feb 9, 2025

dangotbanned Feb 9, 2025

This comment was marked as resolved.

MarcoGorelli left a comment

MarcoGorelli Feb 9, 2025

EdAbati Feb 9, 2025

dangotbanned commented Feb 9, 2025

chore(typing): add np.ndarray aliases #1977

chore(typing): add np.ndarray aliases #1977

Conversation

dangotbanned commented Feb 9, 2025 • edited Loading

What type of PR is this? (check all applicable)

Related issues

Checklist

If you have comments or can explain your changes, please do so below

dangotbanned Feb 9, 2025 • edited Loading

Choose a reason for hiding this comment

dangotbanned Feb 9, 2025

Choose a reason for hiding this comment

MarcoGorelli Feb 9, 2025

Choose a reason for hiding this comment

dangotbanned Feb 9, 2025

Choose a reason for hiding this comment

dangotbanned Feb 9, 2025

Choose a reason for hiding this comment

This comment was marked as resolved.

MarcoGorelli left a comment

Choose a reason for hiding this comment

MarcoGorelli Feb 9, 2025

Choose a reason for hiding this comment

EdAbati Feb 9, 2025

Choose a reason for hiding this comment

dangotbanned commented Feb 9, 2025

chore(typing): add `np.ndarray` aliases #1977

chore(typing): add `np.ndarray` aliases #1977

dangotbanned commented Feb 9, 2025 •

edited

Loading

dangotbanned Feb 9, 2025 •

edited

Loading