-
-
Notifications
You must be signed in to change notification settings - Fork 3k
[mypyc] feat: cache len for iterating over immutable types and expressions with length known at compile time #19503
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
Looks like my tests failed only on 32-bit python, because the generated IR uses 32-bit size numbers? Not sure what to do here, I don't think this failure is indicative of an issue in the PR. |
@ilevkivskyi this one is also ready for review if and when you get a chance, though its definitely a bit more involved than #19497 |
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
07f40fc
to
622f38f
Compare
@@ -1147,3 +1187,33 @@ def gen_step(self) -> None: | |||
def gen_cleanup(self) -> None: | |||
for gen in self.gens: | |||
gen.gen_cleanup() | |||
|
|||
|
|||
def get_expr_length(expr: Expression) -> int | None: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These 2 helper functions can be extended to cover more cases and used for other length-based optimizations I have in mind
for more information, see https://pre-commit.ci
…/mypy into for-loop-len-cache
Currently, if a user has a tuple and uses it as the sequence input for a for loop, the tuple length is checked once at each iteration which, while necessary for some container types such as list and dictionaries, is not necessary for iterating over immutable types tuple, str, and bytes.
This PR modifies the codebase such that the length is only checked at the first iteration, and reused from there.
Also, in cases where a simple genexp is the input argument for a tuple, the length is currently checked one additional time before entering the iteration. In those cases, we don't even need a length check at the first iteration step, and can read the result of that first
len
call instead.Lastly, in cases where a tuple or list is created and the length is knowable at compile time, this PR replaces length checks with constants.