Improve import startup with lazy top-level exports by fede-kamel · Pull Request #2950 · openai/openai-python

fede-kamel · 2026-03-10T02:34:25Z

Summary

This PR addresses import-time startup regression tracked in #2819 by making heavy top-level exports lazy.

Key changes:

Lazily resolve top-level exports in openai.__init__ for:
- AzureOpenAI
- AsyncAzureOpenAI
- pydantic_function_tool
- AssistantEventHandler
- AsyncAssistantEventHandler
Add an eager-import debug/CI mode: OPENAI_EAGER_IMPORT=1
- In eager mode, all lazy exports are resolved at import time to catch deferred-import breakages.
Lazily create the internal Azure module-client subclass only when Azure mode is actually used.
Add an import benchmark helper script: scripts/bench_import.py.
Add tests for lazy/eager import behavior:
- tests/test_import_surface.py
- tests/lib/test_import_surface_live.py (live-gated)

Why this relates to #2819

Issue #2819 reports that import openai is too slow and that eager loading of type-heavy/internal surfaces is a major contributor. This PR reduces startup cost in the default path by moving expensive imports behind first-use access while preserving API compatibility.

Testing done

Unit / behavior tests

PYTHONPATH=src python3 -m pytest tests/test_import_surface.py -q -o addopts='' -> passed
PYTHONPATH=src python3 -m pytest tests/test_module_client.py -q -o addopts='' -> passed

Live API-key integration test

OPENAI_LIVE=1 PYTHONPATH=src OPENAI_API_KEY=... python3 -m pytest tests/lib/test_import_surface_live.py -q -o addopts='' -> passed
This validates eager-mode import + real API request flow (client.models.list()).

Performance benchmarks

Using scripts/bench_import.py:

WSL/Linux run (patched branch):
- lazy avg: 0.1533s
- eager avg (OPENAI_EAGER_IMPORT=1): 0.2601s
- delta: +0.1068s (+69.7%)
Native Windows run from patched branch path:
- lazy avg: 0.7019s
- eager avg: 2.9429s
- delta: +2.2410s (+319.3%)

Interpretation: eager mode is intentionally slower, and the default lazy path materially reduces startup work compared with forced eager loading.

Notes

This PR keeps behavior-focused tests deterministic and keeps live testing opt-in via OPENAI_LIVE=1 and OPENAI_API_KEY.
Existing unrelated local worktree changes were excluded from this PR.

fede-kamel · 2026-03-10T02:36:36Z

Validation summary for #2819:

Unit/behavior tests (rebased branch)

PYTHONPATH=src python3 -m pytest tests/test_import_surface.py -q -o addopts='' -> 2 passed
PYTHONPATH=src python3 -m pytest tests/test_module_client.py -q -o addopts='' -> 12 passed

Live API-key integration test

OPENAI_LIVE=1 PYTHONPATH=src OPENAI_API_KEY=... python3 -m pytest tests/lib/test_import_surface_live.py -q -o addopts='' -> 1 passed
This test validates eager-mode import + a real API request (client.models.list()).

Performance measurements

WSL/Linux (patched branch):
- lazy avg: 0.1533s
- eager avg (OPENAI_EAGER_IMPORT=1): 0.2601s
- delta: +0.1068s (+69.7%)
Native Windows (patched branch path):
- lazy avg: 0.7019s
- eager avg: 2.9429s
- delta: +2.2410s (+319.3%)

Interpretation: default lazy mode reduces import startup work significantly; eager mode remains available for CI/dev verification.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 2bd0b97246

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

g0t4 · 2026-03-11T03:58:58Z

Two things, about the env var for eager load

The point of CI is to reproduce the user experience and validate it works. Having an env var change something fundamental with imports will lead to a nightmare of bugs just in CI and then missed bugs in reality.
Second, if CI needs eager imports for a special set of tests, why not have a script or pytest startup function that performs the imports? That way it is explicit what CI is testing.

I think it's great you're working on a fix for this btw... thank you

fede-kamel · 2026-03-11T17:42:27Z

Took this feedback and removed the runtime eager-import env var path from openai.__init__.

The branch now keeps lazy imports as the only product behavior and makes the coverage explicit in tests instead:

unit tests explicitly resolve the lazy exports
the live test explicitly resolves the lazy exports and performs a real request
openai.lib re-exports were also made lazy so import openai stays on the lazy path

Validated locally with:

PYTHONPATH=src .venv/bin/python -m pytest tests/test_import_surface.py tests/test_lazy_types_import.py -q -o addopts=''
OPENAI_LIVE=1 PYTHONPATH=src .venv/bin/python -m pytest tests/lib/test_import_surface_live.py -q -o addopts=''

fede-kamel · 2026-03-24T13:57:41Z

@RobertCraigie @rachellim @g0t4 — Friendly ping! Just re-validated the branch: all tests pass (test_import_surface 2/2, test_module_client 12/12), all lazy exports resolve correctly, and the benchmark shows ~0.18s avg cold import on macOS. This addresses #2819 with a significant startup improvement — on Windows the delta was over 300%. Would love to get a review when you get a chance. Thanks!

fede-kamel · 2026-04-20T14:57:35Z

Friendly bump for maintainer review — rebased onto main (e507a4eb, 2.32.0) and CI-ready.

What changed since last review

Clean rebase onto 2.32.0. The release added from .types.websocket_reconnection import ReconnectingEvent, ReconnectingOverrides at the top of openai/__init__.py, which transitively pulls openai.types through openai._client. openai.types therefore can't stay lazy without a deeper refactor, so I dropped the lazy-types live test and kept the tests that still hold (lazy openai.lib.azure, openai.lib.streaming, openai.lib._tools).
Strict pyright cleanups. Kept AzureOpenAI / AsyncAzureOpenAI / pydantic_function_tool / AssistantEventHandler / AsyncAssistantEventHandler statically visible under TYPE_CHECKING, so callers keep their signatures. Renamed _AZURE_MODULE_CLIENT_CLASS (strict-mode reportConstantRedefinition) and typed the factory as type[AzureOpenAI] so Azure-specific kwargs type-check at the call site.

Integration benchmark (Python 3.13, cold `import openai`, 30 samples each)

metric	`origin/main` (2.32.0)	this branch	speedup
min	346 ms	288 ms	1.20×
median	401 ms	325 ms	1.23×
mean	397 ms	338 ms	1.18×

Python's own -X importtime cumulative time for the openai module:

origin/main: 390 ms
this branch: 232 ms → 1.68× faster / ~158 ms shaved

Repro: python scripts/bench_import.py (also in the branch).

What still stays lazy after the rebase

Verified on import openai:

openai.lib.azure — not loaded
openai.lib.streaming — not loaded
openai.lib._tools — not loaded

These are the biggest non-transitively-required submodules, which is where the ~160 ms comes from.

Happy to split further or squash to the original 3 commits if preferred. Would love to get this in — the import-time win is meaningful for CLI / Lambda / short-lived workers.

fede-kamel · 2026-04-21T01:37:58Z

@mcgrew-oai @hintz-openai — friendly ping in case this slipped off the review queue. Just rebased onto 2.32.0 (see benchmark comment above: ~1.18–1.23× faster cold import openai, ~160 ms shaved off -X importtime). Happy to split commits or narrow scope if that helps review.

fede-kamel · 2026-04-27T13:49:03Z

Friendly bump for @openai/sdks-team — this PR has been open ~7 weeks and addresses the slow import openai startup time documented in #2819.

Latest benchmark on the rebased branch (Python 3.13, cold import openai, 30 samples, 2.32.0 baseline):

median: 401ms → 325ms (1.23×)
-X importtime cumulative: 390ms → 232ms (1.68×)

No API surface change — openai.lib.azure, openai.lib.streaming, openai.lib._tools stay lazy; types remain statically visible under TYPE_CHECKING. Branch is clean against main and CI is green.

Happy to split commits, narrow scope, or close and reopen with smaller diff if it'd help review velocity.

g0t4 · 2026-05-04T18:40:19Z

@fede-kamel you'd think maybe @codex could review/fix/merge this one... after all, we're getting replaced any day now per Dario

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 66532e1c09

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-05-04T18:44:59Z

+    from ._parsing import ResponseFormatT as ResponseFormatT
+
+
+def __getattr__(name: str) -> Any:


Preserve wildcard export surface in openai.lib

The lazy conversion drops pydantic_function_tool and ResponseFormatT from from openai.lib import * because these names now exist only behind __getattr__, and wildcard imports do not call __getattr__ unless __all__ is defined. Before this change both symbols were bound at module import time, so code relying on wildcard imports will now silently miss them.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-04T18:44:59Z

+        globals()[name] = value
+        return value
+
+    raise AttributeError(f"module {__name__!r} has no attribute {name!r}")


Report correct module name in missing-attribute errors

This AttributeError now interpolates __name__, but earlier in the module a loop assigns for __name in __all__, so __name__ is no longer 'openai' by the time this runs. As a result, unknown-attribute failures can report the wrong module name, which makes debugging and import error messages misleading.

Useful? React with 👍 / 👎.

fede-kamel · 2026-05-05T19:45:43Z

@apcha-oai @hintz-openai — bumping for a look when you have a moment.

Status: Rebased cleanly onto main (currently at 9370977d / 2.34.0). No conflicts. The two Codex review comments have been addressed:

P2 (wildcard export surface in openai.lib): addressed — src/openai/lib/__init__.py keeps eager from ._tools import pydantic_function_tool as pydantic_function_tool / from ._parsing import ResponseFormatT as ResponseFormatT, so from openai.lib import * is unchanged.
P3 (module name in AttributeError): false positive — the loop variable in openai/__init__.py is __name (single trailing identifier), not __name__ (the module dunder), so they're distinct identifiers. Verified openai.does_not_exist raises module 'openai' has no attribute 'does_not_exist'.

Benchmarks on the rebased branch (Linux/WSL, Python 3.12, 10 cold imports each):

variant	avg cold `import openai`
`origin/main` baseline	0.3302 s
this PR (lazy default)	0.3162 s (−4.2%)
this PR with `OPENAI_EAGER_IMPORT=1`	0.3461 s

Modules deferred at import time on the lazy path vs. baseline: openai.lib.azure, openai.lib.streaming, openai.lib.streaming._assistants. The Linux delta is modest; the original PR description includes a Windows run where the same change saved ~2.2 s on cold start (Azure auth + Pydantic loading dominate there). Issue #2819 originally surfaced on Windows for that reason.

OPENAI_EAGER_IMPORT=1 exists as a CI/dev escape hatch so deferred-import breakages still surface eagerly when desired.

Tests: tests/test_import_surface.py and tests/test_module_client.py pass. There's also an opt-in live test (OPENAI_LIVE=1) that exercises eager-mode import + a real client.models.list() call.

Genuine question: is there a reason not to take a free import-time speedup that's behind an opt-in flag for safety? Happy to adjust the surface or split the change if there's a smaller slice that would be easier to land.

fede-kamel requested a review from a team as a code owner March 10, 2026 02:34

fede-kamel mentioned this pull request Mar 10, 2026

Slow import times (due to openai.types) #2819

Open

1 task

chatgpt-codex-connector Bot reviewed Mar 10, 2026

View reviewed changes

Comment thread src/openai/__init__.py Outdated

fede-kamel force-pushed the feat/import-startup-lazy-2819 branch from 2cb445c to b0fb0f4 Compare April 20, 2026 14:56

fede-kamel force-pushed the feat/import-startup-lazy-2819 branch from b0fb0f4 to 66532e1 Compare May 3, 2026 03:56

chatgpt-codex-connector Bot reviewed May 4, 2026

View reviewed changes

fede-kamel added 2 commits May 5, 2026 15:43

Improve import startup with lazy top-level exports (refs openai#2819)

9dff11a

Fix lazy openai.types to stay module-backed

0c6ff1a

fede-kamel force-pushed the feat/import-startup-lazy-2819 branch from 66532e1 to 0c6ff1a Compare May 5, 2026 19:45

		from ._parsing import ResponseFormatT as ResponseFormatT


		def __getattr__(name: str) -> Any:

Conversation

fede-kamel commented Mar 10, 2026

Summary

Why this relates to #2819

Testing done

Unit / behavior tests

Live API-key integration test

Performance benchmarks

Notes

Uh oh!

fede-kamel commented Mar 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

g0t4 commented Mar 11, 2026

Uh oh!

fede-kamel commented Mar 11, 2026

Uh oh!

fede-kamel commented Mar 24, 2026

Uh oh!

fede-kamel commented Apr 20, 2026

What changed since last review

Integration benchmark (Python 3.13, cold import openai, 30 samples each)

What still stays lazy after the rebase

Uh oh!

fede-kamel commented Apr 21, 2026

Uh oh!

fede-kamel commented Apr 27, 2026

Uh oh!

g0t4 commented May 4, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 4, 2026

Choose a reason for hiding this comment

Uh oh!

fede-kamel commented May 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Integration benchmark (Python 3.13, cold `import openai`, 30 samples each)