Skip to content

Conversation

@yakov-g
Copy link
Collaborator

@yakov-g yakov-g commented Nov 21, 2025

This repository is public. Do not put here any private DataRobot or customer's data: code, datasets, model artifacts, .etc.

Summary

Bump drum in envs to 1.17.8

Rationale


Note

Upgrades dependencies and env versions across drop-in environments (including DRUM 1.17.8) and updates tests to set OMP_NUM_THREADS for XGBoost when using multiple workers.

  • Environments:
    • Dependency upgrades across drop-ins: bump datarobot-drum to 1.17.8, datarobot to 3.9.1, OpenTelemetry stack to 1.38.0/0.59b0, pillow to 12.0.0, AWS/Google SDKs, and other libs; update env_info.json environmentVersionId and tags.
    • vLLM GPU: refresh deps (e.g., aiohttp, openai, jiter) while keeping label v0.10.0 and updating env version metadata.
  • Framework-specific pins:
    • PyTorch: update to torch 2.9.1, triton 3.5.1, and newer CUDA/NCCL/NVSHMEM libs.
    • ONNX: bump onnxruntime to 1.23.2.
    • XGBoost: bump xgboost to 3.1.2; add Dockerfile comments re OMP_NUM_THREADS for multi-worker stability.
  • Tests:
    • In prediction server tests, keep Keras multi-processing skip; for XGBoost with max_workers > 1, set OMP_NUM_THREADS=1 and unset after tests.

Written by Cursor Bugbot for commit 83e077e. This will update automatically on new commits. Configure here.

@engprod-2 engprod-2 bot changed the title Yakov/raptor 15042 reconcile [RAPTOR-15042] Yakov/raptor 15042 reconcile Nov 21, 2025
@yakov-g yakov-g changed the title [RAPTOR-15042] Yakov/raptor 15042 reconcile [RAPTOR-15042] hard upgrade of DRUM and other deps in all the envs except agents Nov 21, 2025
@yakov-g yakov-g requested a review from s-gavrenkov November 21, 2025 22:23
@devexp-slackbot
Copy link

The Needs Review labels were added based on the following file changes.

Team @datarobot/core-modeling (#predictive-ai) was assigned because of changes in files:

public_dropin_environments/python3_keras/env_info.json
public_dropin_environments/python3_keras/requirements.txt
public_dropin_environments/python3_pytorch/env_info.json
public_dropin_environments/python3_pytorch/requirements.txt
public_dropin_environments/python3_sklearn/env_info.json
public_dropin_environments/python3_sklearn/requirements.txt
public_dropin_environments/python3_xgboost/env_info.json
public_dropin_environments/python3_xgboost/requirements.txt
public_dropin_environments/r_lang/env_info.json
public_dropin_environments/r_lang/requirements.txt

Team @datarobot/genai-systems (#genai-systems) was assigned because of changes in files:

model_templates/python3_xgboost/xgb_reg.pkl
public_dropin_environments/java_codegen/env_info.json
public_dropin_environments/java_codegen/requirements.txt
public_dropin_environments/python311/env_info.json
public_dropin_environments/python311/requirements.txt
public_dropin_environments/python3_keras/env_info.json
public_dropin_environments/python3_keras/requirements.txt
public_dropin_environments/python3_onnx/env_info.json
public_dropin_environments/python3_onnx/requirements.txt
public_dropin_environments/python3_pytorch/env_info.json
public_dropin_environments/python3_pytorch/requirements.txt
public_dropin_environments/python3_sklearn/env_info.json
public_dropin_environments/python3_sklearn/requirements.txt
public_dropin_environments/python3_xgboost/env_info.json
public_dropin_environments/python3_xgboost/requirements.txt
public_dropin_environments/r_lang/env_info.json
public_dropin_environments/r_lang/requirements.txt
public_dropin_gpu_environments/vllm/env_info.json
public_dropin_gpu_environments/vllm/requirements.txt
tests/fixtures/drop_in_model_artifacts/xgb_bin.pkl
tests/fixtures/drop_in_model_artifacts/xgb_multi.pkl
tests/fixtures/drop_in_model_artifacts/xgb_reg.pkl

If you think that there are some issues with ownership, please discuss with C&A domain at #sdtk slack channel and create PR to update DRCODEOWNERS\CODEOWNERS file.

@devexp-slackbot
Copy link

Label Needs Review: GenAI Systems was removed because @s-gavrenkov is part of GenAI Systems domain.

@yakov-g yakov-g force-pushed the yakov/raptor-15042-reconcile branch from 18d8d89 to 88de0e5 Compare November 24, 2025 23:46
# Setting OMP_NUM_THREADS=1 forces XGBoost to use only one thread per process.
# If we need to use wekzeug max_workers > 1 in DRUM, we must set OMP_NUM_THREADS=1 to avoid potential hangs.
# Maybe the same would apply to gunicorn, when we test.
# ENV OMP_NUM_THREADS=1
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@s-gavrenkov
@nullspoon
@klichukb

Just heads up that we may need this env var configured if we ever use werkzeug max_workers>1
For now I set it only in the related tests.

Copy link

@mbromanowsky mbromanowsky left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM for Predictive AI

@yakov-g yakov-g force-pushed the yakov/raptor-15042-reconcile branch from 88de0e5 to 83e077e Compare November 25, 2025 00:09
@yakov-g yakov-g merged commit 8b2dd30 into master Nov 25, 2025
32 checks passed
@devexp-slackbot devexp-slackbot bot deleted the yakov/raptor-15042-reconcile branch November 25, 2025 03:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants