Data Science ML Agent

This example demonstrates how to build a natural language-driven data science and machine learning agent powered by NVIDIA GPUs. The agent allows users to perform data exploration, model training, and hyperparameter optimization interactively using RAPIDS cuDF and cuML for GPU acceleration.

Overview

The Data Science ML Agent enables natural language interaction for common data science workflows, including dataset loading, target selection, model training, and performance optimization. It combines the flexibility of LLMs with GPU-accelerated computation to simplify and speed up end-to-end machine learning pipelines.

LLM Used

NVIDIA Nemotron Nano-9B-v2: A compact, open-source large language model optimized for reasoning and data analysis tasks.

Donwload it to your local folder using the llm_download.py script. The model will be permanently saved in .cache, for example, /home/your_username/.cache/huggingface/hub/models--nvidia--NVIDIA-Nemotron-Nano-9B-v2/snapshots/bce37e25324449f9be5b6a03c69a15244d27ee6e

Inference Server Used

vLLM (Versatile Large Language Model) is a high-performance, open-source library designed for the fast inference and serving of Large Language Models (LLMs).

Key Features

Natural language interface for running data exploration and ML workflows
GPU acceleration with cuDF (DataFrame operations) and cuML (ML algorithms)
Fast inference using the locally downloaded LLM model
Support for CPU mode using pandas and scikit-learn
Simple setup and execution through Streamlit interface
Compatible with both small and large-scale datasets

Requirements

RAPIDS 25.10
Python 3.10, 3.11, 3.12, or 3.13
vLLM 0.9.x and later
CUDA 12.0 or 13.0 compatible NVIDIA GPU (for GPU mode)
streamlit, optuna, joblib, transformers, accelerate

Please refer to the official RAPIDS installation documentation for detailed instructions.

Recommendated Installation (Example):

For stability on Blackwell systems, it is highly recommended to use two separate environments: one for the Inference Server (vLLM) and one for the Agent (RAPIDS).

1. Create the Agent Environment

This environment hosts the LLM. It must be kept clean to avoid compiler conflicts with RAPIDS.

conda create -n vllm_server
pip install vllm --extra-index-url https://download.pytorch.org/whl/cu130

2. Create the Server Environment (vLLM)

This environment runs the Streamlit UI and RAPIDS libraries.

conda create -n rapids-25.10 -c rapidsai -c conda-forge -c nvidia  \
    rapids=25.10 python=3.11 'cuda-version=13.0'

Example Usage

Step 1: Start the vLLM Server (Terminal 1)

Open a terminal, activate the server environment, and launch the model.

conda activate vllm_server

export TRITON_PTXAS_PATH=/usr/local/cuda/bin/ptxas
export TORCH_CUDA_ARCH_LIST="12.1a"
export PATH=/usr/local/cuda/bin:$PATH
export LD_LIBRARY_PATH=/home/your_username/miniconda3/envs/vllm_server/lib:$LD_LIBRARY_PATH

vllm serve /home/your_username/.cache/huggingface/hub/models--nvidia--NVIDIA-Nemotron-Nano-9B-v2/snapshots/bce37e25324449f9be5b6a03c69a15244d27ee6e \
    --served-model-name nemotron-9b \
    --dtype bfloat16 \
    --max-model-len 8192 \
    --gpu-memory-utilization 0.6 \
    --max-num-seqs 32 \
    --port 8000 \
    --trust-remote-code \
    --enable-auto-tool-choice \
    --tool-call-parser llama3_json

Step 2: Start the User Interface (Terminal 2)

Open a second terminal.

conda activate rapids-25.10

# GPU-accelerated mode
python -m cudf.pandas -m cuml.accel -m streamlit run user_interface.py

# CPU-based mode
streamlit run user_interface.py

Example queries:

load dataset Titanic-Dataset.csv/Titanic-Dataset-test.csv
describe the dataset
preview the first n rows
set target variable to be 'Survived'
train classification/regression model
optimize svc with 50 trials
optimize forest regressor with 30 trials
show best model by r2
make inference for the test dataset
...

📊 Sample Dataset

This project provides sample datasets, you can download the Kaggle Titanic Datasets train.csv and test.csv into the data folder. You can also create train-1M.csv, an extrapolated version scaled to 1M rows of the train.csv using the script extrapolation.py.

Note:

Ensure you have the appropriate dependencies installed for each mode.
GPU mode requires a supported NVIDIA GPU and the RAPIDS ecosystem installed.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
data		data
runs		runs
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
ML_Agent_vLLM_local_nano-9b.pdf		ML_Agent_vLLM_local_nano-9b.pdf
README.md		README.md
extrapolation.py		extrapolation.py
gpu_tools_test.py		gpu_tools_test.py
gpu_tools_test_reg.py		gpu_tools_test_reg.py
llm_download.py		llm_download.py
user_interface.py		user_interface.py
vllm_inference_test.py		vllm_inference_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Science ML Agent

Overview

LLM Used

Inference Server Used

Key Features

Requirements

Recommendated Installation (Example):

1. Create the Agent Environment

2. Create the Server Environment (vLLM)

Example Usage

Step 1: Start the vLLM Server (Terminal 1)

Step 2: Start the User Interface (Terminal 2)

Example queries:

📊 Sample Dataset

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Data Science ML Agent

Overview

LLM Used

Inference Server Used

Key Features

Requirements

Recommendated Installation (Example):

1. Create the Agent Environment

2. Create the Server Environment (vLLM)

Example Usage

Step 1: Start the vLLM Server (Terminal 1)

Step 2: Start the User Interface (Terminal 2)

Example queries:

📊 Sample Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages