This project uses the Titanic training dataset from Kaggle after extrapolation to 1M dataset size.
Download the dataset and place it in the data/ directory before running the agent.
You can run the agent in two different modes:
Leverages NVIDIA's RAPIDS libraries for faster data processing and model training.
python -m cudf.pandas -m cuml.accel run.py
Uses standard pandas and scikit-learn for data processing and modeling.
python run.py
Note:
- Ensure you have the appropriate dependencies installed for each mode.
- GPU mode requires a supported NVIDIA GPU and the RAPIDS ecosystem installed.