I am building my own custom workflow to test and iterate on ideas quickly. Ideally, we can fire off elastic runs serverless-ly.
the workflow is: implement new method in train_gpt.py, run the run.py command --> launches to modal with wandb logging --> results saved to my own local leaderboard state.
Notes:
- Need to make sure we are auto-constraining the limits with modal. 10min train + 10min eval, 16mb memory footprint.
I am building my own custom workflow to test and iterate on ideas quickly. Ideally, we can fire off elastic runs serverless-ly.
Notes: