Use multi-thread for cloud experiments and mutli-process for local ones #29

DonggeLiu · 2024-01-29T05:35:05Z

With clould experiments enabled, local program is left with no computationally heavy tasks. Most of the time, it only waits for cloud build results and writes them to a file.
using multi-thread can achieve higher parallelism (e.g., avoid overhead in creation, management, context switching) and require less resource (e.g., reduce memory usage).

oliverchang · 2024-01-30T03:52:05Z

run_all_experiments.py

+  elif args.cloud_experiment_name:
+    # Use multi-threads for cloud experiments, because each thread only needs to
+    # wait for cloud build results or conduct simple I/O tasks.
+    with ThreadPoolExecutor(max_workers=NUM_EXP) as executor:


Is there a way we can consolidate the two code paths more?

e.g. something like:

if ...: pool = ThreadPool else: pool = Pool

Not sure what the difference is exactly between ThreadPoolExecutor and ThreadPool

See https://docs.python.org/3/library/multiprocessing.html#multiprocessing.pool.ThreadPool

See https://docs.python.org/3/library/multiprocessing.html#multiprocessing.pool.ThreadPool

I reckon the doc implies ThreadPoolExecutor is more modern and supports thread-level parallelism better?
I can definitely try to consolidate the two code parts to make them look better.

Made an attempt to clean up the related code in run_all_experiments.py a bit, will do the same in run_one_experiment.py.
The code should be more readable and have less repetition, but I did not find a perfect way to consolidate them further without over-engineering this.

Please let me know if there are other options.

oliverchang · 2024-01-30T09:11:56Z

Can we use https://docs.python.org/3/library/concurrent.futures.html#concurrent.futures.ProcessPoolExecutor instead to make the code paths more consistent for the process case?

DonggeLiu · 2024-01-30T23:39:31Z

Can we use https://docs.python.org/3/library/concurrent.futures.html#concurrent.futures.ProcessPoolExecutor instead to make the code paths more consistent for the process case?

Yep, sure.
Thanks!

DonggeLiu · 2024-02-26T23:53:04Z

Not high priority, converting it to a draft for now and will come back to this later.

Use multi-thread for cloud experiments and mutli-process for local ones

c3b75cc

DonggeLiu requested a review from oliverchang January 29, 2024 05:35

Justify sleep(30)

2524e83

oliverchang reviewed Jan 30, 2024

View reviewed changes

Cleanup thread/process usage

55886e0

DonggeLiu marked this pull request as draft February 26, 2024 23:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use multi-thread for cloud experiments and mutli-process for local ones #29

Use multi-thread for cloud experiments and mutli-process for local ones #29

DonggeLiu commented Jan 29, 2024

oliverchang Jan 30, 2024 •

edited

Loading

oliverchang Jan 30, 2024

DonggeLiu Jan 30, 2024

DonggeLiu Jan 30, 2024

oliverchang commented Jan 30, 2024

DonggeLiu commented Jan 30, 2024

DonggeLiu commented Feb 26, 2024

Use multi-thread for cloud experiments and mutli-process for local ones #29

Are you sure you want to change the base?

Use multi-thread for cloud experiments and mutli-process for local ones #29

Conversation

DonggeLiu commented Jan 29, 2024

oliverchang Jan 30, 2024 • edited Loading

Choose a reason for hiding this comment

oliverchang Jan 30, 2024

Choose a reason for hiding this comment

DonggeLiu Jan 30, 2024

Choose a reason for hiding this comment

DonggeLiu Jan 30, 2024

Choose a reason for hiding this comment

oliverchang commented Jan 30, 2024

DonggeLiu commented Jan 30, 2024

DonggeLiu commented Feb 26, 2024

oliverchang Jan 30, 2024 •

edited

Loading