def enable_parallelism( self, *, config: Union[ParallelConfig, ContextParallelConfig], cp_plan: Optional[Dict[str, ContextParallelModelPlan]] = None, mesh: Optional[DeviceMesh] = None, # Add this parameter ):
If users have already initialized a device mesh for other parallelism strategies (FSDP, tensor parallelism, pipeline parallelism), they can reuse it instead of creating a new one. This prevents conflicts and redundant initialization.