colossalai.context.process_group_initializer.initializer_3d
- class colossalai.context.process_group_initializer.initializer_3d.Initializer_3D_Input(num_group, depth, *args)
3D tensor parallel initialization among input.
- Parameters
num_group (int) – The number of all tensor groups
depth (int) – Depth of 3D parallelism
args – Args used in base class
- init_dist_group()
Initialize 3D tensor parallel groups among input, and assign local_ranks and groups to each gpu.
- Returns
3D tensor parallelism’s information among input
- Return type
Tuple(local_rank, group_world_size, process_group, ranks_in_group, mode)
- class colossalai.context.process_group_initializer.initializer_3d.Initializer_3D_Weight(num_group, depth, *args)
3D tensor parallel initialization among weight.
- Parameters
num_group (int) – The number of all tensor groups
depth (int) – Depth of 3D parallelism
args – Args used in base class
- init_dist_group()
Initialize 3D tensor parallel groups among weight, and assign local_ranks and groups to each gpu.
- Returns
3D tensor parallelism’s information among weight
- Return type
Tuple(local_rank, group_world_size, process_group, ranks_in_group, mode)
- class colossalai.context.process_group_initializer.initializer_3d.Initializer_3D_Output(num_group, depth, *args)
3D tensor parallel initialization among output.
- Parameters
num_group (int) – The number of all tensor groups
depth (int) – Depth of 3D parallelism
args – Args used in base class
- init_dist_group()
Initialize 3D tensor parallel groups among output, and assign local_ranks and groups to each gpu.
- Returns
3D tensor parallelism’s information among output
- Return type
Tuple(local_rank, group_world_size, process_group, ranks_in_group, mode)
- class colossalai.context.process_group_initializer.initializer_3d.Initializer_3D(*args)
Serve as the single entry point to 3D parallel initialization. :param args: Args used to initialize ProcessGroupInitializer
- init_dist_group()
Initialize 3D tensor parallel groups, and assign local_ranks and groups to each gpu. :return: 3D tensor parallelism’s information :rtype: list of Tuples (local_rank, group_world_size, process_group, ranks_in_group, mode)