[QUESTION] variable tensor shape when pipeline parallelism (pp) #1072

KookHoiKim · 2024-09-04T07:37:04Z

KookHoiKim
Sep 4, 2024

I am working with LLaVA code and i have a question about sequence length when using pipeline parallelism.
In my understanding, tensor shape for recv, send is fixed using args.seq_length in pipeline_parallel/schedules.py.
And if padding tokens make up most of the input, it becomes very inefficient in terms of memory or execution speed.

Is there any way to use variable input length when using pipeline parallel?
Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] variable tensor shape when pipeline parallelism (pp) #1072

{{title}}

Replies: 0 comments

Select a reply

[QUESTION] variable tensor shape when pipeline parallelism (pp) #1072

KookHoiKim Sep 4, 2024

Replies: 0 comments

KookHoiKim
Sep 4, 2024