Add a tensorrt backend #33

BestDriverCN · 2021-04-28T05:59:36Z

Add BaseTensorRTBackend that provide support for TensorRT models

vcap_utils/vcap_utils/backends/base_tensorrt.py

apockill · 2021-04-28T19:19:23Z

I'll add the rest of my comments once there's a clear story for mutli-GPU support.

vcap_utils/vcap_utils/backends/base_tensorrt.py

apockill · 2021-04-30T19:20:26Z

vcap_utils/vcap_utils/backends/base_tensorrt.py

+        self.context = self.trt_engine.create_execution_context()
+        # create buffers for inference
+        self.buffers = {}
+        for batch_size in range(1, self.trt_engine.max_batch_size + 1):


I'm leaving a comment here to remind me:

We need to do some memory measurement to figure out if all of these buffers are necessary. I wonder if allocating a buffer for Batch-Size [1, 2, 5, 10] or other combinations might be better.

THings to test:

How many tensorrt models can the NX hold?

How much extra memory does this allocate (relating to Use an s3 bucket to store large files instead of Git LFS #1)

What's the speed performance if we do [1, 2, 10] vs [1, 2, 5, 10], vs [1, 2, 3, 4, 5, 6,...10]

Another question we'll have to figure out: Should this be configurable via the init?

I'll do some tests to figure how much memory is needed for those buffers. Another thought is that if we don't get performance improvement with a larger batch size, we don't have to do that. Based on my tests, larger batch size will improve the inference time by 10% but lower the preprocessing performance, the overall performance is even a little lower than a small batch size.

vcap_utils/vcap_utils/backends/base_tensorrt.py

apockill · 2021-04-30T20:08:29Z

vcap_utils/vcap_utils/backends/base_tensorrt.py

+        self.ctx.pop()
+        return final_outputs
+
+    def _prepare_post_process(self):


I'm starting to think that are too many constants and GridNet specific functions here, and it might be easier to make a separate class specifically for parsing GridNet bounding boxes.

For now, let's clean up the rest of the code first, then discuss how that would work.

These constants are only necessary for detectors, maybe we need another parameter like is_detector in the constructor to indicate if this capsule a detector or classifier?

Or we can check if these constants exist before we call the post process function

Yeah, but I'm thinking that this is super duper specific to GridNet detectors particularly. Maybe we can just offer a function that for parsing GridNet detector outputs, and name it as such.

class GridNetParser: def __init__(parameters): ... def parse_detection_results(prediction): ... class BaseTensorRTBackend: ...

The benefit would be to separate all of these GridNet specific parameters out of the BaseTensorRTBackend 🤔

Great idea, we should have separate parsers for different architectures.

vcap_utils/vcap_utils/backends/base_tensorrt.py

apockill · 2021-04-30T20:53:54Z

vcap_utils/vcap_utils/backends/base_tensorrt.py

+                outputs.append(HostDeviceMem(host_mem, device_mem))
+        return inputs, outputs, bindings, stream
+
+    def do_inference(self, bindings: List[int], inputs: List[HostDeviceMem], outputs: List[HostDeviceMem],


Suggested change

def do_inference(self, bindings: List[int], inputs: List[HostDeviceMem], outputs: List[HostDeviceMem],

def _do_inference(self, bindings: List[int],

inputs: List[HostDeviceMem],

outputs: List[HostDeviceMem],

stream: cuda.Stream,

batch_size: int = 1) -> List[List[float]]:

vcap_utils/vcap_utils/backends/base_tensorrt.py

apockill · 2021-04-30T21:12:32Z

vcap_utils/vcap_utils/backends/base_tensorrt.py

+    def batch_predict(self, input_data_list: List[Any]) -> List[Any]:
+        task_size = len(input_data_list)
+        curr_index = 0
+        while curr_index < task_size:


This logic may need to be revisited if we decide not to have buffers [0->10], and instead have combinations of [1, 2, 5, 10], for example

Co-authored-by: Alex Thiel <[email protected]>

…ules into tensorrt_backend

BestDriverCN · 2021-05-04T01:24:12Z

@apockill I resolved most of your comments but the post-process stuff. The code is tested to be working and the performance is almost the same. I'll continue to work on the GridParser stuff. In the meantime, you can take another look.

apockill · 2021-05-04T17:32:44Z

vcap_utils/vcap_utils/backends/base_tensorrt.py

+            out_lists = [out_array.tolist() for out_array in out_array_by_batch]
+            batch_outputs.append(out_lists)
+        final_outputs = list(zip(*batch_outputs))
+        final_outputs = [list(item) for item in final_outputs]


Is there a reason we need to cast each item as a list? After zip it's held as a tuple

I just want to match the original type hit. I can also change the type hit instead.

Got it. Yeah, just change the type hint. Tuples are cheaper and faster anyways.

vcap_utils/vcap_utils/backends/base_tensorrt.py

Zhao added 5 commits April 20, 2021 15:41

add a base tensorrt backend

3059a7e

ravel before concatenate to improve performance

17d6c48

don't resize if the size of image remains the same

2319618

code clean up, add type hint

5e12a29

update dependency list

d2a00a0

BestDriverCN requested a review from apockill April 28, 2021 05:59

apockill reviewed Apr 28, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 28, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

BryceBeagle marked this pull request as draft April 28, 2021 21:54

Zhao Wang added 2 commits April 29, 2021 18:32

add support for multi GPU

659d003

update post processing api

4f28496

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

add close() fucntion back, was deleted by accident

99e9248

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

apockill reviewed Apr 30, 2021

View reviewed changes

Zhao Wang and others added 13 commits April 30, 2021 21:21

remove process_frame, add type hint for __init__

1901c87

remove cuda and trt context

d51139b

make stride a local variable

07af9dd

update type hint for batch_predict

f661509

formatting

874414b

formatting

7255b8c

rename device_id to device_name

14d88f9

_apply_box_norm will return int

1b93698

Co-authored-by: Alex Thiel <[email protected]>

Merge branch 'tensorrt_backend' of github.com:opencv/open_vision_caps…

5a372b1

…ules into tensorrt_backend

converting _apply_box_norm's return type

57d3eeb

refactor parse_detection_results

ef70e0d

fix import path, fix variable name, simplify code

a33112d

simplify code

8fe0598

apockill reviewed May 4, 2021

View reviewed changes

vcap_utils/vcap_utils/backends/base_tensorrt.py Outdated Show resolved Hide resolved

change log level to INFO, cast score to float

1da5436

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a tensorrt backend #33

Add a tensorrt backend #33

BestDriverCN commented Apr 28, 2021 •

edited

Loading

apockill commented Apr 28, 2021

apockill Apr 30, 2021

apockill Apr 30, 2021

BestDriverCN May 3, 2021

apockill Apr 30, 2021 •

edited

Loading

BestDriverCN May 3, 2021

BestDriverCN May 3, 2021

apockill May 3, 2021

BestDriverCN May 4, 2021

apockill Apr 30, 2021 •

edited

Loading

apockill Apr 30, 2021 •

edited

Loading

BestDriverCN commented May 4, 2021

apockill May 4, 2021

BestDriverCN May 5, 2021

apockill May 6, 2021

Add a tensorrt backend #33

Are you sure you want to change the base?

Add a tensorrt backend #33

Conversation

BestDriverCN commented Apr 28, 2021 • edited Loading

apockill commented Apr 28, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apockill Apr 30, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apockill Apr 30, 2021 • edited Loading

Choose a reason for hiding this comment

apockill Apr 30, 2021 • edited Loading

Choose a reason for hiding this comment

BestDriverCN commented May 4, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BestDriverCN commented Apr 28, 2021 •

edited

Loading

apockill Apr 30, 2021 •

edited

Loading

apockill Apr 30, 2021 •

edited

Loading

apockill Apr 30, 2021 •

edited

Loading