[VM] [Hexagon] Add buffers to `dma_wait` builtin #16706

abhikran-quic · 2024-03-12T06:43:32Z

While introducing dma operations at graph level, relax KillAfterLastUse pass introduces kill_tensor operation after dma_copy. This leads to memory being deallocated when asynchronous copy operation is in progress. Hence, moving the input/output buffers to dma_wait to ensure kill_tensor is introduced after dma_wait at the graph level.
Also, the logic for size calculation is updated to use GetDataSize function.
The test case is updated to use offsets instead of allocating different storage in VTCM.

While introducing dma operations at graph level, relax KillAfterLastUse pass introduces kill_tensor operation after dma_copy. This leads to memory being deallocated when asynchronous copy operation is in progress. Hence, moving the input/output buffers to dma_wait to ensure kill_tensor is introduced after dma_wait at the graph level. Also, the logic for size calculation is updated to use GetDataSize function. The test case is updated to use offsets instead of allocating different storage in VTCM.

abhikran-quic · 2024-03-13T02:48:17Z

cc: @Hzfengsy @quic-sanirudh

quic-sanirudh

LGTM, apart from the minor comment for unused argument. Thanks.

quic-sanirudh · 2024-03-14T02:55:55Z

src/runtime/relax_vm/hexagon/builtin.cc

+    .set_body_typed([](TVMArgValue vm_ptr, int queue_id, int inflight_dma, NDArray src_arr,
+                       NDArray dst_arr) {


For src_arr and dst_arr, perhaps we should add the [[maybe_unused]].

Thank you @quic-sanirudh ! I have fixed this in the latest patch.

* [VM] [Hexagon] Add buffers to dma_wait builtin While introducing dma operations at graph level, relax KillAfterLastUse pass introduces kill_tensor operation after dma_copy. This leads to memory being deallocated when asynchronous copy operation is in progress. Hence, moving the input/output buffers to dma_wait to ensure kill_tensor is introduced after dma_wait at the graph level. Also, the logic for size calculation is updated to use GetDataSize function. The test case is updated to use offsets instead of allocating different storage in VTCM. * Fix review comments

quic-sanirudh approved these changes Mar 14, 2024

View reviewed changes

Fix review comments

34a4d2e

Hzfengsy merged commit 94866f7 into apache:main Mar 15, 2024

abhikran-quic deleted the abhikran/dma_builtin_fix branch March 15, 2024 08:41

ysh329 mentioned this pull request Apr 21, 2024

[Release] v0.16.0 Release Candidate Notes #16911

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[VM] [Hexagon] Add buffers to `dma_wait` builtin #16706

[VM] [Hexagon] Add buffers to `dma_wait` builtin #16706

Uh oh!

abhikran-quic commented Mar 12, 2024

Uh oh!

abhikran-quic commented Mar 13, 2024

Uh oh!

quic-sanirudh left a comment

Uh oh!

quic-sanirudh Mar 14, 2024

Uh oh!

abhikran-quic Mar 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		.set_body_typed([](TVMArgValue vm_ptr, int queue_id, int inflight_dma, NDArray src_arr,
		NDArray dst_arr) {

[VM] [Hexagon] Add buffers to dma_wait builtin #16706

[VM] [Hexagon] Add buffers to dma_wait builtin #16706

Uh oh!

Conversation

abhikran-quic commented Mar 12, 2024

Uh oh!

abhikran-quic commented Mar 13, 2024

Uh oh!

quic-sanirudh left a comment

Choose a reason for hiding this comment

Uh oh!

quic-sanirudh Mar 14, 2024

Choose a reason for hiding this comment

Uh oh!

abhikran-quic Mar 14, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[VM] [Hexagon] Add buffers to `dma_wait` builtin #16706

[VM] [Hexagon] Add buffers to `dma_wait` builtin #16706