-
Notifications
You must be signed in to change notification settings - Fork 97
Cudastf #794
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
sidelnik
wants to merge
41
commits into
NVIDIA:main
Choose a base branch
from
sidelnik:cudastf
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Cudastf #794
Changes from 1 commit
Commits
Show all changes
41 commits
Select commit
Hold shift + click to select a range
d6dc01d
Update build config to pull CUDASTF
sidelnik 245b20f
remove const expr
sidelnik 9b35ec8
Updates to get basic cudastf functionality working with matx
sidelnik 7d298d4
move to void_interface
sidelnik 154b3f9
add stf executor
sidelnik c8ef988
support for cgsolve operator and a few examples
sidelnik 52b18c9
make the sync() that is part of stfexecutor call ctx.task_fence()
sidelnik d726b10
fix typo
sidelnik 5e7576c
Added test case
sidelnik 1373699
Fixes to the sync
sidelnik 92e7204
add support for cgsolve
sidelnik a608f3f
update to the simple radar code
sidelnik b062577
minor typo fix
sidelnik bbf9abc
update version of stf
sidelnik 3e831ea
cleanup constexpr case for stfexecutor
sidelnik 702fe79
cleanup constexpr case for stfexecutor
sidelnik 5bfe21e
add conditional support for cudagraph to the stf executor
sidelnik f407256
update to latest cudastf
sidelnik 221599c
switch to use logical token
sidelnik 7a5bb6c
update parameters for radar code
sidelnik 0c2432f
update to radar code to work with command line args
sidelnik 3ae267b
cleanup to support different executor
sidelnik 6a75794
cleanup radar code to emit stf and cuda versions
sidelnik f1facca
test script that runs simple radar with different input sizes. output…
sidelnik 0199e75
enable cuda graphs as a command line argument enableGraphs
sidelnik 39b16f4
add support for the random/randomOp generator
sidelnik 9b7c4b0
get the basic spectrogram code working with stf
sidelnik f9e09f1
get spectrogram cudagraph code working with stf
sidelnik 6c9a791
add assert in the case stream capture is turned on if creating a plan
sidelnik a1efd1c
Merge branch 'cudastf' into cudastf_latest
sidelnik 6437eab
Merge pull request #2 from sidelnik/cudastf_latest
sidelnik bbb9aae
Apps using matx with stf should get these flags
caugonnet e13c9b6
fix constructor
caugonnet 7244399
fix typo/bug
sidelnik 66f6850
update to example code to fix compile error
sidelnik 89e2a43
update to example code to fix compile error
sidelnik 973886b
update test script for radar code
sidelnik 92885e7
temp fix to the allocator dtor
sidelnik 8607840
remove warning to work with latest stf
sidelnik 14e0985
replace logical token with token
sidelnik 92e04d5
update version to use cccl from main
sidelnik File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
move to void_interface
- Loading branch information
commit 7d298d405f4937af5657f4d79079924b18b15b38
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some comment would be welcome here :) This is creating a logical data with a void data interface because we don't rely on CUDASTF for transfers/allocation, it's just for sync.
Putting a value here, and not a shape of a void interface means we don't have to issue a "write" task in CUDASTF