-
-
Notifications
You must be signed in to change notification settings - Fork 381
[GSK-1953] Max data size as param #1636
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
GSK-1953 Allow users to set maximum number of samples in loss-based scan detectors
Currently we limit to |
mattbit
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All good, one small change
|
|
||
| _needs_target = True | ||
|
|
||
| def __init__(self, max_dataset_size: int = MAX_DATASET_SIZE): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
inline this and probably divide by ~10?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok, also maybe it still makes sense to divide by n_features if the param is not specified?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yep, you could set Optional[int] = None and calculate the default at runtime if not provided
|
Broken test! |
mattbit
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ok, but please fix conflicts
|
Kudos, SonarCloud Quality Gate passed! |








This PR allow users to set maximum number of samples (
max_dataset_size) in loss-based scan detectors.