[GSK-1275] Importance of metrics calculated on partial data slice #1169

mattbit · 2023-06-13T07:45:35Z

linear · 2023-06-13T07:45:39Z

GSK-1275 Importance of metrics calculated on partial data slice

User KD_A on reddit pointed out that

And does the # samples refer to the # samples in the data slice? It should be the denominator of the metric for the slice. For example, if that second row's recall of 0.111 is 1 predicted positive / 9 true positive, it's debatable whether to flag that.

This is right. We may have 1000 samples in our data slice, but to calculate for example the recall we only use the positive samples, which may be just a few samples out of the total, making the detection a false positive.

…GSK-1279] - [GSK-1275] Fixes problems with metrics calculated on small samples - [GSK-1279] Experimental support for false discovery rate control via Benjamini–Hochberg procedure

andreybavt

Also LGTM with some questions

python-client/giskard/scanner/performance/performance_bias_detector.py

andreybavt · 2023-06-21T14:13:04Z

python-client/giskard/scanner/performance/metrics.py

+    def _calculate_affected_samples(self, y_true: np.ndarray, y_pred: np.ndarray, model: BaseModel) -> int:
+        if model.is_binary_classification:
+            # F1 score will not be affected by true negatives
+            neg = model.meta.classification_labels[0]


Why do we only do it for binary classification and not for all cases?

Because the way the F1 is computed for multiclass is different. In our case it will use the total count of true positives, false positives, and false negatives in a one-vs-rest way for each class. So in the end it will use all the samples.

… into task/fix-scan-metrics

sonarqubecloud · 2023-06-21T14:59:34Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
3 Code Smells

100.0% Coverage
0.0% Duplication

mattbit force-pushed the task/fix-scan-metrics branch from 240413c to a5bf2d4 Compare June 19, 2023 09:51

mattbit changed the base branch from main to task/GSK-1078 June 20, 2023 09:38

mattbit added 3 commits June 20, 2023 15:25

Fix performance metrics calculated on small size samples [GSK-1275] […

2e7f8b4

…GSK-1279] - [GSK-1275] Fixes problems with metrics calculated on small samples - [GSK-1279] Experimental support for false discovery rate control via Benjamini–Hochberg procedure

Add test

ee211eb

Clean up code

11a84a5

mattbit force-pushed the task/fix-scan-metrics branch from 874fd77 to 11a84a5 Compare June 20, 2023 14:30

mattbit marked this pull request as ready for review June 20, 2023 14:31

mattbit added 3 commits June 20, 2023 16:48

More tests

2d5000d

Fix tests

b85edf7

Make p_value parameter optional in PerformanceIssueInfo

25c6288

mattbit requested a review from andreybavt June 21, 2023 10:19

mattbit and others added 2 commits June 21, 2023 14:23

Merge branch 'task/GSK-1078' into task/fix-scan-metrics

f24e541

Merge branch 'task/GSK-1078' into task/fix-scan-metrics

ce913b2

andreybavt reviewed Jun 21, 2023

View reviewed changes

mattbit added 2 commits June 21, 2023 16:30

Remove unused line

09fa043

Merge branch 'task/fix-scan-metrics' of github.com:Giskard-AI/giskard…

27340f7

… into task/fix-scan-metrics

mattbit requested a review from andreybavt June 21, 2023 14:33

andreybavt merged commit f1f9465 into task/GSK-1078 Jun 21, 2023

mattbit mentioned this pull request Jun 21, 2023

[GSK-1275] Importance of metrics calculated on partial data slice #1193

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[GSK-1275] Importance of metrics calculated on partial data slice #1169

[GSK-1275] Importance of metrics calculated on partial data slice #1169

Uh oh!

mattbit commented Jun 13, 2023

linear bot commented Jun 13, 2023

andreybavt left a comment

Uh oh!

andreybavt Jun 21, 2023

mattbit Jun 21, 2023

sonarqubecloud bot commented Jun 21, 2023

Labels

3 participants

Uh oh!

[GSK-1275] Importance of metrics calculated on partial data slice #1169

[GSK-1275] Importance of metrics calculated on partial data slice #1169

Uh oh!

Conversation

mattbit commented Jun 13, 2023

linear bot commented Jun 13, 2023

andreybavt left a comment

Choose a reason for hiding this comment

Uh oh!

andreybavt Jun 21, 2023

Choose a reason for hiding this comment

mattbit Jun 21, 2023

Choose a reason for hiding this comment

sonarqubecloud bot commented Jun 21, 2023

Labels

3 participants