[GSK-1275] Importance of metrics calculated on partial data slice #1193

mattbit · 2023-06-21T17:37:58Z

Same as #1169 but now on main!

…GSK-1279] - [GSK-1275] Fixes problems with metrics calculated on small samples - [GSK-1279] Experimental support for false discovery rate control via Benjamini–Hochberg procedure

… into task/fix-scan-metrics

linear · 2023-06-21T17:38:05Z

GSK-1275 Importance of metrics calculated on partial data slice

User KD_A on reddit pointed out that

And does the # samples refer to the # samples in the data slice? It should be the denominator of the metric for the slice. For example, if that second row's recall of 0.111 is 1 predicted positive / 9 true positive, it's debatable whether to flag that.

This is right. We may have 1000 samples in our data slice, but to calculate for example the recall we only use the positive samples, which may be just a few samples out of the total, making the detection a false positive.

sonarqubecloud · 2023-06-21T18:02:05Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
3 Code Smells

100.0% Coverage
0.0% Duplication

mattbit and others added 10 commits June 20, 2023 15:25

Fix performance metrics calculated on small size samples [GSK-1275] […

2e7f8b4

…GSK-1279] - [GSK-1275] Fixes problems with metrics calculated on small samples - [GSK-1279] Experimental support for false discovery rate control via Benjamini–Hochberg procedure

Add test

ee211eb

Clean up code

11a84a5

More tests

2d5000d

Fix tests

b85edf7

Make p_value parameter optional in PerformanceIssueInfo

25c6288

Merge branch 'task/GSK-1078' into task/fix-scan-metrics

f24e541

Merge branch 'task/GSK-1078' into task/fix-scan-metrics

ce913b2

Remove unused line

09fa043

Merge branch 'task/fix-scan-metrics' of github.com:Giskard-AI/giskard…

27340f7

… into task/fix-scan-metrics

Merge branch 'main' into task/fix-scan-metrics

83a9d8f

mattbit requested a review from andreybavt June 21, 2023 17:39

mattbit merged commit d7047f6 into main Jun 21, 2023

mattbit mentioned this pull request Jun 23, 2023

[GSK-1279] More rigorous evaluation of significance of performance metrics #1162

Open

2 tasks

Hartorn deleted the task/fix-scan-metrics branch September 13, 2023 11:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[GSK-1275] Importance of metrics calculated on partial data slice #1193

[GSK-1275] Importance of metrics calculated on partial data slice #1193

Uh oh!

mattbit commented Jun 21, 2023

linear bot commented Jun 21, 2023

sonarqubecloud bot commented Jun 21, 2023

Labels

3 participants

Uh oh!

[GSK-1275] Importance of metrics calculated on partial data slice #1193

[GSK-1275] Importance of metrics calculated on partial data slice #1193

Uh oh!

Conversation

mattbit commented Jun 21, 2023

linear bot commented Jun 21, 2023

sonarqubecloud bot commented Jun 21, 2023

Labels

3 participants