Andy Ful
From Hard_Configurator Tools
Thread author
Verified
Honorary Member
Top Poster
Developer
Well-known
- Dec 23, 2014
- 8,817
- Content source
- https://github.com/AndyFul/ConfigureDefender
Homemade AV testing (a suggestion).
Post updated / redesigned / shortened 14.02.2025
Simplified and standardized version can be found here:
malwaretips.com
In my posts, I often criticized the homemade malware "tests" (YouTube tests), pointing out that the results obtained are not statistically significant and the testing methodology is incorrect. In this post, I am going to suggest a test outline that removes some important issues.
Short description.
This kind of test is intended for fresh malware samples. It is a kind of competition between a selected AV called AV4 and a collective AV123 (a collection of three top AVs).
On each malware sample, the AV4 can win, lose, and draw with a collective AV123.
AV4 loses whenever it is bypassed by the malware sample and all three top AVs protect against that sample (Collective AV123 Pass).
AV4 wins whenever it protects against the malware sample and at least one of the top three AVs is bypassed by that sample (Collective AV123 Failure).
In other cases, we have a draw between AV4 and AV123.
Test details.
Each partial test should be done against AV123, and AV4 in two hours. The VM images must be saved for each of the four AVs.
To save time, the analysis of possible infections must be done after testing AV1, AV2, AV3, and AV4.
The partial tests with 25 samples can be done with a few days brake.
Example of partial test (25 samples, passed mean that the sample was blocked/detected):
1-10. All AVs passed <--- 10 draws
11. AV1 passed, AV2 passed, AV3 passed (Collective AV123 Pass) ; AV4 failed <---- losses = 1
12-15. All AVs passed <--- 4 draws
16. AV1 passed, AV2 failed, AV3 passed (Collective AV123 Failure) ; AV4 failed <--- draw
17. All AVs passed <--- 1 draw
18. AV1 failed, AV2 failed, AV3 passed (Collective AV123 Failure) ; AV4 passed <---- wins = 1
19. AV1 passed, AV2 failed, AV3 passed (Collective AV123 Failure) ; AV4 passed <---- wins = 1 + 1
20-24. All AVs passed <--- 1 draw
25. AV1 passed, AV2 passed, AV3 passed (Collective AV123 Pass) ; AV4 failed <---- losses = 1 + 1
End of partial test. The result is inconclusive because it is not true that wins < losses. The test must be continued with another pule of 25 samples.
More examples:
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117642
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117528
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117531
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117613
Post updated / redesigned / shortened 14.02.2025
Simplified and standardized version can be found here:

App Review - Homemade AV testing (a suggestion).
Homemade AV testing (a suggestion). Post updated / redesigned / shortened 14.02.2025 In my posts, I often criticized the homemade malware "tests" (YouTube tests), pointing out that the results obtained are not statistically significant and the testing methodology is incorrect. In this post...

In my posts, I often criticized the homemade malware "tests" (YouTube tests), pointing out that the results obtained are not statistically significant and the testing methodology is incorrect. In this post, I am going to suggest a test outline that removes some important issues.
Short description.
This kind of test is intended for fresh malware samples. It is a kind of competition between a selected AV called AV4 and a collective AV123 (a collection of three top AVs).
On each malware sample, the AV4 can win, lose, and draw with a collective AV123.
AV4 loses whenever it is bypassed by the malware sample and all three top AVs protect against that sample (Collective AV123 Pass).
AV4 wins whenever it protects against the malware sample and at least one of the top three AVs is bypassed by that sample (Collective AV123 Failure).
In other cases, we have a draw between AV4 and AV123.
Test details.
- Take three top AVs (AV1, AV2, AV3 ---> collective AV123), and next the AV4 which is probably not a top AV (for example Norton + Kaspersky + Bitdefender as collective AV123, and additionally Microsoft Defender as AV4).
- Take 25 fresh malware samples to perform a partial test.
- Test those samples against AV123 and AV4.
- Count the number of the AV4 wins and losses.
- End the test if the condition [ wins < losses ] is fulfilled. This will prove with high confidence that AV4 is not a top AV.
- If not [ wins < losses ], perform another partial test with a new pule of 25 samples, as in points 2-5. Do not reset the numbers of wins and losses. Those numbers should reflect wins and losses of all partial tests.
- If still the condition [ wins < losses ] does not apply, continue partial tests but end the full test when [ wins < losses ] or 16 tests are done.
- If the condition [ wins < losses] is not fulfilled after 16 partial tests, the AV4 is most probably a top AV (or close to top AVs).
Each partial test should be done against AV123, and AV4 in two hours. The VM images must be saved for each of the four AVs.
To save time, the analysis of possible infections must be done after testing AV1, AV2, AV3, and AV4.
The partial tests with 25 samples can be done with a few days brake.
Example of partial test (25 samples, passed mean that the sample was blocked/detected):
1-10. All AVs passed <--- 10 draws
11. AV1 passed, AV2 passed, AV3 passed (Collective AV123 Pass) ; AV4 failed <---- losses = 1
12-15. All AVs passed <--- 4 draws
16. AV1 passed, AV2 failed, AV3 passed (Collective AV123 Failure) ; AV4 failed <--- draw
17. All AVs passed <--- 1 draw
18. AV1 failed, AV2 failed, AV3 passed (Collective AV123 Failure) ; AV4 passed <---- wins = 1
19. AV1 passed, AV2 failed, AV3 passed (Collective AV123 Failure) ; AV4 passed <---- wins = 1 + 1
20-24. All AVs passed <--- 1 draw
25. AV1 passed, AV2 passed, AV3 passed (Collective AV123 Pass) ; AV4 failed <---- losses = 1 + 1
End of partial test. The result is inconclusive because it is not true that wins < losses. The test must be continued with another pule of 25 samples.
More examples:
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117642
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117528
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117531
https://malwaretips.com/threads/homemade-av-testing-a-suggestion.134815/post-1117613
Last edited: