MITRE ATT&CK Evaluation 2023 Results Collation/Subjective Ranking

ShenguiTurmi · Sep 27, 2023

Recently, they released their 2023 test, which mimics the attacks of Turla, a Russian country-level APT organization, and is used to examine the detection capabilities of EDR products and the protection capabilities of EPP products. Since they never provide official rankings and comparisons, I've organized and simplified the test results and then provided them in table form for quick reading.

As per MITRE's requirements, they do not do or endorse any third-party-provided report interpretations, so I've only worked to keep the test results as simple to read as possible while keeping the reading difficulty as low as possible by consolidating the test results according to the full individual product reports. As such, I will not say that a particular security software performs better in a particular area, but simply mirror the overall performance in their tests.

This year's test was divided into a total of three categories (Carbon, Snake, and Protection), with 31 security software programs participating, including Cisco and Checkpoint, which withdrew early due to compatibility issues during the test, so the results of 29 security software programs are reflected in the final report.

In the two sections on detection capabilities, I've followed the methodology of their official once-simplified version of the report, and divided the testing simply into Analytic Coverage and Visibility, where Analytic Coverage means that the security software produced technical or tactical detections that mapped to the definitions in ATT&CK or general (but analytically valuable) detections that are not mapped to ATT&CK. Visibility builds on Analytic Coverage but include behavioral logs generated by the security software that cannot be used for separate analysis.

Part I - Carbon (detection)
Scenario:
This scenario follows Turla’s multi-phase approach to implant a watering hole for persistence on a victim’s network as a way to compromise more victims of interest. Turla gains initial access through a spearphishing email, a fake software installer is downloaded onto the victim machine and execution of the EPIC payload takes place. Once persistence and C2 communications are established, a domain controller is discovered, and CARBON-DLL is ingress into victim network. Further lateral movement takes the attackers to a Linux Apache server, PENQUIN is copied to the server and used to install a watering hole.
Result:

Annotation:
Products marked in blue represent the software's statement that certain sub-steps of the test are unsupported, e.g. there may be some security software that does not support the Linux operating system, and these unsupported sub-steps are excluded from the overall percentage.

Part II - Snake (detection)
Scenario:
This scenario continues Turla’s multi-phased, intelligence collection campaign, with the attackers establishing a typo-squatting website to target entities with high value information. Turla targets the victim with a drive-by compromise, Adobe Flash installer bundled with EPIC, which installs on the victim’s network. EPIC communicates to the C2 server via proxy web server with HTTPS requests, persists via process injection, and performs enumeration on the victim’s workstation. SNAKE is then deployed to maintain foothold, elevate privileges and communicates to the C2 via HTTP/SMTP/DNS. Finally, the attackers move laterally to install LightNeuron, enabling Turla to collect and exfiltrate sensitive communications to further mission objectives.
Result:

Annotation:
Products marked in blue represent the software's statement that certain sub-steps of the test are unsupported, e.g. there may be some security software that does not support the Linux operating system, and these unsupported sub-steps are excluded from the overall percentage.

Part III - Protection
Scenario:
Protections reflect the spirit of an assumed breach and defense in depth, exploring a range of ATT&CK behaviors and preventions/remediation procedures that address them. When an adversary activity is blocked, protections enables product owners to explore the next activity. We explore this by developing a clean testing range, executing the adversary emulation plan that is broken down into separate test scenarios, and conducting detection analysis to determine when and if the variant would fail.
Result:

Annotation:
The blue labeled N/A means that the software does not support protection for Linux systems (it may have scanning capability, but no real-time protection, and there is no point in continuing the test). 6 software did not participate in the protection test (may be pure EDR or software vendors do not want to participate), all of their sub-projects are recorded as N/A. The number after √ represents the interception of the sub-steps, can not be categorically said that the earliest interception of the stronger, but can reflect it is more biased towards the binary or behavioral interception.

ShenguiTurmi · Sep 27, 2023

@[correlate]
Cortex is not only 100% Protection and Detection.

Unless my collation is wrong, at least Crowdstrike has this "only" as well.

Shadowra · Sep 27, 2023

Very surprised at Eset

DeepInstinct, on the other hand, I'm quite happy with the results.

[correlate] · Sep 27, 2023

ShenguiTurmi said:
@[correlate]
Cortex is not only 100% Protection and Detection.
Unless my collation is wrong, at least Crowdstrike has this "only" as well.

Thank you Mr:

ShenguiTurmi

To clarify, Crowdstrike is indeed good at detection, but those programs based on detection and learning from malicious samples and the use of signatures do not guarantee effective protection, even if they get a full score of 100%.

ForgottenSeer 103564 · Sep 27, 2023

Assumptions could be squished with further information as most here im assuming are unfamiliar with EDR versions of products listed, panels, settings, default settings out of the box and testing methods set forth in the test as to per settings.

Same old issue different day when it comes to testing facilities. Just my couple cents worth.

Xeno1234 · Sep 27, 2023

Where’s Kaspersky?

ShenguiTurmi · Sep 27, 2023

Xeno1234 said:
Where’s Kaspersky?

They withdrew midway through the 2021 test and have not participated in a test organized by MITRE since.

Jonny Quest · Sep 27, 2023

Ultimate Vision said:
Assumptions could be squished with further information as most here im assuming are unfamiliar with EDR versions of products listed, panels, settings, default settings out of the box and testing methods set forth in the test as to per settings.

Same old issue different day when it comes to testing facilities. Just my couple cents worth.

I'd be one of those. But I was curious about how individual companies faired that I knew about, or that others had tested or were using on this forum.

Xeno1234 · Sep 27, 2023

ShenguiTurmi said:
They withdrew midway through the 2021 test and have not participated in a test organized by MITRE since.

Wonder why

ShenguiTurmi · Sep 27, 2023

Xeno1234 said:
Wonder why

idk

Search

MITRE ATT&CK Evaluation 2023 Results Collation/Subjective Ranking

ShenguiTurmi

Level 3

ShenguiTurmi

Level 3

Shadowra

Level 36

[correlate]

Level 18

ShenguiTurmi

ForgottenSeer 103564

Xeno1234

Level 14

ShenguiTurmi

Level 3

Jonny Quest

Level 21

Xeno1234

Level 14

ShenguiTurmi

Level 3

Similar threads

MITRE ATT&CK Evaluation 2023 Results Collation/Subjective Ranking

Level 3

Level 3

Level 36

Level 18

ShenguiTurmi​

ForgottenSeer 103564

Level 14

Level 3

Level 21

Level 14

Level 3

Similar threads

ShenguiTurmi