Hi everyone, I'm working on a cybersecurity project, and part of it involves comparing files to malware samples to determine if the scanned file is malware.
Therefore I need a database that contains malware hashes to perform this step.
If anyone knows of a site that contains something similar to what I require, I would appreciate it.
Some of these APIs are not free, subscriptions are required as threat intelligence, specially when curated and properly checked for FPs is not free.
Many AV vendors offer APIs as well.
The Bazaar API should be free but it is plagued with false positives.
You implement it via HTTP Post request like this:
In addition, the Sophos Sorel collection contains 20 million samples you can use to train ML models, be advised that you will also need a large number of safe files for false positives control.
Explore Sophos’ innovative AI technologies, combining deep learning, GenAI, and human expertise to deliver unmatched cyber threat protection, through the largest AI-native platform in the industry.
ai.sophos.com
I also found this, that contains more APIs, supposedly open source.
The MalShare Project is a community driven public malware repository that works to provide free access to malware samples and tooling to the infomation security community.
The MalShare Project is a community driven public malware repository that works to provide free access to malware samples and tooling to the infomation security community.
Some of these APIs are not free, subscriptions are required as threat intelligence, specially when curated and properly checked for FPs is not free.
Many AV vendors offer APIs as well.
The Bazaar API should be free but it is plagued with false positives.
You implement it via HTTP Post request like this:
In addition, the Sophos Sorel collection contains 20 million samples you can use to train ML models, be advised that you will also need a large number of safe files for false positives control.
Explore Sophos’ innovative AI technologies, combining deep learning, GenAI, and human expertise to deliver unmatched cyber threat protection, through the largest AI-native platform in the industry.
ai.sophos.com
I also found this, that contains more APIs, supposedly open source.
The Malware BaaZar website contains the information I am looking for. Thank you very much also for the documentation that you shared, it may be useful to me.
That's true. However, the hashes are still visible (and that's what the OP asked for if I am not mistaken).
Also, the VT links are visible as well, and most malware researchers can download malware samples directly from VT.
So VxVault can still be useful.