The datasets shared here are either generated by us in a controlled environment or crawled from websites. In particular, they do not belong to any customers or private users.
The datasets shared here are either generated by us in a controlled environment or crawled from websites. In particular, they do not belong to any customers or private users.
This site has the labeled datasets used in the following work:
This site has the labeled datasets of webpage screenshots and logos as well as implementations of various baselines. Refer the following paper for more details:
This dataset consists of network traffic corresponding to botnet attacks used in the context of IoT attack detection:
This dataset was used in our work on building phishing detection system robust to evasion techniques:
Jehyun Lee, Pingxiao Ye, Ruofan Liu, Dinil Mon Divakaran, and Chan Mun Choon, “Building robust phishing detection system: an empirical analysis,” in NDSS MADWeb (Workshop on Measurements, Attacks, and Defenses for the Web), Feb. 2020.
This dataset was used for device fingerprinting in the following works:
Biswadeep Chakraborty, Dinil Mon Divakaran, Ido Nevat, Gareth W. Peters, and Mohan Gurusamy, “Cost-aware Feature Selection for IoT Device Classification,” IEEE Internet of Things Journal, 2021 [Dataset] [PDF].
Vijayanand Thangavelu, Dinil Mon Divakaran, Rishi Sairam, Suman Sankar Bhunia, and Mohan Gurusamy, “DEFT: A Distributed IoT Fingerprinting Technique,” IEEE Internet of Things Journal, vol. 6, no. 1, pp. 940–952, Feb 2019 [Dataset] [PDF].
Bharat Atul Desai, Dinil Mon Divakaran, Ido Nevat, Gareth W. Peters, and Mohan Gurusamy, “A feature-ranking framework for IoT device classification,” in 11th International Conference on Communication Systems & Networks (COMSNETS 2019), Jan. 2019 [PDF].