This repository includes the artifacts from the ACM Web Conference 2025 (WWW'25) paper entitled "The Poorest Man in Babylon: A Longitudinal Study of Cryptocurrency Investment Scams" by Muhammad Muzammil, Abisheka Pitumpe, Xigao Li, Amir Rahmati, and Nick Nikiforakis (PDF).
data/
contains a JSON file of our dataset of cryptocurrency investment scam websites detected by Crimson.src/
contains the source code for each module in Crimsoncron.py
: This script manages the execution of the (certstream server) by continuously monitoring its status and restarting it if necessary.listen.py
: This script listens to a WebSocket server, processes incoming messages, and publishes them to a RabbitMQ queue.send.py
: This script listens for certificate update messages, filters domain names using a keyword-matching utility, and publishes the filtered domain names to a RabbitMQ queue. It maintains logging for domains that pass or fail the filtering process.recv.py
: This script processes URLs from a RabbitMQ queue and checks for potential cryptocurrency scam websites using Object Character Recognition (OCR).validate.py
: This script processes OCR-extracted text from a CSV file, interacts with a local LLM model via subprocess calls, and classifies text as scam-related or not.authentication_crawling/crawler_script.py
: This script is a web automation crawler that automates sign-up and login processes on various websites, and then further crawls the websites to look for cryptocurrency addresses and other IOCs.
If you use this work, please use this citation:
@inproceedings{muzammil2025crimson,
title = {{The Poorest Man in Babylon: A Longitudinal Study of Cryptocurrency Investment Scams}},
author = {Muhammad Muzammil and Abisheka Pitumpe and Xigao Li and Amir Rahmati and Nick Nikiforakis},
booktitle = {Proceedings of the Web Conference (WWW)},
year = {2025},
}