- removal of userinfo - dots and slashes in path and hostname - spaces succeeding and preceding the URL - common session id variables and their values - ip v6 canonicalization Useful links: https://developers.google.com/safe-browsing/v4/urls-hashing https://github.com/iipc/urlcanon/blob/master/python/urlcanon/canon.py#L530