Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

safer, more foolproof hash: make hash return an opaque value, and take that type as the second argument #57055

Open
nsajko opened this issue Jan 15, 2025 · 1 comment
Labels
breaking This change will break code design Design of APIs or of the language itself hashing speculative Whether the change will be implemented is speculative

Comments

@nsajko
Copy link
Contributor

nsajko commented Jan 15, 2025

The hash doc string says:

only use the output of another hash function as the second argument

It'd be easy to enforce this with dispatch in v2, if hash returned an opaque type and took the same type as the second argument. If UInt isn't accepted as the second argument there's less potential for error and confusion.

Something like this:

"""
    HashValue

Constructed by [`hash`](@ref), opaque.
"""
struct HashValue
    v::UInt
    # there's no constructors and this function is not `public`
    global function new_hash_value(v::UInt)
        new(v)
    end
end

"""
    xor(::HashValue, ::HashValue)::HashValue

Combine two hash values in a symmetric manner.
"""
function xor(l::HashValue, r::HashValue)
    xor(l.v, r.v)
end

"""
    hash(::Any, ::HashValue)::HashValue

...
"""
function hash end

Currently a common practice that would break is seeding hash with UInt constants. This would have to be fixed by changing const seed = 0xdeadbeef to const seed = hash(0xdeadbeef). Perhaps a nullary method should be provided, turning this into const seed = hash()?

@nsajko nsajko added breaking This change will break code design Design of APIs or of the language itself hashing speculative Whether the change will be implemented is speculative labels Jan 15, 2025
@adienes
Copy link
Member

adienes commented Jan 22, 2025

Currently a common practice that would break is seeding hash with UInt constants

what are the problems with this? that's not rhetorical I'm genuinely unaware of such constraints

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
breaking This change will break code design Design of APIs or of the language itself hashing speculative Whether the change will be implemented is speculative
Projects
None yet
Development

No branches or pull requests

2 participants