Hardcoded data types prevent from using some models with dtype different from float32

For some models, such as Terramind, there are types statically set to torch.float which is 32bit.

An example is in the Terramind backbone while initializing  the positional embeddings
https://github.com/terrastackai/terratorch/blob/986f0a7e8d015bec10ad275c34a33188ffc0a19d/terratorch/models/backbones/terramind/model/tm_utils.py#L60-L62
 
However, it is common for models to be served using float16 or bfloat16 to reduce GPU memory usage and increase inference throughput. This is the case of vLLM that by default downcasts everything to float16. Loading terramind with vLLM fails exactly because of a data mismatch float16 vs float32.

I suggest setting the data type to `torch.get_default_dtype()` in place of `torch.float` to guarantee that all data uses the same data type.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hardcoded data types prevent from using some models with dtype different from float32 #1032

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	pos_dim = embed_dim // 4
	omega = torch.arange(pos_dim, dtype=torch.float) / pos_dim # Shape (D/4,)
	omega = 1.0 / (temperature**omega)

Hardcoded data types prevent from using some models with dtype different from float32 #1032

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions