Additional intrinsic optimizations?

At the moment there's the `llvm_intrinsically_optimized!` macro which, when using the unstable flag, will call an unstable LLVM intrinsic.

However, there's some opportunities for using intrinsics (edit: _hardware_ intrinsics) in _stable_, and even in core, if we wanted to reach for SSE / SSE2 / etc when available (compile time detected).

For example, `libm` defines [sqrt](https://github.com/rust-lang-nursery/libm/blob/master/src/math/sqrt.rs) with a full software implementation, but if people call it in `std` they get either (in debug) [the `sqrtss` instruction with some indirection in between](https://rust.godbolt.org/z/jnTj3b) or (in release) [the `sqrtss` instruction without any indirection](https://rust.godbolt.org/z/peZpOP). Based on this, I think it would be fine to have `libm` _also_ just use the `sqrtss` instruction when available.

Of course this should probably be behind its own feature flag, but I think it would be a reasonable progression to develop in this direction of using stable hardware intrinsics when possible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Additional intrinsic optimizations? #214

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Additional intrinsic optimizations? #214

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions