documentation for Enzyme Type Trees #2385

KMJ-007 · 2025-05-13T15:03:42Z

No description provided.

ZuseZ4 · 2025-05-14T12:00:29Z

src/autodiff/type-trees.md

+
+## What are Type Trees?
+
+Type trees in Enzyme are a way to represent the types of variables, including their activity (e.g., whether they are active, duplicated, or contain duplicated data) for automatic differentiation. They provide a structured way for Enzyme to understand how to handle different data types during the differentiation process.


Are there any docs or code which shows them including activity?

my bad i saw this and got confused they both are in the same, later when i was going through enzyme codebase i realised they are different, fixing this

ZuseZ4 · 2025-05-14T12:05:59Z

src/autodiff/type-trees.md

+
+Enzyme needs to understand the structure and properties of Rust types to perform automatic differentiation correctly. This is where type trees come in. They provide a detailed map of a type, including pointer indirections and the underlying concrete data types.
+
+The `-enzyme-rust-type` flag in Enzyme helps in interpreting types more accurately in the context of Rust's memory layout and type system.


The flag just tells enzyme to parse (rust) debug "dwarf" information.
A lot of type information is not encoded in such debug metadata, and the flag hasn't been re-evaluated or used in years. It's good to mention it here (with the corrected description), but I wouldn't mention it in the following sections. We should generate typetrees based on Rust types even without debug metadata. But how to use debug metadata is something we'll also discuss in one of the meetings with oli.

got this, the flag was there in most of test files, which were related to rust

ZuseZ4 · 2025-05-14T12:10:31Z

src/autodiff/type-trees.md

+
+Consider a Rust reference to a 32-bit floating-point number, `&f32`.
+
+In LLVM IR, this might be represented, for instance, as an `i8*` (a generic byte pointer) that is then `bitcast` to a `float*`. Consider the following LLVM IR function:


Ah, you took that from rustmutpointer.ll I guess? Unfortunately they are too outdated, I just realized.
Typed ptr were removed (see my comment about this flag being very outdated), so i8* isn't a thing anymore.
Instead, we now have ptr ("opaque pointers") in LLVM.

You can look for the PRs which introduced these tests a few years ago. They should have instructions on how to reproduce them, so you can re-generate newer tests once you have a working setup.

yes,

found the PR, this will be a great reference and understanding of how things were done

EnzymeAD/Enzyme#307

ZuseZ4 · 2025-05-14T12:20:31Z

src/autodiff/type-trees.md

+
+*   **`{ ... }`**: This encloses the set of type information for the variable.
+*   **`[-1]:Pointer`**:
+    *   `[-1]` is an index or path. In this context, `-1` often refers to the base memory location or the immediate value pointed to.


-1 has a slightly different meaning, something along the lines of everything accessible from here, without dereferencing a pointer.
So [f64;32] could be represented as [-1]:Float@double, or as [0]:Float@double, [8]:Float@double, ...
Afaik we usually prefer -1 in such cases since it's shorter, but IIRC there were some gotchas.
@wsmoses can you share the private youtube video with him (if you prefer in a zulip dm), such that he has more information on how to write these docs?

Or e.g. x: *const [f64;32] could be represented as
[[-1]:Pointer, [-1:-1]Float@double], or as [[0]:Pointer, [0:0]:Float@double, [0:8]:Float@double, ...
or as [[0]:Pointer, [0:-1]:Float@double or as [[-1]:Pointer, [-1:0]:Float@double, [-1:8]:Float@double, ...
(I think)

That also all is under the assumption that we won't access e.g. x[34] which is out of bounds of the original array.
I am not 100% sure if there are cases where this could be valid, I think with raw pointers it might be valid to access other elements. E.g. struct { x: [f64;32], y: i32 }. I you derive a raw pointer to x, you might be able to use it to access y (legally), in which case -1:Float@double would be wrong. I'd need a refresher on pointer provenance and other things, hopefully Oli will know more about it. You can add this as an open question at the end.

ZuseZ4 · 2025-05-26T18:25:57Z

ping @wsmoses can you share the private yt video with him, where you talked about TA and other Enzyme interals? Feel free to dm him on zulip, if you don't want to share it publicly. Just so he can better understand why and how to generate metadata.

wsmoses · 2025-06-14T16:15:00Z

I looked and didn't see any private youtube video.

KMJ-007 and others added 2 commits May 13, 2025 20:32

basic type docs for auto diff

733773c

Merge branch 'rust-lang:master' into master

fc47bce

KMJ-007 marked this pull request as draft May 13, 2025 15:03

ZuseZ4 reviewed May 14, 2025

View reviewed changes

ZuseZ4 marked this pull request as ready for review May 14, 2025 12:32

jieyouxu added S-waiting-on-author Status: this PR is waiting for additional action by the OP T-compiler Relevant to compiler team F-autodiff Feature: autodiff labels May 15, 2025

jieyouxu assigned ZuseZ4 May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

documentation for Enzyme Type Trees #2385

documentation for Enzyme Type Trees #2385

Uh oh!

KMJ-007 commented May 13, 2025

Uh oh!

ZuseZ4 May 14, 2025

Uh oh!

KMJ-007 May 14, 2025

Uh oh!

ZuseZ4 May 14, 2025 •

edited

Loading

Uh oh!

KMJ-007 May 14, 2025

Uh oh!

ZuseZ4 May 14, 2025

Uh oh!

KMJ-007 May 14, 2025

Uh oh!

ZuseZ4 May 14, 2025 •

edited

Loading

Uh oh!

ZuseZ4 May 14, 2025

Uh oh!

ZuseZ4 commented May 26, 2025 •

edited

Loading

Uh oh!

wsmoses commented Jun 14, 2025

Uh oh!

Uh oh!


		## What are Type Trees?

		Type trees in Enzyme are a way to represent the types of variables, including their activity (e.g., whether they are active, duplicated, or contain duplicated data) for automatic differentiation. They provide a structured way for Enzyme to understand how to handle different data types during the differentiation process.


		Enzyme needs to understand the structure and properties of Rust types to perform automatic differentiation correctly. This is where type trees come in. They provide a detailed map of a type, including pointer indirections and the underlying concrete data types.

		The `-enzyme-rust-type` flag in Enzyme helps in interpreting types more accurately in the context of Rust's memory layout and type system.


		Consider a Rust reference to a 32-bit floating-point number, `&f32`.

		In LLVM IR, this might be represented, for instance, as an `i8` (a generic byte pointer) that is then `bitcast` to a `float`. Consider the following LLVM IR function:

documentation for Enzyme Type Trees #2385

Are you sure you want to change the base?

documentation for Enzyme Type Trees #2385

Uh oh!

Conversation

KMJ-007 commented May 13, 2025

Uh oh!

ZuseZ4 May 14, 2025

Choose a reason for hiding this comment

Uh oh!

KMJ-007 May 14, 2025

Choose a reason for hiding this comment

Uh oh!

ZuseZ4 May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KMJ-007 May 14, 2025

Choose a reason for hiding this comment

Uh oh!

ZuseZ4 May 14, 2025

Choose a reason for hiding this comment

Uh oh!

KMJ-007 May 14, 2025

Choose a reason for hiding this comment

Uh oh!

ZuseZ4 May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ZuseZ4 May 14, 2025

Choose a reason for hiding this comment

Uh oh!

ZuseZ4 commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wsmoses commented Jun 14, 2025

Uh oh!

Uh oh!

ZuseZ4 May 14, 2025 •

edited

Loading

ZuseZ4 May 14, 2025 •

edited

Loading

ZuseZ4 commented May 26, 2025 •

edited

Loading