Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle units of literal constants by means of rules for the empty unit #3257

Open
wants to merge 26 commits into
base: MCP/0027
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
7524b11
Initial commit of ReadMe.md for MCP-0027
henrikt-ma Oct 4, 2022
1309d10
Handle units of literal constants by means of rules for the empty unit
henrikt-ma Oct 4, 2022
2dfb4e8
Clarify that all operands of additive operators need to have the same…
henrikt-ma Oct 5, 2022
0c62f41
Clarify "component references outside functions"
henrikt-ma Oct 5, 2022
203c16e
Fix small mistake in example
henrikt-ma Oct 5, 2022
5d42065
Change "common sense" -> "(standard) dimensional analysis"
henrikt-ma Oct 5, 2022
fc4abde
Add section with examples of "standard dimensional analysis"
henrikt-ma Oct 5, 2022
b86567b
Fix minor typo
henrikt-ma Oct 5, 2022
1f5131c
Put quotes around 'standard dimensional analysis'
henrikt-ma Oct 5, 2022
3291a73
Say "multiplying the operands' units"
henrikt-ma Oct 5, 2022
f639afc
Make use of terminology 'dimensional homogenity'
henrikt-ma Oct 5, 2022
3a3b1f0
Don't present lists as "Examples of...", but add "other" item at the end
henrikt-ma Oct 26, 2022
e411347
Say "reject or approve"
henrikt-ma Oct 26, 2022
f3755a2
Add missing "on"
henrikt-ma Oct 26, 2022
fccda32
Reformulate "In some cases"
henrikt-ma Oct 26, 2022
89092ba
Change 'can be' -> 'is'
henrikt-ma Oct 26, 2022
82bb062
Fix typo in comment
henrikt-ma Oct 26, 2022
99ff3c9
Describe unit propagation and elaborate bottom-up nature of unit deri…
henrikt-ma Oct 26, 2022
13529f7
Also describe unit mappings of transcendental functions to match othe…
henrikt-ma Oct 31, 2022
847e645
Change 'are expected to' -> 'should'
henrikt-ma Nov 1, 2022
ebfc229
Add example illustrating bottom-up nature of unit inference
henrikt-ma Nov 1, 2022
232c4fa
Make stronger statement about when unit inference takes place
henrikt-ma Nov 1, 2022
9aeb15f
Change example on undefined unit to user-defined function instead of …
henrikt-ma Nov 2, 2022
a28b795
Simplify proposal by not omitting rules concerning non-empty units
henrikt-ma Nov 2, 2022
089f554
Move 'Otherwise...' into item list
henrikt-ma Nov 7, 2022
35e4f01
Rule out unit inference for components with empty unit-attribute
henrikt-ma Nov 10, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
34 changes: 34 additions & 0 deletions RationaleMCP/0027/ReadMe.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
# Modelica Change Proposal MCP-0027<br/>Units of Literal Constants
Francesco Casella, Henrik Tidefelt

**(In Development)**

## Summary
The purpose of this MCP is to allow more unit errors to be detected by giving more expressions the unit `"1"` instead of having an undefined unit.
The problem with undefined unit is that it gets in the way of carrying out checking of units (which tools tend to deal with by not checking units at all where this happens).

## Revisions
| Date | Description |
| --- | --- |
| 2022-10-04 | Henrik Tidefelt. Filling this document with initial content. |

## Contributor License Agreement
All authors of this MCP or their organizations have signed the "Modelica Contributor License Agreement".

## Rationale
FIXME

## Backwards Compatibility
As current Modelica doesn't clearly reject some models with non-sensical combination of units, this MCP will break backwards compatibility by turning at least some of these invalid.

## Tool Implementation
None, so far.

### Experience with Prototype
N/A

## Required Patents
To the best of our knowledge, there are no patents that would conflict with the incorporation of this MCP.

## References
(None.)
264 changes: 264 additions & 0 deletions chapters/unitexpressions.tex
Original file line number Diff line number Diff line change
Expand Up @@ -89,3 +89,267 @@ \section{The Syntax of Unit Expressions}\label{the-syntax-of-unit-expressions}

The unit expression \lstinline!"T"! means tesla, but note that the letter \lstinline!T! is also the symbol for the prefix tera which has a multiplier value of 10\textsuperscript{12}.
\end{example}


\section{Unit Checking}\label{unit-checking}

How to verify that units are used in compatible ways is current not fully determined by the specification.
This section gives rules for certain situations, but in general tools should reason according to standard dimensional analysis.


\subsection{Standard Dimensional Analysis}\label{standard-dimensional-analysis}

This section gives an incomplete characterization of ``standard dimensional analysis'', also referred to as just \firstuse{dimensional analysis}.
What is described below applies to unit checking in Modelica -- \emph{dimensional analysis} could have additional meanings in other contexts.
While Modelica has a concept of \willintroduce{empty units} (described in later sections), standard dimensional analysis only deals with non-empty units such as \lstinline!"m"!, \lstinline!"m/s"!, or \lstinline!"1"!.
It consists of two parts:
\begin{itemize}
\item
Unit compatibility requirements.
\item
Rules for deriving the unit of an expression.
\end{itemize}

Unit compatibility requirements that must be met in Modelica:
\begin{itemize}
\item
Dimensional homogeneity: The two sides of an equation or assignment statement must have the same unit.
\item
The expression of a binding equation must have the same unit as the component to which it belongs (special case of dimensional homogeneity).
\item
The two operands of the additive operators \lstinline!+! and \lstinline!-! must have the same unit.
\item
The two connectors in a \lstinline!connect!-equation must agree on all units inside the connectors (follows from dimensional homogeneity and additive operator rule).
\item
In a function call, the unit of an argument expression must match the unit of the corresponding function input component.
\item
Other situations where unit compatibility might seem natural are currently not covered by the specification, but could become additional unit compatibility requirements in the future.
\end{itemize}
The requirements above apply to non-empty as well as empty units, but both \willintroduce{unit propagation} and \willintroduce{unit inference} (described below) need to be considered when verifying the requirements.

% TODO: Replace these examples by giving unit semantics where operators and built-in functions are defined in the specification,
% and just include some links to places where such semantics are given.
Unit derivation in Modelica:
\begin{itemize}
\item
The result of additive operators has the same unit as the operands.
\item
The result of multiplication has a unit obtained by multiplying the operands' units.
\item
Built-in operators such as \lstinline!pre! and \lstinline!previous! preserve units, while \lstinline!der! changes the unit by dividing it by \lstinline!"s"!.
\item
Other expressions are not yet covered by the specification.
\end{itemize}


\subsection{Empty and Undefined Units}\label{empty-and-undefined-units}

In situations where the specification does not prescribe how to determine the unit of an expression, the unit of that expression is said to be \firstuse[undefined unit]{undefined}.
It is then not possible for a tool to reject or approve the equation (or other construct) with support in the specification, and options for the tool include silently not performing unit checking, or applying checks based on dimensional analysis.

The unit of an expression can also be defined as being \firstuse[empty unit]{empty}, see below.
In certain places (see below), expressions with empty unit can be implicitly cast to suitable units.
Copy link

@bilderbuchi bilderbuchi Oct 18, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not a friend of "soft" qualifiers like "certain", "some of", etc. in specification texts. I think it would be better to enumerate all the specific places/sections/cases you mean to include in this statement. Include a "fall-through" case handling the situation if no element of the enumeration fits, if appropriate.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The paragraph is part of the section introduction, so it isn't meant to convey the details. I agree that later, when the actual rule is given, one has to avoid formulations that risk causing unnecessary ambiguity.

Does it work better now, with the reformulated presentation of the corresponding list of cases? (I don't think this list needs a fall-through case, as it should always be possible to add more cases in a backwards compatible way.)

Copy link

@bilderbuchi bilderbuchi Oct 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The paragraph is part of the section introduction, so it isn't meant to convey the details.

Should it then be marked non-normative? Otherwise, it's normative, right? (I'm not too firm on the procedural details of this)

reformulated presentation of the corresponding list of cases?

I can't tell which list you mean here?

I don't think this list needs a fall-through case, as it should always be possible to add more cases in a backwards compatible way.)

This is not a contradiction! You can first specify (random example: operators covered by some procedure):

  • Addition: do A
  • Subtraction: do B
  • All others: undefined

and then later refine by adding more cases

  • Addition: do A
  • Subtraction: do B
  • Multiplication: do C
  • Division: do D
  • All others: undefined

The set of the fallthrough/uncovered cases shrinks by adding more cases, but it's always crystal clear which ones are "undefined" (first: everything except +,-, then everything except +,-,*,/)

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The paragraph is part of the section introduction, so it isn't meant to convey the details.

Should it then be marked non-normative? Otherwise, it's normative, right? (I'm not too firm on the procedural details of this)

I'm not sure. It's a matter of style where we have at least decided to not explicitly mark chapter introductions as non-normative even though their nature is non-normative. In the case at hand, I'd say that the (see below) should be sufficient to ensure the reader that there's a specific meaning of certain places to be found by looking at the sections below.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

reformulated presentation of the corresponding list of cases?

I can't tell which list you mean here?

I refer to formulations like this one:

When encountering the empty unit in the following situations, the inferred unit is (uniquely) determined by ensuring that unit compatibility requirements are fulfilled:

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think this list needs a fall-through case, as it should always be possible to add more cases in a backwards compatible way.)

This is not a contradiction! You can first specify (random example: operators covered by some procedure):

The nice thing about this list (When encountering the empty unit in…) is that it doesn't leave any situation undefined. In the other cases, the unit isn't determined by the context, but always becomes "1" as (now) described directly below the list.

When an expression with empty unit is implicitly cast to some unit, that unit is referred to as the \firstuse{inferred unit} of the expression.


\subsection{Unit Propagation}\label{unit-propagation}

The main work of unit checking is performed on the flattened model, with the exception that \lstinline!connect!-equations need to be considered for \firstuse{unit propagation}.

Unit propagation is the act of replacing the empty unit of an instantiated component by some other unit in order to fulfill some of the most obvious unit compatibility requirements.
It is the first step of the unit checking procedure, being independent of both unit derivation and unit inference.
Unit propagation takes place in the following situations:
\begin{itemize}
\item
Simple binding equations (\cref{equation-categories}):
When a component is declared with empty \lstinline!unit!-attribute and has a binding equation with just a component reference on the right-hand side, the unit of the right-hand side replaces the empty unit.
\item
\lstinline!connect!-equations:
Units can be propagated in both directions of a \lstinline!connect!-equation.
\end{itemize}

Notation: The \emph{\lstinline!unit!-attribute of a component} refers to the attribute as given in the model.
The \emph{unit of a component} refers to the component's unit after unit propagation has been carried out.

\begin{nonnormative}
The reason to not propagate units of non-simple binding equations is to avoid bootstrapping problems where unit propagation depends on unit derivation, and unit derivation depends on unit propagation.
If the restrictions on unit propagation would be relaxed in the future, this would be a backwards compatible change as it only means that there would be less need to write out units explicitly.
\end{nonnormative}

\begin{nonnormative}
For unit propagation in \lstinline!connect!-equations, it is recommended to perform the propagation on the connection sets to avoid diagnostics involving two connectors where neither is declared with a non-empty unit.
By considering a connection set, a diagnostic message can omit all the connectors where the unit is empty, and only report examples of connectors in the set with mismatched non-empty units.
\end{nonnormative}

\begin{example}
The following illustrates unit propagation and its limitations:
\begin{lstlisting}[language=modelica]
Real x(unit = "m") = 1.0; // OK: No unit propagation.
Real y = x; // OK: Unit propagation assigns y the unit "m".
Real z = y; // OK: Unit propagation assigns z the unit "m".
Real w = 2 * z; // Error: No unit propagation as 2 * z isn't simple.
\end{lstlisting}
(The rules making the binding equation for \lstinline!x! OK will be given in the sections below.)
\end{example}


\subsection{Bottom-Up Unit Derivation}\label{bottom-up-unit-derivation}

After completed unit propagation, the unit of every expression shall be determined in order to be able to verify unit compatibility requirements.
The derivation is a bottom-up analysis of the expression tree, involving expression-specific rules and a simple form of unit inference.
Separate rules assign units to all expression tree leaves (such as variables and literals).
For a general non-leaf expression $\mathit{op}(e_{1},\, e_{2},\, \ldots, e_{n})$ where $\mathit{op}$ symolizes the kind of expression and the $e_{i}$ represent the immediate children in the expression tree, unit derivation follows these steps:
\begin{enumerate}
\item
Derive the unit of each subexpression $e_{i}$.
The so obtained unit of the expression might be empty as well as non-empty.
\item
If there is a unit derivation rule for $\mathit{op}$ matching the units of the $e_{i}$, apply that rule.
Note that some expressions can handle subexpressions with empty unit, often assigning the empty unit to the entire $\mathit{op}$-expression as well.
\item
Otherwise:
\begin{enumerate}
\def\labelenumii{\alph{enumii}.}
\item
Infer a non-empty unit for each $e_{i}$ that has empty unit, and let $e'_{i}$ denote the subexpressions after unit inference.
How the unit is inferred depends on the kind of expression, and is described in \cref{unit-inference}.
\item
If there is a unit derivation rule for $\mathit{op}(e'_{1},\, e'_{2},\, \ldots, e'_{n})$, apply that rule.
\item
Otherwise, the $\mathit{op}$-expression has a unit error.
\end{enumerate}
\end{enumerate}


\subsection{Unit Inference}\label{unit-inference}

In Modelica unit checking, \firstuse{unit inference} refers to the implicit casting of an expression with empty unit to a corresponding expression with non-empty unit.
An expression having empty unit always gets an inferred non-empty unit when appearing in a context where the empty unit is not allowed.

When encountering the empty unit in the following situations, the inferred unit is (uniquely) determined by ensuring that unit compatibility requirements are fulfilled:
\begin{itemize}
\item
In binding equations and modifications:
\begin{itemize}
\item The entire expression of the binding or modification, provided that the left-hand side component has non-empty \lstinline!unit!-attribute.
\item When the entire expression is an array construction, array concatenation and array range, then apply rules recursively for the direct subexpressions.
\end{itemize}
\item
The entire argument expression in a function call.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comparing it to #3266 (comment) it seems that it has the same critical flaw.

In general, my issue with some proposals is that there are way too much text and theory, but too little actual practice.

I can see that the other proposal - changed to not apply to addition and subtraction - does detect actual modeling issues; but also that it finds a number of issues that aren't that important and will take time to correct. I don't know if people are actually willing to put in the effort to correct - and I don't know the corresponding result for this proposal.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general, my issue with some proposals is that there are way too much text and theory, but too little actual practice.

I think it needs to be seen in the light of me looking for a solid ground on which to implement unit checking in System Modeler, where the specification at least marks a starting point for something that can evolve into a set of rules that make unit consistency a model feature that is portable between tools.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general, my issue with some proposals is that there are way too much text and theory, but too little actual practice.

Can you explain what you mean by that, because to me this proposal is very clear?

Copy link
Collaborator

@qlambert-pro qlambert-pro Nov 4, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I can see that the other proposal - changed to not apply to addition and subtraction - does detect actual modeling issues; but also that it finds a number of issues that aren't that important and will take time to correct. I don't know if people are actually willing to put in the effort to correct - and I don't know the corresponding result for this proposal.

You are just describing the properties of a sound logical system, as opposed to a complete one.
At the end of the day, if MA wants the models produced with modelica to be portable it will have to opine on where unit checking should fall on the sound to complete spectrum. And most people, expects a sounder system when we talk about type checking. This will obviously imply a lack of backward compatibility, but then again the spec does not currently support the rejection of models relying on invalid unit use anyway.

% To keep the design close to the bare minimum, this part is currently excluded:
%\item
% For a translation-time constant with value 0.0:
% \begin{itemize}
% \item Expressions constituting the entire side of an equality or relation.
% \item The right hand side of an assignment.
% \item The direct subexpressions of array construction, array concatenation, and array range.
% \end{itemize}
\item
Otherwise, the inferred unit is \lstinline!"1"!.
\end{itemize}

\begin{example}
Consider unit inference in the binding equation below:
\begin{lstlisting}[language=modelica]
Real y(unit = "m") = 1.5;
\end{lstlisting}
With the pseudo-code form \lstinline!unit($e$, $u$)! representing the expression $e$ having empty unit being cast to unit $u$, the binding equation after unit inference could be expressed explicitly as:
\begin{lstlisting}[language=modelica]
Real y(unit = "m") = unit(1.5, "m"); // Using pseudo-code operator 'unit'.
\end{lstlisting}
Note that unit inference has not changed the empty unit of \lstinline!1.5! itself, but that it has introduced an implicit unit cast \emph{around} \lstinline!1.5! in order to fulfill the unit compatibility requirement.
\end{example}

\begin{example}
Consider the following unit error:
\begin{lstlisting}[language=modelica]
RealInput u = 1.5;
RealOutput y(unit = "m");
equation
connect(u, y);
\end{lstlisting}
Here, the \lstinline!unit!-attribute of \lstinline!u! is empty, but the unit of \lstinline!u! is \lstinline!"m"! after unit propagation.
It follows that the binding equation for \lstinline!u! has a unit error since the inferred unit of \lstinline!1.5! is \lstinline!"1"!.
Note that relaxing the unit inference rules so that \lstinline!1.5! would get inferred unit based on the unit (instead of the \lstinline!unit!-attribute) of \lstinline!u! would be dangerous, as the meaning of \lstinline!1.5! could then change due to unit changes in remote parts of the model.
\end{example}


\subsection{Expressions with Empty Unit}\label{expressions-with-empty-unit}

This section describes conditions under which an expression has empty unit.
Conditions not given here must not be interpreted as definitely not implying empty unit; instead, the unit may be currently undefined for some expressions, allowing the unit to be properly defined in future versions of the specification.

The following expressions are the only ones having empty unit:
\begin{itemize}
\item
\lstinline!Real! literals.
\item
\lstinline!Real! component references where the component's unit (after unit propagation has been carried out) is empty (possibly by not being specified).
\item
\lstinline!Integer! expressions implicitly cast to \lstinline!Real!.
\item
The following compound expressions result in the empty unit if and only if all operands have empty unit:
\begin{itemize}
\item \lstinline!Real!-valued numeric operator expressions.
\item Function calls to \lstinline!Real!-valued built-in functions (see \cref{built-in-mathematical-functions-and-external-built-in-functions}, \cref{modelica:der}, \cref{modelica:pre}, \cref{modelica:abs}, \cref{modelica:sqrt}, \cref{modelica:floor}, etc.).
\end{itemize}
If one of the operands has non-empty unit, all operands must have non-empty unit, meaning that operands with empty unit get inferred unit \lstinline!"1"!.
\item
Function calls to \lstinline!Real!-valued user-defined functions where:
\begin{itemize}
\item All \lstinline!Real! function inputs and outputs have empty declared unit (possibly by not being specified).
\item All \lstinline!Real! argument expressions have empty unit.
\end{itemize}
\end{itemize}

\begin{example}
Consider unit checking of the following binding equation:
\begin{lstlisting}[language=modelica]
Real y(unit = "m") = 1 + 2.5 * 3;
\end{lstlisting}
The unit of the binding equation right-hand side is determined as follows:
\begin{itemize}
\item The \lstinline!Real! literal \lstinline!2.5! has empty unit.
\item The \lstinline!Integer! literals \lstinline!1! and \lstinline!3! are implicitly cast to \lstinline!Real!, and therefore have empty unit.
\item The multiplication \lstinline!2.5 * 3! has empty unit, as both multiplication operands have empty unit.
\item The addition \lstinline!1 + (2.5 * 3)! has empty unit, as both addition operands have empty unit.
\item The entire right-hand side expression gets inferred unit \lstinline!"m"! in order to be compatible with the component's declared \lstinline!unit!-attribute.
\end{itemize}
\end{example}

\begin{example}
Consider unit checking of the following erroneous binding equation for \lstinline!y!:
\begin{lstlisting}[language=modelica]
Real x(unit = "m") = 1.0;
Real y(unit = "m") = (x * x) / 2;
\end{lstlisting}
The unit of the binding equation right-hand side is determined as follows:
\begin{itemize}
\item The unit of \lstinline!x! is \lstinline!"m"!, and dimensional analysis gives that \lstinline!x * x! has unit \lstinline!"m2"!
\item The \lstinline!Integer! literal \lstinline!2! is implicitly cast to \lstinline!Real!, and therefore has empty unit.
\item As the left operand of \lstinline!(x * x) / 2! has non-empty unit, the right operand is implicitly cast to unit \lstinline!"1"!.
\item Dimensional analysis then gives that the unit of \lstinline!(x * x) / 2! is \lstinline!"m2"!, which is an error due to the required unit being \lstinline!"m"!.
\end{itemize}
\end{example}

\begin{example}
Consider the potential consequences of an undefined unit in the following binding equation for \lstinline!y!:
\begin{lstlisting}[language=modelica]
function f
input Real u;
output Real y = u;
end f;
Real x(unit = "m2") = 1.57;
Real y(unit = "m") = f(x);
\end{lstlisting}
To see that the unit of the binding equation right-hand side is undefined, note that the unit of the function call \lstinline!f(x)! is not covered by the specification, and hence has undefined unit.
If a tool wants to proceed according to ``standard dimensional analysis'', alternatives include:
\begin{itemize}
\item
Ingore the unit of the input passed to \lstinline!f!, and assume that the call expression always gets the declared unit of the output, namely the empty unit.
As the unit of \lstinline!f(x)! is assumed empty, it gets inferred unit \lstinline!"m"!.
\item
Analyze \lstinline!f! and conclude that it preserves any unit.
The unit of \lstinline!f(x)! then becomes \lstinline!"m2"!, which is a unit error for the binding equation.
\end{itemize}
\end{example}