Version: Next

Constraints

A constraint is a spatial, temporal, or semantic relation between any two components. In the context of interpreting plexes as a multigraph of relationships over our system (comprised of different components), constraints are the edge-relationships of the graph.

Constraint Types

Conventions

From and To, To and From

Constraints are always across two components. MetriCal will often refer to each of these components in a directional sense using the "from" and "to" specifiers, which reference the UUIDs of the components.

This directional specifier is useful for spatial and temporal constraints, because it allows us to know how the extrinsics from the spatial constraints, or the synchronization data from the temporal constraints can be applied to data to put two observations into the same frame of reference.

Our extrinsics type is essentially a way to describe how to transform points in one coordinate system into another. Anyone who has ever worked with transforms has experienced confusion in convention. In order to cut through the ambiguity of extrinsics, every spatial constraint has a from and to field. Let's dive into how this works.

We can think of an extrinsics transform between components $A$ and $B$ using the following notation:

\Gamma_{B}^{A} \mathrel{:=} \Gamma_{from\ B}^{to\ A}

If we wanted to move a point $p$ from the frame of reference of component $B$ to that of $A$ , we would use the following math:

\vec{p}_{W}^{A} = \Gamma_{B}^{A} \cdot \vec{p}_{W}^{B}

...also read as " $p_A$ equals $p_B$ by transforming to $A$ from $B$ ".

Thus, when constructing a spatial constraint, the reference frame for the extrinsics transform is in the coordinate frame of component $A$ , and would move points from the coordinate frame of component $B$ .

Similar examples can be made for converting timestamps from e.g. $B$ into the same "clock" as $A$ using temporal constraints.

Coordinate Bases in MetriCal

It's common to represent a transform in a certain convention, such as FLU (X-Forward, Y-Left, Z-Up) or RDF (X-Right, Y-Down, Z-Forward). One might then wonder what the default coordinate system is for MetriCal. Short answer: it entirely depends on the data you're working with.

In MetriCal, spatial constraints are designed to transform observations (not components!) from the from frame to the to frame. In the case of a camera-LiDAR extrinsics transform $\Gamma_{L}^{C}$ , the solved extrinsics will move LiDAR observations $p_{L}$ (which may be in FLU) to a camera observations's coordinate frame (which may be in RDF):

p_{C} = \Gamma_{L}^{C} \cdot p_{L}

This makes it simple to move observations from one component to another for sensor fusion tasks. This is what is meant when MetriCal is said to have no "default" component coordinate system; it operates directly on the provided observations!

However, for those users that really need the extrinsics in a specific coordinate basis, MetriCal provides. There are two relevant global options that can be passed to any mode that produces or displays a calibrated plex (Calibrate, Shape, Pretty Print, and Evaluate). These are:

--topic-to-observation-basis (or -z): Assign a topic a specific coordinate basis for its observations, i.e. the data it produces. For cameras, this is usually RDF.
--topic-to-component-basis (or -Z): Assign a topic a specific coordinate basis for its component, i.e. the sensor itself.

If both of these options are set for every component, MetriCal will transform the extrinsics into the proper basis for you! Let's take a look at an example:

metrical calibrate -z ir_cam:RDF -z lidar:FLU -Z *:FLU $DATA $PLEX $OBJ

Here, we're telling MetriCal that the observations from the camera topic are in RDF, while the data from our LiDAR is in FLU. We're also telling MetriCal that we need the output of the entire plex to be in FLU, probably because our system expects that basis natively.

In this case, MetriCal will produce two plexes in the results JSON:

results.plex - The first plex is what is natively produced by the calibration process, with extrinsics in the basis of the observations.
results.changed_basis_plex - The second plex is the same as the first, but with all extrinsics transformed into the desired component bases.

Note that this second plex is for your use only; if you use this in Init or Calibrate mode, MetriCal will probably start a little confused. It needs results.plex for proper operation.

Spatial Constraints

It is common to ask for the spatial relationship or extrinsics between two given components. A Plex incorporates this information in the form of what is called spatial constraints. A spatial constraint can be broken down into:

Field	Type	Description
Extrinsics	An extrinsics object	The extrinsics describing the "To" from "From" transformation.
Covariance	A matrix of floats	The 6×6 covariance of the extrinsics described by this constraint.
From	UUID	The UUID of the component that describes the "From" or base coordinate frame.
To	UUID	The UUID of the component that describes the "To" coordinate frame, which we are transforming into. This can be considered the "origin" of the extrinsics matrix

Spatial Covariance

Spatial covariance is generally presented as a 6×6 matrix relating the variance-covariance of an se3 lie group:

\begin{bmatrix} v_1 & v_2 & v_3 & \omega_1 & \omega_2 &\omega_3 \end{bmatrix}

When traversing for spatial constraints within the Plex, the constraint returned will always contain the extrinsic with the minimum overall covariance. This ensures that users will always get the extrinsic that has the smallest covariance (thus, the highest confidence / precision), even if multiple spatial constraints exist between any two components.

Spatial Constraint Traversal - Python Example

Since all spatial constraints in a plex form an undirected (maybe even fully connected) graph, it can be confusing to figure out how to traverse that graph to find the "best" extrinsics between two components. MetriCal itself provides the Shape mode to help with this (with a few helpful options in Pretty Print), but sometimes it's useful to do your own thing.

To help would-be derivers, the MetriCal Sensor Calibration Utilities repository contains the spatial_constraint_traversal directory, which demonstrates the right way to derive optimal extrinsics straight from the Plex JSON via the magic of python.

Temporal Constraints

Time is a tricky thing in perception, but of crucial importance to get right. We've developed our temporal constraint to be flexible enough to describe many of the most common timing scenarios between components.

Field	Type	Description
Synchronization	A synchronization object	The strategy to achieve known synchronization between these two components in the Plex.
Resolution	float	The resolution to which synchronization should be applied.
From	UUID	The UUID of the component that the synchronization strategy must be applied to.
To	UUID	The UUID of the component whose clock we synchronize into by applying our synchronization strategy (to the `from` component).

The Problem With Clocks

In the world of hardware, measuring time can be a challenge. Two clocks might differ in several different ways; without taking these nuances into account, many higher-level perception tasks can fail.

Let's take the example below: two different clocks, possibly from two different hosts, that might be informing separate components in our plex.

Two clocks out of sync

Temporal constraints can balance these different clocks across a plex in order to make sure time confusion never occurs. It achieves this through Synchronization.

Synchronization

Synchronization describes the following relationship between two clocks:

C_{\text{to}} = (1e9 + \text{skew}) \cdot C_{\text{from}} + \text{offset}

Field	Type	Description
offset	Integer	The epoch offset between two clocks, in units of integer nanoseconds.
skew	Integer	The scale offset between two clocks. Unitless.

Offset

Unless two components are using the same clock, there's a chance that they are offset in time. This means that time t in one clock does not align with time t in the other. Fixing this is rather simple: just shift the time values in the from clock by the offset parameter until their two t times match.

Appplying offset to a clock

Skew

Skew compensates for the difference in the duration of a time increment between two clocks. In other words, a second in one clock might be a different length than a second in another! These differences can be very subtle, but they will result in some unwanted drift.

Applying skew to a from clock's timestamps will match the duration of a second to that of the to clock.

Appplying skew to a clock

Between skew and offset, we have the tools we need to synchronize time between two clocks! Note that components that use the same host clock will need no synchronization; their skew and offset remains 0.0.

Resolution

Resolution helps MetriCal identify observations that are meant to be synchronized between two components.

Say we have two camera components. The first is producing one image every 5 seconds; the second produces a new image every 1.3 seconds. We want to pair up observations from the two separate streams that we know are uniquely synced in time as a one-to-one set.

Our resolution tells the Platform how far from an observation we want to search for a synced pair. In the case of our first camera, we know that one new frame comes every 5 seconds. This means that there's a span of 2.5 seconds on either side of this image that could hold a matching observation from our second camera. So, we set resolution to 2.5 * 1e9 (for nanoseconds).

The Platform will then look for any synced observation candidates in camera two and find the observation that matches most closely in time to the image in camera one.

All that being said, resolution is a concept better shown than told:

Applying resolution to an observation series

If one is confident that two observation streams are in-sync, one may set the resolution to be fairly small. However, given the way some components can behave, it's generally not necessary or recommended.

Semantic Constraints

Semantic constraints are a bit different from spatial and temporal constraints. While spatial and temporal constraints exist to model the physical realities of components in a system, semantic constraints exist to model relationships in the system that don't fall within those boundaries.

Semantic constraints are defined by the following fields:

Field	Type	Description
Components	An array of component UUIDs	A set of unique UUIDs (corresponding to components that exist within the plex) that are grouped under this semantic.
Name	String	The name for these semantics. Usually indicates the purpose or function of the group.
UUID	UUID	The unique identifier for the semantic constraint.

Semantic constraints can label subplexes within a plex, label individual OEM pieces of hardware (e.g. labeling a single Tangram Vision HiFi within a plex), or to group together components that share some common function (e.g. grouping a stereo pair of cameras together).

Some semantic constraints can be used by MetriCal to generate unique kinds of metrics after calibration. These are identified purely through the semantic constraint's name field. For example, MetriCal currently understands the following semantic constraint kinds:

"stereo_pair": This denotes that two cameras are part of a stereo pair. This also tells MetriCal to compute stereo rectification metrics after a calibration is complete.

Conventions​

From and To, To and From​

Coordinate Bases in MetriCal​

Spatial Constraints​

Spatial Covariance​

Temporal Constraints​

The Problem With Clocks​

Synchronization​

Offset​

Skew​

Resolution​

Semantic Constraints​