Note

Go to the end to download the full example code. or to run this example in your browser via Binder

Extend your analysis methods along data dimensions#

Learn how to use the make_broadcastable decorator, to easily cast functions across an entire xarray.DataArray.

Imports#

We will need numpy and xarray to make our custom data for this example, and matplotlib to show what it contains. We will be using the movement.utils.broadcasting module to turn our one-dimensional functions into functions that work across entire DataArray objects.

# For interactive plots: install ipympl with `pip install ipympl` and uncomment
# the following lines in your notebook
# %matplotlib widget
import matplotlib.pyplot as plt
import numpy as np
import xarray as xr

from movement import sample_data
from movement.plots import plot_centroid_trajectory
from movement.utils.broadcasting import make_broadcastable

Load Sample Dataset#

First, we load the SLEAP_three-mice_Aeon_proofread example dataset. For the rest of this example we’ll only need the position data array, so we store it in a separate variable.

ds = sample_data.fetch_dataset("SLEAP_three-mice_Aeon_proofread.analysis.h5")
positions: xr.DataArray = ds.position

The individuals in this dataset follow very similar, arc-like trajectories. To help emphasise what we are doing in this example, we will offset the paths of two of the individuals by a small amount so that the trajectories are more distinct.

positions.loc[:, "y", :, "AEON3B_TP1"] -= 100.0
positions.loc[:, "y", :, "AEON3B_TP2"] += 100.0

fig, ax = plt.subplots(1, 1)
for mouse_name, col in zip(
    positions.individuals.values, ["r", "g", "b"], strict=False
):
    plot_centroid_trajectory(
        positions,
        individual=mouse_name,
        keypoints="centroid",
        ax=ax,
        linestyle="-",
        marker=".",
        s=2,
        linewidth=0.5,
        c=col,
        label=mouse_name,
    )
ax.invert_yaxis()
ax.set_title("Trajectories")
ax.set_xlabel("x (pixels)")
ax.set_ylabel("y (pixels)")
ax.legend()

<matplotlib.legend.Legend object at 0x7f1fb7fa3710>

Motivation#

Suppose that, during our experiment, we have a region of the enclosure that has a slightly wet floor, making it slippery. The individuals must cross this region in order to reach some kind of reward on the other side of the enclosure. We know that the “slippery region” of our enclosure is approximately rectangular in shape, and has its opposite corners at (400, 0) and (600, 2000), where the coordinates are given in pixels. We could then write a function that determines if a given (x, y) position was inside this “slippery region”.

def in_slippery_region(xy_position) -> bool:
    """Return True if xy_position is in the slippery region.

    Return False otherwise.
    xy_position has 2 elements, the (x, y) coordinates respectively.
    """
    # The slippery region is a rectangle with the following bounds
    x_min, y_min = 400.0, 0.0
    x_max, y_max = 600.0, 2000.0

    is_within_bounds_x = x_min <= xy_position[0] <= x_max
    is_within_bounds_y = y_min < xy_position[1] <= y_max
    return is_within_bounds_x and is_within_bounds_y


# We can just check our function with a few sample points
for point in [(0, 100), (450, 700), (550, 1500), (601, 500)]:
    print(f"{point} is in slippery region: {in_slippery_region(point)}")

(0, 100) is in slippery region: False
(450, 700) is in slippery region: True
(550, 1500) is in slippery region: True
(601, 500) is in slippery region: False

Data Generalisation Issues#

The shape of the resulting DataArray is the same as our original DataArray, but without the "space" dimension. Indeed, we have essentially collapsed the "space" dimension, since our in_slippery_region function takes in a 1D data slice (the x, y positions of a single individual’s centroid at a given point in time) and returns a scalar value (True/False). However, the fact that we have to construct a new DataArray after running our function over all space slices in our DataArray is not scalable - our for loop approach relied on knowing how many dimensions our data had (and the size of those dimensions). We don’t have a guarantee that the next DataArray that comes in will have the same structure.

Extending to Class Methods#

make_broadcastable can also be applied to class methods, though it needs to be told that you are doing so via the is_classmethod parameter.

class Rectangle:
    """Represents an observing camera in the experiment."""

    xy_min: tuple[float, float]
    xy_max: tuple[float, float]

    def __init__(self, xy_min=(0.0, 0.0), xy_max=(1.0, 1.0)):
        """Create a new instance."""
        self.xy_min = tuple(xy_min)
        self.xy_max = tuple(xy_max)

    @make_broadcastable(is_classmethod=True, only_broadcastable_along="space")
    def is_inside(self, /, xy_position) -> bool:
        """Whether the position is inside the rectangle."""
        # For the sake of brevity, we won't redefine the entire method here,
        # and will just call our existing function.
        return in_slippery_region_general(
            xy_position, self.xy_min, self.xy_max
        )


slippery_region = Rectangle(xy_min=(400.0, 0.0), xy_max=(600.0, 2000.0))
was_in_region_clsmethod = slippery_region.is_inside(positions)

xr.testing.assert_equal(
    was_in_region_clsmethod, in_slippery_region_broadcasting
)

The broadcastable_method decorator is provided as a helpful alias for make_broadcastable(is_classmethod=True), and otherwise works in the same way (and accepts the same parameters).

class RectangleAlternative:
    """Represents an observing camera in the experiment."""

    xy_min: tuple[float, float]
    xy_max: tuple[float, float]

    def __init__(self, xy_min=(0.0, 0.0), xy_max=(1.0, 1.0)):
        """Create a new instance."""
        self.xy_min = tuple(xy_min)
        self.xy_max = tuple(xy_max)

    @make_broadcastable(is_classmethod=True, only_broadcastable_along="space")
    def is_inside(self, /, xy_position) -> bool:
        """Whether the position is inside the rectangle."""
        # For the sake of brevity, we won't redefine the entire method here,
        # and will just call our existing function.
        return in_slippery_region_general(
            xy_position, self.xy_min, self.xy_max
        )


slippery_region_alt = RectangleAlternative(
    xy_min=(400.0, 0.0), xy_max=(600.0, 2000.0)
)
was_in_region_clsmethod_alt = slippery_region.is_inside(positions)

xr.testing.assert_equal(
    was_in_region_clsmethod_alt, in_slippery_region_broadcasting
)

xr.testing.assert_equal(was_in_region_clsmethod_alt, was_in_region_clsmethod)

In fact, if you look at the Regions of Interest submodule, and in particular the classes inside it, you’ll notice that we use the broadcastable_method decorator ourselves in some of these methods!

Total running time of the script: (0 minutes 1.105 seconds)

Gallery generated by Sphinx-Gallery

Extend your analysis methods along data dimensions#

Imports#

Load Sample Dataset#

Motivation#

Determine if each position was slippery#

Data Generalisation Issues#

Making our Function Broadcastable#

Additional Function Arguments#

Only Broadcast Along Select Dimensions#

Extending to Class Methods#

This Page