Note

Go to the end to download the full example code.

Statistics and interaction structure of a multi-modal dataset¶

A multi-modal dataset can be characterized beyond basic shape information. With iMML you can:

Summarize core properties of each modality (samples, features, completeness).
Quantify how modalities relate to a target via PID (Partial Information Decomposition): Redundancy (shared info), Uniqueness (modality-specific info), and Synergy (info emerging only when modalities are combined).

What you will learn:

How to describe per‑modality completeness and cross‑modality overlap with get_summary.
How to compute redundancy, uniqueness, and synergy (PID) with respect to a target using pid.
How to visualize and interpret PID results.
How PID generalizes when you have more than two modalities.

This tutorial is fully reproducible and uses a small synthetic dataset. You can easily replace the data‑loading section with your own data following the same structure.

# sphinx_gallery_thumbnail_number = 2

# License: BSD 3-Clause License

Step 1: Import required libraries¶

import copy
import numpy as np
import pandas as pd
from sklearn.datasets import make_classification

from imml.statistics import pid
from imml.explore import get_summary
from imml.visualize import plot_pid

Step 3: Summarize the dataset¶

The get_summary function provides a compact overview of the multi‑modal dataset. Below we first make the dataset a bit more complex by introducing some incomplete samples, then show two views: 1) a dataframe aggregated across modalities (one_row=True) and 2) per‑modality counts (one_row=False).

inc_Xs = copy.deepcopy(Xs)
# Introduce block-wise missingness in a few regions for illustration
inc_Xs[0][:20, :] = np.nan
inc_Xs[0][25, 1] = np.nan
inc_Xs[1][18:22, :] = np.nan
inc_Xs[1][-15:, 3] = np.nan

summary = get_summary(Xs=inc_Xs, one_row=True, compute_pct=True, return_df=True)
summary

	Complete samples	Incomplete samples	Observed samples per modality	Missing samples per modality	% Observed samples per modality	% Missing samples per modality
0	28	22	[30, 46]	[20, 4]	[60, 92]	[40, 8]

Per‑modality view:

summary = get_summary(Xs=inc_Xs, modalities=["Modality A", "Modality B"], one_row=False, compute_pct=True, return_df=True)
summary

	Complete samples	Missing samples	Incomplete samples	% Complete samples	% Missing samples	% Incomplete samples
Modality A	29	20	21	58.0	40.0	42.0
Modality B	31	4	19	62.0	8.0	38.0
Total	12	22	38	24.0	44.0	76.0

For quick inspection, we can also plot the per‑modality counts. Here we create a bar chart.

summary.index = summary.index.str.replace(" samples", "")
_ = summary[[c for c in summary.columns if not c.startswith('%')]].plot(
    kind="bar", xlabel="Samples", ylabel="Count", rot=0,
    title="Summary of the multi-modal dataset")

Step 4: Compute PID statistics (Redundancy, Uniqueness, Synergy)¶

Using pid, we quantify the degree of redundancy, uniqueness, and synergy relating input modalities to the target. With two input modalities, pid returns a dictionary with keys: "Redundancy", "Uniqueness1", "Uniqueness2", and "Synergy".

rus = pid(Xs=Xs, y=y, random_state=random_state, normalize=True)
rus  # a dict with keys: Redundancy, Uniqueness1, Uniqueness2, Synergy

{'Redundancy': np.float64(0.2165211867463263), 'Uniqueness1': np.float64(0.7743673072618481), 'Uniqueness2': np.float64(0.0009259835942955852), 'Synergy': np.float64(0.008185522397530105)}

Step 5: Visualize the PID as a Venn-like diagram¶

You can directly pass the rus dict returned by pid to plot_pid. Alternatively, plot_pid can also compute PID internally if you pass Xs and y, which is convenient when you want a one‑liner.

rus = {"Redundancy": 0.2, "Synergy": 0.1, "Uniqueness1": 0.45, "Uniqueness2": 0.25}
fig, ax = plot_pid(rus=rus, modalities=["Modality A", "Modality B"], abb=False)

Step 6: Interpreting PID results¶

Redundancy: Information about the target available in both modalities. High values suggest overlap.
Uniqueness1/2: Modality‑specific information about the target. High values suggest complementarity.
Synergy: Information that emerges only when modalities are combined. High synergy often indicates interactions.

If redundancy is high while uniqueness and synergy are low, this may suggest that the dataset could be more appropriately analyzed using classical unimodal modeling.

Step 7: Working with more than two modalities¶

If you have more than two modalities, PID statistics are computed pairwise; pid returns a list of dictionaries (one per pair). For example, adding a third modality yields three pairwise results.

rus = pid(Xs=Xs + [Xs[0]], y=y, random_state=random_state, normalize=True)
rus

[{'Redundancy': np.float64(0.2165211867463263), 'Uniqueness1': np.float64(0.7743673072618481), 'Uniqueness2': np.float64(0.0009259835942955852), 'Synergy': np.float64(0.008185522397530105)}, {'Redundancy': np.float64(0.9967118426445922), 'Uniqueness1': np.float64(8.484563903507687e-05), 'Uniqueness2': np.float64(8.484563906632952e-05), 'Synergy': np.float64(0.003118466077306442)}, {'Redundancy': np.float64(0.2165211867513762), 'Uniqueness1': np.float64(0.0009259835942656313), 'Uniqueness2': np.float64(0.7743673072590187), 'Synergy': np.float64(0.008185522395339602)}]

Conclusion¶

In this tutorial, we:

Summarized key per‑modality statistics for a multi‑modal dataset.
Quantified redundancy, uniqueness, and synergy with respect to a target using PID.
Visualized and interpreted PID, including the multi‑modality (>2) case.

These insights help you understand complementarity and interactions across modalities, informing model design and feature engineering for downstream multi‑modal learning.

Total running time of the script: (0 minutes 11.718 seconds)

Gallery generated by Sphinx-Gallery