Ordination¶

The ordination module provides various ordination methods commonly used in community ecology.

NMDS (Non-metric Multidimensional Scaling)¶

nuee.ordination.metaMDS(X: ndarray | DataFrame, k: int = 2, distance: str = 'bray', trymax: int = 20, maxit: int = 200, trace: bool = False, autotransform: bool = True, wascores: bool = True, expand: bool = True, random_state: int | None = None, **kwargs) → OrdinationResult[source]¶

Non-metric Multidimensional Scaling with automatic transformation.

This function provides a high-level interface for NMDS ordination, following the conventions of the R vegan package’s metaMDS function. It automatically handles data transformation and uses multiple random starts to find the best ordination solution.

Parameters:

X (np.ndarray or pd.DataFrame) – Community data matrix with samples in rows and species in columns. Values should be non-negative abundances or counts.
k (int, default=2) – Number of dimensions for the ordination. Common choices are 2 or 3.
distance (str, default="bray") – Distance metric to use for calculating dissimilarities. See nuee.vegdist() for available options.
trymax (int, default=20) – Maximum number of random starts to find the best solution. Higher values increase computation time but may find better solutions.
maxit (int, default=200) – Maximum number of iterations for each random start.
trace (bool, default=False) – If True, print progress information including stress values.
autotransform (bool, default=True) – If True, automatically apply square root transformation to abundance data (values > 1) to reduce the influence of dominant species.
wascores (bool, default=True) – If True, calculate weighted average species scores based on site scores and species abundances.
expand (bool, default=True) – If True, expand the result to include additional information.
random_state (int, optional) – Seed used for reproducible random starts. If None, each run is initialised independently.
**kwargs (dict) – Additional parameters passed to the NMDS class.

Returns:

Result object containing:

pointspd.DataFrame
Site (sample) scores in the ordination space
speciespd.DataFrame, optional
Species scores (if wascores=True)
stressfloat
Final stress value (lower is better)
convergedbool
Whether the solution converged

Return type:

OrdinationResult

Notes

NMDS is a rank-based ordination method that attempts to preserve the rank order of dissimilarities between samples. Unlike metric methods like PCA, NMDS makes no assumptions about the linearity of relationships.

Stress values provide a measure of fit: - < 0.05: excellent - 0.05 - 0.10: good - 0.10 - 0.20: acceptable - > 0.20: poor (consider increasing k or using a different method)

The function uses multiple random starts (trymax) because NMDS can get stuck in local optima. The solution with the lowest stress is returned.

Examples

Basic NMDS ordination:

>>> import nuee
>>> species = nuee.datasets.varespec()
>>> nmds = nuee.metaMDS(species, k=2, distance="bray")
>>> print(f"Stress: {nmds.stress:.3f}")
Stress: 0.133

With custom parameters:

>>> nmds = nuee.metaMDS(species, k=3, distance="bray", trymax=50, trace=True)
Applying square root transformation
Applying Wisconsin double standardization
NMDS stress: 0.0869
NMDS converged

Visualize the ordination:

>>> import matplotlib.pyplot as plt
>>> fig = nuee.plot_ordination(nmds, display="sites")
>>> plt.show()

RDA (Redundancy Analysis)¶

Redundancy Analysis (RDA).

RDA is a constrained ordination method that finds linear combinations of explanatory variables that best explain the variation in the response matrix.

Parameters:

X – Response matrix (samples x species)
Y – Explanatory matrix (samples x variables)
Z – Conditioning matrix for partial RDA (optional)
formula – Formula string (e.g., “~ var1 + var2”)
data – DataFrame containing variables for formula
scale – Whether to scale species to unit variance
**kwargs – Additional parameters

Returns:

ConstrainedOrdinationResult with RDA results

Examples

# Simple RDA result = rda(species_data, environmental_data)

# RDA with formula result = rda(species_data, formula=”~ pH + temperature”, data=env_data)

# Partial RDA result = rda(species_data, environmental_data, conditioning_data)

class nuee.ordination.RDA(formula: str | None = None, scale: bool = False, center: bool = True)[source]¶

Bases: OrdinationMethod

Redundancy Analysis for constrained ordination.

This implementation follows the classical algorithm:

Centre/scale the response matrix as requested.
Centre the constraining variables.
Solve the weighted least-squares regression analytically.
Perform SVD on fitted (constrained) and residual (unconstrained) matrices.
Store the full decomposition so downstream tools can reconstruct scaled scores mirroring vegan’s behaviour.

__init__(formula: str | None = None, scale: bool = False, center: bool = True)[source]¶

Parameters:

formula – R-style model formula describing the constraints (optional).
scale – Whether to standardise species to unit variance before analysis.
center – Whether to centre species (response) data column-wise.

Fit RDA to the data.

Parameters:

X – Response matrix (samples x species).
Y – Constraining variables (samples x predictors) or DataFrame used with a formula.
Z – Conditioning matrix for partial RDA.

Return type:

ConstrainedOrdinationResult

CCA (Canonical Correspondence Analysis)¶

Canonical Correspondence Analysis (or CA when X is None).

Parameters:

Y – Species data matrix (sites x species).
X – Environmental data matrix (sites x variables) or DataFrame for formula evaluation. When None and no formula is given, an unconstrained Correspondence Analysis (CA) is performed.
formula – R-style formula string referencing columns in X.

Return type:

ConstrainedOrdinationResult (CCA) or OrdinationResult (CA).

class nuee.ordination.CCA(formula: str | None = None)[source]¶

Bases: OrdinationMethod

Canonical Correspondence Analysis.

__init__(formula: str | None = None)[source]¶

fit(Y: ndarray | DataFrame, X: ndarray | DataFrame | None = None, **kwargs) → ConstrainedOrdinationResult | OrdinationResult[source]¶: Fit CCA (or CA when X is None) to the data.

PCA (Principal Component Analysis)¶

nuee.ordination.pca(X: ndarray | DataFrame, n_components: int | None = None, scale: bool = True, center: bool = True, **kwargs) → OrdinationResult[source]¶

Principal Component Analysis.

Parameters:

X – Data matrix (samples x variables).
n_components – Number of components to keep.
scale – Whether to scale variables (unit variance) prior to analysis.
center – Whether to subtract column means prior to analysis.

Return type:

OrdinationResult with PCA results.

class nuee.ordination.PCA(n_components: int | None = None, scale: bool = True, center: bool = True)[source]¶

Bases: OrdinationMethod

Principal Component Analysis for ordination.

__init__(n_components: int | None = None, scale: bool = True, center: bool = True)[source]¶

fit(X: ndarray | DataFrame, **kwargs) → OrdinationResult[source]¶: Fit PCA to the data using an explicit SVD.

Environmental Fitting¶

Fit environmental vectors to an ordination configuration.

Parameters:

ordination – OrdinationResult providing site scores.
env – Environmental data matrix (matching site order).
permutations – Number of permutations used for significance testing.
scaling – Scaling option passed to ordination.get_scores.
choices – Optional 1-based axis indices to include; defaults to all axes.
random_state – Seed for the permutation generator.

Returns:

Dictionary mirroring vegan’s envfit output layout with a vectors entry containing scores, r, r², and p-values.

Return type:

dict

Procrustes Analysis¶

nuee.ordination.procrustes(X: ndarray | DataFrame, Y: ndarray | DataFrame, scale: bool = True) → dict[source]¶

Procrustes analysis to compare two ordinations.

Parameters:

X – First ordination matrix
Y – Second ordination matrix
scale – Whether to scale configurations

Returns:

Dictionary with Procrustes results

Variable Selection¶

nuee.ordination.ordistep(ordination: OrdinationResult, env: ndarray | DataFrame, direction: str = 'both', **kwargs) → dict[source]¶

Stepwise model selection for ordination.

Parameters:

ordination – OrdinationResult object
env – Environmental data matrix
direction – Direction of selection (“forward”, “backward”, “both”)
**kwargs – Additional parameters

Returns:

Dictionary with selection results

Base Classes¶

Bases: object

Base class for ordination results.

points¶: Sample coordinates in ordination space

species¶: Species coordinates (if applicable)

eigenvalues¶: Eigenvalues for ordination axes

stress¶: Stress value (for NMDS)

converged¶: Whether the ordination converged

nobj¶: Number of objects (samples)

ndim¶: Number of dimensions

call¶: Dictionary containing method call parameters

__init__(points: ndarray | None, species: ndarray | None = None, eigenvalues: ndarray | None = None, stress: float | None = None, converged: bool = True, call: Dict[str, Any] | None = None, site_u: ndarray | None = None, species_v: ndarray | None = None, singular_values: ndarray | None = None, site_u_unconstrained: ndarray | None = None, species_v_unconstrained: ndarray | None = None, singular_values_unconstrained: ndarray | None = None, row_weights: ndarray | None = None, column_weights: ndarray | None = None, scaling_backend: Callable[[int], Tuple[ndarray | None, ndarray | None]] | None = None)[source]¶

property proportion_explained: ndarray¶: Proportion of variance/inertia explained by each axis.

property cumulative_proportion: ndarray¶: Cumulative proportion of variance/inertia explained.

biplot(scaling: int | str | None = None, **kwargs)[source]¶

Create a biplot.

For PCA the species loadings are shown as arrows from the origin. For constrained ordination (RDA / CCA), environmental variables are shown as arrows while species are shown as points.

Parameters:

scaling – Scaling mode for site/species scores
**kwargs – Additional plotting arguments

Returns:

matplotlib Figure object

get_scores(display: str = 'sites', scaling: int | str | None = None) → ndarray | Tuple[ndarray, ndarray][source]¶

Retrieve (optionally scaled) ordination scores.

Parameters:

display – “sites”, “species”, or “both”
scaling – None, 1/2/3 or “sites”/”species”/”symmetric”

Returns:

Requested scores as numpy arrays.

scores(display: str = 'sites', scaling: int | str | None = None) → ndarray | Tuple[ndarray, ndarray]¶

Retrieve (optionally scaled) ordination scores.

Parameters:

display – “sites”, “species”, or “both”
scaling – None, 1/2/3 or “sites”/”species”/”symmetric”

Returns:

Requested scores as numpy arrays.

class nuee.ordination.OrdinationMethod[source]¶

Bases: ABC

Abstract base class for ordination methods.

abstractmethod fit(X: ndarray | DataFrame, **kwargs) → OrdinationResult[source]¶

Fit the ordination method to the data.

Parameters:

X – Community data matrix (samples x species)
**kwargs – Method-specific parameters

Returns:

OrdinationResult object

Ordination¶

NMDS (Non-metric Multidimensional Scaling)¶

RDA (Redundancy Analysis)¶

CCA (Canonical Correspondence Analysis)¶

PCA (Principal Component Analysis)¶

Environmental Fitting¶

Procrustes Analysis¶

Variable Selection¶

Base Classes¶

nuee

Navigation

Related Topics