Utilities

CalibrateEmulateSample.Utilities.CanonicalCorrelation — Type

struct CanonicalCorrelation{VV1, VV2, VV3, FT, VV4} <: CalibrateEmulateSample.Utilities.PairedDataContainerProcessor

Uses both input and output data to learn a subspace of maximal correlation between inputs and outputs. The subspace for a pair (X,Y) will be of size minimum(rank(X),rank(Y)), computed using SVD-based method e.g. See e.g., https://numerical.recipes/whp/notes/CanonCorrBySVD.pdf

Preferred construction is with the canonical_correlation method

Fields

data_mean::Any: storage for the input or output data mean
encoder_mat::Any: the encoding matrix of input or output canonical correlations
decoder_mat::Any: the decoding matrix of input or output canonical correlations
retain_var::Any: the fraction of variance to be retained after truncating singular values (1 implies no truncation)
apply_to::Any: Stores whether this is an input or output encoder (vector with string "in" or "out")

CalibrateEmulateSample.Utilities.DataContainerProcessor — Type

Abstract supertype for processors that operate independently on one data container — either inputs or outputs — (e.g. Decorrelator, ElementwiseScaler).

CalibrateEmulateSample.Utilities.Decorrelator — Type

struct Decorrelator{VV1, VV2, VV3, FT, NT<:NamedTuple, AS<:AbstractString} <: CalibrateEmulateSample.Utilities.DataContainerProcessor

Decorrelate the data via taking an SVD decomposition and projecting onto the singular-vectors.

Preferred construction is with the methods

decorrelate_structure_mat
decorrelate_sample_cov
decorrelate

For decorrelate_structure_mat: The SVD is taken over a structure matrix (e.g., prior_cov for inputs, obs_noise_cov for outputs). The structure matrix will become exactly I after processing.

For decorrelate_sample_cov: The SVD is taken over the estimated covariance of the data. The data samples will have a Normal(0,I) distribution after processing,

For decorrelate(;decorrelate_with="combined") (default): The SVD is taken to be the sum of structure matrix and estimated covariance. This may be more robust to ill-specification of structure matrix, or poor estimation of the sample covariance.

Depending on the size of the matrix, we perform different options of SVD:

Small Matrix (dim < 3000): use LinearAlgebra.svd(Matrix) Large Matrix (dim > 3000): if retainvar = 1.0 use LowRankApprox.psvd(LinearMap; psvdkwargs...) if retain_var < 1.0 use TSVD.tsvd(LinearMap)

Fields

data_mean::Any: storage for the data mean
encoder_mat::Any: the matrix used to perform encoding
decoder_mat::Any: the inverse of the the matrix used to perform encoding
retain_var::Any: the fraction of variance to be retained after truncating singular values (1 implies no truncation)
n_totvar_samples::Int64: when retain_var < 1, number of samples to estimate the total variance. Larger values reduce the error in approximation at the cost of additional matrix-vector products.
max_rank::Int64: maximum dimension of subspace for retain_var < 1. The search may become expensive at large ranks, and therefore can be cut-off in this way
psvd_kwargs::NamedTuple: when retain_var = 1, the psvd algorithm from LowRankApprox.jl is used to decorrelate the space. here, kwargs can be passed in as a NamedTuple
decorrelate_with::AbstractString: Switch to choose what form of matrix to use to decorrelate the data
structure_mat_name::Union{Nothing, Symbol}: When given, use the structure matrix by this name if decorrelate_with uses structure matrices. When nothing, try to use the only present structure matrix instead.

CalibrateEmulateSample.Utilities.ElementwiseScaler — Type

struct ElementwiseScaler{T, VV<:(AbstractVector), VV2<:(AbstractVector), VV3<:(AbstractVector), VV4<:(AbstractVector), VV5<:(AbstractVector)} <: CalibrateEmulateSample.Utilities.DataContainerProcessor

The ElementwiseScaler{T} will create an encoding of the data_container via elementwise affine transformations.

Different methods T will build different transformations:

quartile_scale : creates QuartileScaling,
minmax_scale : creates MinMaxScaling
zscore_scale : creates ZScoreScaling

and are accessed with get_type

CalibrateEmulateSample.Utilities.LikelihoodInformed — Type

mutable struct LikelihoodInformed{VV1<:(AbstractVector), VV2<:(AbstractVector), VV3<:(AbstractVector), VV4<:(AbstractVector), FT<:Real} <: CalibrateEmulateSample.Utilities.PairedDataContainerProcessor

Uses both input and output data to learn a subspace that allows for a reduced posterior which is close to the full posterior.

Preferred construction is with the likelihood_informed method.

Fields

encoder_mat::AbstractVector
decoder_mat::AbstractVector
data_mean::AbstractVector
retain_info::Real
apply_to::Union{Nothing, AbstractString}
iters::AbstractVector
grad_type::Symbol

CalibrateEmulateSample.Utilities.MinMaxScaling — Type

Elementwise scaling strategy that maps each dimension to $[0,1]$ using its minimum and maximum. Use via minmax_scale.

CalibrateEmulateSample.Utilities.NoiseInjector — Type

struct NoiseInjector{MM1<:(AbstractMatrix), MM2<:(AbstractMatrix), MM3<:(AbstractMatrix), VV<:(AbstractVector), NorMM<:Union{Nothing, AbstractMatrix}, FT<:Real}

Structure used to store precomputed quantities for decode_and_add_noise(...), build with create_noise_injector(...)

K::AbstractMatrix: Gain Matrix from encoded to decoded space
enc_m::AbstractMatrix: encoded prior mean
m::AbstractMatrix: prior mean
L::Union{Nothing, AbstractMatrix}: cholesky factor of encoded prior covariance
scaling::Real: Scale the noise (may be needed (<1.0) for robustness if samples will be run in a physical model)
use_noise::Bool: whether to use the noise injection or not
encoder_schedule::AbstractVector: the encoding that was used to construct this object

CalibrateEmulateSample.Utilities.PairedDataContainerProcessor — Type

Abstract supertype for processors that operate jointly on input–output data pairs (e.g. CanonicalCorrelation, LikelihoodInformed).

CalibrateEmulateSample.Utilities.QuartileScaling — Type

Elementwise scaling strategy that centres each dimension on its median and scales by the interquartile range ($Q_3 - Q_1$). Use via quartile_scale.

CalibrateEmulateSample.Utilities.UnivariateAffineScaling — Type

Abstract supertype for the family of elementwise affine scaling strategies. Concrete subtypes — QuartileScaling, MinMaxScaling, ZScoreScaling — are used as type parameters of ElementwiseScaler to select the transformation.

CalibrateEmulateSample.Utilities.ZScoreScaling — Type

Elementwise scaling strategy that standardises each dimension to zero mean and unit variance. Use via zscore_scale.

CalibrateEmulateSample.Utilities.canonical_correlation — Method

canonical_correlation(
;
    retain_var
) -> CalibrateEmulateSample.Utilities.CanonicalCorrelation{Vector{Any}, Vector{Any}, Vector{Any}, Float64, Vector{AbstractString}}

Constructs the CanonicalCorrelation struct. Can optionally provide the keyword

retain_var[=1.0]: to project onto the leading singular vectors (of the input-output product) such that retain_var variance is retained.

CalibrateEmulateSample.Utilities.create_compact_linear_map — Method

create_compact_linear_map(
    A;
    svd_dim_max,
    psvd_or_tsvd,
    tsvd_max_rank,
    psvd_kwargs
) -> LinearMaps.FunctionMap{Float64, CalibrateEmulateSample.Utilities.var"#14#20"{Vector{Any}, Vector{Any}, Vector{Any}, Vector{Any}, Vector{Any}}, CalibrateEmulateSample.Utilities.var"#16#22"{Vector{Any}, Vector{Any}, Vector{Any}, Vector{Any}, Vector{Any}}}

Return a LinearMap that evaluates the action A*x (and A'*x) of a structure matrix A in compact SVD+diagonal form, without building the full dense matrix. The map can be used directly with TSVD.jl, IterativeSolvers.jl, and similar packages.

Internally decomposes each block as U * S * Vt + D and stacks the results.

Arguments

A: structure matrix (or vector of structure matrices for a block-diagonal layout).
svd_dim_max (keyword, default 3000): matrices larger than this dimension use an approximate SVD instead of the exact LinearAlgebra.svd.
psvd_or_tsvd (keyword, default "psvd"): approximation algorithm for large matrices; "psvd" (randomised, LowRankApprox.jl) or "tsvd" (truncated, TSVD.jl).
tsvd_max_rank (keyword, default 50): maximum rank when using "tsvd".
psvd_kwargs (keyword, default (; rtol=1e-1)): keyword arguments forwarded to psvd.

CalibrateEmulateSample.Utilities.create_encoder_schedule — Method

Create size-1 encoder schedule with a tuple of (DataProcessor1(...), apply_to) with apply_to = "in", "out" or "in_and_out".

CalibrateEmulateSample.Utilities.create_encoder_schedule — Method

create_encoder_schedule(
    schedule_in::AbstractVector
) -> Vector{Any}

Create a flatter encoder schedule for the from the user's proposed schedule of the form:

enc_schedule = [
    (DataProcessor1(...), "in"), 
    (DataProcessor2(...), "out"), 
    (PairedDataProcessor3(...), "in"), 
    (DataProcessor4(...), "in_and_out"), 
]

This function creates the encoder scheduler that is also machine readable. E.g.,

enc_schedule = [
    (DataProcessor1(...), "in"), 
    (DataProcessor2(...), "out"), 
    (PairedDataProcessor3(...),"in"), 
    (DataProcessor4(...), "in"),
    (DataProcessor4(...), "out"), 
]

and the decoder schedule is a copy of the encoder schedule reversed (and processors copied)

CalibrateEmulateSample.Utilities.create_noise_injector — Method

create_noise_injector(
    encoder_schedule::AbstractVector,
    prior::EnsembleKalmanProcesses.ParameterDistributions.ParameterDistribution,
    noise_injector_threshold::Real,
    noise_injector_scaling::Real
) -> Union{Nothing, CalibrateEmulateSample.Utilities.NoiseInjector}

Returns either a NoiseInjector object that stores precomputed quantities used in decode_and_add_noise(...), or returns nothing. The condition to return nothing:

If the encoder is effectively lossless, as determined by it's variance loss not exceeding threshold noise_injector_threshold
If the encoder_schedule is empty

One can additionally scale the injected samples with noise_injector_scaling

CalibrateEmulateSample.Utilities.decode_and_add_noise — Method

decode_and_add_noise(
    encoder_schedule::AbstractVector,
    samples::AbstractMatrix,
    prior::EnsembleKalmanProcesses.ParameterDistributions.ParameterDistribution,
    noise_injector_threshold::Real,
    noise_injector_scaling::Real
) -> Any

Lift back the encoded samples into the full space. Similar to using decode_data, except that this additionally injects noise from the prior when the encoding is determined to be sufficiently lossy (total lost variance < keyword noise_injector_threshold). This is done in a way that preserves any known correlations between reduced and null-space directions, which is important for posterior reconstruction.

The quantification of correlation depends on Gaussian assumptions, and therefore is approximate.

CalibrateEmulateSample.Utilities.decode_data — Method

decode_data(
    encoder_schedule::AbstractVector,
    data::Union{EnsembleKalmanProcesses.DataContainers.DataContainer, AbstractMatrix, AbstractVector},
    in_or_out::AbstractString
) -> Any

Decode the new data (a DataContainer, or matrix where data are columns, or vector viewed as one column) representing inputs ("in") or outputs ("out"), with the stored and initialized encoder schedule. Always internally calls CES.Utilities.decode_with_schedule

CalibrateEmulateSample.Utilities.decode_structure_matrix — Method

decode_structure_matrix(
    encoder_schedule::AbstractVector,
    structure_mat,
    in_or_out::AbstractString
) -> Any

Decode a new structure matrix in the input space ("in") or output space ("out"). with the stored and initialized encoder schedule. Always internally calls CES.Utilities.decode_with_schedule. If the structure matrix is a LinearMap, then the decoded structure matrix remains a LinearMap

CalibrateEmulateSample.Utilities.decode_with_schedule — Method

decode_with_schedule(
    encoder_schedule::AbstractVector,
    data_container::EnsembleKalmanProcesses.DataContainers.DataContainer,
    in_or_out::AbstractString
) -> EnsembleKalmanProcesses.DataContainers.DataContainer

Takes in an already initialized encoder schedule, and decodes a DataContainer, the in_or_out string indicates if the data is input "in" or output "out" data (and thus decoded differently)

CalibrateEmulateSample.Utilities.decode_with_schedule — Method

decode_with_schedule(
    encoder_schedule::AbstractVector,
    structure_matrix::Union{LinearAlgebra.UniformScaling, LinearMaps.LinearMap, AbstractMatrix, AbstractVector},
    in_or_out::AbstractString
) -> Any

Takes in an already initialized encoder schedule, and decodes a structure matrix, the in_or_out string indicates if the structure matrix is for input "in" or output "out" space (and thus decoded differently)

CalibrateEmulateSample.Utilities.decode_with_schedule — Method

decode_with_schedule(
    encoder_schedule::AbstractVector,
    io_pairs::EnsembleKalmanProcesses.DataContainers.PairedDataContainer,
    input_structure_mat::Union{LinearAlgebra.UniformScaling, LinearMaps.LinearMap, AbstractMatrix, AbstractVector},
    output_structure_mat::Union{LinearAlgebra.UniformScaling, LinearMaps.LinearMap, AbstractMatrix, AbstractVector}
) -> Tuple{EnsembleKalmanProcesses.DataContainers.PairedDataContainer, Any, Any}

Takes in an already initialized encoder schedule, and decodes a DataContainer, and structure matrices with it, the in_or_out string indicates if the data is input "in" or output "out" data (and thus decoded differently)

CalibrateEmulateSample.Utilities.decorrelate — Method

decorrelate(
;
    retain_var,
    decorrelate_with,
    structure_mat_name,
    n_totvar_samples,
    max_rank,
    psvd_kwargs
) -> CalibrateEmulateSample.Utilities.Decorrelator{Vector{Any}, Vector{Any}, Vector{Any}, Float64, @NamedTuple{rtol::Float64}, String}

Constructs the Decorrelator struct. Users can add optional keyword arguments:

retain_var[=1.0]: to project onto the leading singular vectors such that retain_var variance is retained
decorrelate_with [="combined"]: from which matrix do we provide subspace directions, options are
- "structure_mat", see decorrelate_structure_mat
- "sample_cov", see decorrelate_sample_cov
- "combined", sums the "sample_cov" and "structure_mat" matrices
n_totvar_samples[=500]: when retain_var < 1, number of samples to estimate the total variance for performing truncation.
max_rank[=100]: for retain_var < 1, the maximum dimension of subspace when using an the tsvd algorithm from TSVD.jl.
psvd_kwargs [= (; rtol = 1e-3)]: for retain_var = 1, the psvd algorithm from LowRankApprox.jl is used to decorrelate the space. kwargs can be passed in as a NamedTuple

CalibrateEmulateSample.Utilities.decorrelate_sample_cov — Method

decorrelate_sample_cov(
;
    retain_var,
    n_totvar_samples,
    max_rank,
    psvd_kwargs
) -> CalibrateEmulateSample.Utilities.Decorrelator{Vector{Any}, Vector{Any}, Vector{Any}, Float64, @NamedTuple{rtol::Float64}, String}

Constructs the Decorrelator struct, setting decorrelatewith = "samplecov". Encoding data with this will ensure that the distribution of data samples after encoding will be Normal(0,I). One can additionally add keywords:

retain_var[=1.0]: to project onto the leading singular vectors such that retain_var variance is retained
n_totvar_samples[=500]: when retain_var < 1, number of samples to estimate the total variance for performing truncation.
max_rank[=100]: for retain_var < 1, the maximum dimension of subspace when using an the tsvd algorithm from TSVD.jl.
psvd_kwargs [= (; rtol = 1e-3)]: for retain_var = 1, the psvd algorithm from LowRankApprox.jl is used to decorrelate the space. kwargs can be passed in as a NamedTuple

CalibrateEmulateSample.Utilities.decorrelate_structure_mat — Method

decorrelate_structure_mat(
;
    retain_var,
    structure_mat_name,
    n_totvar_samples,
    max_rank,
    psvd_kwargs
) -> CalibrateEmulateSample.Utilities.Decorrelator{Vector{Any}, Vector{Any}, Vector{Any}, Float64, @NamedTuple{rtol::Float64}, String}

Constructs the Decorrelator struct, setting decorrelatewith = "structuremat". This encoding will transform a provided structure matrix into I. One can additionally add keywords:

retain_var[=1.0]: to project onto the leading singular vectors such that retain_var variance is retained
n_totvar_samples[=500]: when retain_var < 1, number of samples to estimate the total variance for performing truncation.
max_rank[=100]: for retain_var < 1, the maximum dimension of subspace when using an the tsvd algorithm from TSVD.jl.
psvd_kwargs [= (; rtol = 1e-3)]: for retain_var = 1, the psvd algorithm from LowRankApprox.jl is used to decorrelate the space. kwargs can be passed in as a NamedTuple

CalibrateEmulateSample.Utilities.encode_data — Method

encode_data(
    encoder_schedule::AbstractVector,
    data::Union{EnsembleKalmanProcesses.DataContainers.DataContainer, AbstractMatrix, AbstractVector},
    in_or_out::AbstractString
) -> Any

Encode the new data (a DataContainer, or matrix where data are columns, or vector viewed as one column) representing inputs ("in") or outputs ("out"), with the stored and initialized encoder schedule. Always internally calls CES.Utilities.encode_with_schedule

CalibrateEmulateSample.Utilities.encode_structure_matrix — Method

encode_structure_matrix(
    encoder_schedule::AbstractVector,
    structure_mat,
    in_or_out::AbstractString
) -> Any

Encode a new structure matrix in the input space ("in") or output space ("out"). with the stored and initialized encoder schedule. Always internally calls CES.Utilities.encode_with_schedule. If the structure matrix is a LinearMap, then the encoded structure matrix remains a LinearMap

CalibrateEmulateSample.Utilities.encode_with_schedule — Method

encode_with_schedule(
    encoder_schedule::AbstractVector,
    data_container::EnsembleKalmanProcesses.DataContainers.DataContainer,
    in_or_out::AbstractString
) -> EnsembleKalmanProcesses.DataContainers.DataContainer

Takes in an already initialized encoder schedule, and encodes a DataContainer, the in_or_out string indicates if the data is input "in" or output "out" data (and thus encoded differently)

CalibrateEmulateSample.Utilities.encode_with_schedule — Method

encode_with_schedule(
    encoder_schedule::AbstractVector,
    structure_matrix::Union{LinearAlgebra.UniformScaling, LinearMaps.LinearMap, AbstractMatrix, AbstractVector},
    in_or_out::AbstractString
) -> Any

Takes in an already initialized encoder schedule, and encodes a structure matrix, the in_or_out string indicates if the structure matrix is for input "in" or output "out" space (and thus encoded differently)

CalibrateEmulateSample.Utilities.encoder_kwargs_from — Method

encoder_kwargs_from(
    obs::EnsembleKalmanProcesses.Observation
) -> NamedTuple{(:obs_noise_cov, :observation), <:Tuple{Any, Any}}

Extracts the relevant encoder kwargs from the observation as a NamedTuple. Contains,

:obs_noise_cov as (unbuilt) noise covariance
:observation as obs vector

Commonly called from encoder_kwargs_from(ekp, prior)

CalibrateEmulateSample.Utilities.encoder_kwargs_from — Method

encoder_kwargs_from(
    os::EnsembleKalmanProcesses.ObservationSeries
) -> NamedTuple{(:obs_noise_cov, :observation), <:Tuple{Any, Any}}

Extracts the relevant encoder kwargs from the ObservationSeries as a NamedTuple. Assumes the same noise covariance for all observation vectors. Contains,

:obs_noise_cov as (unbuilt) noise covariance of FIRST observation
:observation as obs vector from all observations

Commonly called from encoder_kwargs_from(ekp, prior)

CalibrateEmulateSample.Utilities.encoder_kwargs_from — Method

encoder_kwargs_from(
    prior::EnsembleKalmanProcesses.ParameterDistributions.ParameterDistribution
) -> NamedTuple{(:prior_cov,), <:Tuple{Any}}

Extracts the relevant encoder kwargs from the ParameterDistribution prior. Contains,

:prior_cov as prior covariance

Commonly called from encoder_kwargs_from(ekp, prior)

CalibrateEmulateSample.Utilities.encoder_kwargs_from — Method

encoder_kwargs_from(
    ekp::EnsembleKalmanProcesses.EnsembleKalmanProcess,
    prior::EnsembleKalmanProcesses.ParameterDistributions.ParameterDistribution;
    observation_series,
    samples_in,
    samples_out,
    dt,
    final_samples_out
) -> NamedTuple

Extracts the relevant encoder kwargs from the ekp object, prior distribution. returned as a tuple that is passed to an Emulator or ForwardMapWrapper in the keyword argument encoder_kwargs. One can overload constructed kwargs by providing kwargs.

kwargs:

Common overloaded kwarg: final_samples_out. As ekp stores one more input than output, by default we truncate to the penultimate ekp iteration (where input output pairs exist). However, one can provide an additional final output paired with g=forward_map_ensemble(get_ϕ_final(ekp)) with final_samples_out=g
Other overloading kwargs: observation_series,samples_in,samples_out,dt

CalibrateEmulateSample.Utilities.encoder_kwargs_from — Method

encoder_kwargs_from(
    samples_in::AbstractVector,
    samples_out::AbstractVector,
    dt::AbstractVector
) -> NamedTuple{(:input_structure_vecs, :output_structure_vecs), <:Tuple{Dict, Dict}}

Extracts the relevant encoder kwargs from a vector triple (samplesin, samplesout, dt). Samples describe an ordered sequence of distributions in input and output space, each indexed with a temperature, or algorithm time, dt.

Contains

:input_structure_vecs: Dict with fields :dt (Vec{Float}), :samples_in (Vec{Matrix})
:output_structure_vecs: Dict with fields :dt (Vec{Float}), :samples_out (Vec{Matrix})

Commonly called from encoder_kwargs_from(ekp, prior)

CalibrateEmulateSample.Utilities.get_apply_to — Method

get_apply_to(
    cc::CalibrateEmulateSample.Utilities.CanonicalCorrelation
) -> Any

returns the apply_to field of the CanonicalCorrelation.

CalibrateEmulateSample.Utilities.get_data_decoder_mat — Method

get_data_decoder_mat(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

returns the data_decoder_mat field of the ElementwiseScaler.

CalibrateEmulateSample.Utilities.get_data_encoder_mat — Method

get_data_encoder_mat(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

returns the data_encoder_mat field of the ElementwiseScaler.

CalibrateEmulateSample.Utilities.get_data_mean — Method

get_data_mean(
    cc::CalibrateEmulateSample.Utilities.CanonicalCorrelation
) -> Any

returns the data_mean field of the CanonicalCorrelation.

CalibrateEmulateSample.Utilities.get_data_mean — Method

get_data_mean(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> Any

returns the data_mean field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_decoder_from_schedule — Method

get_decoder_from_schedule(
    encoder_schedule::AbstractVector
) -> Dict

Affine decodings can be represented as Dx + b. This function returns D,b for the input and output encoders in a Dict indexed by "in" and "out". D will be represented as a LinearMap object (can apply D = Matrix(D) to rebuild).

CalibrateEmulateSample.Utilities.get_decoder_from_schedule — Method

get_decoder_from_schedule(encoder_schedule::AbstractString)

Affine decodings can be represented as Dx + b. This function returns D,b. D will be represented as a LinearMap object (can apply D = Matrix(D) to rebuild).

in_or_out: should be either "in" or "out", to retrieve either the input or output encoder

CalibrateEmulateSample.Utilities.get_decoder_mat — Method

get_decoder_mat(
    cc::CalibrateEmulateSample.Utilities.CanonicalCorrelation
) -> Any

returns the decoder_mat field of the CanonicalCorrelation.

CalibrateEmulateSample.Utilities.get_decoder_mat — Method

get_decoder_mat(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> Any

returns the decoder_mat field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_decoder_mat — Method

get_decoder_mat(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

returns the struct_decoder_mat field of the ElementwiseScaler. For Consistent API with other encoders

CalibrateEmulateSample.Utilities.get_decorrelate_with — Method

get_decorrelate_with(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> AbstractString

returns the decorrelate_with field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_encoded_dim — Method

get_encoded_dim(encoder_schedule::AbstractVector) -> Dict

gets the dimension of the encoded space, returned as a Dict with keys "in","out". Provides nothing values if encoder schedule is empty or uninitialized

CalibrateEmulateSample.Utilities.get_encoded_dim — Method

get_encoded_dim(encoder_schedule::AbstractString)

gets the dimension of the encoded space, for input (providing "in"), or output (providing "out"), provides nothing if encoder schedule is empty or uninitialized

CalibrateEmulateSample.Utilities.get_encoder_from_schedule — Method

get_encoder_from_schedule(
    encoder_schedule::AbstractVector
) -> Dict

Affine encodings can be represented as Ex + b. This function returns E,b for the input and output encoders in a Dict indexed by "in" and "out". E will be represented as a LinearMap object (can apply E = Matrix(E) to rebuild).

CalibrateEmulateSample.Utilities.get_encoder_from_schedule — Method

get_encoder_from_schedule(encoder_schedule::AbstractString)

Affine encodings can be represented as Ex + b. This function returns E,b. E will be represented as a LinearMap object (can apply E = Matrix(E) to rebuild).

in_or_out: should be either "in" or "out", to retrieve either the input or output encoder

CalibrateEmulateSample.Utilities.get_encoder_mat — Method

get_encoder_mat(
    cc::CalibrateEmulateSample.Utilities.CanonicalCorrelation
) -> Any

returns the encoder_mat field of the CanonicalCorrelation.

CalibrateEmulateSample.Utilities.get_encoder_mat — Method

get_encoder_mat(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> Any

returns the encoder_mat field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_encoder_mat — Method

get_encoder_mat(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

returns the struct_encoder_mat field of the ElementwiseScaler. For Consistent API with other encoders

CalibrateEmulateSample.Utilities.get_grad_type — Method

get_grad_type(
    li::CalibrateEmulateSample.Utilities.LikelihoodInformed
) -> Symbol

Return the gradient-approximation type stored in li: :linreg (global linear regression) or :localsl (localized statistical linearization).

CalibrateEmulateSample.Utilities.get_iters — Method

get_iters(
    li::CalibrateEmulateSample.Utilities.LikelihoodInformed
) -> AbstractVector

Return the iteration indices stored in li that specify which distribution samples (indexed by algorithm-time α) are used to construct the likelihood-informed subspace.

CalibrateEmulateSample.Utilities.get_max_rank — Method

get_max_rank(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> Int64

returns the max_rank field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_n_totvar_samples — Method

get_n_totvar_samples(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> Int64

returns the n_totvar_samples field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_psvd_kwargs — Method

get_psvd_kwargs(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> NamedTuple

returns the psvd_kwargs field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_retain_info — Method

get_retain_info(
    li::CalibrateEmulateSample.Utilities.LikelihoodInformed
) -> Real

Return the retain_info threshold stored in li, controlling how much of the KL-divergence reduction from the full posterior is retained by the subspace.

CalibrateEmulateSample.Utilities.get_retain_var — Method

get_retain_var(
    cc::CalibrateEmulateSample.Utilities.CanonicalCorrelation
) -> Any

returns the retain_var field of the CanonicalCorrelation.

CalibrateEmulateSample.Utilities.get_retain_var — Method

get_retain_var(
    dd::CalibrateEmulateSample.Utilities.Decorrelator
) -> Any

returns the retain_var field of the Decorrelator.

CalibrateEmulateSample.Utilities.get_scale — Method

get_scale(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

Gets the scale field of the ElementwiseScaler

CalibrateEmulateSample.Utilities.get_shift — Method

get_shift(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

Gets the shift field of the ElementwiseScaler

CalibrateEmulateSample.Utilities.get_struct_decoder_mat — Method

get_struct_decoder_mat(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

returns the struct_decoder_mat field of the ElementwiseScaler.

CalibrateEmulateSample.Utilities.get_struct_encoder_mat — Method

get_struct_encoder_mat(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler
) -> AbstractVector

returns the struct_encoder_mat field of the ElementwiseScaler.

CalibrateEmulateSample.Utilities.get_training_points — Method

get_training_points(
    ekp::EnsembleKalmanProcesses.EnsembleKalmanProcess{FT, IT, P},
    train_iterations::Union{AbstractVector{IT}, IT} where IT;
    g_final
) -> EnsembleKalmanProcesses.DataContainers.PairedDataContainer

Extract and flatten the training data from an EnsembleKalmanProcess into a PairedDataContainer suitable for training an Emulator.

Arguments

ekp: EnsembleKalmanProcess holding the parameter ensemble and forward-model outputs.
train_iterations: integer n (uses iterations 1:n) or an index vector such as 3:2:9.
g_final (keyword, default nothing): optional AbstractMatrix of forward-model outputs for the final parameter ensemble (not yet stored in ekp), sized as get_g(ekp, 1).

CalibrateEmulateSample.Utilities.get_type — Method

get_type(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler{T}
) -> Any

Gets the UnivariateAffineScaling type T

CalibrateEmulateSample.Utilities.initialize_and_encode_with_schedule! — Method

initialize_and_encode_with_schedule!(
    encoder_schedule::AbstractVector,
    io_pairs::EnsembleKalmanProcesses.DataContainers.PairedDataContainer;
    input_structure_mats,
    output_structure_mats,
    input_structure_vecs,
    output_structure_vecs,
    prior_cov,
    obs_noise_cov,
    observation,
    samples_in,
    samples_out
) -> Tuple{EnsembleKalmanProcesses.DataContainers.PairedDataContainer, Dict{Symbol, Union{LinearAlgebra.UniformScaling, LinearMaps.LinearMap, AbstractMatrix, AbstractVector}}, Dict{Symbol, Union{LinearAlgebra.UniformScaling, LinearMaps.LinearMap, AbstractMatrix, AbstractVector}}, Dict{Symbol, Union{AbstractMatrix, AbstractVector}}, Dict{Symbol, Union{AbstractMatrix, AbstractVector}}}

Takes in the created encoder schedule (See create_encoder_schedule), and initializes it, and encodes the paired data container, and structure matrices with it.

CalibrateEmulateSample.Utilities.initialize_processor! — Method

initialize_processor!(
    cc::CalibrateEmulateSample.Utilities.CanonicalCorrelation,
    in_data::AbstractMatrix,
    out_data::AbstractMatrix,
    input_structure_matrices,
    output_structure_matrices,
    input_structure_vectors,
    output_structure_vectors,
    apply_to::AbstractString
) -> Any

Computes and populates the data_mean, encoder_mat, decoder_mat and apply_to fields for the CanonicalCorrelation

CalibrateEmulateSample.Utilities.initialize_processor! — Method

initialize_processor!(
    es::CalibrateEmulateSample.Utilities.ElementwiseScaler,
    data::AbstractMatrix,
    structure_matrices,
    structure_vectors
) -> Any

Computes and populates the shift and scale fields for the ElementwiseScaler

CalibrateEmulateSample.Utilities.initialize_processor! — Method

initialize_processor!(
    dd::CalibrateEmulateSample.Utilities.Decorrelator,
    data::AbstractMatrix,
    structure_matrices::Dict{Symbol, SM<:Union{LinearAlgebra.UniformScaling, LinearMaps.LinearMap, AbstractMatrix, AbstractVector}},
    _::Dict{Symbol, SV<:Union{AbstractMatrix, AbstractVector}}
) -> Any

Computes and populates the data_mean and encoder_mat and decoder_mat fields for the Decorrelator

CalibrateEmulateSample.Utilities.isequal_linear — Method

isequal_linear(
    A::LinearMaps.LinearMap,
    B::LinearMaps.LinearMap;
    tol,
    n_eval,
    rng,
    up_to_sign
) -> Bool

Test whether two LinearMaps A and B act identically on a standard basis of the input space. Requires one matrix–vector product per basis vector tested.

Arguments

A, B: LinearMaps to compare; must have compatible sizes.
n_eval (keyword, default nothing): number of basis vectors to compare; when less than size(A, 2), a random subset is used.
tol (keyword, default 2*eps()): per-entry absolute tolerance for equality.
rng (keyword, default Random.default_rng()): random number generator used when n_eval < size(A, 2).
up_to_sign (keyword, default false): when true, equality is checked up to componentwise sign differences (sufficient for comparing encoder/decoder matrices).

CalibrateEmulateSample.Utilities.likelihood_informed — Method

likelihood_informed(
;
    retain_info,
    iters,
    grad_type
) -> CalibrateEmulateSample.Utilities.LikelihoodInformed{Vector{Any}, Vector{Any}, Vector{Any}, Vector{Int64}, Int64}

Constructs the LikelihoodInformed struct. Keywords:

retain_info: the method will attempt to limit the KL divergence of the true posterior from the reduced posterior to a value proportional to (1 - retain_info). Choose retain_info close to 1 to get a good approximation in a large subspace, and reduce it to get a worse approximation in a smaller subspace.
iters[= [1]]: the likelihood-informed data processor requires samples from the distribution ∝ π_prior(x) π_likelihood(y | x)^α with α ∈ [0, 1]. Here, iter indicates the structure vector iterations to use, as sampled from these distributions. For how to pass in these samples, see the use_data_as_samples parameter.
grad_type[= :linreg]: how the gradient of the forward model at the samples will be approximated. Choose from :linreg (global linear regression) and :localsl (localized statistical linearization; see [Wacker, 2025]).

CalibrateEmulateSample.Utilities.minmax_scale — Method

minmax_scale(

) -> CalibrateEmulateSample.Utilities.ElementwiseScaler{CalibrateEmulateSample.Utilities.MinMaxScaling, Vector{Float64}, Vector, Vector, Vector, Vector}

Constructs ElementwiseScaler{MinMaxScaling} processor. As part of an encoder schedule, this will apply the transform $\frac{x - \min(x)}{\max(x) - \min(x)}$ to each data dimension.

CalibrateEmulateSample.Utilities.norm_linear_map — Method

norm_linear_map(A::LinearMaps.LinearMap; ...) -> Any
norm_linear_map(
    A::LinearMaps.LinearMap,
    p::Real;
    n_eval,
    rng
) -> Any

Approximate the p-norm of a LinearMap A using random matrix–vector products, satisfying norm_linear_map(A, p) ≈ norm(Matrix(A), p). Can be called via norm(A, p).

Arguments

A: the LinearMap to evaluate.
p (default 2): norm order.
n_eval (keyword, default nothing): number of matrix–vector products; defaults to size(A, 2) (exact for p=2, approximate otherwise).
rng (keyword, default Random.default_rng()): random number generator.

CalibrateEmulateSample.Utilities.quartile_scale — Method

quartile_scale(

) -> CalibrateEmulateSample.Utilities.ElementwiseScaler{CalibrateEmulateSample.Utilities.QuartileScaling, Vector{Float64}, Vector, Vector, Vector, Vector}

Constructs ElementwiseScaler{QuartileScaling} processor. As part of an encoder schedule, it will apply the transform $\frac{x - Q2(x)}{Q3(x) - Q1(x)}$ to each data dimension. Also known as "robust scaling"

CalibrateEmulateSample.Utilities.zscore_scale — Method

zscore_scale(

) -> CalibrateEmulateSample.Utilities.ElementwiseScaler{CalibrateEmulateSample.Utilities.ZScoreScaling, Vector{Float64}, Vector, Vector, Vector, Vector}

Constructs ElementwiseScaler{ZScoreScaling} processor. As part of an encoder schedule, this will apply the transform $\frac{x-\mu}{\sigma}$, (where $x\sim N(\mu,\sigma)$), to each data dimension. For multivariate standardization, see Decorrelator