nerfbaselines.metrics¶
- nerfbaselines.metrics.dmpix_ssim(a: ndarray, b: ndarray, *, max_val: float = 1.0, kernel_size: int = 11, sigma: float = 1.5, k1: float = 0.01, k2: float = 0.03, return_map: bool = False, filter_fn: Callable[[ndarray], ndarray] | None = None) ndarray [source]¶
Computes the structural similarity index (SSIM) between image pairs.
This function is based on the standard SSIM implementation from: Z. Wang, A. C. Bovik, H. R. Sheikh and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity”, in IEEE Transactions on Image Processing, vol. 13, no. 4, pp. 600-612, 2004.
This function was modeled after tf.image.ssim, and should produce comparable output.
Note: the true SSIM is only defined on grayscale. This function does not perform any colorspace transform. If the input is in a color space, then it will compute the average SSIM.
NOTE: This function exactly matches dm_pix.ssim
- Parameters:
a – First image (or set of images).
b – Second image (or set of images).
max_val – The maximum magnitude that a or b can have.
kernel_size – Window size (>= 1). Image dims must be at least this small.
sigma – The bandwidth of the Gaussian used for filtering (> 0.).
k1 – One of the SSIM dampening parameters (> 0.).
k2 – One of the SSIM dampening parameters (> 0.).
return_map – If True, will cause the per-pixel SSIM “map” to be returned.
precision – The numerical precision to use when performing convolution.
- Returns:
Each image’s mean SSIM, or a tensor of individual values if return_map.
- nerfbaselines.metrics.lpips(a: ndarray, b: ndarray) ndarray | float32 [source]¶
Compute Learned Perceptual Image Patch Similarity (the lower the better). :param a: Tensor of prediction images [B…, H, W, C]. :param b: Tensor of target images [B…, H, W, C].
- Returns:
Tensor of LPIPS values for each image [B…].
- nerfbaselines.metrics.lpips_alex(a: ndarray, b: ndarray) ndarray | float32 [source]¶
Compute Learned Perceptual Image Patch Similarity (the lower the better). :param a: Tensor of prediction images [B…, H, W, C]. :param b: Tensor of target images [B…, H, W, C].
- Returns:
Tensor of LPIPS values for each image [B…].
- nerfbaselines.metrics.lpips_vgg(a: ndarray, b: ndarray) ndarray | float32 [source]¶
Compute Learned Perceptual Image Patch Similarity (the lower the better). :param a: Tensor of prediction images [B…, H, W, C]. :param b: Tensor of target images [B…, H, W, C].
- Returns:
Tensor of LPIPS values for each image [B…].
- nerfbaselines.metrics.mae(a: ndarray, b: ndarray) ndarray | float32 [source]¶
Compute Mean Absolute Error (the lower the better). :param a: Tensor of prediction images [B, H, W, C]. :param b: Tensor of target images [B, H, W, C].
- Returns:
Tensor of mean absolute error values for each image [B].
- nerfbaselines.metrics.mse(a: ndarray, b: ndarray) ndarray | float32 [source]¶
Compute Mean Squared Error (the lower the better). :param a: Tensor of prediction images [B, H, W, C]. :param b: Tensor of target images [B, H, W, C].
- Returns:
Tensor of mean squared error values for each image [B].
- nerfbaselines.metrics.psnr(a: ndarray | float32 | float64, b: ndarray | None = None) ndarray | float32 | float64 [source]¶
Compute Peak Signal to Noise Ratio (the higher the better). It can reuse computed MSE values if b is None. :param a: Tensor of prediction images [B, H, W, C] or a tensor of MSE values [B] (b must be None in that case). :param b: Tensor of target images [B, H, W, C] or None (if a are MSE values).
- Returns:
Tensor of PSNR values for each image [B].
- nerfbaselines.metrics.ssim(a: ndarray, b: ndarray) ndarray | float32 [source]¶
Compute Structural Similarity Index Measure (the higher the better). :param a: Tensor of prediction images [B, H, W, C]. :param b: Tensor of target images [B, H, W, C].
- Returns:
Tensor of mean SSIM values for each image [B].
- nerfbaselines.metrics.torchmetrics_ssim(a: ndarray, b: ndarray, *, gaussian_kernel: bool = True, sigma: float | Sequence[float] = 1.5, kernel_size: int | Sequence[int] = 11, data_range: float | Tuple[float, float] | None = None, k1: float = 0.01, k2: float = 0.03) ndarray | Tuple[ndarray, ndarray] [source]¶
Compute Structural Similarity Index Measure.
NOTE: this metric exactly matches torchmetrics.ssim
- Parameters:
preds – estimated image
target – ground truth image
gaussian_kernel – If true (default), a gaussian kernel is used, if false a uniform kernel is used
sigma – Standard deviation of the gaussian kernel, anisotropic kernels are possible. Ignored if a uniform kernel is used
kernel_size – the size of the uniform kernel, anisotropic kernels are possible. Ignored if a Gaussian kernel is used
data_range – Range of the image. If
None
, it is determined from the image (max - min)k1 – Parameter of SSIM.
k2 – Parameter of SSIM.