Module: transform <a class="headerlink" href="#module-skimage.transform" title="Permalink to this heading">¶

iradon_sart¶

skimage.transform.iradon_sart(radon_image, theta=None, image=None, projection_shifts=None, clip=None, relaxation=0.15, dtype=None)[source]¶

Inverse radon transform.

Reconstruct an image from the radon transform, using a single iteration of the Simultaneous Algebraic Reconstruction Technique (SART) algorithm.

Parameters:

radon_image2D array: Image containing radon transform (sinogram). Each column of the image corresponds to a projection along a different angle. The tomography rotation axis should lie at the pixel index radon_image.shape[0] // 2 along the 0th dimension of radon_image.
theta1D array, optional: Reconstruction angles (in degrees). Default: m angles evenly spaced between 0 and 180 (if the shape of radon_image is (N, M)).
image2D array, optional: Image containing an initial reconstruction estimate. Shape of this array should be (radon_image.shape[0], radon_image.shape[0]). The default is an array of zeros.
projection_shifts1D array, optional: Shift the projections contained in radon_image (the sinogram) by this many pixels before reconstructing the image. The i’th value defines the shift of the i’th column of radon_image.
cliplength-2 sequence of floats, optional: Force all values in the reconstructed tomogram to lie in the range [clip[0], clip[1]]
relaxationfloat, optional: Relaxation parameter for the update step. A higher value can improve the convergence rate, but one runs the risk of instabilities. Values close to or higher than 1 are not recommended.
dtypedtype, optional: Output data type, must be floating point. By default, if input data type is not float, input is cast to double, otherwise dtype is set to input data type.

Returns:

reconstructedndarray: Reconstructed image. The rotation axis will be located in the pixel with indices (reconstructed.shape[0] // 2, reconstructed.shape[1] // 2).

Notes

Algebraic Reconstruction Techniques are based on formulating the tomography reconstruction problem as a set of linear equations. Along each ray, the projected value is the sum of all the values of the cross section along the ray. A typical feature of SART (and a few other variants of algebraic techniques) is that it samples the cross section at equidistant points along the ray, using linear interpolation between the pixel values of the cross section. The resulting set of linear equations are then solved using a slightly modified Kaczmarz method.

When using SART, a single iteration is usually sufficient to obtain a good reconstruction. Further iterations will tend to enhance high-frequency information, but will also often increase the noise.

References

[1]

AC Kak, M Slaney, “Principles of Computerized Tomographic Imaging”, IEEE Press 1988.

[2]

AH Andersen, AC Kak, “Simultaneous algebraic reconstruction technique (SART): a superior implementation of the ART algorithm”, Ultrasonic Imaging 6 pp 81–94 (1984)

[3]

S Kaczmarz, “Angenäherte auflösung von systemen linearer gleichungen”, Bulletin International de l’Academie Polonaise des Sciences et des Lettres 35 pp 355–357 (1937)

[4]

Kohler, T. “A projection access scheme for iterative reconstruction based on the golden section.” Nuclear Science Symposium Conference Record, 2004 IEEE. Vol. 6. IEEE, 2004.

[5]

Kaczmarz’ method, Wikipedia, https://en.wikipedia.org/wiki/Kaczmarz_method

Examples using `skimage.transform.iradon_sart`¶

Straight line Hough transform

matrix_transform¶

skimage.transform.matrix_transform(coords, matrix)[source]¶

Apply 2D matrix transform.

Parameters:

coords(N, 2) array_like: x, y coordinates to transform
matrix(3, 3) array_like: Homogeneous transformation matrix.

Returns:

coords(N, 2) array: Transformed coordinates.

order_angles_golden_ratio¶

skimage.transform.order_angles_golden_ratio(theta)[source]¶

Order angles to reduce the amount of correlated information in subsequent projections.

Parameters:

theta1D array of floats: Projection angles in degrees. Duplicate angles are not allowed.

Returns:

indices_generatorgenerator yielding unsigned integers: The returned generator yields indices into theta such that theta[indices] gives the approximate golden ratio ordering of the projections. In total, len(theta) indices are yielded. All non-negative integers < len(theta) are yielded exactly once.

Notes

The method used here is that of the golden ratio introduced by T. Kohler.

References

[1]

Kohler, T. “A projection access scheme for iterative reconstruction based on the golden section.” Nuclear Science Symposium Conference Record, 2004 IEEE. Vol. 6. IEEE, 2004.

[2]

Winkelmann, Stefanie, et al. “An optimal radial profile order based on the Golden Ratio for time-resolved MRI.” Medical Imaging, IEEE Transactions on 26.1 (2007): 68-76.

probabilistic_hough_line¶

skimage.transform.probabilistic_hough_line(image, threshold=10, line_length=50, line_gap=10, theta=None, seed=None)[source]¶

Return lines from a progressive probabilistic line Hough transform.

Parameters:

image(M, N) ndarray: Input image with nonzero values representing edges.
thresholdint, optional: Threshold
line_lengthint, optional: Minimum accepted length of detected lines. Increase the parameter to extract longer lines.
line_gapint, optional: Maximum gap between pixels to still form a line. Increase the parameter to merge broken lines more aggressively.
theta1D ndarray, dtype=double, optional: Angles at which to compute the transform, in radians. Defaults to a vector of 180 angles evenly spaced in the range [-pi/2, pi/2).
seedint, optional: Seed to initialize the random number generator.

Returns:

lineslist: List of lines identified, lines in format ((x0, y0), (x1, y1)), indicating line start and end.

References

[1]

C. Galamhos, J. Matas and J. Kittler, “Progressive probabilistic Hough transform for line detection”, in IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1999.

Examples using `skimage.transform.probabilistic_hough_line`¶

pyramid_expand¶

skimage.transform.pyramid_expand(image, upscale=2, sigma=None, order=1, mode='reflect', cval=0, preserve_range=False, *, channel_axis=None)[source]¶

Upsample and then smooth image.

Parameters:

imagendarray: Input image.
upscalefloat, optional: Upscale factor.
sigmafloat, optional: Sigma for Gaussian filter. Default is 2 * upscale / 6.0 which corresponds to a filter mask twice the size of the scale factor that covers more than 99% of the Gaussian distribution.
orderint, optional: Order of splines used in interpolation of upsampling. See skimage.transform.warp for detail.
mode{‘reflect’, ‘constant’, ‘edge’, ‘symmetric’, ‘wrap’}, optional: The mode parameter determines how the array borders are handled, where cval is the value when mode is equal to ‘constant’.
cvalfloat, optional: Value to fill past edges of input if mode is ‘constant’.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html
channel_axisint or None, optional: If None, the image is assumed to be a grayscale (single channel) image. Otherwise, this parameter indicates which axis of the array corresponds to channels.

New in version 0.19: channel_axis was added in 0.19.

Returns:

outarray: Upsampled and smoothed float image.

References

[1]

pyramid_gaussian¶

skimage.transform.pyramid_gaussian(image, max_layer=-1, downscale=2, sigma=None, order=1, mode='reflect', cval=0, preserve_range=False, *, channel_axis=None)[source]¶

Yield images of the Gaussian pyramid formed by the input image.

Recursively applies the pyramid_reduce function to the image, and yields the downscaled images.

Note that the first image of the pyramid will be the original, unscaled image. The total number of images is max_layer + 1. In case all layers are computed, the last image is either a one-pixel image or the image where the reduction does not change its shape.

Parameters:

imagendarray: Input image.
max_layerint, optional: Number of layers for the pyramid. 0th layer is the original image. Default is -1 which builds all possible layers.
downscalefloat, optional: Downscale factor.
sigmafloat, optional: Sigma for Gaussian filter. Default is 2 * downscale / 6.0 which corresponds to a filter mask twice the size of the scale factor that covers more than 99% of the Gaussian distribution.
orderint, optional: Order of splines used in interpolation of downsampling. See skimage.transform.warp for detail.
mode{‘reflect’, ‘constant’, ‘edge’, ‘symmetric’, ‘wrap’}, optional: The mode parameter determines how the array borders are handled, where cval is the value when mode is equal to ‘constant’.
cvalfloat, optional: Value to fill past edges of input if mode is ‘constant’.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html
channel_axisint or None, optional: If None, the image is assumed to be a grayscale (single channel) image. Otherwise, this parameter indicates which axis of the array corresponds to channels.

New in version 0.19: channel_axis was added in 0.19.

Returns:

pyramidgenerator: Generator yielding pyramid layers as float images.

References

[1]

Examples using `skimage.transform.pyramid_gaussian`¶

Build image pyramids

pyramid_laplacian¶

skimage.transform.pyramid_laplacian(image, max_layer=-1, downscale=2, sigma=None, order=1, mode='reflect', cval=0, preserve_range=False, *, channel_axis=None)[source]¶

Yield images of the laplacian pyramid formed by the input image.

Each layer contains the difference between the downsampled and the downsampled, smoothed image:

layer = resize(prev_layer) - smooth(resize(prev_layer))

Note that the first image of the pyramid will be the difference between the original, unscaled image and its smoothed version. The total number of images is max_layer + 1. In case all layers are computed, the last image is either a one-pixel image or the image where the reduction does not change its shape.

Parameters:

imagendarray: Input image.
max_layerint, optional: Number of layers for the pyramid. 0th layer is the original image. Default is -1 which builds all possible layers.
downscalefloat, optional: Downscale factor.
sigmafloat, optional: Sigma for Gaussian filter. Default is 2 * downscale / 6.0 which corresponds to a filter mask twice the size of the scale factor that covers more than 99% of the Gaussian distribution.
orderint, optional: Order of splines used in interpolation of downsampling. See skimage.transform.warp for detail.
mode{‘reflect’, ‘constant’, ‘edge’, ‘symmetric’, ‘wrap’}, optional: The mode parameter determines how the array borders are handled, where cval is the value when mode is equal to ‘constant’.
cvalfloat, optional: Value to fill past edges of input if mode is ‘constant’.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html
channel_axisint or None, optional: If None, the image is assumed to be a grayscale (single channel) image. Otherwise, this parameter indicates which axis of the array corresponds to channels.

New in version 0.19: channel_axis was added in 0.19.

Returns:

pyramidgenerator: Generator yielding pyramid layers as float images.

References

[1]

http://sepwww.stanford.edu/data/media/public/sep/morgan/texturematch/paper_html/node3.html

[2]

pyramid_reduce¶

skimage.transform.pyramid_reduce(image, downscale=2, sigma=None, order=1, mode='reflect', cval=0, preserve_range=False, *, channel_axis=None)[source]¶

Smooth and then downsample image.

Parameters:

imagendarray: Input image.
downscalefloat, optional: Downscale factor.
sigmafloat, optional: Sigma for Gaussian filter. Default is 2 * downscale / 6.0 which corresponds to a filter mask twice the size of the scale factor that covers more than 99% of the Gaussian distribution.
orderint, optional: Order of splines used in interpolation of downsampling. See skimage.transform.warp for detail.
mode{‘reflect’, ‘constant’, ‘edge’, ‘symmetric’, ‘wrap’}, optional: The mode parameter determines how the array borders are handled, where cval is the value when mode is equal to ‘constant’.
cvalfloat, optional: Value to fill past edges of input if mode is ‘constant’.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html
channel_axisint or None, optional: If None, the image is assumed to be a grayscale (single channel) image. Otherwise, this parameter indicates which axis of the array corresponds to channels.

New in version 0.19: channel_axis was added in 0.19.

Returns:

outarray: Smoothed and downsampled float image.

References

[1]

radon¶

skimage.transform.radon(image, theta=None, circle=True, *, preserve_range=False)[source]¶

Calculates the radon transform of an image given specified projection angles.

Parameters:

imagearray_like: Input image. The rotation axis will be located in the pixel with indices (image.shape[0] // 2, image.shape[1] // 2).
thetaarray_like, optional: Projection angles (in degrees). If None, the value is set to np.arange(180).
circleboolean, optional: Assume image is zero outside the inscribed circle, making the width of each projection (the first dimension of the sinogram) equal to min(image.shape).
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html

Returns:

radon_imagendarray: Radon transform (sinogram). The tomography rotation axis will lie at the pixel index radon_image.shape[0] // 2 along the 0th dimension of radon_image.

Notes

Based on code of Justin K. Romberg (https://www.clear.rice.edu/elec431/projects96/DSP/bpanalysis.html)

References

[1]

AC Kak, M Slaney, “Principles of Computerized Tomographic Imaging”, IEEE Press 1988.

[2]

Examples using `skimage.transform.radon`¶

Interpolation: Edge Modes

rescale¶

skimage.transform.rescale(image, scale, order=None, mode='reflect', cval=0, clip=True, preserve_range=False, anti_aliasing=None, anti_aliasing_sigma=None, *, channel_axis=None)[source]¶

Scale image by a certain factor.

Performs interpolation to up-scale or down-scale N-dimensional images. Note that anti-aliasing should be enabled when down-sizing images to avoid aliasing artifacts. For down-sampling with an integer factor also see skimage.transform.downscale_local_mean.

Parameters:

imagendarray: Input image.
scale{float, tuple of floats}: Scale factors. Separate scale factors can be defined as (rows, cols[, …][, dim]).

Returns:

scaledndarray: Scaled version of the input.

Other Parameters:

orderint, optional: The order of the spline interpolation, default is 0 if image.dtype is bool and 1 otherwise. The order has to be in the range 0-5. See skimage.transform.warp for detail.
mode{‘constant’, ‘edge’, ‘symmetric’, ‘reflect’, ‘wrap’}, optional: Points outside the boundaries of the input are filled according to the given mode. Modes match the behaviour of numpy.pad.
cvalfloat, optional: Used in conjunction with mode ‘constant’, the value outside the image boundaries.
clipbool, optional: Whether to clip the output to the range of values of the input image. This is enabled by default, since higher order interpolation may produce values outside the given input range.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html
anti_aliasingbool, optional: Whether to apply a Gaussian filter to smooth the image prior to down-scaling. It is crucial to filter when down-sampling the image to avoid aliasing artifacts. If input image data type is bool, no anti-aliasing is applied.
anti_aliasing_sigma{float, tuple of floats}, optional: Standard deviation for Gaussian filtering to avoid aliasing artifacts. By default, this value is chosen as (s - 1) / 2 where s is the down-scaling factor.
channel_axisint or None, optional: If None, the image is assumed to be a grayscale (single channel) image. Otherwise, this parameter indicates which axis of the array corresponds to channels.

New in version 0.19: channel_axis was added in 0.19.

Notes

Modes ‘reflect’ and ‘symmetric’ are similar, but differ in whether the edge pixels are duplicated during the reflection. As an example, if an array has values [0, 1, 2] and was padded to the right by four values using symmetric, the result would be [0, 1, 2, 2, 1, 0, 0], while for reflect it would be [0, 1, 2, 1, 0, 1, 2].

Examples

>>> from skimage import data
>>> from skimage.transform import rescale
>>> image = data.camera()
>>> rescale(image, 0.1).shape
(51, 51)
>>> rescale(image, 0.5).shape
(256, 256)

Examples using `skimage.transform.rescale`¶

Rescale, resize, and downscale

Using Polar and Log-Polar Transformations for Registration

resize¶

skimage.transform.resize(image, output_shape, order=None, mode='reflect', cval=0, clip=True, preserve_range=False, anti_aliasing=None, anti_aliasing_sigma=None)[source]¶

Resize image to match a certain size.

Performs interpolation to up-size or down-size N-dimensional images. Note that anti-aliasing should be enabled when down-sizing images to avoid aliasing artifacts. For downsampling with an integer factor also see skimage.transform.downscale_local_mean.

Parameters:

imagendarray: Input image.
output_shapeiterable: Size of the generated output image (rows, cols[, …][, dim]). If dim is not provided, the number of channels is preserved. In case the number of input channels does not equal the number of output channels a n-dimensional interpolation is applied.

Returns:

resizedndarray: Resized version of the input.

Other Parameters:

orderint, optional: The order of the spline interpolation, default is 0 if image.dtype is bool and 1 otherwise. The order has to be in the range 0-5. See skimage.transform.warp for detail.
mode{‘constant’, ‘edge’, ‘symmetric’, ‘reflect’, ‘wrap’}, optional: Points outside the boundaries of the input are filled according to the given mode. Modes match the behaviour of numpy.pad.
cvalfloat, optional: Used in conjunction with mode ‘constant’, the value outside the image boundaries.
clipbool, optional: Whether to clip the output to the range of values of the input image. This is enabled by default, since higher order interpolation may produce values outside the given input range.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html
anti_aliasingbool, optional: Whether to apply a Gaussian filter to smooth the image prior to downsampling. It is crucial to filter when downsampling the image to avoid aliasing artifacts. If not specified, it is set to True when downsampling an image whose data type is not bool. It is also set to False when using nearest neighbor interpolation (order == 0) with integer input data type.
anti_aliasing_sigma{float, tuple of floats}, optional: Standard deviation for Gaussian filtering used when anti-aliasing. By default, this value is chosen as (s - 1) / 2 where s is the downsampling factor, where s > 1. For the up-size case, s < 1, no anti-aliasing is performed prior to rescaling.

Notes

Examples

>>> from skimage import data
>>> from skimage.transform import resize
>>> image = data.camera()
>>> resize(image, (100, 100)).shape
(100, 100)

Examples using `skimage.transform.resize`¶

Interpolation: Edge Modes

Rescale, resize, and downscale

resize_local_mean¶

skimage.transform.resize_local_mean(image, output_shape, grid_mode=True, preserve_range=False, *, channel_axis=None)[source]¶

Resize an array with the local mean / bilinear scaling.

Parameters:

imagendarray

Input image. If this is a multichannel image, the axis corresponding to channels should be specified using channel_axis

output_shapeiterable

Size of the generated output image. When channel_axis is not None, the channel_axis should either be omitted from output_shape or the output_shape[channel_axis] must match image.shape[channel_axis]. If the length of output_shape exceeds image.ndim, additional singleton dimensions will be appended to the input image as needed.

grid_modebool, optional

Defines image pixels position: if True, pixels are assumed to be at grid intersections, otherwise at cell centers. As a consequence, for example, a 1d signal of length 5 is considered to have length 4 when grid_mode is False, but length 5 when grid_mode is True. See the following visual illustration:

| pixel 1 | pixel 2 | pixel 3 | pixel 4 | pixel 5 |
     |<-------------------------------------->|
                        vs.
|<----------------------------------------------->|

The starting point of the arrow in the diagram above corresponds to coordinate location 0 in each mode.

preserve_rangebool, optional

Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html

Returns:

resizedndarray: Resized version of the input.

rotate¶

skimage.transform.rotate(image, angle, resize=False, center=None, order=None, mode='constant', cval=0, clip=True, preserve_range=False)[source]¶

Rotate image by a certain angle around its center.

Parameters:

imagendarray: Input image.
anglefloat: Rotation angle in degrees in counter-clockwise direction.
resizebool, optional: Determine whether the shape of the output image will be automatically calculated, so the complete rotated image exactly fits. Default is False.
centeriterable of length 2: The rotation center. If center=None, the image is rotated around its center, i.e. center=(cols / 2 - 0.5, rows / 2 - 0.5). Please note that this parameter is (cols, rows), contrary to normal skimage ordering.

Returns:

rotatedndarray: Rotated version of the input.

Other Parameters:

orderint, optional: The order of the spline interpolation, default is 0 if image.dtype is bool and 1 otherwise. The order has to be in the range 0-5. See skimage.transform.warp for detail.
mode{‘constant’, ‘edge’, ‘symmetric’, ‘reflect’, ‘wrap’}, optional: Points outside the boundaries of the input are filled according to the given mode. Modes match the behaviour of numpy.pad.
cvalfloat, optional: Used in conjunction with mode ‘constant’, the value outside the image boundaries.
clipbool, optional: Whether to clip the output to the range of values of the input image. This is enabled by default, since higher order interpolation may produce values outside the given input range.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html

Notes

Examples

>>> from skimage import data
>>> from skimage.transform import rotate
>>> image = data.camera()
>>> rotate(image, 2).shape
(512, 512)
>>> rotate(image, 2, resize=True).shape
(530, 530)
>>> rotate(image, 90, resize=True).shape
(512, 512)

Examples using `skimage.transform.rotate`¶

Using Polar and Log-Polar Transformations for Registration

Local Binary Pattern for texture classification

Sliding window histogram

Measure perimeters with different estimators

Measure region properties

Visual image comparison

swirl¶

skimage.transform.swirl(image, center=None, strength=1, radius=100, rotation=0, output_shape=None, order=None, mode='reflect', cval=0, clip=True, preserve_range=False)[source]¶

Perform a swirl transformation.

Parameters:

imagendarray: Input image.
center(column, row) tuple or (2,) ndarray, optional: Center coordinate of transformation.
strengthfloat, optional: The amount of swirling applied.
radiusfloat, optional: The extent of the swirl in pixels. The effect dies out rapidly beyond radius.
rotationfloat, optional: Additional rotation applied to the image.

Returns:

swirledndarray: Swirled version of the input.

Other Parameters:

output_shapetuple (rows, cols), optional: Shape of the output image generated. By default the shape of the input image is preserved.
orderint, optional: The order of the spline interpolation, default is 0 if image.dtype is bool and 1 otherwise. The order has to be in the range 0-5. See skimage.transform.warp for detail.
mode{‘constant’, ‘edge’, ‘symmetric’, ‘reflect’, ‘wrap’}, optional: Points outside the boundaries of the input are filled according to the given mode, with ‘constant’ used as the default. Modes match the behaviour of numpy.pad.
cvalfloat, optional: Used in conjunction with mode ‘constant’, the value outside the image boundaries.
clipbool, optional: Whether to clip the output to the range of values of the input image. This is enabled by default, since higher order interpolation may produce values outside the given input range.
preserve_rangebool, optional: Whether to keep the original range of values. Otherwise, the input image is converted according to the conventions of img_as_float. Also see https://scikit-image.org/docs/dev/user_guide/data_types.html

Examples using `skimage.transform.swirl`¶

Swirl

warp¶

skimage.transform.warp(image, inverse_map, map_args={}, output_shape=None, order=None, mode='constant', cval=0.0, clip=True, preserve_range=False)[source]¶

Warp an image according to a given coordinate transformation.

Parameters:

imagendarray

Input image.

inverse_maptransformation object, callable cr = f(cr, **kwargs), or ndarray

Inverse coordinate map, which transforms coordinates in the output images into their corresponding coordinates in the input image.

There are a number of different options to define this map, depending on the dimensionality of the input image. A 2-D image can have 2 dimensions for gray-scale images, or 3 dimensions with color information.

For 2-D images, you can directly pass a transformation object, e.g. skimage.transform.SimilarityTransform, or its inverse.

For 2-D images, you can pass a (3, 3) homogeneous transformation matrix, e.g. skimage.transform.SimilarityTransform.params.

For 2-D images, a function that transforms a (M, 2) array of (col, row) coordinates in the output image to their corresponding coordinates in the input image. Extra parameters to the function can be specified through map_args.

For N-D images, you can directly pass an array of coordinates. The first dimension specifies the coordinates in the input image, while the subsequent dimensions determine the position in the output image. E.g. in case of 2-D images, you need to pass an array of shape (2, rows, cols), where rows and cols determine the shape of the output image, and the first dimension contains the (row, col) coordinate in the input image. See scipy.ndimage.map_coordinates for further documentation.

Note, that a (3, 3) matrix is interpreted as a homogeneous transformation matrix, so you cannot interpolate values from a 3-D input, if the output is of shape (3,).

See example section for usage.

map_argsdict, optional

Keyword arguments passed to inverse_map.

output_shapetuple (rows, cols), optional

Shape of the output image generated. By default the shape of the input image is preserved. Note that, even for multi-band images, only rows and columns need to be specified.

orderint, optional

The order of interpolation. The order has to be in the range 0-5:

0: Nearest-neighbor
1: Bi-linear (default)
2: Bi-quadratic
3: Bi-cubic
4: Bi-quartic
5: Bi-quintic

Default is 0 if image.dtype is bool and 1 otherwise.

mode{‘constant’, ‘edge’, ‘symmetric’, ‘reflect’, ‘wrap’}, optional

Points outside the boundaries of the input are filled according to the given mode. Modes match the behaviour of numpy.pad.

cvalfloat, optional

Used in conjunction with mode ‘constant’, the value outside the image boundaries.

clipbool, optional

Whether to clip the output to the range of values of the input image. This is enabled by default, since higher order interpolation may produce values outside the given input range.

preserve_rangebool, optional

Returns:

warpeddouble ndarray: The warped input image.

Notes

The input image is converted to a double image.
In case of a SimilarityTransform, AffineTransform and ProjectiveTransform and order in [0, 3] this function uses the underlying transformation matrix to warp the image with a much faster routine.

Examples

>>> from skimage.transform import warp
>>> from skimage import data
>>> image = data.camera()

The following image warps are all equal but differ substantially in execution time. The image is shifted to the bottom.

Use a geometric transform to warp an image (fast):

>>> from skimage.transform import SimilarityTransform
>>> tform = SimilarityTransform(translation=(0, -10))
>>> warped = warp(image, tform)

Use a callable (slow):

>>> def shift_down(xy):
...     xy[:, 1] -= 10
...     return xy
>>> warped = warp(image, shift_down)

Use a transformation matrix to warp an image (fast):

>>> matrix = np.array([[1, 0, 0], [0, 1, -10], [0, 0, 1]])
>>> warped = warp(image, matrix)
>>> from skimage.transform import ProjectiveTransform
>>> warped = warp(image, ProjectiveTransform(matrix=matrix))

You can also use the inverse of a geometric transformation (fast):

>>> warped = warp(image, tform.inverse)

For N-D images you can pass a coordinate array, that specifies the coordinates in the input image for every element in the output image. E.g. if you want to rescale a 3-D cube, you can do:

>>> cube_shape = np.array([30, 30, 30])
>>> rng = np.random.default_rng()
>>> cube = rng.random(cube_shape)

Setup the coordinate array, that defines the scaling:

>>> scale = 0.1
>>> output_shape = (scale * cube_shape).astype(int)
>>> coords0, coords1, coords2 = np.mgrid[:output_shape[0],
...                    :output_shape[1], :output_shape[2]]
>>> coords = np.array([coords0, coords1, coords2])

Assume that the cube contains spatial data, where the first array element center is at coordinate (0.5, 0.5, 0.5) in real space, i.e. we have to account for this extra offset when scaling the image:

>>> coords = (coords + 0.5) / scale - 0.5
>>> warped = warp(cube, coords)

Examples using `skimage.transform.warp`¶

Swirl

Piecewise Affine Transformation

Robust matching using RANSAC

Registration using optical flow

CENSURE feature detector

Corner detection

Using Polar and Log-Polar Transformations for Registration

warp_coords¶

skimage.transform.warp_coords(coord_map, shape, dtype=<class 'numpy.float64'>)[source]¶

Build the source coordinates for the output of a 2-D image warp.

Parameters:

coord_mapcallable like GeometricTransform.inverse: Return input coordinates for given output coordinates. Coordinates are in the shape (P, 2), where P is the number of coordinates and each element is a (row, col) pair.
shapetuple: Shape of output image (rows, cols[, bands]).
dtypenp.dtype or string: dtype for return value (sane choices: float32 or float64).

Returns:

coords(ndim, rows, cols[, bands]) array of dtype dtype: Coordinates for scipy.ndimage.map_coordinates, that will yield an image of shape (orows, ocols, bands) by drawing from source points according to the coord_transform_fn.

Notes

This is a lower-level routine that produces the source coordinates for 2-D images used by warp().

It is provided separately from warp to give additional flexibility to users who would like, for example, to re-use a particular coordinate mapping, to use specific dtypes at various points along the the image-warping process, or to implement different post-processing logic than warp performs after the call to ndi.map_coordinates.

Examples

Produce a coordinate map that shifts an image up and to the right:

>>> from skimage import data
>>> from scipy.ndimage import map_coordinates
>>>
>>> def shift_up10_left20(xy):
...     return xy - np.array([-20, 10])[None, :]
>>>
>>> image = data.astronaut().astype(np.float32)
>>> coords = warp_coords(shift_up10_left20, image.shape)
>>> warped_image = map_coordinates(image, coords)

warp_polar¶

skimage.transform.warp_polar(image, center=None, *, radius=None, output_shape=None, scaling='linear', channel_axis=None, **kwargs)[source]¶

Remap image to polar or log-polar coordinates space.

Parameters:

imagendarray: Input image. Only 2-D arrays are accepted by default. 3-D arrays are accepted if a channel_axis is specified.
centertuple (row, col), optional: Point in image that represents the center of the transformation (i.e., the origin in cartesian space). Values can be of type float. If no value is given, the center is assumed to be the center point of the image.
radiusfloat, optional: Radius of the circle that bounds the area to be transformed.
output_shapetuple (row, col), optional
scaling{‘linear’, ‘log’}, optional: Specify whether the image warp is polar or log-polar. Defaults to ‘linear’.
channel_axisint or None, optional: If None, the image is assumed to be a grayscale (single channel) image. Otherwise, this parameter indicates which axis of the array corresponds to channels.

New in version 0.19: channel_axis was added in 0.19.
**kwargskeyword arguments: Passed to transform.warp.

Returns:

warpedndarray: The polar or log-polar warped image.

Examples

Perform a basic polar warp on a grayscale image:

>>> from skimage import data
>>> from skimage.transform import warp_polar
>>> image = data.checkerboard()
>>> warped = warp_polar(image)

Perform a log-polar warp on a grayscale image:

>>> warped = warp_polar(image, scaling='log')

Perform a log-polar warp on a grayscale image while specifying center, radius, and output shape:

>>> warped = warp_polar(image, (100,100), radius=100,
...                     output_shape=image.shape, scaling='log')

Perform a log-polar warp on a color image:

>>> image = data.astronaut()
>>> warped = warp_polar(image, scaling='log', channel_axis=-1)

Examples using `skimage.transform.warp_polar`¶

`AffineTransform`¶

class skimage.transform.AffineTransform(matrix=None, scale=None, rotation=None, shear=None, translation=None, *, dimensionality=2)[source]¶

Bases: ProjectiveTransform

Affine transformation.

Has the following form:

X = a0*x + a1*y + a2 =
  = sx*x*cos(rotation) - sy*y*sin(rotation + shear) + a2

Y = b0*x + b1*y + b2 =
  = sx*x*sin(rotation) + sy*y*cos(rotation + shear) + b2

where sx and sy are scale factors in the x and y directions, and the homogeneous transformation matrix is:

[[a0  a1  a2]
 [b0  b1  b2]
 [0   0    1]]

In 2D, the transformation parameters can be given as the homogeneous transformation matrix, above, or as the implicit parameters, scale, rotation, shear, and translation in x (a2) and y (b2). For 3D and higher, only the matrix form is allowed.

In narrower transforms, such as the Euclidean (only rotation and translation) or Similarity (rotation, translation, and a global scale factor) transforms, it is possible to specify 3D transforms using implicit parameters also.

Parameters:

matrix(D+1, D+1) array_like, optional: Homogeneous transformation matrix. If this matrix is provided, it is an error to provide any of scale, rotation, shear, or translation.
scale{s as float or (sx, sy) as array, list or tuple}, optional: Scale factor(s). If a single value, it will be assigned to both sx and sy. Only available for 2D.

New in version 0.17: Added support for supplying a single scalar value.
rotationfloat, optional: Rotation angle in counter-clockwise direction as radians. Only available for 2D.
shearfloat, optional: Shear angle in counter-clockwise direction as radians. Only available for 2D.
translation(tx, ty) as array, list or tuple, optional: Translation parameters. Only available for 2D.
dimensionalityint, optional: The dimensionality of the transform. This is not used if any other parameters are provided.

Raises:

ValueError: If both matrix and any of the other parameters are provided.

Attributes:

params(D+1, D+1) array: Homogeneous transformation matrix.

__init__(matrix=None, scale=None, rotation=None, shear=None, translation=None, *, dimensionality=2)[source]¶

property rotation¶

property scale¶

property shear¶

property translation¶

Examples using `skimage.transform.AffineTransform`¶

Robust matching using RANSAC

CENSURE feature detector

Corner detection

https://en.wikipedia.org/wiki/Rotation_matrix#In_three_dimensions

`EssentialMatrixTransform`¶

class skimage.transform.EssentialMatrixTransform(rotation=None, translation=None, matrix=None, *, dimensionality=2)[source]¶

Bases: FundamentalMatrixTransform

Essential matrix transformation.

The essential matrix relates corresponding points between a pair of calibrated images. The matrix transforms normalized, homogeneous image points in one image to epipolar lines in the other image.

The essential matrix is only defined for a pair of moving images capturing a non-planar scene. In the case of pure rotation or planar scenes, the homography describes the geometric relation between two images (ProjectiveTransform). If the intrinsic calibration of the images is unknown, the fundamental matrix describes the projective relation between the two images (FundamentalMatrixTransform).

Parameters:

rotation(3, 3) array_like, optional: Rotation matrix of the relative camera motion.
translation(3, 1) array_like, optional: Translation vector of the relative camera motion. The vector must have unit length.
matrix(3, 3) array_like, optional: Essential matrix.

References

[1]

Hartley, Richard, and Andrew Zisserman. Multiple view geometry in computer vision. Cambridge university press, 2003.

Attributes:

params(3, 3) array: Essential matrix.

__init__(rotation=None, translation=None, matrix=None, *, dimensionality=2)[source]¶

estimate(src, dst)[source]¶

Estimate essential matrix using 8-point algorithm.

The 8-point algorithm requires at least 8 corresponding point pairs for a well-conditioned solution, otherwise the over-determined solution is estimated.

Parameters:

src(N, 2) array_like: Source coordinates.
dst(N, 2) array_like: Destination coordinates.

Returns:

successbool: True, if model estimation succeeds.

`EuclideanTransform`¶

class skimage.transform.EuclideanTransform(matrix=None, rotation=None, translation=None, *, dimensionality=2)[source]¶

Bases: ProjectiveTransform

Euclidean transformation, also known as a rigid transform.

Has the following form:

X = a0 * x - b0 * y + a1 =
  = x * cos(rotation) - y * sin(rotation) + a1

Y = b0 * x + a0 * y + b1 =
  = x * sin(rotation) + y * cos(rotation) + b1

where the homogeneous transformation matrix is:

[[a0  b0  a1]
 [b0  a0  b1]
 [0   0    1]]

The Euclidean transformation is a rigid transformation with rotation and translation parameters. The similarity transformation extends the Euclidean transformation with a single scaling factor.

Parameters:

matrix(D+1, D+1) array_like, optional: Homogeneous transformation matrix.
rotationfloat or sequence of float, optional: Rotation angle in counter-clockwise direction as radians. If given as a vector, it is interpreted as Euler rotation angles [1]. Only 2D (single rotation) and 3D (Euler rotations) values are supported. For higher dimensions, you must provide or estimate the transformation matrix.
translationsequence of float, length D, optional: Translation parameters for each axis.
dimensionalityint, optional: The dimensionality of the transform.

References

[1]

Attributes:

params(D+1, D+1) array: Homogeneous transformation matrix.

__init__(matrix=None, rotation=None, translation=None, *, dimensionality=2)[source]¶

estimate(src, dst)[source]¶

Estimate the transformation from a set of corresponding points.

You can determine the over-, well- and under-determined parameters with the total least-squares method.

Number of source and destination coordinates must match.

Parameters:

src(N, 2) array_like: Source coordinates.
dst(N, 2) array_like: Destination coordinates.

Returns:

successbool: True, if model estimation succeeds.

property rotation¶

property translation¶

Examples using `skimage.transform.EuclideanTransform`¶

Fundamental matrix estimation

`FundamentalMatrixTransform`¶

class skimage.transform.FundamentalMatrixTransform(matrix=None, *, dimensionality=2)[source]¶

Bases: GeometricTransform

Fundamental matrix transformation.

The fundamental matrix relates corresponding points between a pair of uncalibrated images. The matrix transforms homogeneous image points in one image to epipolar lines in the other image.

The fundamental matrix is only defined for a pair of moving images. In the case of pure rotation or planar scenes, the homography describes the geometric relation between two images (ProjectiveTransform). If the intrinsic calibration of the images is known, the essential matrix describes the metric relation between the two images (EssentialMatrixTransform).

Parameters:

matrix(3, 3) array_like, optional: Fundamental matrix.

References

[1]

Hartley, Richard, and Andrew Zisserman. Multiple view geometry in computer vision. Cambridge university press, 2003.

Attributes:

params(3, 3) array: Fundamental matrix.

__init__(matrix=None, *, dimensionality=2)[source]¶

estimate(src, dst)[source]¶

Estimate fundamental matrix using 8-point algorithm.

The 8-point algorithm requires at least 8 corresponding point pairs for a well-conditioned solution, otherwise the over-determined solution is estimated.

Parameters:

src(N, 2) array_like: Source coordinates.
dst(N, 2) array_like: Destination coordinates.

Returns:

successbool: True, if model estimation succeeds.

inverse(coords)[source]¶

Apply inverse transformation.

Parameters:

coords(N, 2) array_like: Destination coordinates.

Returns:

coords(N, 3) array: Epipolar lines in the source image.

residuals(src, dst)[source]¶

Compute the Sampson distance.

The Sampson distance is the first approximation to the geometric error.

Parameters:

src(N, 2) array: Source coordinates.
dst(N, 2) array: Destination coordinates.

Returns:

residuals(N, ) array: Sampson distance.

Examples using `skimage.transform.FundamentalMatrixTransform`¶

`PiecewiseAffineTransform`¶

class skimage.transform.PiecewiseAffineTransform[source]¶

Bases: GeometricTransform

Piecewise affine transformation.

Control points are used to define the mapping. The transform is based on a Delaunay triangulation of the points to form a mesh. Each triangle is used to find a local affine transform.

Attributes:

affineslist of AffineTransform objects: Affine transformations for each triangle in the mesh.
inverse_affineslist of AffineTransform objects: Inverse affine transformations for each triangle in the mesh.

__init__()[source]¶

estimate(src, dst)[source]¶

Estimate the transformation from a set of corresponding points.

Number of source and destination coordinates must match.

Parameters:

src(N, D) array_like: Source coordinates.
dst(N, D) array_like: Destination coordinates.

Returns:

successbool: True, if all pieces of the model are successfully estimated.

inverse(coords)[source]¶

Apply inverse transformation.

Coordinates outside of the mesh will be set to - 1.

Parameters:

coords(N, D) array_like: Source coordinates.

Returns:

coords(N, D) array: Transformed coordinates.

Examples using `skimage.transform.PiecewiseAffineTransform`¶

Piecewise Affine Transformation

`PolynomialTransform`¶

class skimage.transform.PolynomialTransform(params=None, *, dimensionality=2)[source]¶

Bases: GeometricTransform

2D polynomial transformation.

Has the following form:

X = sum[j=0:order]( sum[i=0:j]( a_ji * x**(j - i) * y**i ))
Y = sum[j=0:order]( sum[i=0:j]( b_ji * x**(j - i) * y**i ))

Parameters:

params(2, N) array_like, optional: Polynomial coefficients where N * 2 = (order + 1) * (order + 2). So, a_ji is defined in params[0, :] and b_ji in params[1, :].

Attributes:

params(2, N) array: Polynomial coefficients where N * 2 = (order + 1) * (order + 2). So, a_ji is defined in params[0, :] and b_ji in params[1, :].

__init__(params=None, *, dimensionality=2)[source]¶

estimate(src, dst, order=2, weights=None)[source]¶

Estimate the transformation from a set of corresponding points.

You can determine the over-, well- and under-determined parameters with the total least-squares method.

Number of source and destination coordinates must match.

The transformation is defined as:

X = sum[j=0:order]( sum[i=0:j]( a_ji * x**(j - i) * y**i ))
Y = sum[j=0:order]( sum[i=0:j]( b_ji * x**(j - i) * y**i ))

These equations can be transformed to the following form:

0 = sum[j=0:order]( sum[i=0:j]( a_ji * x**(j - i) * y**i )) - X
0 = sum[j=0:order]( sum[i=0:j]( b_ji * x**(j - i) * y**i )) - Y

which exist for each set of corresponding points, so we have a set of N * 2 equations. The coefficients appear linearly so we can write A x = 0, where:

A   = [[1 x y x**2 x*y y**2 ... 0 ...             0 -X]
       [0 ...                 0 1 x y x**2 x*y y**2 -Y]
        ...
        ...
      ]
x.T = [a00 a10 a11 a20 a21 a22 ... ann
       b00 b10 b11 b20 b21 b22 ... bnn c3]

In case of total least-squares the solution of this homogeneous system of equations is the right singular vector of A which corresponds to the smallest singular value normed by the coefficient c3.

Weights can be applied to each pair of corresponding points to indicate, particularly in an overdetermined system, if point pairs have higher or lower confidence or uncertainties associated with them. From the matrix treatment of least squares problems, these weight values are normalised, square-rooted, then built into a diagonal matrix, by which A is multiplied.

Parameters:

src(N, 2) array_like: Source coordinates.
dst(N, 2) array_like: Destination coordinates.
orderint, optional: Polynomial order (number of coefficients is order + 1).
weights(N,) array_like, optional: Relative weight values for each pair of points.

Returns:

successbool: True, if model estimation succeeds.

inverse(coords)[source]¶

Apply inverse transformation.

Parameters:

coords(N, 2) array_like: Destination coordinates.

Returns:

coords(N, 2) array: Source coordinates.

`ProjectiveTransform`¶

class skimage.transform.ProjectiveTransform(matrix=None, *, dimensionality=2)[source]¶

Bases: GeometricTransform

Projective transformation.

Apply a projective transformation (homography) on coordinates.

For each homogeneous coordinate \(\mathbf{x} = [x, y, 1]^T\), its target position is calculated by multiplying with the given matrix, \(H\), to give \(H \mathbf{x}\):

[[a0 a1 a2]
 [b0 b1 b2]
 [c0 c1 1 ]].

E.g., to rotate by theta degrees clockwise, the matrix should be:

[[cos(theta) -sin(theta) 0]
 [sin(theta)  cos(theta) 0]
 [0            0         1]]

or, to translate x by 10 and y by 20:

[[1 0 10]
 [0 1 20]
 [0 0 1 ]].

Parameters:

matrix(D+1, D+1) array_like, optional: Homogeneous transformation matrix.
dimensionalityint, optional: The number of dimensions of the transform. This is ignored if matrix is not None.

Attributes:

params(D+1, D+1) array: Homogeneous transformation matrix.

__init__(matrix=None, *, dimensionality=2)[source]¶

property dimensionality¶: The dimensionality of the transformation.

estimate(src, dst, weights=None)[source]¶

Estimate the transformation from a set of corresponding points.

You can determine the over-, well- and under-determined parameters with the total least-squares method.

Number of source and destination coordinates must match.

The transformation is defined as:

X = (a0*x + a1*y + a2) / (c0*x + c1*y + 1)
Y = (b0*x + b1*y + b2) / (c0*x + c1*y + 1)

These equations can be transformed to the following form:

0 = a0*x + a1*y + a2 - c0*x*X - c1*y*X - X
0 = b0*x + b1*y + b2 - c0*x*Y - c1*y*Y - Y

which exist for each set of corresponding points, so we have a set of N * 2 equations. The coefficients appear linearly so we can write A x = 0, where:

A   = [[x y 1 0 0 0 -x*X -y*X -X]
       [0 0 0 x y 1 -x*Y -y*Y -Y]
        ...
        ...
      ]
x.T = [a0 a1 a2 b0 b1 b2 c0 c1 c3]

In case of total least-squares the solution of this homogeneous system of equations is the right singular vector of A which corresponds to the smallest singular value normed by the coefficient c3.

In case of the affine transformation the coefficients c0 and c1 are 0. Thus the system of equations is:

A   = [[x y 1 0 0 0 -X]
       [0 0 0 x y 1 -Y]
        ...
        ...
      ]
x.T = [a0 a1 a2 b0 b1 b2 c3]

Parameters:

src(N, 2) array_like: Source coordinates.
dst(N, 2) array_like: Destination coordinates.
weights(N,) array_like, optional: Relative weight values for each pair of points.

Returns:

successbool: True, if model estimation succeeds.

inverse(coords)[source]¶

Apply inverse transformation.

Parameters:

coords(N, D) array_like: Destination coordinates.

Returns:

coords_out(N, D) array: Source coordinates.

Examples using `skimage.transform.ProjectiveTransform`¶

Robust matching using RANSAC

CENSURE feature detector

Corner detection

`SimilarityTransform`¶

class skimage.transform.SimilarityTransform(matrix=None, scale=None, rotation=None, translation=None, *, dimensionality=2)[source]¶

Bases: EuclideanTransform

Similarity transformation.

Has the following form in 2D:

X = a0 * x - b0 * y + a1 =
  = s * x * cos(rotation) - s * y * sin(rotation) + a1

Y = b0 * x + a0 * y + b1 =
  = s * x * sin(rotation) + s * y * cos(rotation) + b1

where s is a scale factor and the homogeneous transformation matrix is:

[[a0  b0  a1]
 [b0  a0  b1]
 [0   0    1]]

The similarity transformation extends the Euclidean transformation with a single scaling factor in addition to the rotation and translation parameters.

Parameters:

matrix(dim+1, dim+1) array_like, optional: Homogeneous transformation matrix.
scalefloat, optional: Scale factor. Implemented only for 2D and 3D.
rotationfloat, optional: Rotation angle in counter-clockwise direction as radians. Implemented only for 2D and 3D. For 3D, this is given in ZYX Euler angles.
translation(dim,) array_like, optional: x, y[, z] translation parameters. Implemented only for 2D and 3D.

Attributes:

params(dim+1, dim+1) array: Homogeneous transformation matrix.

__init__(matrix=None, scale=None, rotation=None, translation=None, *, dimensionality=2)[source]¶

estimate(src, dst)[source]¶

Estimate the transformation from a set of corresponding points.

You can determine the over-, well- and under-determined parameters with the total least-squares method.

Number of source and destination coordinates must match.

Parameters:

src(N, 2) array_like: Source coordinates.
dst(N, 2) array_like: Destination coordinates.

Returns:

successbool: True, if model estimation succeeds.

property scale¶

Examples using `skimage.transform.SimilarityTransform`¶