SeparableFunctions.jl
SeparableFunctions.jl Interface
Separable Functions are multidimensional functions that can be written as a broadcast (outer product) of functions of individual dimensions, e.g. f(x,y) = f(x).*f(y)
. In a more general way, this also applies to additions ramp(x,y, slope) = ramp(x, slope_x) .+ ramp(y,slope_y)
. This package provides an easy-to-use performant interfact to represent such functions. The performance gains can be quite massive, particulary if the single function evaluation is expensive. E.g. a complex-valued phase ramp exp.(1im.*(k_x.*x .+ k_y.*y)
is a separable product, which allows the complex exponential to be calculated only for the one-dimensional vectors rather than an N-dimensional array.
In this package you will find generic tools to represent your own separable function as well as a number of predefined functions such a gaussian_sep
, rr2_sep
, ramp_sep
, box_sep
, sinc_sep
, exp_ikx_sep
. There are corresponding conveniance _col
versions for collected arrays, but it is recommended to directly use the _sep
versions. There are also versions (_lz
) based on LanzyArrays.jl
but since they are currently not performing that well, they are currently not exported. To generate your own versions of separable functions you can use calculate_broadcasted
.
Additionally this package also provides a convenient interface (calc_radial_symm
) to quickly calculate (calc_radial_symm
, calc_radial_symm!
) radial functions. A fast calculation is achieved by calculating the values only for a fraction of the N-dimensional array and using effective copy operations exploiting symmetry. Also here example implementations, which either collect _col
or modify existing array !_col
,are provided. Currently implemented examples are propagator_col
, phase_kz_col
.
In a more general sense:
SeparableFunctions
— ModuleCalculates multidimensional functions faster by exploiting their separability. Often a function involves an operation such as a (complex) exponential which by itself is computationally relatively heavy. Yet a number of multidimenstional functions are separable, which means that they can be written as a product (or sum) of single-dimensional functions. A good example of a separable function is a Gaussian function, which can be written as a product of a purely X-dependend Gaussian with a purely Y-dependend Gaussian.
In this package, multidimensional functions are computed by first calculating their single-dimensional values and then creating the final multidimensional result by an outer product. Since multiplications and the broadcasting mechanism of Julia are fast compared to the evaluation of the function at each multidimensional position, the final result is calculated faster. The typical speedup can be an order of magnitude.
The package offers a general way of calculating separable functions as well as a LazyArrays
version of that function which can then be used inside other expressions. The non-lazy version should currently also work with CUDA.jl
, however the LazyArrays
version does not. To nevertheless use separable expressions in CUDA.jl
, you can reside to externally applying the broadcast operator to the separable expression (see the gaussian_sep
example below).
The package further offers a number of predifined separable function implementations such as gaussian_col()
collects a multidimensional array of a multidimensional Gaussian via a fast seperable implementation, gaussian_lz()
yields a lazy representation via LazyArrays
and sep = gaussian_sep()
yields an iterable of separable pre-oriented vectors which can easily be mutually combined via .*(sep...)
.
Another noteworthy example is the complex plain wave as represented by the respective function exp_ikx_col()
, exp_ikx_lz()
, exp_ikx_sep()
.
All separable functions share a common interface with Standard non-named arguments
- the first optional argument being the type of the result array. Examples are
Array{Float64}
,CuArray{Float32}
and the default depends on the function but uses a 32-bit result type where applicable. - the next argument is the
size
of the result data supplied as a tuple. - optionally a further argument can specify the
position
of zero of the array with respect tooffset
as given below. This allows for convenient N-dimensional shifting of the functions.
Named arguments:
offset
: is a optional named argument specifying center of the array. By default the Fourier center,size().÷2 .+ 1
is chosen.scale
: multiplies the axes with these factors. This can be interpreted as apixelsize
.
Some functions have additional named arguments. E.g. gaussian
has the additional named argument sigma
specifying the width of the gaussian, even though this can also be controlled via scale
. Note that this nomenclature is in large parts identical to the package IndexFunArrays.jl
.
In general arguments can be supplied as single scalar values or vectors. If a scalar value is supplied, it will automatically be replicated as a vector. E.g. sigma=10.0
for a 2-dimensional array will be interpreted as sigma=(10.0, 10.0)
.
Generic Ways to Create Seperable and Radial Functions
SeparableFunctions.calculate_broadcasted
— Functioncalculate_broadcasted([::Type{TA},] fct, sz::NTuple{N, Int}, args...; dims=1:N, pos=zero(real(eltype(DefaultArrType))), offset=sz.÷2 .+1, scale=one(real(eltype(DefaultArrType))), operation = *, kwargs...) where {TA, N}
returns an instantiated broadcasted separable array, which essentially behaves almost like an array yet uses broadcasting. Test revealed maximal speed improvements for this version. Yet, a problem is that reduce operations with specified dimensions cause an error. However this can be avoided by calling collect
.
Arguments:
fct
: The separable function, with a number of arguments corresponding tolength(args)
, each corresponding to a vector of separable inputs of length N. The first argument of this function is a Tuple corresponding the centered indices.sz
: The size of the N-dimensional array to createargs
...: a list of arguments, each being an N-dimensional vectordims
: a vector[]
of valid dimensions. Only these dimension will be calculated but they are oriented in ND.offset
: position of the center from which the position is measuredscale
: defines the pixel size as vector or scalar. Default: 1.0.operation
: the separable operation connecting the separable dimensions
julia> fct = (r, sz, pos, sigma)-> exp(-(r-pos)^2/(2*sigma^2))
julia> my_gaussian = calculate_broadcasted(fct, (6,5), (0.1,0.2), (0.5,1.0))
Base.Broadcast.Broadcasted{Base.Broadcast.DefaultArrayStyle{2}}(*,
(Float32[4.4963495f-9; 0.00014774836; … ; 0.1978987; 0.0007318024;;],
Float32[0.088921614 0.48675224 … 0.726149 0.1978987]))
julia> collect(my_gaussian)
6×5 Matrix{Float32}:
3.99823f-10 2.18861f-9 4.40732f-9 3.26502f-9 8.89822f-10
1.3138f-5 7.19168f-5 0.000144823 0.000107287 2.92392f-5
0.00790705 0.0432828 0.0871608 0.0645703 0.0175975
0.0871608 0.477114 0.960789 0.71177 0.19398
0.0175975 0.0963276 0.19398 0.143704 0.0391639
6.50731f-5 0.000356206 0.000717312 0.000531398 0.000144823
SeparableFunctions.calculate_separables
— Functioncalculate_separables([::Type{AT},] fct, sz::NTuple{N, Int}, args...; dims = 1:N, pos=zero(real(eltype(AT))), offset=sz.÷2 .+1, scale=one(real(eltype(AT))), kwargs...) where {AT, N}
creates a list of one-dimensional vectors, which can be combined to yield a separable array. In a way this can be seen as a half-way Lazy operation. The (potentially heavy) work of calculating the one-dimensional functions is done now but the memory-heavy calculation of the array is done later. This function is used in separable_view
and separable_create
.
#Arguments
AT
: optional type signfying the array result type. You can for example useCuArray{Float32}
usingCUDA
to create the views on the GPU.fct
: the function to calculate for each axis index (no need for broadcasting!) of this iterable of seperable axes. Note that the first arguments offct
have to be the index of this coordinate and the size of this axis. Any furtherargs
andnargs
can follow. Often the second argument is not used but it still needs to be present.sz
: the size of the result array (when appying the one-D axes)args
: further arguments which are passed over to the functionfct
.dims
: a vector[]
of valid dimensions. Only these dimension will be calculated but they are oriented in ND.pos
: a position shifting the indices passed tofct
in relationship to theoffset
.offset
: specifying the center (zero-position) of the result array in one-based coordinates. The default corresponds to the Fourier-center.scale
: multiplies the index before passing it tofct
#Example:
julia> fct = (r, sz, sigma)-> exp(-r^2/(2*sigma^2))
julia> gauss_sep = calculate_separables(fct, (6,5), (0.5,1.0), pos = (0.1,0.2))
2-element Vector{Array{Float32}}:
[4.4963495f-9; 0.00014774836; … ; 0.1978987; 0.0007318024;;]
[0.088921614 0.48675224 … 0.726149 0.1978987]
julia> my_gaussian = .*(gauss_sep...) # this is how to broadcast it
6×5 Matrix{Float32}:
3.99823f-10 2.18861f-9 4.40732f-9 3.26502f-9 8.89822f-10
1.3138f-5 7.19168f-5 0.000144823 0.000107287 2.92392f-5
0.00790705 0.0432828 0.0871608 0.0645703 0.0175975
0.0871608 0.477114 0.960789 0.71177 0.19398
0.0175975 0.0963276 0.19398 0.143704 0.0391639
6.50731f-5 0.000356206 0.000717312 0.000531398 0.000144823
SeparableFunctions.separable_view
— Functionseparable_view{N}(fct, sz, args...; pos=zero(real(eltype(AT))), offset = sz.÷2 .+1, scale = one(real(eltype(AT))), operation = .*)
creates an array view of an N-dimensional separable function. Note that this view consumes much less memory than a full allocation of the collected result. Note also that an N-dimensional calculation expression may be much slower than this view reprentation of a product of N one-dimensional arrays. See the example below.
Arguments:
fct
: The separable function, with a number of arguments corresponding tolength(args)
, each corresponding to a vector of separable inputs of length N. The first argument of this function is a Tuple corresponding the centered indices.sz
: The size of the N-dimensional array to createargs
...: a list of arguments, each being an N-dimensional vectordims
: a vector[]
of valid dimensions. Only these dimension will be calculated but they are oriented in ND.offset
: position of the center from which the position is measuredscale
: defines the pixel size as vector or scalar. Default: 1.0.operation
: the separable operation connecting the separable dimensions
Example:
julia> fct = (r, sz, pos, sigma)-> exp(-(r-pos)^2/(2*sigma^2))
julia> my_gaussian = separable_view(fct, (6,5), (0.1,0.2), (0.5,1.0))
(6×1 Matrix{Float32}) .* (1×5 Matrix{Float32}):
3.99823e-10 2.18861e-9 4.40732e-9 3.26502e-9 8.89822e-10
1.3138e-5 7.19168e-5 0.000144823 0.000107287 2.92392e-5
0.00790705 0.0432828 0.0871609 0.0645703 0.0175975
0.0871609 0.477114 0.960789 0.71177 0.19398
0.0175975 0.0963276 0.19398 0.143704 0.0391639
6.50731e-5 0.000356206 0.000717312 0.000531398 0.000144823
SeparableFunctions.separable_create
— Functionseparable_create([::Type{TA},] fct, sz::NTuple{N, Int}, args...; pos=zero(real(eltype(AT))), offset = sz.÷2 .+1, scale = one(real(eltype(AT))), operation = *, kwargs...) where {TA, N}
creates an array view of an N-dimensional separable function including memory allocation and collection. See the example below.
Arguments:
fct
: The separable function, with a number of arguments corresponding tolength(args)
, each corresponding to a vector of separable inputs of length N. The first argument of this function is a Tuple corresponding the centered indices.sz
: The size of the N-dimensional array to createargs
...: a list of arguments, each being an N-dimensional vectordims
: a vector[]
of valid dimensions. Only these dimension will be calculated but they are oriented in ND.offset
: position of the center from which the position is measuredscale
: defines the pixel size as vector or scalar. Default: 1.0.operation
: the separable operation connecting the separable dimensions
Example:
julia> fct = (r, sz, sigma)-> exp(-r^2/(2*sigma^2))
julia> my_gaussian = separable_create(fct, (6,5), (0.5,1.0); pos=(0.1,0.2))
6×5 Matrix{Float32}:
3.99823f-10 2.18861f-9 4.40732f-9 3.26502f-9 8.89822f-10
1.3138f-5 7.19168f-5 0.000144823 0.000107287 2.92392f-5
0.00790705 0.0432828 0.0871608 0.0645703 0.0175975
0.0871608 0.477114 0.960789 0.71177 0.19398
0.0175975 0.0963276 0.19398 0.143704 0.0391639
6.50731f-5 0.000356206 0.000717312 0.000531398 0.000144823
SeparableFunctions.copy_corners!
— Functioncopy_corners!(arr::AbstractArray{T,N}) where {T,N}
replicates the first N-dimensional quadrant my several mirror operations over the entire array. The overwrites the entire array with the (mirrored) content of the first quadrant!
Arguments
arr
: The array in which the copy operations are performedspeedup_last_dim=true
: iftrue
a one-dimensional assignment trick is used for the last non-singleton dimension.