Description

Classes:: jwst.adaptive_trace_model.adaptive_trace_model_step.AdaptiveTraceModelStep
Alias:: adaptive_trace_model

Overview

The adaptive_trace_model step models the spectral trace in a 2D spectral image with a set of univariate basis spline fits to the spatial profile, along the dispersion axis. Optionally, the step may also use the trace model to oversample the input data by a specified factor.

This step is intended in part to address spatial undersampling effects in NIRSpec IFU and MIRI MRS spectra extracted from rectified cubes. This “resampling noise” manifests as low-frequency oscillations in spectra extracted from apertures smaller than the observational point-spread-function (PSF). Interpolating the data onto a higher resolution grid prior to building a rectified spectral cube can mitigate these spectral artifacts.

This step is currently available for NIRSpec IFU and MIRI MRS exposures only. It is incorporated into the calwebb_spec2 and calwebb_spec3 pipelines, prior to the pixel_replace and cube_build steps, but may also be run as a standalone step.

Upon successful completion of this step, the status keyword S_TRCMDL is set to “COMPLETE”, and the trace model image is stored in the output datamodel’s trace_model attribute (FITS extension TRACEMODEL). If oversampling was performed, the input flux, error, and variance images are replaced with interpolated data sampled onto a new pixel grid. The data quality (DQ), wavelength, and regions images are also oversampled, but any additional images (e.g. pathloss corrections) are not propagated to the output datamodel.

Algorithm

The adaptive spline modeling used by this step depends on two assumptions: one, that the PSF should change only slowly with wavelength, and two, that unresolved point sources should have centroids that remain fixed in celestial coordinates at all wavelengths. These assumptions enable the creation of a model of the spatial profile at each wavelength by fitting the spatial data within a window of each dispersion element. The data are fit with a cubic basis spline function, with sufficient knots defined to evenly sample the spatial profile without overfitting artifacts.

Model the Trace

Modeling is performed one spectral region (slice or slit) at a time. For IFU data, each slice is separately modeled as follows:

Determine if the slice has sufficient signal to justify fitting spline models. If the mean value for the slice is less than 10-sigma higher than the overall mean for the image, by default, then the slice is ignored and no modeling is performed.
Compute spatial (along-slice) coordinates for every pixel in the slice.
Make a normalized image by dividing by the sum over each dispersion element (column) in the slice.
For each column in the slice:
1. Select normalized data within a range of nearby wavelengths.
2. Fit the normalized data by spatial coordinate with a cubic basis spline function.
3. Reject any data points more than 2.5 sigma from the spline model and re-fit the remaining data, for a maximum of 3 iterations.
4. Evaluate the model at the input coordinates for the column and determine a scaling factor to reproduce the observed values, from the weighted mean ratio of the original fluxes to the normalized spline model.

The set of spline models and scale factors for each wavelength in each slice constitutes the adaptive trace model for the spectral image.

The spline modeling assumptions are generally only appropriate for compact sources, so an additional check is made to determine regions for which the model is likely to be accurate. The slope of the model flux is computed for each column pixel as the absolute difference between the normalized spline model at that pixel and its immediate neighbor. Slope values higher than a threshold value (step parameter slope_limit) indicate a compact source region. The trace model will be evaluated for these regions, with some padding for nearby pixels; it will not be evaluated in other regions.

If no oversampling is desired (i.e. the oversample parameter is set to 1.0), then the trace model is evaluated at every input pixel in a compact source region to create a wavelength-dependent spatial profile. This image is stored in the output datamodel, in the trace_model attribute. Regions for which a spline model could not be computed, or which did not meet the compact source criteria, are set to NaN in the image. The step then returns without further changes to the input datamodel. The rest of the algorithm description, below, applies only to oversampling.

Oversample the Flux

If oversampling is desired, the step will create new data arrays. The spatial dimension is scaled by the oversampling factor; the spectral dimension remains the same. Each slice is again processed separately, by interpolating the spectral flux onto the new grid as follows:

Compute spatial coordinates for each pixel in the oversampled grid.
For each column in the slice:
1. Compute a linear interpolation of the data onto the new oversampled coordinates (\(f_{linear}\)).
2. If a spline fit is available for this column, evaluate the spline model at the original coordinates for the column.
3. Construct the residual between the evaluated spline fit and the original data, and linearly interpolate it onto the oversampled coordinates (\(f_{residual}\)).
4. Compute the slope of each column pixel as the absolute difference between the normalized spline model at that pixel and its immediate neighbor.
5. Evaluate the spline model at the oversampled coordinates (\(f_{spline}\)).
Construct the oversampled slice flux (f) from a piecewise model:

\(f = f_{spline} + f_{residual}\), where the slope is greater than a threshold value

and

\(f = f_{linear}\), otherwise.

This method results in an interpolated flux image that uses the spline models for any bright, compact sources and a linear interpolation for faint, diffuse regions. The residual image added into the spline model accounts for any local structures that are not well modeled by the spline profile.

Note that the interpolation process may provide output values for some pixels corresponding to data with NaN values in the input. If the region is modeled by a valid spline interpolation, the missing values are extrapolated and replaced with real values from the spline model plus residual flux. These values will be marked in the DQ plane with a FLUX_ESTIMATED flag (see below).

Optionally, if the psf_optimal step parameter is set to True, fit threshold and slope limits are ignored, so that spline models are created and used for all pixels, and the residual image is not added into the oversampled flux. This option is only appropriate for simple, isolated point sources, but if used, can significantly improve the signal-to-noise ratio (SNR) for extracted spectra, at the cost of ignoring non-PSF structures.

Alternately, crowded fields with multiple stars may benefit particularly from setting the fit_threshold and slope_limit parameters to zero in order to ensure proper modeling of both bright and faint stars.

Alongside the oversampled flux image, the set of spline models evaluated at all compact source coordinates (\(f_{spline}\), above, where the slope condition is met) are saved to the output model in the trace_model attribute, as a record of the wavelength-dependent spatial profile for the oversampled data.

Propagate DQ, Error, and Variance

To match the oversampled flux image, the error and variance arrays in the datamodel are linearly interpolated onto the oversampled grid. The DQ array is oversampled as well, with a nearest-pixel interpolation. It is then updated with a FLUX_ESTIMATED flag for any pixels that were NaN in the input but were replaced with real values by the spline modeling.

After oversampling, error and variance arrays are inflated by a factor dependent on the oversampling ratio to account for the introduced covariance. This factor is intended to produce resampled cubes with the same SNR for all oversampling factors, to first order. The value of the factor (X) was empirically determined from tests on a line-free region of a stellar spectrum, and is calculated for oversample factor N as:

\[X = 0.23 N + 0.77\]

The oversampled error image is multiplied by X; variance images are multiplied by X².

Note that the inflated error arrays do not accurately reflect the per-pixel errors on the oversampled flux, but rather are intended to produce approximately correct errors after further resampling. The oversampled product is primarily intended to be an intermediate format, prior to building a rectified spectral cube.

Update the WCS

Finally, for oversampled data, the WCS object for the exposure must be updated to include a transform from the oversampled pixel coordinates to the original detector coordinates. The transform is stored in a frame called “coordinates”, prior to the “detector” frame. WCS operations following oversampling should use “coordinates” as the input frame. For example, to retrieve the world coordinates for pixel x, y in the oversampled image, these operations are equivalent:

ra, dec, lam = oversampled_model.meta.wcs(x, y)

and:

oversampled_transform = oversampled_model.meta.wcs.get_transform('coordinates', 'world')
ra, dec, lam = oversampled_transform(x, y)

To retrieve the transform from original detector pixels to world coordinates instead, use the “detector” frame:

detector_pixel_transform = oversampled_model.meta.wcs.get_transform('detector', 'world')

References

The adaptive trace model algorithm is based on work by D. Law, “Mitigating Pixel Phase Artifacts for the JWST IFU Spectrometers with Adaptive Trace Modeling” (in prep).