Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HLA-1352: Description of JSON files #1901

Merged
Show file tree
Hide file tree
Changes from 6 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
16 changes: 15 additions & 1 deletion doc/ADRs.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,4 +25,18 @@ The second column of a poller file is now reserved (for WFPC2) for passing the a

## Consequences

Caution must be taken is using variations of the poller file while processing WFPC2 data.
Caution must be taken is using variations of the poller file while processing WFPC2 data.

# The Use of the TEAL Interface 11/14/23

## Context

The code can be run using the interactive GUI TEAL, however, it is hard to maintain. TEAL is also used in the code to load the json parameter files.

## Decision

In order to make the code more easily maintainable, we will no longer support the use of the GUI for running tasks, however, TEAL will still be used in the background to load the parameter files as there is no current alternative for parsing that data.

## Consequences

TEAL will need to be included in drizzlepac until a replacement for parsing the json files can be found.
64 changes: 50 additions & 14 deletions doc/source/mast_data_products/hap-parameters.rst
Original file line number Diff line number Diff line change
Expand Up @@ -3,10 +3,46 @@ HAP Parameters

.. _hap-parameters:

In the high-level svm_parameters or mvm_parameters directories, the instrument/detector-dependent configuration index files (e.g., acs_sbc_index.json) provide a map to the configuration files used for the major HAP processing steps. The svm_parameter configuration files are also used for pipeline products.


For the astrodrizzle step, there are different files which should be employed depending upon the number of exposures int the visit.


Detector Configuration Files
****************************

acs_hrc_index.json

.. code-block:: python

{
"alignment": {
"all": "acs/hrc/acs_hrc_alignment_all.json"
},
"astrodrizzle": {
"any_n1": "acs/hrc/acs_hrc_astrodrizzle_any_n1.json",
"acs_hrc_any_n2": "acs/hrc/acs_hrc_astrodrizzle_any_n2.json",
"acs_hrc_any_n4": "acs/hrc/acs_hrc_astrodrizzle_any_n4.json",
"acs_hrc_any_n6": "acs/hrc/acs_hrc_astrodrizzle_any_n6.json",
"filter_basic": "any/any_astrodrizzle_filter_hap_basic.json",
"single_basic": "any/any_astrodrizzle_single_hap_basic.json",
"total_basic": "acs/hrc/acs_hrc_astrodrizzle_any_total.json"
},
"catalog generation": {
"all": "acs/hrc/acs_hrc_catalog_generation_all.json"
},
"quality control": {
"all": "acs/hrc/acs_hrc_quality_control_all.json"
}
}

The different files for "astrodrizzle" are for the case of 1 exposure ("any_n1"), 2 exposures (acs_hrc_any_n2), etc. Each json file with an associated task above (e.g. astrodrizzle) will have the parameters for that task. These files will be used to initialize variables that are required in the processing of the data and, in the case of "astrodrizzle", are separated into different steps.

s-goldman marked this conversation as resolved.
Show resolved Hide resolved
Shown below are the parameters that are used by the Hubble Advanced Products. We include the parameter, the default value for WFC3 processing, and a description of that parameter.

.. run_hap_processing
.. identified in json files.
.. identified in json files.
Copy link
Collaborator

@mdlpstsci mdlpstsci Nov 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is a bit confusing below as there are sections of the configuration file, wfc3_ir_astrodrizzle_any_n1.json which I though were skipped, but they are just out of order. This needs to be corrected to avoid confusion.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've reorganized the sections.


Copy link
Collaborator

@mdlpstsci mdlpstsci Nov 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The discussion here seems out of context. If you are describing the contents of a configuration file, provide the name of the specific file, and then the contents. Note each numbered section maps to a corresponding section of the NameOfConfigurationFile. Perhaps this would be more obvious to me if I were looking at the formatted output. Having said this, I will look at this with a phased approach in that more and clarifying information can added later once the basics are described.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good. I've added the example filename to the docs.


Initialize HAP
Expand Down Expand Up @@ -153,7 +189,7 @@ driz_sep_bits: int (*default="16"*)
Integer sum of all the DQ bit values from the input image's DQ array that should be considered "good" when building the weighting mask. This can also be used to reset pixels to good if they had been flagged as cosmic rays during a previous run of ``AstroDrizzle``, by adding the value 4096 for ACS and WFPC2 data. Please see the section on Selecting the ``Bits`` Parameter for a more detailed discussion.

driz_sep_kernel: str (*default="turbo"*)
Used for the initial separate drizzling operation only, this parameter specifies the form of the kernel function used to distribute flux onto the separate output images. The current options are: "square", "point", "turbo", "gaussian", and "lanczos3". The latter two options ("gaussian" and "lanczos3") are not guaranteed to conserve flux, but may produce reasonable results; understand the effects of these kernels before using them. A former option "tophat" has been removed as it was found to produce poor results. See adrizzle.help for more details.
Used for the initial separate drizzling operation only, this parameter specifies the form of the kernel function used to distribute flux onto the separate output images. The current options are: "square", "point", "turbo", "gaussian", and "lanczos3". The latter two options ("gaussian" and "lanczos3") are not guaranteed to conserve flux, but may produce reasonable results; understand the effects of these kernels before using them. A former option "tophat" has been removed as it was found to produce poor results. See adrizzle.help for more details.

driz_sep_wt_scl: float (*default=exposure time (from image header)*)
This parameter specifies the weighting factor for input image. If ``driz_sep_wt_scl``\ =\ ``exptime``, then the scaling value will be set equal to the exposure time found in the image header. The use of the default value is recommended for producing optimal behavior for most scenarious. It is possible to set ``wt_scl``\ =\ "expsq" for weighting by the square of the exposure time, which is optimal for read-noise dominated images.
Expand Down Expand Up @@ -266,7 +302,7 @@ driz_cr: bool (*default=False*)

driz_cr_snr: str (*default="5.0 4.0"*)
Driz_cr.SNR parameter*

driz_cr_grow: int (*default=1*)
Driz_cr_grow parameter

Expand Down Expand Up @@ -356,7 +392,7 @@ MAX_FIT_RMS: int (*default=20*)
Not currently in use.

MAX_SOURCES_PER_CHIP: int (*default=250*)
Not currently in use.
Not currently in use.


run_align (*primarily in align.py*)
Expand All @@ -378,19 +414,19 @@ MIN_CATALOG_THRESHOLD: int (*default=3*)
Not currently in use.

MIN_OBSERVABLE_THRESHOLD: int (*default=10*)
Not currently in use. The minimum number of observed sources required to continue fitting. If below this threshold, the code will return a status=1 and try with another catalog.
Not currently in use. The minimum number of observed sources required to continue fitting. If below this threshold, the code will return a status=1 and try with another catalog.

MAX_FIT_LIMIT: int (*default=150*)
Not currently in use.

mosaic_catalog_list: list of strings (*default=["GAIAeDR3", "GSC242", "2MASS"]*)
List of available catalogs for aligning for both pipeline and SVM products. The code will go through each catalog in this order.
List of available catalogs for aligning for both pipeline and SVM products. The code will go through each catalog in this order.

mosaic_fit_list: list of strings (*default=["match_relative_fit", "match_2dhist_fit", "match_default_fit"]*)
List of available fit algorithms for aligning for both pipeline and SVM products; match_default_fit relative alignment without using 2dhist and different throusholds (see json configuration files).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fix spelling: throusholds

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed.


mosaic_fitgeom_list: dict (*default={"rshift": 10, "rscale": 10, "general": 6}*)
The different fit geometries tried in alignment as well as their minobj value which specifies the number of matched sources required for a successful fit. For pipeline products, the fitgeometry value is ignored and defaults to a fit geometry of ``rscale``. The fitgeom for the pipeline products is specified as a default in *align_utils.perform_fit*. The value for minobj specified here, however, is used for the pipeline products.
The different fit geometries tried in alignment as well as their minobj value which specifies the number of matched sources required for a successful fit. For pipeline products, the fitgeometry value is ignored and defaults to a fit geometry of ``rscale``. The fitgeom for the pipeline products is specified as a default in *align_utils.perform_fit*. The value for minobj specified here, however, is used for the pipeline products.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Change "minobj" to minimum number of objects. Fix this description regarding the type of products and the minimum number of objects.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed.


fit quality categories
""""""""""""""""""""""
Expand All @@ -410,7 +446,7 @@ generate_source_catalogs (*primarily in align_utils.py*)
""""""""""""""""""""""""""""""""""""""""""""""""""""""""
Copy link
Collaborator

@mdlpstsci mdlpstsci Nov 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The "fit quality categories" title seems out of place here.

Add a note so the user is reminded...The generate_source_catalogs section is used for alignment purposes only. This has nothing to do with the output HAP catalog products.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've added them as a note now. I agree that it's not the ideal place, but I couldn't find a better place. With a "note", it seems to flow better now.


box_size: int (*default=13*)
The size of the box used for calculating the 2D Background of the catalog "white light" image along each axis in pixels. In nothing is specified, a default of BKG_BOX_SIZE=27 is used.
The size of the box used for calculating the 2D Background of the catalog "white light" image along each axis in pixels. In nothing is specified, a default of BKG_BOX_SIZE=27 is used.

win_size: int (*default=3*)
The size of the 2D filter to apply to the background of the catalog "white light" image. If nothing is specified, a default of BKG_FILTER_SIZE=3 is used.
Expand Down Expand Up @@ -440,7 +476,7 @@ classify: bool (*default=false*)
Not currently in use as cosmic rays are being removed before segmentation. Specifies whether or not to apply photutils classification algorithm when evaluating each of the identified segments (sources) from the chip.

threshold: float (*default=-1.1*)
Value from the image which serves as the limit for determining sources. If None, compute a default value of (background+5*rms(background)). If threshold < 0.0, use absolute value as scaling factor for default value. If nothing is specified a default value of None is used to compute the background.
Value from the image which serves as the limit for determining sources. If None, compute a default value of (background+5*rms(background)). If threshold < 0.0, use absolute value as scaling factor for default value. If nothing is specified a default value of None is used to compute the background.



Expand All @@ -459,7 +495,7 @@ perform_fit (*primarily external in tweakwcs.matchutils.XYXYMatch*)
For match_relative_fit, match_default_fit, and match_2dhist_fit, the following parameters are used:

fitgeom": "rscale",
As used above, this is ignored for pipeline products.
As used above, this is ignored for pipeline products.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since you are discussing three categories at once for brevity, I would not quote the "default" values as they are not the same for all categories.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

default values removed.


searchrad: float (*default=125*)
The search radius for a match (in units of the tangent plane).
Expand All @@ -478,19 +514,19 @@ determine_fit_quality
"""""""""""""""""""""

MIN_CATALOG_THRESHOLD": int (*default=3*)
The minimum number of catalog sources required to continue fitting. If below this threshold, the code will return a fit_quality=5 and try with another catalog.
The minimum number of catalog sources required to continue fitting. If below this threshold, the code will return a fit_quality=5 and try with another catalog.

MIN_OBSERVABLE_THRESHOLD": int (*default=4*)
If the number of observed sources is below this threshold, the code ends alignment and defers to an *a priori* solution.

MIN_CROSS_MATCHES": int (*default=3*)
Not currently in use.
Not currently in use.

MIN_FIT_MATCHES": int (*default=4*)
Not currently in use.

MAX_FIT_RMS": float (*default=20*)
Not currently in use. Maximum RMS value for a fit to be considered good. Currently a warning is printed but nothing is done with this parameter.
Not currently in use. Maximum RMS value for a fit to be considered good. Currently a warning is printed but nothing is done with this parameter.

MAX_FIT_LIMIT": int (*default=150*)
The maximum allowable RMS value for a fit to be considered good. If not, the fit is considered compromised.
Expand All @@ -502,4 +538,4 @@ MAS_TO_ARCSEC: float (*default=1000*)
Conversion factor from milliarcseconds to arcseconds.

GOOD_FIT_QUALITY_VALUES: int (*default=[1, 2, 3, 4]*)
The fit_quality (see above) flag values that are allowable for a successful fit.
The fit_quality (see above) flag values that are allowable for a successful fit.
Loading