This document outlines the current plan for exchanging and archiving data collected during CRYSTAL-FACE. The key elements are included, but there are some details which have not yet been completely defined, so these pages will be updated as new information becomes available. If you have questions, concerns or problems with any aspect of the data exchange then please send a note to mailto:gaines@cloud1.arc.nasa.gov.
The primary objectives are to encourage a timely exchange of data during and after CRYSTAL-FACE by providing facilities and predefined mechanisms by which participants will have access to those data. The emphasis, both during and after the mission, is to exchange those data which other participants want to use. All data and image files should include well-defined variables so they can be readily compared with other datasets. They should also clearly indicate the source (instrument, model, etc.) of the data. The term data is used loosely to include both measured and derived quantities, so theory teams with products which will aid in the interpretation of measured quantities are also encouraged to participate in the data exchange. The following subsections outline the three types of data discussed in this document.
The Earth Science Project Office (ESPO) archive will be the only archive provided by the project during CRYSTAL-FACE, and it is expected that most instrument teams will submit their preliminary data to this archive during the mission. It will serve as the main clearing-house for preliminary data during the mission, and as data are recalibrated and files updated, it will become the final archive for those files. For most in-situ instruments this will contain the complete datasets they distribute to the community. For most remote sensing instruments this will contain a reduced resolution subset of their complete datasets, and final versions of their high resolution datasets will be stored as binary files in a different archive (see 1.3). Please limit the size of your files stored in the ESPO archive to less than 20 MegaBytes (MB); that is not a hard limit, just a guideline. See 2.1 for more details.
These are conspicuously displayed on web pages shortly after the data are collected. Some of these may also be submitted to one of the archives, but if not then they will only exist for the duration of the web site which hosts them. Instrument teams that plan to produce quick-look plots should also archive data files from which the plots can be regenerated. The archives are described in 1.1 and 1.3. The Project Office will, with conditions, provide resources for displaying quick-look plots; see 2.2.1 for details. Alternatively, your team may have a web site tailored to displaying its data plots, so if it prefers to utilize that then review the criteria listed in 2.2.2 and respond accordingly.
Some data products will be too large (>20 MB) to exchange via the ESPO archive, and those will more efficiently be exchanged in a portable binary file format. The project is considering options for providing an archive for final versions of these large binary data files and associated image files, and those plans will be included in 2.3.1 once the details have been finalized. Your team may already use or maintain an existing archive for its large data products, and if that archive satisfies the conditions in 2.3.2 then it can be used as an alternative to the archive the project plans to provide for large files. Those teams which produce large data products are encouraged to also contribute reduced resolution data and/or image files, as in 1.1 and 1.2, for other investigators to use until the high resolution products are archived. Requests for these high resolution data products before they are archived will be handled at the discretion of the PI providing the data.
The Earth Science Project Office (ESPO) maintains an online archive at Ames Research Center which can be accessed at http://espoarchive.nasa.gov/archive/, or via ftp and telnet logins to the several archive accounts on espoarchive.nasa.gov. The archive is primarily used for the exchange of ASCII data files from several NASA missions, but it can also accommodate a few types of standard image files. The usual procedure is to password protect access to data from each mission until those data are deemed public, and at that time the archive is opened to everyone via the web or anonymous ftp. You can use your web browser to draw simple plots of the archived data, check for newly archived files, and browse the archived files via its ftp capabilities.
The name of the CRYSTAL-FACE archive account is crystalf, and the login password for that account has been sent to all team leaders. Send a note to mailto:gaines@cloud1.arc.nasa.gov if you have not received the login password or if you are having trouble accessing that archive account. All investigators will share this account to submit/retrieve files to/from the archive. You can re-submit files as often as you like, with older versions being replaced by the new versions.
During the mission there will be a mirror of the CRYSTAL-FACE archive on the LAN in Key West, cf46.crystalface.nasa.gov (198.116.48.46). The archive procedures on both machines are the same. Files can be submitted/retrieved to/from either archive, but to reduce traffic on the WAN to the Key West LAN you are encouraged to use the archive on that LAN if your computer is on that LAN, and to use the archive at Ames if your computer is not on the Key West LAN.
If you will be submitting files to the archive then it is recommended that you login to the crystalf account and review the ~/docs/README.TXT file. It contains a tutorial on the basic archive procedures, describes the information you need to convey to the archivist before you submit files to the archive, and describes other useful files in the ~/docs/ directory.
All data and image files in the ESPO archives use a standardized naming convention. A complete description of that convention can be found at http://espoarchive.nasa.gov/archive/, and anyone planning to submit files to the archive should review that document. Here we will just reiterate some issues of general interest.
All file names include a date, and some may include a time, and those dates and times are always expressed in UTC. To facilitate inter-comparisons of data collected from a particular aircraft, the following conventions apply to data files pertaining to a particular aircraft flight:
To locate a particular file in the ESPO archive, you need to know its name and the directory in which it resides. The best way to find a file is from the data and image file catalogs, located in the ~/docs/ directory of the crystalf archive account. You can obtain those catalogs by directing your web browser to http://espoarchive.nasa.gov/archive/ and selecting the Download Files (FTP) link. Then proceed to the ~/docs/ directory and you will find:
Each of those file catalogs contains (or will contain) four columns which list the: file name code; subdirectory; point of contact and content description for each type of data or image file in the archive. The data file subdirectories are relative to the ~/data/ directory and the image file subdirectories are relative to the ~/images/ directory.
The file name codes are constructed from the constant parts of the file names, so for the above examples, the corresponding file name codes are:
There are nine different ASCII data file formats used in the ESPO archive. Each includes a file header, followed by the data records. The file headers have fields allocated for specific types of information, and those should be used to clearly define the originator of the file, the source of data and the units used for each variable. A complete description of the data file formats can be found at http://espoarchive.nasa.gov/archive/.
It is useful to keep in mind the following points when you design your data files:
Images can be archived in any of the following standard image file formats: GIF, JPEG, PNG, PDF. It is important to include enough annotations in the image so that by viewing the image one can determine its subject, and readily compare it with other datasets. The annotations should also indicate the source of the data depicted in the image.
As mentioned in 1.2, you can use either the web site provided by the project or your own site to display your quick-look plots. The following sections define the conditions for using those sites.
The project will provide a web site for displaying your quick-look plots, subject to the following conditions:
Your team may use its own web site for displaying its quick-look plots, subject to the following conditions:
Details will be provided once they have been defined.
As mentioned in 1.3, your team may already use or maintain an existing archive for its large data products, and you can use that to archive your large data products from CRYSTAL-FACE if the following conditions are satisfied: