9407
Comment:
|
9549
|
Deletions are marked like this. | Additions are marked like this. |
Line 124: | Line 124: |
= JSOC Development Projects = == Exports == * [wiki:AjaxJsocConnect AJAX Style Web Access for JSOC] == Remote DRMS/SUMS - netDRMS == |
Welcome to the Wiki for JSOC software development
For general information including the basic setup to get started see the [wiki:JsocUsersGuide Users Guide].
News
JSOC 3.2 released -- Sep 15, 2006 See [http://jsoc.stanford.edu/release-3.2.notes release notes]
JSOC 3.1 released -- Aug 24, 2006 See [http://jsoc.stanford.edu/release-3.1.notes release notes]
JSOC 3.0 released -- Aug 14, 2006. See [http://jsoc.stanford.edu/release-3.0.notes release notes]
JSOC 2.3 released -- Aug 10, 2006. See [http://jsoc.stanford.edu/release-2.3.notes release notes]
JSOC 2.2 released -- May 30, 2006. See [http://jsoc.stanford.edu/release-2.2.notes release notes]
JSOC 2.1 released -- May 24, 2006. See [http://jsoc.stanford.edu/release-2.1.notes release notes]
JSOC 2.0 released -- February 03, 2006. See [http://jsoc.stanford.edu/release-2.0.notes release notes]
JSOC 1.0 released -- October 19, 2005. See [http://jsoc.stanford.edu/release-1.0.notes release notes]
DRMS Data Series
[http://jsoc.stanford.edu/doc/DRMSreport_html/index.html DRMS (html)] -- DRMS Overview[http://hmi.stanford.edu/development/JSOC_Documents/DRMS_V10.pdf (as pdf)] - old doc, contents being verified and included in these pages.
[wiki:DrmsNames DRMS Names Summary] -- DRMS Dataset Names and Queries BNF summary
[http://hmi.stanford.edu/doc/JSOC/DRMS_dataset_names.pdf DRMS Dataset Names] -- Full description (pdf)
[wiki:DrmsSeriesNames DRMS Series Names] -- Data Series Reserved Names
- Data Storage, Archive, On-Line Retention, EtC.
[wiki:SumsDataModel SUMS Data Storage] -- SUMS Basic Data Storage Concepts
[wiki:SumsArchiveTimes SUMS Archive and Retention Table] -- Meaning and implications of the values
- [wiki:Jsd .jsd] -- JSOC Series Definition Files
- Data series utilities to be run from a user's shell
[wiki:DrmsCreateSeriesCmd create_series] -- Create entries and tables for a new series in the DRMS database
[wiki:DrmsDescribeSeriesCmd describe_series] -- Prints a verbose description of the
named series and its current highest record number on stdout.
[wiki:DrmsDeleteSeriesCmd delete_series] -- Removes a series and all its associated entries from DRMS.
JSOC Sessions, Pipelines, and Modules (''Oh my!'')
JSOC programs that use DRMS to operate on DataSeries are called "modules". Modules are run in "sessions". HMI and AIA major processing tasks are accomplished in "pipelines" consisting of one or more sessions. Pipelines are started by "PUI" (Pipeline User Interface) usually by the JSOC production team. Pipelines may also be initiated by users requesting [wiki:DataSet DataSets] via the web or by team members running locally or remotely. A DataSet is a collection of records selected by a query. In essence a dataset name is simply the query that describes it.
A DRMS Session is the basic unit of computing that interracts with DRMS and SUMS. At the start of a session the user connects to the DRMS database. During the session the user runs one or more modules which read or create [wiki:DataRecord DataRecords] in DataSeries. Access to the actual data stored in SUMS is accomplished within a module via the DRMS API. At the end of a session, SUMS is notified to save any new records online and/or on tape, or to delete records marked temporary to the session.
Actually using the JSOC DRMS requires running a program or module. By "program" we mean a normal shell command and by "module" we mean a program built to run within a DRMS session and communication to a drms_server. There are four types of programs/modules:
Modules - Most programs that do the work of the user of JSOC are what we call "modules". On the outside modules look like programs. They must run in a DRMS session. If they are built with the normal jsoc_main program they will use an existing session if they are run from a Session Provider or will start their own use-once session if they are called stand-alone from the shell.
Utility programs like [wiki:DrmsCreateSeriesCmd create_series] and [wiki:DrmsDescribeSeriesCmd describe_series] which are usually used to manage the existence of dataseries, not to use dataseries. These programs talk directly to the database.
Session Providers like [wiki:DrmsRunCmd drms_run] or later the [wiki:JsocPui Pipeline User Interface] start DRMS sessions and execute a script file. They can also be used to execute a single instance of a module.
[wiki:DrmsServerCmd drms_server] which connects connects to the database and serves sessions. Most users will not need to start drms_server explicitly.
The benefit of running programs as "modules" will hopefully become apparent when we start running complex pipelines using hundreds of processors.
Setting up Your Own DRMS
Information for developers outside the JSOC who wish to construct an independent data archive that can work in cooperation with the JSOC and other archives (or completely independently) can be found on the [wiki:DRMSSetup Setting up Your Own DRMS] page.
General Information
DRMS Man Pages
All the JSOC "man" pages are now maintained with [http://www.stack.nl/~dimitri/doxygen/index.html doxygen], a semi-automatic documentation tool. They are available in html form via:
[http://jsoc.stanford.edu/doxygen_html/ Drms Developer's Documentation]BR
All programs and functions should have entries in the doxygen generated pages but some do not yet. Old documentation still exists for these:
[wiki:DrmsServerCmd drms_server],
[wiki:DrmsQueryCmd drms_query],
[wiki:DrmsCreateSeriesCmd create_series],
[wiki:DrmsDescribeSeriesCmd describe_series],
[wiki:DrmsDeleteSeriesCmd delete_series]
Note that all modules built with jsoc_main share a basic set of flags and command line keywords. See module.1 in man1.
We will once again soon have unix-style man pages at [http://jsoc.stanford.edu/man/index.html man1], but not yet:
Limits
There are limits ...
- memory limits on number of records in the cache (512Meg / (2.5*record size) ). While this may seem like a lot, for datasets with a lot of keywords (e.g. mdi_vw_V_06h) it can be a real limit to the number of records that can be open at a time. For the vw_V example it means that DRMS_QUERY_MEM should be set to at lest 2500 (yes 2.5 gig) to open 100 days of one-minute data. Modules expecting to need tens of thousands of records opened should arrange to do the work in blocks with drms_close_records used to empty the record cache to free memory.
- length of names of series, keywords, etc. (64 chars)
- length of comma sep list of prime key names (1024 chars)
- length of descriptions of series, keywords, links, and segments (254 chars)
- length of string values of keywords (dont know)
- number of keywords in a record (dont know)
- number of records in a series (no fixed limit)
- length of segment filename (255 chars)
- length of path (511 chars)
Log Files - Processing meta-data
There are log files. Stdout and Stderr are captured in files as well as shown during processing (depending on module and -v flag). These are all put into a SUMS directory and indexed in DRMS by session ID. The session ID is stored in each record so the log files can be retrieved if/when needed. Unless otherwise specified, the default retention time for log files is the maximum retention time of all SUs processed in the current session. The log files are archived if any one of the SUs in the current session is to be archived.
- drms_server logs --
- The default is no logging. When the logging option is turned on (-L), stdout and stderr are redirected to files in SU directory.
- module logs --
- The default is no logging. When the logging option is turned on (-L), stdout and stderr are tee-ed to files in SU directory.
Software Development - Building Modules
JSOC Software Tree
[wiki:FileStruct Software File Tree] -- Organization of files and cvs tree.
[http://jsoc.stanford.edu/cvs/ GUI for JSOC CVS] -- GUI access to DRMS and SUMS software
[http://jsoc.stanford.edu/cvs/*checkout*/jsoc/src/base/libdrms/drms_types.h?rev=HEAD&content-type=text/plain drms_types.h] -- DRMS types and structure defintions.
Making a JSOC/DRMS Module
[wiki:DrmsModule DRMS Module] -- DRMS Module Structure and Overview
[wiki:DrmsApi DRMS API] -- DRMS Data Types and Structures and API
[wiki:DrmsMakeModule DRMS Module Compilation] -- Running 'make' for modules
Making a JSOC/DRMS Library
[wiki:JsocLibrary JSOC LIbrary] -- Creating and using a JSOC library
Notes on JSOC Makefile
[wiki:JsocMakefileBackground Background/Reference]
[wiki:JsocMakefileAdd How to add a sub directory]
Development Notes
[wiki:DevNotes Development Notes Page]
Database Administration
- [wiki:PgDBAdmin PostgreSQL]
JSOC Backup and Restore
- [wiki:JSOC Backup/Restore Notes Page]
SUM API
DSDS-Data Access from JSOC
[http://hmi.stanford.edu/development/JSOC_Documents/JSOC_DSDS_Interface_Specification.pdf Interface Specification (.pdf)]
JSOC Development Projects
Exports
[wiki:AjaxJsocConnect AJAX Style Web Access for JSOC]