This introdution covers:
-Dchadothis is used to get Artemis to look for the database. The address of the database (hostname, port and name) can be conveniently included as follows:
-Dchado="hostname:port/test?username"So that these details are already completed in the popup login pane.
-Djdbc.drivers=org.postgresqlthis is used to define the JDBC postgres driver .
-Dibatisuse the iBATIS Data Mapper
./art -Dchado="localhost:2996/test?tjc" -Dibatis \ -Djdbc.drivers=org.postgresql.Driver
A sequence can be opened in Artemis from the command line (without going
through the database manager). This is done by supplying a command line argument
with the organism and chromosome (or source feature):
Pfalciparum:Pf3D7_09and optionally a range can be included to just display features within it:
Pfalciparum:Pf3D7_09:92000..112000this could be used in combination with the -Doffset=base flag (e.g. -Doffset=10000) to open Artemis at a particular section of a sequence
To reduce the number of transactions to the database, all of the sequence is read into Artemis. This includes most of the feature qualifiers. There are some qualifiers (ortho/paralog and similarity qualifiers) that lazily load their data as and when it is needed, i.e. when opened for viewing in the gene builder. This lazy loading improves the performance of reading data from the database for sequences with a large number of features.
The SQL statements can be seen in the Artemis
Log Viewer window:
This is mainly useful for debugging and tracking problems with reading
from and writing to the database. Artemis uses
log4j to produce logging
and the configuration file for this is in the file 'etc/log4j.properties'.
The names (in red) are the internal database uniquenames. These names are automatically generated by the gene builder from an ID provided by the user. N.B. in our data model UTRs are represented as distinct from exons.
A gene builder for a selected gene feature can be opened from the 'Edit' menu by selecting the 'Selected Feature in Editor' option or simply using the 'E' shortcut key.
The Artemis Gene Builder
There are two distinct parts to the gene builder window. The top part shows the gene hierarchy and structure. The bottom part shows the annotation associated with one of the constituent features. These two parts of the gene builder are described below.
Structural changes can be carried out in the graphical view. The feature ends can be dragged to adjust their coordinates. On right clicking on this area there is a popup menu for adding and deleting features in the gene model.
Editing the Gene Model In the Gene Builder
Additional transcripts can be added from here. The checkbox to the right of the above CDS is used to hide and show the associated CDS in the Artemis feature display. This can make structural edits clearer for multiple transcripts.
Properties section
This section in the gene builder provides a form for adding and deleting GO, controlled curation, product, Riley class annotation. CV terms are added by clicking the 'ADD' button. When adding a term to a feature the user is prompted for the CV name and then keyword. The term to be added is then selected from a drop down list of terms containing the word or phrase. To further assist in finding the CV term from the list, typing in the text will start to autocomplete and scroll to the first matching term.
CV section
GO terms are selected from molecular_function, biological_process or cellular_component CV's.
Products are stored in Chado as a CV (i.e. in cvterm in a cv named 'genedb_products').
Other generic controlled curations can be found by Artemis and shown if their CV name in Chado is prefixed with 'CC_' (e.g. CC_controlledcuration, CC_workshop). These then appear in a drop down list when adding CV terms to a feature.
Adding new terms to the database can also be done from this section. In the drop down selection of CV's there is an 'Add term...' option. This opens an input panel for new terms.
Adding a new CV term
The ortholog/paralog tables provide links for opening the gene editor or an Artemis window for each entry. The 'VIEW' button opens a separate Artemis displaying the gene ortholog or paralog and the surrounding features.
In addition similarity qualifiers can be added here from matches to blast and fasta searches carried out in Artemis. These are added from the Artemis Object Editor.
Edit->Selected Feature(s)->Merge
The annotation and names from the segment first selected are maintained and the CDS features from the second gene model are added to the first selected gene model. The second gene model is deleted automatically.
To unmerge (split) the gene model into two gene models consecutive segments in the CDS are selected. This is done by clicking on the first segment and then pressing SHIFT and clicking on the second segment. Then use the menu option:
Edit->Selected Feature(s)->Unmerge
On unmerging the annotation and synonyms are maintained in both gene models. The second gene model component features are given a new internal ID (uniquename) based on the original and prefixed with DUP1-.
Commit Button
There is also an option under the 'File' menu to 'Commit To Database'. Note in ACT there is no commit button and the 'Commit To Database' menu option is used to write back to the database.
If there is an error during the commit then Artemis will provide the option to force commit. This means it will commit what it can. Naturally this can be potentially problematic. Therefore, committing back to the database frequently is encouraged. Any errors are reported in the log viewer.
Artemis records the time a features was last modified (timelastmodified). Before changing a feature it will check this time stamp against the database record of the timelastmodified. If the corresponding feature in the database has changed by another user it will ask whether to continue with the commit process.