The Graphical Query Language:
A GHMM-based tool for querying and clustering Gene-Expression time-course data

(c) 2003-2004 Alexander Schliep

Tutorial GQLQuery

Overview

GQLQuery is a graphical interface for querying time course data given a HMM model. The interface is divided in two main parts: the Model panel in the left, where you can see and modify the model's parameters and the Query panel, where information of the queried time courses is displayed.

Getting Started

Run GQLQuery with the following command (or by double-clicking at the executable binary files):

 python GQLQuery.py 

After running GQLQuery, open the file yspo_2fold.txt with the Menu File -> Open Data Set. You will see all available time courses plotted in the Query panel. Next, open the sample model yspo_upregulated.smo in the Menu File -> Open Query. The Model panel is updated to reflect the loaded model parameters (see Documentation for file formats).

Making a Query

In our example the model has two steps (or states), with probabilities emissions around 0.0 for the first step and around 3.0 for the second step. By changing the similarity rank, using the corresponding slider in the Query panel, only the best fitting time-courses are displayed in the Graph (in this case, up-regulated genes).
The user can also modify the loaded model parameters. Try changing the emission probability of step 2, by changing the slide of step 2 down. If the new emission is negative, only down-regulated genes will be displayed (given that you have set a stringent similarity rank threshold).
Going back to the loaded model, you could also try increasing the duration parameter of step 1, favoring late up-regulated time courses.
New steps can also be included, by clicking in the "Append Step" button in the lower part of the Model panel. For example, add a new step, and make it emit around zero, and (optionally), to sharpen its emissions, lower its variance slide. Now, genes with the behavior non-regulated, up-regulated and non-regulated will be displayed. By adding or deleting new steps, you can try to explore more complex or simple patterns in the data.
GQLQuery also displays information about the queried genes. By pointing with the mouse cursor over one time course, its identifier or any annotation provided in the data file can be viewed in the lower part of the Query panel. Furthermore, the color bar shown below the plot area, displays in what order the steps of the query are used to generate the respective time-course in the given model (the colors of the bars match with the corresponding step colors).

Saving your Results

After all analysis is done, all results can be saved. Through the Menu File the user can: save the current model, save the current plot in a eps format or create a data file containing only the queried time-courses.