※ User Guide:


Frequently Asked Questions:

1. Q: How to use GPS 5.0 software?

A: You can find the latest version of GPS 5.0 at http://gps.biocuckoo.cn/download.php. Then download and install the GPS 5.0 software to your computer.Currently, GPS 5.0 is implemented in JAVA and could be installed on a computer with Windows/Linux/Unix/Mac OS . And we also wrote a manual for users which included in the installation package.

 

2. Q: What's the difference between simple prediction and comprehensive prediction?

A: The only difference between simple prediction and comprehensive prediction is that the simple prediction didn't provide annoations of surface accessbility and secondary structure. The annoations of surface accessbility and secondary structure were provided by NetSurfP ver. 1.1 [PMID: 19646261], which needs long-time computation. So, in the simple prediction, the surface accessbility and secondary structure are not visulized.

 

3. Q: How to read the GPS 5.0 results?

A: Here we use the human protein Beclin-1 as the example. After clicking "Submit", the prediction results of AKT-catalyzed sites with medium threshold are shown as follows:


<1>. The table of the GPS 5.0 results (Page 1)

ID: The name/id of the protein sequence that you input to predict.

Position: The position of the site which is predicted to be phosphorylated.

Code: The residue which is predicted to be phosphorylated.

Kinase: The regulatory kinase which is predicted to phosphorylate the site.

Peptide: The predicted phosphopeptide with 7 amino acids upstream and 7 amino acids downstream around the modified residue.

Score: The value calculated by GPS algorithm to evaluate the potential of phosphorylation. The higher the value, the more potential the residue is phosphorylated.

Cutoff: The cutoff value under the threshold. Different threshold means different precision, sensitivity and specificity.

Source: Whether this phosphorylation site validated by experiment, "Exp." means YES, while "Pred." means NO.

Logo: The sequence logo of this phosphopeptide.


<2>. The visualization of simple prediction

Part 1: The visualization for protein disordered region predicted by IUPred [PMID: 15955779]. Cutoff = 0.5, if score of prediction > cutoff, the residue is considered in disordered region.

Part 2:
Up: The visualization for the positional distribution of the predicted site in protein sequence.
Down left: The distribution of S/T/Y sites in kinase families.
Down middle: The distribution of S/T/Y sites.
Down Right: The distribution of S/T/Y sites in disordered region.


<3>. The visualization of comprehensive prediction

Part1:
Top: The surface accessbility of amino acids and the protein disordered region were predicted by NetSurfP ver. 1.1 (PMID: 19646261) and IUPred (PMID: 15955779), respectively. The cutoff of disordered region prediction = 0.5, if score of prediction > cutoff, the residue is considered in disordered region. The cutoff of surface accessbility prediction = 0.25, if score of prediction > cutoff, the residue is considered as surface exposed residue. Bottom: The positions of the predicted phosphorylation sites were visualized in the protein sequence together with the secondary structure predicted by NetSurfP ver. 1.1 (PMID: 19646261).

Part 2 :
Left: The distribution of S/T/Y sites in kinase families.
Middle left: The distribution of S/T/Y sites.
Middle right: The distribution of S/T/Y sites in secondary structure.
Right: The distribution of S/T/Y sites in disordered region.

 

4. Q: Is GPS 5.0 accurate?

A: Yes, but not all. Prediction of kinase-specific phosphorylation sites is a greatly difficult problem. If the training data is enough, the prediction is satisfying and accurate. But for many protein kinases, the training data set are very limited, to make the performance lower. For kinase-specific prediction, no algorithm or approach could reach the best performances for all of the protein kinases. However, by comparison, the prediction performances of GPS are better or at least comparable with previous tools. And also, we will updated the GPS routinely to make it more accurate and powerful.

 

5. Q: How to choose the cut-off values and the thresholds?

A: Firstly, we calculated the theoretically maximal false positive rate (FPR) for each PK cluster. The three thresholds of GPS 5.0 were decided based on calculated FPRs.For serine/threonine kinases, the high, medium and low thresholds were established with FPRs of 2%, 6% and 10%. And for tyrosine kinases, the high, medium and low thresholds were selected with FPRs of 4%, 9% and 15%. in substrates.

 

6. Q: What's the meaning of False Positive Rate (FPR)?

A: The false positive rate (FPR) is the proportion of negative sites that are erroneously predicted as positive hits. Given a data set containing all of non-phosphorylation sites, the real FPR could be easily computed. However, precise calculation of FPR is unavailable due to lack of a "gold-standard" negative data set. Here we developed a simple and fast method to construct the near-negative data set and estimate the theoretically maximal FPRs. Firstly, we calculated the distributions of amino acids composition in six organisms, including S. cerevisiae, S. pombe, C. elegans , D. melanogaster, M. musculus, and H. sapiens. Then we randomly generated 10,000 PSP(30,30) peptides to construct a near-negative data set based on the real frequencies of twenty amino acids in eukaryotic proteomes. Although there were a few sites to be real hits, its proportion would be very small. The process was repeated twenty times and the average FPR was calculated by GPS 5.0 as the theoretically maximal FPR. Also, the negative sites could be randomly retrieved from eukaryotic proteomes. And the results from both methods are very similar.

 

7. Q: I was trying to install the software on macbook pro but my installer says the file is damaged. How can I properly install the software in Mac OS?

A: By default, Mac OS 10.8 only allows users to install applications from 'verified sources'. In effect, this means that users are unable to install most applications downloaded from the internet. You can follow the directions below to prevent this error message from appearing.
(1) Open the Preferences. This can be done by either clicking on the System Preferences icon in the Dock or by going to Apple Menu > System Preferences.
(2) Open the Security & Privacy pane by clicking Security & Privacy.
(3) Make sure that the General section of the the Security & Privacy pane is selected. Click the icon labeled Click the lock to prevent further changes.
(4) Enter your username and password into the prompt that appears and click Unlock.
(5) Under the section labeled Allow applications downloaded from, select Anywhere. On the prompt that appears, click Allow From Anywhere.
(6) Exit System Preferences by clicking the red button in the upper left of the window. You should now be able to install applications downloaded from the internet.

 

8. Q: I have a few questions which are not listed above, how can I contact the authors of GPS 5.0?

A: Please contact the major author: Dr. Yu Xue for details.

 

 

9. Q: I was trying to install the software in Mac OS but my installer says the file is damaged. How can I properly install the software in Mac OS?

A: By default, Mac OS 10.8 or later only allows users to install applications from 'verified sources'. In effect, this means that users are unable to install most applications downloaded from the internet. You can follow the directions below to prevent this error message from appearing.

(1) Open the Preferences. This can be done by either clicking on the System Preferences icon in the Dock or by going to Apple Menu > System Preferences.
(2) Open the Security & Privacy pane by clicking Security & Privacy.
(3) Make sure that the General section of the the Security & Privacy pane is selected. Click the icon labeled Click the lock to prevent further changes.
(4) Enter your username and password into the prompt that appears and click Unlock.
(5) Under the section labeled Allow applications downloaded from, select Anywhere. On the prompt that appears, click Allow From Anywhere.
(6) Exit System Preferences by clicking the red button in the upper left of the window. You should now be able to install applications downloaded from the internet.