| ||||||||||||||||||
Depending on how many programs you select to use, this server can take several minutes to run. It also depends on how many residues there are in the protein you submit. SFCheck, in particular, takes a couple of minutes to run itself, even with a medium sized structure.
The following table is provided to indicate typical lengths of time to run the 5 programs which require
only the PDB: PROCHECK,WHAT_CHECK,ERRAT,VERIFY_3D,PROVE
| Time required (updated July 12, 2006) | |||||
|---|---|---|---|---|---|
| # of residues | ERRAT alone | ERRAT + PROVE + Verify_3D | PROCHECK + WHATCHECK | ALL 5 | |
| 1GCN | |||||
| 1SDF | |||||
| 3LZM | |||||
| 3SEB | |||||
| 1EZM | |||||
| 1CEM | |||||
| 3TMK | |||||
SAVES is a server for analyzing protein structures for validity and assessing how correct they are. It utilizes 6 programs for doing this:
Structure Factor data is also analyzed using the program:Each of these programs produces large amounts of data that may or may not be useful in determining if any problems exist with a protein model. However, one must look at all of the output information in order to find where any problems are being reported. A useful application would be to have all of the output from all of the programs displayed on a single page. In this way a user could quickly exam all of the information produced from these programs and easily see where problems may exist.
This server is designed to run all of the programs and present the user with all of the data on a single page. Users do not have to run all validation programs, however. Users are presented with a page where they can select which programs to run. Or, a user can push the button on the front corresponding to the validation program that they want to run and run that one alone.
The evaluations the server makes of the outputs of each program are labeled by a simple
3 color scheme:
If the output or a section of the output satisfies the conditions the server has for that validation program, then the resulting page will show green boxes. If the validation program returns an error state, then the box will be red. Otherwise, the box will be orange indicating a warning.
The Structure Factor file should be in the .mtz (CCP4) format only. In the future we hope to allow users to submit CNS formatted files.
| SFCHECK | ||
|---|---|---|
For each test, a green rating is given if > 95% of the positions pass the test. |
||
| yellow rating if 90 - 95% of the positions pass the test. | ||
| Red rating if fewer than 90% of the positions pass the test. | ||
| PROCHECK | ||
| One of the many files produced from running PROCHECK is the summary file: file.sum. The first column in this file for each line indicates 1 of 3 things (as described in the file itself): Nothing or Blank | ||
| + | ||
| * | ||
| WHAT_CHECK | ||
| The output file produced from running WHAT_CHECK is divided up into sections. Each section is identified by 1 of 3 words: NOTE | ||
| WARNING | ||
| ERROR | ||
| ERRAT | ||
| The data plot produced from running ERRAT contains a percentage of the sequence that is below the 95% limit for each chain in the input structure. This percentage is reported and no color evaluation is made by SAVES. | ||
| VERIFY_3D | ||
| Using the averaged data points produced for each amino acid in the sequence, the number of times the value is greater than 0.2 is converted into the percentage of the sequence that has positions with values > 0.2. If this percentage is < 20% | ||
| 20 - 45% | ||
| >= 45% | ||
| PROVE | ||
| In the section delimiter identified with "**OUTLIERS", the percent of the number of buried outliers is given. If this number is < 1.0% | ||
| 1-5% | ||
| > 5% | ||
In order to run SFCHECK properly, the columns of data need to be indicated from the input file as
to which ones contain data for F, SIGF, and FREE.
The column types are identified by SAVES, however, a user's input file can have more than one column
of the same type, thus requiring the user to indicate which ones are to be used by SFCHECK. SAVES
will color code the columns that it thinks are the correct ones, but the user will still have to
select the columns to be used. This is the only program that requires user input to run properly.
Only files in the CCP4 mtz format are accepted.
Run SAVES now
| Laskoswki RA, MacArthur MW, Moss DS & Thorton JM. (1993). PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Cryst. 26, 283-291. |
| Hooft, RWW, Vriend G, Sander C, Abola EE. (1996). Errors in protein structures. 381, 272-272. |
| Colovos C, Yeates TO. (1993). Verification of protein structures: patterns of nonbonded atomic interactions. Protein Sci. 2, 1511-1519 |
| Luthy R, Bowie JU, Eisenberg D. (1992). Assessment of protein models with three-dimensional profiles. Nature 356, 83-85. |
| Pontius J, Richelle J, Wodak SJ. (1996). Deviations from standard atomic volumes as a quality measure for protein crystal structures. J. Mol. Biol. 264, 121-136. |