 
PFRMAT TS 
TARGET T0052 
AUTHOR 5529-3140-9255 
REMARK Prediction team BENNER-COHEN has two group leaders, 
REMARK but we will consistently use Steven Benner's predictor 
REMARK number to avoid confusion concerning a second team 
REMARK connected with Fred Cohen (called Cohen, Fred). 
REMARK (Fred Cohen's number as a group leader is 6140-7890-6093). 
REMARK Prediction team members: D.L.Gerloff, G.Cannarozzi, 
REMARK M.Joachimiak, M.Cueto, F.E.Cohen & S.A.Benner. 
METHOD Important comment: We submit T0052 as a "modeling 
METHOD exercise" as the lack of homologous sequences in the 
METHOD databases precluded the use of the systematic methods 
METHOD developed by members of our team. In particular, the 
METHOD prediction is NOT based on the SAINT secondary structure 
METHOD prediction system (Benner et al., U.of Florida). 
METHOD To illustrate this fact, we submit coordinates only. 
METHOD 
METHOD Methodology used in this prediction for T0052 (CN-V): 
METHOD The prediction process was mostly manual and based on 
METHOD different, sometimes contradicting, clues found through  
METHOD sequence analysis and in the literature, as described 
METHOD below.  
METHOD Firstly, we inspected the automated secondary structure 
METHOD predictions available to us and potentially useful even 
METHOD if used in the absence of homologous sequence information 
METHOD (PHD and J.M.Chandonia, personal communication). The in- 
METHOD consistencies between the outputs by different methods 
METHOD prompted us to disregard the predicted secondary structure 
METHOD in this case! Instead, we attempted to conclude from the 
METHOD position of the DISULFIDE BRIDGES and the indications in  
METHOD the sequence on the number and approximate position of  
METHOD SEGMENTS OF POSSIBLE SECONDARY STRUCTURE. On these grounds,  
METHOD we favor a model with a total of up to 7 short segments, or half- 
METHOD segments, probably separated only by a bend in the protein chain  
METHOD (in contrast to a sharp turn). Thus the 7 short segments are  
METHOD likely to be found in approximately five stretches of secondary 
METHOD structure. Further, like others before us, we found a putative  
METHOD INTERNAL REPEAT in the target sequence (#1-50, 51-101).  
METHOD Interestingly, this piece of information was IN CONFLICT 
METHOD WITH A POSSIBLE, DISTANT HOMOLOGY TO 1PMD that was found through 
METHOD "second frame" homology searches. 
METHOD With this information, wea ttempted to perform an EXHAUSTIVE 
METHOD COMBINATORIAL ANALYSIS of the possible folding topology of T0052, 
METHOD assuming that the repeated domains are superimposable and contain 
METHOD predominately (exclusively) beta-sheet structure. 
METHOD However, the analysis using a similar scheme as we applied 
METHOD quite successfully in CASP1 on synaptotagmin (Proteins 
METHOD 1995, 22:299-310), FAILED TO REDUCE THE NUMBER OF POSSIBLE 
METHOD FOLDING TOPOLOGIES to the desired level for submission at 
METHOD at CASP3. This is because we could not find sufficiently 
METHOD strong and reliable "filters", for example a prediction 
METHOD of tight turns vs. wide turns/connections as is often 
METHOD possible with the help of multiple sequence information. 
METHOD 
METHOD Therefore, and due to time pressure, we submit our model 
METHOD as an "educated guess", illustrating that we decided to 
METHOD rely on the postulated internal repeat rather than on the 
METHOD the conflicting secondary structures. Further, we disre- 
METHOD garded the output of the UCLA-DOE fold recognition server 
METHOD programs as no significant matches were returned.  
METHOD Instead, the packing constraints for the putative 50 res. 
METHOD domains seem to FAVOR AN ALL-BETA FOLD PREDICTION. 
METHOD The possible homology to 1pmd (163-263) was, again,  
METHOD conflicting with an internally repeated structure made of 
METHOD two lobes. Intriguingly, however, we are able to propose 
METHOD a model predicting partial structural similarity of T0052 
METHOD with a small domain in 1prc (photosythetic react. center) 
METHOD which, if duplicated as putatively in T0052, has some 
METHOD resemblance in shape with the domain of interest in 1pmd. 
METHOD As the crystal structure 1pmd is at low resolution and 
METHOD undoubtedly faulty, this could be an indication for how 
METHOD to correct 1pmd, if correct. However, according to our 
METHOD combinatorial analysis (see above), there are several 
METHOD other topologies that would meet all the "filtering" 
METHOD constraints we could bring forward. For example, a  
METHOD model made of two symmetry-equivalent SH3-like domains 
METHOD would also be conceivable. 
METHOD 
METHOD Both of the lobes of our model are constructed partly 
METHOD onto the topology of 1prc (res. 155h-196h). The remaining 
METHOD parts of the model, and the possible ROTATIONAL SYMMETRY 
METHOD between the two lobes was modeled manually, ab initio.  
METHOD (coordinates generated using SYBYL, Tripos). The ab initio 
METHOD fragments are mostly to accommodate the necessary changes 
METHOD in the structure for adopting a two-lobe structure with 
METHOD a shared part of the core. As such, the overall folding 
METHOD arrangement (topology) is more important in our prediction  
METHOD than the detail conformation of the polypeptide  
METHOD chain of the shortfragments, in the case of T0052! 
METHOD Backbone and C-beta coordinate are submitted, but the 
METHOD model is to be considered at the resolution equivalent 
METHOD to not more than a wire model... 
MODEL 1 
REMARK     T0052 Cyanovirin-N, Nostoc ellipsosporum 
PARENT N/A               
ATOM    319  N   ASP    44       4.541   6.220   4.059  1.00 10.00               
ATOM    320  CA  ASP    44       3.450   6.427   5.008  1.00 10.00               
ATOM    321  C   ASP    44       2.378   7.353   4.500  1.00 10.00               
ATOM    322  O   ASP    44       1.215   7.048   4.712  1.00 10.00               
ATOM    323  CB  ASP    44       3.889   6.850   6.427  1.00 10.00               
ATOM    327  N   GLY    45       2.723   8.470   3.825  1.00 10.00               
ATOM    328  CA  GLY    45       1.669   9.341   3.330  1.00 10.00               
ATOM    329  C   GLY    45       1.531   9.264   1.835  1.00 10.00               
ATOM    330  O   GLY    45       1.334  10.302   1.223  1.00 10.00               
ATOM    331  N   SER    46       1.631   8.054   1.239  1.00 10.00               
ATOM    332  CA  SER    46       1.543   7.949  -0.209  1.00 10.00               
ATOM    333  C   SER    46       2.809   8.418  -0.875  1.00 10.00               
ATOM    334  O   SER    46       3.457   9.315  -0.360  1.00 10.00               
ATOM    335  CB  SER    46       0.283   8.613  -0.799  1.00 10.00               
ATOM    337  N   LEU    47       3.167   7.804  -2.016  1.00 10.00               
ATOM    338  CA  LEU    47       4.430   8.172  -2.628  1.00 10.00               
ATOM    339  C   LEU    47       4.235   8.682  -4.029  1.00 10.00               
ATOM    340  O   LEU    47       3.227   8.386  -4.651  1.00 10.00               
ATOM    341  CB  LEU    47       5.406   6.985  -2.457  1.00 10.00               
ATOM    345  N   LYS    48       5.214   9.461  -4.526  1.00 10.00               
ATOM    346  CA  LYS    48       5.115   9.905  -5.903  1.00 10.00               
ATOM    347  C   LYS    48       6.334   9.503  -6.684  1.00 10.00               
ATOM    348  O   LYS    48       7.153  10.333  -7.048  1.00 10.00               
ATOM    349  CB  LYS    48       4.746  11.398  -6.009  1.00 10.00               
ATOM    354  N   TRP    49       6.436   8.191  -6.960  1.00 10.00               
ATOM    355  CA  TRP    49       7.508   7.755  -7.831  1.00 10.00               
ATOM    356  C   TRP    49       6.950   6.921  -8.946  1.00 10.00               
ATOM    357  O   TRP    49       6.382   5.873  -8.686  1.00 10.00               
ATOM    358  CB  TRP    49       8.623   6.993  -7.091  1.00 10.00               
ATOM    368  N   GLN    50       7.125   7.394 -10.196  1.00 10.00               
ATOM    369  CA  GLN    50       6.623   6.598 -11.301  1.00 10.00               
ATOM    370  C   GLN    50       7.364   5.295 -11.403  1.00 10.00               
ATOM    371  O   GLN    50       8.583   5.309 -11.453  1.00 10.00               
ATOM    372  CB  GLN    50       6.510   7.306 -12.667  1.00 10.00               
ATOM    377  N   PRO    51       6.627   4.165 -11.426  1.00 10.00               
ATOM    378  CA  PRO    51       7.266   2.867 -11.493  1.00 10.00               
ATOM    379  C   PRO    51       7.424   2.287 -10.113  1.00 10.00               
ATOM    380  O   PRO    51       7.821   1.139  -9.993  1.00 10.00               
ATOM    381  CB  PRO    51       6.140   2.071 -12.181  1.00 10.00               
ATOM    384  N   SER    52       7.112   3.078  -9.065  1.00 10.00               
ATOM    385  CA  SER    52       7.192   2.561  -7.708  1.00 10.00               
ATOM    386  C   SER    52       6.129   1.519  -7.525  1.00 10.00               
ATOM    387  O   SER    52       4.970   1.872  -7.383  1.00 10.00               
ATOM    388  CB  SER    52       8.574   1.974  -7.347  1.00 10.00               
ATOM    390  N   ASN    53       6.553   0.238  -7.527  1.00 10.00               
ATOM    391  CA  ASN    53       5.631  -0.843  -7.230  1.00 10.00               
ATOM    392  C   ASN    53       5.294  -0.704  -5.773  1.00 10.00               
ATOM    393  O   ASN    53       5.836  -1.389  -4.920  1.00 10.00               
ATOM    394  CB  ASN    53       6.325  -2.191  -7.520  1.00 10.00               
ATOM    398  N   PHE    54       4.372   0.237  -5.509  1.00 10.00               
ATOM    399  CA  PHE    54       4.060   0.563  -4.136  1.00 10.00               
ATOM    400  C   PHE    54       2.600   0.876  -4.015  1.00 10.00               
ATOM    401  O   PHE    54       1.891   1.028  -4.998  1.00 10.00               
ATOM    402  CB  PHE    54       4.837   1.832  -3.742  1.00 10.00               
ATOM    409  N   ILE    55       2.167   0.967  -2.748  1.00 10.00               
ATOM    410  CA  ILE    55       0.781   1.312  -2.488  1.00 10.00               
ATOM    411  C   ILE    55       0.458   2.777  -2.585  1.00 10.00               
ATOM    412  O   ILE    55       0.822   3.505  -1.675  1.00 10.00               
ATOM    413  CB  ILE    55      -0.314   0.342  -2.972  1.00 10.00               
ATOM    417  N   GLU    56      -0.247   3.231  -3.642  1.00 10.00               
ATOM    418  CA  GLU    56      -0.743   4.595  -3.592  1.00 10.00               
ATOM    419  C   GLU    56      -1.924   4.667  -2.660  1.00 10.00               
ATOM    420  O   GLU    56      -2.803   3.822  -2.731  1.00 10.00               
ATOM    421  CB  GLU    56      -0.974   5.283  -4.955  1.00 10.00               
ATOM    426  N   THR    57      -1.931   5.676  -1.768  1.00 10.00               
ATOM    427  CA  THR    57      -3.023   5.794  -0.825  1.00 10.00               
ATOM    428  C   THR    57      -3.837   7.029  -1.106  1.00 10.00               
ATOM    429  O   THR    57      -3.281   8.059  -1.452  1.00 10.00               
ATOM    430  CB  THR    57      -2.600   5.574   0.639  1.00 10.00               
ATOM    433  N   CYS    58      -5.174   6.899  -1.001  1.00 10.00               
ATOM    434  CA  CYS    58      -6.040   7.924  -1.565  1.00 10.00               
ATOM    435  C   CYS    58      -7.258   8.319  -0.766  1.00 10.00               
ATOM    436  O   CYS    58      -7.435   7.853   0.348  1.00 10.00               
ATOM    437  CB  CYS    58      -6.487   7.359  -2.929  1.00 10.00               
ATOM    439  N   ARG    59      -8.113   9.169  -1.390  1.00 10.00               
ATOM    440  CA  ARG    59      -9.417   9.502  -0.819  1.00 10.00               
ATOM    441  C   ARG    59     -10.382   8.384  -1.129  1.00 10.00               
ATOM    442  O   ARG    59     -10.935   7.806  -0.209  1.00 10.00               
ATOM    443  CB  ARG    59     -10.057  10.742  -1.493  1.00 10.00               
TER 
END 
