SWISS-MODEL Homology Modelling Report

Model Building Report

This document lists the results for the homology modelling project "Wuhan_protease" submitted to SWISS-MODEL workspace on Jan. 25, 2020, 4:11 a.m..The submitted primary amino acid sequence is given in Table T1.

If you use any results in your research, please cite the relevant publications:

Results

The SWISS-MODEL template library (SMTL version 2020-01-23, PDB release 2020-01-17) was searched with BLAST (Camacho et al.) and HHBlits (Remmert et al.) for evolutionary related structures matching the target sequence in Table T1. For details on the template search, see Materials and Methods. Overall 328 templates were found (Table T2).

Models

The following model was built (see Materials and Methods "Model Building"):

Model #01

File Built with Oligo-State Ligands GMQE QMEAN
PDB ProMod3 2.0.0 homo-dimer (matching prediction)
2 x DTZ: ZINC(II)HYDROGENSULFIDE;
0.99 0.15
Template Seq Identity Oligo-state QSQE Found by Method Resolution Seq Similarity Range Coverage Description
2z9j.1.A 96.08 homo-dimer 0.91 HHblits X-ray 1.95Å 0.61 1 - 302 1.00 3C-like proteinase

Included Ligands

Ligand Description
2 x DTZ
ZINC(II)HYDROGENSULFIDE

Excluded ligands

Ligand Name.Number Reason for Exclusion Description
DMS.1 Not biologically relevant.
DIMETHYL SULFOXIDE
DMS.2 Not biologically relevant.
DIMETHYL SULFOXIDE
DMS.3 Not biologically relevant.
DIMETHYL SULFOXIDE
DMS.4 Not biologically relevant.
DIMETHYL SULFOXIDE

Target    SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGH
2z9j.1.A SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKSNHSFLVQAGNVQLRVIGH

Target SMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFC
2z9j.1.A SMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFC

Target YMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYE
2z9j.1.A YMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYE

Target PLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQ
2z9j.1.A PLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQCSGVTFQ


Target SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGH
2z9j.1.B SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKSNHSFLVQAGNVQLRVIGH

Target SMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFC
2z9j.1.B SMQNCLLRLKVDTSNPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFC

Target YMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYE
2z9j.1.B YMHHMELPTGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYE

Target PLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQ
2z9j.1.B PLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGSTILEDEFTPFDVVRQCSGVTFQ




Materials and Methods

Template Search

Template search with BLAST and HHBlits has been performed against the SWISS-MODEL template library (SMTL, last update: 2020-01-23, last included PDB release: 2020-01-17).

The target sequence was searched with BLAST against the primary amino acid sequence contained in the SMTL. A total of 119 templates were found.

An initial HHblits profile has been built using the procedure outlined in (Remmert et al.), followed by 1 iteration of HHblits against NR20. The obtained profile has then be searched against all profiles of the SMTL. A total of 273 templates were found.

Model Building

Models are built based on the target-template alignment using ProMod3. Coordinates which are conserved between the target and the template are copied from the template to the model. Insertions and deletions are remodelled using a fragment library. Side chains are then rebuilt. Finally, the geometry of the resulting model is regularized by using a force field. In case loop modelling with ProMod3 fails, an alternative model is built with PROMOD-II (Guex et al.).

Model Quality Estimation

The global and per-residue model quality has been assessed using the QMEAN scoring function (Benkert et al.) . For improved performance, weights of the individual QMEAN terms have been trained specifically for SWISS-MODEL.

Ligand Modelling

Ligands present in the template structure are transferred by homology to the model when the following criteria are met: (a) The ligands are annotated as biologically relevant in the template library, (b) the ligand is in contact with the model, (c) the ligand is not clashing with the protein, (d) the residues in contact with the ligand are conserved between the target and the template. If any of these four criteria is not satisfied, a certain ligand will not be included in the model. The model summary includes information on why and which ligand has not been included.

Oligomeric State Conservation

The quaternary structure annotation of the template is used to model the target sequence in its oligomeric form. The method (Bertoni et al.) is based on a supervised machine learning algorithm, Support Vector Machines (SVM), which combines interface conservation, structural clustering, and other template features to provide a quaternary structure quality estimate (QSQE). The QSQE score is a number between 0 and 1, reflecting the expected accuracy of the interchain contacts for a model built based a given alignment and template. Higher numbers indicate higher reliability. This complements the GMQE score which estimates the accuracy of the tertiary structure of the resulting model.

References

Table T1:

Primary amino acid sequence for which templates were searched and models were built.

SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPK
YKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTI
TVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQC
SGVTFQ

Table T2:

Template Seq Identity Oligo-state QSQE Found by Method Resolution Seq Similarity Coverage Description
2z9j.1.B 96.08 homo-dimer 0.91 HHblits X-ray 1.95Å 0.61 1.00 3C-like proteinase
3vb3.1.A 96.08 homo-dimer 0.91 HHblits X-ray 2.20Å 0.61 1.00 3C-like proteinase
2z9j.1.A 96.08 homo-dimer 0.91 HHblits X-ray 1.95Å 0.61 1.00 3C-like proteinase
1uk3.1.B 96.08 homo-dimer 0.88 HHblits X-ray 2.40Å 0.61 1.00 3C-like proteinase
1uk3.1.A 96.08 homo-dimer 0.88 HHblits X-ray 2.40Å 0.61 1.00 3C-like proteinase
2a5i.1.A 96.08 homo-dimer 0.86 HHblits X-ray 1.88Å 0.61 1.00 3C-like peptidase
1uj1.1.B 96.08 homo-dimer 0.86 HHblits X-ray 1.90Å 0.61 1.00 3C-like proteinase
1z1i.1.A 96.08 monomer - HHblits X-ray 2.80Å 0.61 1.00 3C-like proteinase

The table above shows the top 8 filtered templates. A further 264 templates were found which were considered to be less suitable for modelling than the filtered list.
3tgn.1.B, 6jij.2.A, 3f9h.1.B, 3f9e.1.A, 6oip.1.A, 2zu2.1.A, 2p6s.1.F, 3v7d.2.A, 2w29.1.A, 1ves.1.A, 2ma3.1.A, 2q6d.1.B, 5wkj.1.A, 2f5c.1.A, 2v79.1.A, 6mak.1.A, 5j73.1.B, 2heo.1.B, 2heo.1.A, 5j73.1.A, 2p6s.1.B, 2hyf.3.A, 2hyf.3.B, 2hyf.3.C, 3l2o.1.A, 5nh0.1.A, 5nh0.1.B, 5nh0.1.C, 2amd.1.A, 2amd.1.B, 1ub5.2.A, 1wi9.1.A, 2liz.1.A, 4dnc.1.A, 3aw1.1.A, 3m3v.1.B, 3aw1.1.B, 5j73.2.B, 4yoi.1.A, 6oin.1.A, 5zqg.1.A, 5zqg.1.B, 5zu1.1.A, 5hyw.1.B, 2p5v.1.C, 2p5v.1.A, 2ev6.1.B, 3m3s.1.A, 2ivm.1.B, 2acj.1.F, 2a5k.1.A, 2alv.1.A, 4mds.1.A, 5v4b.1.A, 2w24.1.B, 4tww.1.B, 2v79.1.B, 2ivm.1.A, 4wmd.1.B, 2dbb.1.B, 2b0l.2.A, 2amp.1.A, 3mks.2.A, 4egz.1.B, 4egz.1.A, 5ibk.2.B, 5hyo.1.B, 1lvo.2.A, 1lvo.2.B, 5hyo.1.A, 5j9q.2.A, 6fv1.3.A, 2bx4.1.A, 3d62.1.A, 2vn2.1.B, 4rsp.2.A, 1lvo.1.B, 1lvo.1.A, 1mjb.1.A, 2yx4.1.A, 3c6n.1.A, 2yna.1.A, 1nex.1.A, 2cg4.1.A, 4f49.1.A, 3toa.1.A, 1ves.2.A, 2acj.1.E, 2acj.1.D, 3m3v.1.A, 6brp.2.B, 5j8f.1.A, 2qiq.1.A, 5hzg.2.C, 2qc2.1.A, 5d4r.2.A, 2e7x.1.A, 2qc2.1.B, 2ev6.1.A, 2op9.1.A, 2w29.1.B, 3v7d.1.A, 2op9.1.B, 4ylu.1.A, 3tgn.1.A, 4ylu.1.B, 2w24.1.A, 4xfq.1.B, 5c3n.1.B, 4xfq.1.A, 2giv.1.A, 4hi3.1.A, 3to6.1.A, 4hi3.1.B, 4yo9.1.B, 2efo.1.A, 4zuh.1.A, 4zuh.1.B, 2qcy.1.A, 6fv2.1.A, 1rvf.1.E, 2c3s.1.A, 2p6t.1.H, 3ebn.2.A, 2p6t.1.E, 2p6t.1.G, 3tlo.1.B, 3tlo.1.A, 5gk9.1.A, 3mks.3.A, 2vn2.1.A, 1fqv.1.B, 2b0l.1.A, 2duc.1.A, 3fzd.1.A, 4yxy.1.A, 1lvo.3.B, 4wmf.1.C, 2pn6.1.A, 2vj1.1.B, 2vj1.1.A, 6maj.1.A, 2xvc.1.A, 3f9g.1.B, 2cfx.1.A, 3f9g.1.A, 2hyf.3.D, 3ea7.1.B, 2p1n.3.A, 3f6o.1.B, 5j9u.1.A, 2p6s.1.D, 2rvc.1.A, 1p9s.1.A, 3c6p.1.A, 6ct2.1.A, 2y0m.1.A, 2b0l.1.B, 5gwz.1.A, 1ub9.1.A, 4egy.1.A, 3ea9.1.A, 3f6o.1.A, 3iwm.1.H, 3iwm.1.E, 2hyg.1.A, 3iwm.1.G, 3iwm.1.F, 5hzg.1.C, 4czd.1.B, 5b6o.1.A, 6bro.2.B, 5b6o.1.B, 5eu8.1.A, 2l54.1.A, 2bx3.1.B, 6brp.1.B, 2dbb.1.A, 4wme.1.A, 3i4p.1.A, 2pwx.1.A, 3qah.1.A, 2q6d.1.A, 3e91.1.A, 1v4r.1.A, 5wci.1.A, 3sna.1.A, 2efn.1.A, 4yog.1.B, 5j73.2.A, 2cyy.1.A, 3o3k.1.A, 2qz8.1.A, 5zu1.1.C, 3udw.1.A, 4wmd.1.A, 5c3n.1.A, 4u0y.1.D, 4u0y.1.A, 2pq8.1.A, 3to9.1.A, 4u0y.1.B, 3ebn.1.A, 2vn2.2.A, 5d4s.2.B, 5ibk.1.A, 2gxb.1.B, 3ogl.1.A, 2gxb.1.A, 1z1j.1.A, 5j8c.1.A, 1z1j.1.B, 2p1q.1.A, 6ba2.1.A, 3tob.1.A, 2q6g.1.B, 2q6g.1.A, 5hyw.2.B, 2f5d.1.B, 3f9h.1.A, 5gwy.1.B, 2f5e.1.A, 3d23.1.A, 3d23.1.C, 2gtb.1.A, 2acj.1.C, 2e7w.1.A, 4zro.1.B, 4zro.1.C, 5vzu.1.A, 4zro.1.A, 1mja.1.A, 2p5v.1.D, 4zro.1.D, 3m3t.1.A, 1ub5.1.A, 6brq.1.B, 3ea8.1.A, 1z67.1.A, 2ovr.1.A, 1sfx.1.B, 1sfx.1.A, 1j75.1.A, 1qwg.1.A, 6p7v.1.B, 3c6o.1.A, 2q6d.2.A, 2efp.1.A, 1q2w.1.B, 2k7x.1.A, 6brq.2.B, 5kvr.1.A, 2l4a.1.A, 1mj9.1.A, 5zu1.1.B, 4czd.2.B, 5j9t.1.A, 1q2w.1.A, 4kmf.1.A, 5zu1.1.D, 5gwy.1.A, 6jij.1.A, 2q6f.1.B, 2gt8.1.A