SWISS-MODEL Homology Modelling Report

Model Building Report

This document lists the results for the homology modelling project "Wuhan_N Protein" submitted to SWISS-MODEL workspace on Jan. 22, 2020, 3:33 p.m..The submitted primary amino acid sequence is given in Table T1.

If you use any results in your research, please cite the relevant publications:

Results

The SWISS-MODEL template library (SMTL version 2020-01-15, PDB release 2020-01-10) was searched with BLAST (Camacho et al.) and HHBlits (Remmert et al.) for evolutionary related structures matching the target sequence in Table T1. For details on the template search, see Materials and Methods. Overall 75 templates were found (Table T2).

Models

The following models were built (see Materials and Methods "Model Building"):

Model #01

File Built with Oligo-State Ligands GMQE QMEAN
PDB ProMod3 2.0.0 monomer
None
0.22 -6.60
Template Seq Identity Oligo-state QSQE Found by Method Resolution Seq Similarity Range Coverage Description
1ssk.1.A 92.37 monomer 0.00 BLAST NMR - 0.61 44 - 174 0.31 Nucleocapsid protein

The template contained no ligands.


Target    MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQGLPNNTASWFTALTQHGKEDLKFPRGQGVPINTNSSP
1ssk.1.A -------------------------------------------GLPNNTASWFTALTQHGKEELRFPRGQGVPINTNSGP

Target DDQIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQ
1ssk.1.A DDQIGYYRRATRRVRGGDGKMKELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNNAATVLQ

Target LPQGTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSRGTSPARMAGNGGDAALALLLLDRLNQLESKMSGKGQQ
1ssk.1.A LPQGTTLPKGFYAE------------------------------------------------------------------

Target QQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRI
1ssk.1.A --------------------------------------------------------------------------------

Target GMEVTPSGTWLTYTGAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADL
1ssk.1.A --------------------------------------------------------------------------------

Target DDFSKQLQQSMSSADSTQA
1ssk.1.A -------------------




Model #02

File Built with Oligo-State Ligands GMQE QMEAN
PDB ProMod3 2.0.0 monomer (matching prediction)
None
0.17 -2.02
Template Seq Identity Oligo-state QSQE Found by Method Resolution Seq Similarity Range Coverage Description
2jw8.1.A 95.76 homo-dimer 0.40 HHblits NMR - 0.60 247 - 364 0.28 Nucleocapsid protein

The template contained no ligands.


Target    MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQGLPNNTASWFTALTQHGKEDLKFPRGQGVPINTNSSP
2jw8.1.A --------------------------------------------------------------------------------

Target DDQIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQ
2jw8.1.A --------------------------------------------------------------------------------

Target LPQGTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSRGTSPARMAGNGGDAALALLLLDRLNQLESKMSGKGQQ
2jw8.1.A --------------------------------------------------------------------------------

Target QQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRI
2jw8.1.A ------TKKSAAEASKKPRQKRTATKQYNVTQAFGRRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRI

Target GMEVTPSGTWLTYTGAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADL
2jw8.1.A GMEVTPSGTWLTYHGAIKLDDKDPQFKDNVILLNKHIDAYKTFP------------------------------------

Target DDFSKQLQQSMSSADSTQA
2jw8.1.A -------------------




Materials and Methods

Template Search

Template search with BLAST and HHBlits has been performed against the SWISS-MODEL template library (SMTL, last update: 2020-01-15, last included PDB release: 2020-01-10).

The target sequence was searched with BLAST against the primary amino acid sequence contained in the SMTL. A total of 33 templates were found.

An initial HHblits profile has been built using the procedure outlined in (Remmert et al.), followed by 1 iteration of HHblits against NR20. The obtained profile has then be searched against all profiles of the SMTL. A total of 47 templates were found.

Model Building

Models are built based on the target-template alignment using ProMod3. Coordinates which are conserved between the target and the template are copied from the template to the model. Insertions and deletions are remodelled using a fragment library. Side chains are then rebuilt. Finally, the geometry of the resulting model is regularized by using a force field. In case loop modelling with ProMod3 fails, an alternative model is built with PROMOD-II (Guex et al.).

Model Quality Estimation

The global and per-residue model quality has been assessed using the QMEAN scoring function (Benkert et al.) . For improved performance, weights of the individual QMEAN terms have been trained specifically for SWISS-MODEL.

Ligand Modelling

Ligands present in the template structure are transferred by homology to the model when the following criteria are met: (a) The ligands are annotated as biologically relevant in the template library, (b) the ligand is in contact with the model, (c) the ligand is not clashing with the protein, (d) the residues in contact with the ligand are conserved between the target and the template. If any of these four criteria is not satisfied, a certain ligand will not be included in the model. The model summary includes information on why and which ligand has not been included.

Oligomeric State Conservation

The quaternary structure annotation of the template is used to model the target sequence in its oligomeric form. The method (Bertoni et al.) is based on a supervised machine learning algorithm, Support Vector Machines (SVM), which combines interface conservation, structural clustering, and other template features to provide a quaternary structure quality estimate (QSQE). The QSQE score is a number between 0 and 1, reflecting the expected accuracy of the interchain contacts for a model built based a given alignment and template. Higher numbers indicate higher reliability. This complements the GMQE score which estimates the accuracy of the tertiary structure of the resulting model.

References

Table T1:

Primary amino acid sequence for which templates were searched and models were built.

MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQGLPNNTASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGK
MKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPG
SSRGTSPARMAGNGGDAALALLLLDRLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQGNFGDQELIRQGTDYKH
WPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADL
DDFSKQLQQSMSSADSTQA

Table T2:

Template Seq Identity Oligo-state QSQE Found by Method Resolution Seq Similarity Coverage Description
1ssk.1.A 92.37 monomer - BLAST NMR NA 0.61 0.31 Nucleocapsid protein
4ud1.1.A 55.41 monomer - BLAST X-ray 2.48Å 0.47 0.37 N PROTEIN
1ssk.1.A 92.70 monomer - HHblits NMR NA 0.61 0.33 Nucleocapsid protein
4ud1.1.A 53.50 monomer - HHblits X-ray 2.48Å 0.46 0.37 N PROTEIN
2ofz.1.A 92.06 monomer - HHblits X-ray 1.17Å 0.61 0.30 Nucleocapsid protein
4kxj.1.A 44.27 monomer - HHblits X-ray 2.65Å 0.42 0.31 Nucleoprotein
2gec.1.B 39.53 homo-dimer 0.12 HHblits X-ray 1.30Å 0.39 0.31 Nucleocapsid protein
2gec.1.A 39.53 homo-dimer 0.12 HHblits X-ray 1.30Å 0.39 0.31 Nucleocapsid protein
4j3k.1.A 44.27 monomer - HHblits X-ray 2.00Å 0.42 0.31 Nucleoprotein
2bxx.1.A 40.32 monomer - HHblits X-ray 1.85Å 0.40 0.30 NUCLEOCAPSID PROTEIN
4lmc.1.A 43.85 monomer - HHblits X-ray 1.74Å 0.42 0.31 Nucleoprotein
4kxj.1.A 47.97 monomer - BLAST X-ray 2.65Å 0.44 0.29 Nucleoprotein
4lm9.1.A 43.85 monomer - HHblits X-ray 1.60Å 0.42 0.31 Nucleoprotein
2jw8.1.A 95.76 homo-dimer 0.40 HHblits NMR NA 0.60 0.28 Nucleocapsid protein
2jw8.1.B 95.76 homo-dimer 0.40 HHblits NMR NA 0.60 0.28 Nucleocapsid protein
4j3k.1.A 47.97 monomer - BLAST X-ray 2.00Å 0.44 0.29 Nucleoprotein
4lm9.1.A 47.97 monomer - BLAST X-ray 1.60Å 0.44 0.29 Nucleoprotein
4lmc.1.A 47.97 monomer - BLAST X-ray 1.74Å 0.44 0.29 Nucleoprotein
2c86.1.A 40.32 monomer - HHblits X-ray 3.00Å 0.40 0.30 NUCLEOCAPSID PROTEIN
2bxx.1.A 40.68 monomer - BLAST X-ray 1.85Å 0.40 0.28 NUCLEOCAPSID PROTEIN
2gec.1.B 40.65 homo-dimer 0.12 BLAST X-ray 1.30Å 0.40 0.29 Nucleocapsid protein
2gec.1.A 40.65 homo-dimer 0.12 BLAST X-ray 1.30Å 0.40 0.29 Nucleocapsid protein
2cjr.1.A 95.76 homo-dimer 0.50 HHblits X-ray 2.50Å 0.60 0.28 NUCLEOCAPSID PROTEIN
2cjr.2.B 95.76 homo-dimer 0.50 HHblits X-ray 2.50Å 0.60 0.28 NUCLEOCAPSID PROTEIN
2cjr.1.B 95.76 homo-dimer 0.50 HHblits X-ray 2.50Å 0.60 0.28 NUCLEOCAPSID PROTEIN
2cjr.2.A 95.76 homo-dimer 0.50 HHblits X-ray 2.50Å 0.60 0.28 NUCLEOCAPSID PROTEIN
2btl.1.A 38.71 monomer - HHblits X-ray 1.95Å 0.40 0.30 INFECTIOUS BRONCHITIS VIRUS NUCLEOCAPSID PROTEIN
2btl.2.A 38.71 monomer - HHblits X-ray 1.95Å 0.40 0.30 INFECTIOUS BRONCHITIS VIRUS NUCLEOCAPSID PROTEIN
2cjr.3.B 95.76 homo-dimer 0.43 HHblits X-ray 2.50Å 0.60 0.28 NUCLEOCAPSID PROTEIN
2c86.1.A 40.68 monomer - BLAST X-ray 3.00Å 0.40 0.28 NUCLEOCAPSID PROTEIN
2btl.2.A 38.98 monomer - BLAST X-ray 1.95Å 0.40 0.28 INFECTIOUS BRONCHITIS VIRUS NUCLEOCAPSID PROTEIN
2btl.1.A 38.98 monomer - BLAST X-ray 1.95Å 0.40 0.28 INFECTIOUS BRONCHITIS VIRUS NUCLEOCAPSID PROTEIN
2cjr.3.A 95.76 homo-dimer 0.43 HHblits X-ray 2.50Å 0.60 0.28 NUCLEOCAPSID PROTEIN

The table above shows the top 33 filtered templates. A further 30 templates were found which were considered to be less suitable for modelling than the filtered list.
1je5.1.B, 1je5.1.A, 2cjr.1.A, 2gib.1.A, 5wsg.1.J, 6j6n.1.J, 6g13.1.A, 6g13.1.B, 3hd4.1.A, 5mq0.1.H, 6bk8.1.O, 2jw8.1.A, 2jw8.1.B, 2cjr.4.A, 2ge8.2.B, 2cjr.4.B, 6exn.1.H, 2cjr.2.B, 2cjr.2.A, 5epw.1.A, 2cjr.1.B, 2ge8.1.B, 2ge8.1.A, 2ca1.1.B, 2ca1.1.A, 2cjr.3.B, 2cjr.3.A, 5gm6.1.X, 2ge7.1.A, 5n4k.1.A