Accurate prediction of protein dihedral angles through conditional random field

Shesheng ZHANG; Shengping JIN; Bin XUE

doi:10.1007/s11515-013-1261-3

Front. Biol. ›› 2013, Vol. 8 ›› Issue (3) :353 -361. DOI: 10.1007/s11515-013-1261-3

RESEARCH ARTICLE

Accurate prediction of protein dihedral angles through conditional random field

Author information +

History +

PDF (355KB)

Abstract

Identifying local conformational changes induced by subtle differences on amino acid sequences is critical in exploring the functional variations of the proteins. In this study, we designed a computational scheme to predict the dihedral angle variations for different amino acid sequences by using conditional random field. This computational tool achieved an accuracy of 87% and 84% in 10-fold cross validation in a large data set for ϕ and ψ, respectively. The prediction accuracies of ϕ and ψ are positively correlated to each other for most of the 20 types of amino acids. Helical amino acids can achieve higher prediction accuracy in general, while amino acids in beet sheet have higher accuracy at specific angular regions. The prediction accuracy of ϕ is negatively correlated with amino acid flexibility represented by Vihinen Index. The prediction accuracy of ϕ can also be negatively correlated with angle distribution dispersion.

Keywords

conditional random field / flexibility / angle distribution dispersion

Cite this article

Download citation ▾

Shesheng ZHANG, Shengping JIN, Bin XUE. Accurate prediction of protein dihedral angles through conditional random field. Front. Biol., 2013, 8(3): 353-361 DOI:10.1007/s11515-013-1261-3

登录浏览全文

4963

注册一个新账户忘记密码

Introduction

Subtle variations on amino acid sequences have been extensively observed in the studies on protein molecular biology. From the genetic point of view, mutation on nucleotides in coding region will result in a different codon. In the case of mis-sense mutation, this new codon will code for a different amino acid. Substitution of amino acids with similar properties may not severely affect the local and global structures of the protein and hence the function of the protein. However, substation to an amino acid with completely different physicochemical properties may disrupt the local and global 3-dimensional structures and eventually the function of the protein. Although it could be deleterious, amino acid substitution can also bring in novel functions to proteins and protein interaction networks (Ellegren, 2008; Tokuriki and Tawfik, 2009). That is the reason why the substitutions composed of the most magnificent pictures of competition and cooperation in the development of biological world.

Studying the influence of sequence variations on three-dimensional structures is still a formidable task. Traditional experimental methods for characterizing protein structures are X-ray and NMR. Although being very accurate, these two methods are both time and cost consuming, in addition to many critical technical issues in preparing protein samples. Another powerful experimental technique to study the local variance of amino acid on function is alanine-scanning mutagenesis (Cunningham and Wells, 1989; Ashkenazi et al., 1990; Gibbs and Zoller, 1991). This technique applies systematic alanine substitution to test various functional interactions. Since alanine has only backbone and C_β atoms, amino acid side-chain over C_β atoms will be removed upon the substitution. Through systematic substitutions and measurement of functionality, the functional epitopes and interaction sites can be identified. Nonetheless, alanine-scanning is laborious. All the substituted proteins have to be constructed, expressed, and re-folded in the experiments (Morrison and Weiss, 2001).

Hence, it is necessary to develop other efficient methods. Computational techniques are thus introduced into protein structure biology and are playing more and more important roles. Many computational tools are developed to predict three-dimensional structures, such as homology modeling, fold recognition, and ab initio modeling. Another category of computational methods are designed for various structural properties, such as secondary structure (Green et al., 2009), dihedral angles (Helles and Fonseca, 2009), accessible surface area (Chang et al., 2008), contact map (Xue et al., 2009), structural classes (Ahmadi et al., 2012), etc. The prediction of various structural properties is able to provide useful information on three-dimensional structure, active site, and function. The methods for the prediction of structural properties are mainly composed of various machine learning techniques. In all these machine learning methods, a consecutive segment of sequence is normally applied as the input. This unique scheme ensures that the influence of neighboring residues is taken into consideration properly. Besides, in all machine learning methods, only sequence related information was required for the prediction; none of the local structural information was needed in advance.

Here in this manuscript, we applied linear-chain Conditional Random Field (CRF) method to predict backbone dihedral angles. CRF is a very powerful undirected graphical technique similar to simple Hidden Markov Models (HMM). However, while HMM has very specific feature functions using constant probabilities, CRF can have any number and formality of feature functions. These characteristics make CRF very flexible and fitful for many purposes. Recently, CRF has been successfully applied in protein fold recognition (Liu et al., 2006), conformation sampling (Zhao et al., 2008), identification of phosphorylation site (Dang et al., 2008), and prediction of intrinsic disorder (Wang and Sauer, 2008). CRF can also be combined with artificial neural networks to sample protein conformation ensembles (Zhao et al., 2010). Therefore, we applied CRF function in a large data set of proteins to predict protein dihedral angles in this work. The 10-fold cross-validated prediction accuracy reached above 80%.

Method

Data set

Due to inadequate number of structures for single amino acid substitution in PDB, we came up with a compromised method to build up the data set. Since a sliding-window of amino acids will be used as inputs, we may only collect a group of non-homologous proteins as the data set. The proteins applied in this study are extracted from protein sequence culling server PISCES (Wang and Dunbrack, 2003). By choosing X-ray structure with resolution higher than 3 Ǻ and sequence identity lower than 25%, 2531 sequences were finally selected. This set of proteins has length variance from 60 to 1244 residues, with the average length of 232 residues.

Protein backbone representation

The protein backbone structure can be represented by

Ω = {(R k, Ψ k) | k = 1, 2, ..., N}

. Here, Ω represents a specific backbone conformation; R_k is the amino acid type of k-th residue; Ψ_k = (ϕ_k, ψ_k) is the dihedral angles of the k-th residue. Dihedral angles have initial values from -180 to 180 degrees. N is the total number of residues in the protein.

In the following analysis, while keeping the original values of ϕ, all the values of ψ were shifted by -50 degrees (Xue et al., 2008). Upon the angle-shifting, the populations of ψ near two extremes were reduced to zero and all the values of ψ have a very nice two-peak distribution. This angle-shifting technique improved the prediction accuracy of protein dihedral angles remarkably (Xue et al., 2008). After the angle-shifting, both ϕ and shifted ψ were further split into 18 bins with each bin corresponding to an angle difference of 20 degree. Finally, each ϕ_k and ψ_k has only a bin value from 1 to 18. We also applied 72 bins in the prediction. However, the results from 72-bin based prediction were only used in the distribution analysis.

Two-peak prediction

As shown by recent studies, both ϕ and shifted ψ angles have two-peak distribution (Xue et al., 2008). This observation leads to an interesting application: predict on which peak the dihedral angles are. The two-peak based prediction can provide high accurate information on the rough range of the dihedral angles. The rough range of dihedral angle is very helpful not only for understanding the structure, but also for refining the real value predictions (Faraggi et al., 2009). For these purposes, a two-peak analysis upon the amino acid substitution was also included in this paper.

Prediction evaluation

The prediction results were evaluated by Q10% for 18-bin based prediction, and 2-state true positive rate for two-peak prediction. In 18-bin based prediction, each angle is assigned a predicted value same as the middle value of the bin. After this assignment, Q10% is calculated as the true positive rate when predicted value is within 36 degrees of experimental value.

Conditional random field

Our main objective in this study is to predict the value of dihedral angles for a substituted residue under the condition of knowing the conformation of neighboring residues. For this purpose, the conditional probability in the CRF is defined as (Wallach, 2004; Sutton and McCallum, - Lafferty et al., ) (Faraggi et al., 2009):

P (s / o) = Π i exp ⁡ [Σ k μ k f k (o, s i, i)] / Z 0 .

Here, P(BoldItalic/BoldItalic) is the conditional probability. In which, “BoldItalic” is the states of angles of the predicted residues; “BoldItalic” is the input over a sliding window of 5 residues, including: the amino acid type of predicted residue, amino acid types of two neighboring residues at both side of predicted residue, and corresponding (ϕ, ψ) values of these four neighboring residues. f_k is one of the K feature functions. In this study, K is the product of number of residue types and number of angle bins. µ_k is the weight parameter of k-th feature function. Z₀ is the normalization factor defined by:

Z 0 = Σ s Π i exp ⁡ [Σ k μ k f k (o, s i, i)] .

The weight parameters of this CRF model can be obtained by maximizing the conditional log-likelihood BoldItalic on the training data set:

L = ln ⁡ P (s / o) = Σ i Σ k μ k f k (o, s i, i) - ln ⁡ Z 0 - Σ k μ k 2 / σ 2 .

In which the entire third item was introduced to prevent over-fitting and σ² = 50 was applied. This equation is convex and hence a global optimum can be expected. The equation was finally solved by a slightly modified Powell method for function maximization.

Angle distribution dispersion

To quantify the angle distribution, we calculated the angle distribution dispersion D= Σ_kabs(A_k - A_max)P_k. Here, A_k and P_k are the middle value of k-th bin and percentage of angles distributed in the k-th bin. A_max is the middle value of the bin with the highest percentage of angles. Σ_k is the summation over all the bins with percentage higher than the threshold value. In this study, the threshold value is taken as 1%. “abs” indicates the absolute value. To take consideration of angular periodicity, values of abs(A_k - A_max) larger than 180 will be subtracted by 180.

Results and discussion

Table 1 shows the prediction accuracy of ϕ and ψ for 20 types of amino acids under two different prediction schemes: 18-bin based and two-peak based predictions. The results of comparison among them were further illustrated in Fig. 1. Clearly, several trends can be observed in the figure: (1) the prediction accuracy under two-peak scheme is always better than that under 18-bin based scheme. The averaged increment on ϕ and ψ are 10% and 6% as shown in Table 1. Although this result is not a surprise, it may be very useful in designing new prediction strategies. (2) Prediction accuracies of ϕ are always higher than that of ψ for almost all amino acid residues. The only exception is Glysine. Under both prediction schemes, the prediction accuracy of ψ is 3% higher than that of ϕ for Glysine. (3) There are outliers in the distribution of prediction accuracy. The prediction accuracy of ψ for Glycine and Asparagine are remarkably lower than others. In the prediction of ϕ, Aspatic Acid in addition to Glysine and Asparagine also has an obvious lower prediction accuracy than others. Meanwhile, the prediction of ϕ for Proline goes to another extreme. Under both schemes, the prediction result for Proline is almost perfect. (4) For aliphatic amino acids (Isoleusine, valine, and Leusine), the prediction accuracies are highly consistent between two schemes and are the highest among all 20 amino acids.

To understand the reasons why there is so much divergence in the prediction accuracy among amino acids, we presented further analysis in Fig. 2. The first factor in our consideration is amino acid composition. Biased distribution of samples will influence the results of various machine-learning methods very frequently. However, as indicated by all three panels in Fig. 2, there is no direct connection between prediction accuracy and amino acid composition. Actually, six amino acids have consistently lower prediction accuracy on both ϕ and ψ. These amino acids are: glysine; asparagine; aspatic acid; serine; histidine; and cysteine. In these six amino acids, only histidine and cysteine have much lower composition than the averaged value of 5%. The other four amino acids have the similar or higher compositions than the average. As a comparison, tryptophan and methionine have similar composition as histidine and cysteine but much higher prediction accuracy.

In Fig. 2A and 2B, we further compared the relation between prediction accuracy and amino acid flexibility for both ϕ and ψ. Here, Vihinen Index is applied to indicate the amino acid flexibility (Vihinen et al., 1994). In general, low flexibility amino acids have higher prediction accuracy, while high flexibility amino acids have lower prediction accuracy. Nonetheless, histidine and cysteine are abnormal examples in low flexibility amino acids. Lysine and glutamic acid are strangers in high flexibility amino acids. Exclusion of these four amino acids could result in a very nice correlation between prediction accuracy and flexibility. Proline is also a peculiar amino acid. The prediction of ϕ for Proline is almost completely correct in Fig. 2A. However, the accuracy of ψ for Proline is in line with the expectation that accuracy is correlated to Vihinen Index as shown in Fig. 2B.

We also presented the correlation on prediction accuracy between ϕ and ψ in Fig. 2C. Apparently, there are good correlations among all the amino acids, except Proline, aspatic acid, asparagine, and glysine. These four residues seem to have more divergence from the diagonal line in Fig. 2C. Part of the reasons could be the low prediction accuracies in either ϕ or ψ. From Fig. 2C, we can further identify several groups of residues in respect of their prediction accuracies on both ϕ and ψ: (1) Group I has the highest prediction accuracy and has three aliphatic amino acids: isoleusine, valine, and leusine; (2) Group II has higher accuracy and has ten residues (methionine; glutamic acid; alanine; tryptophan; phenylalanine; tryrosine; glutamine; arginine; lysine; threonine); (3) Group III has three residues: cysteine; histidine; and serine; (4) Group IV has four residues: proline; aspatic acid; asparagine; and glysine.

Since the distribution of ψ has an obvious two peaks with a separation of about 170 degrees (Xue et al., 2008), it is interesting to check if the prediction accuracy at two peaks is similar or not. Meanwhile, the distribution of ϕ has two extremely unbalanced peaks with the angular distance of 120 degrees. The second peak is much smaller than the first one. It is also necessary to examine the influence of biased distribution on the prediction accuracy. The results were shown in Table 2 and Fig. 3. In general, no matter for peak I or peak II, two-peak prediction scheme will have higher accuracy than 18-bin based prediction scheme. In peak I region, the overall improvement for ϕ and ψ is 2% and 3%, respectively. In peak II regions, the improvement for ϕ and ψ is 19% and 8%, accordingly. In addition, 18-bin based scheme has higher prediction accuracy in peak I range than in peak II region. The accuracy for ϕ and ψ was improved by 16% and 1%, accordingly. However, two-peak based scheme give more credits on peak II region. The improvement on ϕ and ψ was 0.5% and 4%, respectively.

We also analyzed various relations on prediction accuracy between different schemes and peaks in Fig. 3. As shown in panels A and C, the prediction accuracy of ϕ between peak I and II can be correlated to each other except several outliers in both 18-bin and two-peak based schemes. However, it is difficult to find such a correlation in panels B and D. Meanwhile, two-peak based scheme had more accurate prediction than 18-bin based scheme.

For the prediction of ϕ, almost all the prediction accuracies were improved in two-peak based scheme compared to that in 18-bin based scheme. In panel A, the accuracy for proline is almost 100% in both peak I and II. Three aliphatic residues have the accuracy of more than 95% in peak I and around 90% in peak II. Another ten residues have an accuracy of around 95% at peak I and diversified range of accuracy from 65% to 85% at peak II. Histidine and Aspatic Acid are 90% accurate on peak I and 70% accurate on peak II. Asparagine has 84% and 70% on peak I and II, respectively. Glysine achieved also 84% on peak I, but only 45% on peak II. After applying the two-peak based scheme as shown in panel C, the accuracy on peak I for every amino acid was increased by several percent. For example, asparagine and glysine are reaching 86%; histidine and aspatic acid achieved 93%; and all other residues are well above 95%. In the meantime, tremendous improvement was achieved in the prediction of peak II. The accuracy of glysine was improved to 65%; asparagine is almost 95%; hisditine and aspartic acid are over 95%; Other residues are approaching 100% in this binary prediction.

For the prediction of ψ, we can also observe the similar improvement by applying two-peak based scheme. Originally in panel B, the amino acids can be roughly divided into four clusters by their accuracies on peak I and II: proline has 75% on peak I and 90% on peak II; glysine, asparagine and aspartic acid have less than 80% on peak I and only 60% on peak II; serine, threonine, cysteine, and histidine have leass than 80% on peak I but over 80% on peak II; other residues have over 85% on peak I and over 80% on peak II. In panel D which is from two-peak based scheme, although four clusters can still be observed, the overall accuracies were improved significantly except Proline.

Table 3 shows the analysis of prediction accuracy on various secondary structures. Generally, helical residues have higher accuracy than residues in beta-sheet on both ϕ and ψ. And the accuracy for residues in beta-sheet is higher than in coils. After splitting amino acids into peak I and II, residues in peak I keep the same trend while residues in peak II are different. In peak II regions, beta-sheet residues have the highest accuracy for ϕ, followed by helical residues and then coiled residues. For the prediction of ψ in peak II regions, although residues in beta-sheet still have the highest accuracy, the accuracy of coiled residues exceeds that of helical residues. This observation is in accordance with the knowledge that residues in various secondary structures have different distribution on their dihedral angles as shown by their Ramanchandran plot.

Figure 4 presents the distribution of predicted ϕ and ψ angles for 20 types of amino acids. The original distribution of ϕ calculated from the data set statistics has a large peak and a tinny peak. The larger peak has a head and a shoulder separated by ~60 degrees. The distribution between the locations of head and shoulder is quite flat. Here in Fig. 4A, although the predicted ϕ still has a separation of about 60 degrees between its two major peaks, the flat distribution between them is missing. In addition, Proline has only one major peak at around -70 degree. Aspartic Acid has a combined peak of head and shoulder. Glysine has three peaks with the major one at ~90 degree and far away from other amino acids. The original distribution of ψ from data set has also two peaks separated by 170 degree. In the distribution of predicted ψ, the location and separation of two peaks are maintained very well. Anyhow, we can see the shrunk separation for Asparagine and Aspartic Acid. Glysine also has three peaks with only one overlapped with other amino acids. The other two locate at -50 and 110 degree.

Figure 5 is the analysis on angle distribution dispersion. A nice correlation on angle distribution dispersion between ϕ and ψ can be observed for most amino acids in Fig. 5A. The outliers are: glysine; cysteine; tryptophan; leusine; aspartic acid. The first two amino acids have larger dispersion on ϕ, while the other three have larger dispersion on ψ. proline, Glutamic acid, valine, glutamine, alalnine, and isoleusine have smaller dispersion on both ϕ and ψ. Furthermore, we presented the dispersion for both ϕ and ψ as a function of amino acid flexibility in Fig. 5B. Clearly, there is an obvious negative correlation for ϕ and a marginal negative correlation for ψ. By taking into consideration that prediction accuracy of ϕ is also negatively correlated with Vihinen Index, we presented the relation between prediction accuracy and Vihinen Index for ϕ and ψ in Fig. 5C and 5D, accordingly. Obviously, the accuracy of ϕ can be linearly correlated with angle distribution dispersion.

Conclusion

We designed a computational model by applying conditional random field to predict dihedral angles. The overall accuracies by dividing dihedral angles into 18 bins are 86% for ϕ and 84% for ψ. In the case of two-peak prediction, the accuracies are improved to 96% for ϕ and 89% for ψ. Apparently, these high-accuracy predictions are very useful in understanding the structural variations of proteins.

In addition to the division of both dihedral angles into 18 equal bins, we also did the prediction by clustering angles into two peaks. By applying the two-peak based scheme, the prediction accuracy was further improved. The accuracy under two-peak scheme is positively correlated with the accuracy under 18-bin based scheme. Between two peaks, the prediction accuracy on ϕ showed correlations, but not for ψ. This is an interesting observation. As shown by previous study on the prediction of protein dihedral angles, the division of two peaks will be helpful in improving the overall prediction accuracy (Faraggi et al., 2009). Hence, the same strategy can be applied in the future on the study for amino acid substitution.

To understand the sources of the prediction error, we further analyzed the accuracies of 20 types of amino acids, as well as their relations with amino acid flexibility, composition, location, secondary structures, and angular distribution. As indicated by the analysis, glysine has the way low accuracy compared to other amino acids. Helical residues will usually have higher accuracy than other amino acids in other secondary structures. However, in specific angular regions, beta-sheet amino acids will have higher accuracy. The accuracy was also influenced by amino acid flexibility, but not amino acid composition.

Glycine is a very peculiar amino acid. From structural point of view, glycine has no side chains. Hence, glysine can be very flexible on its dihedral angles. This was reflected by several points in our study. First, glycine has a very large value of angle distribution dispersion on both dihedral angles. The dispersion of ϕ is almost twice larger than others, while the dispersion on ψ is the second largest among 20 amino acids. Second, the prediction accuracy for ψ of glycine is higher than that of ϕ. This is in contrary to all other amino acids. These properties make it very difficult to prediction dihedral angles of glycine. The similar situation is also observed for asparagine.

We further applied a quantity called angle distribution dispersion to describe the width of distribution for predicted angles. Other than above mentioned 18-bin and two-peak based schemes, we tested 72-bin based prediction and applied the results in distribution analysis. The angle distribution dispersion for ϕ is negatively correlated with amino acid flexibility. The prediction accuracy of ϕ can also be negatively correlated to the scale of angle distribution dispersion.

Compliance with ethics guidelines

Shesheng Zha, Shengping Jin, and Bin Xue declare that they have no conflict of interest.This article does not contain any studies with human or animal subjects by any of the authors.

References

Publishing order | Descend order by publishing year | Descend order by cited within

[1]	Ahmadi Adl A, Nowzari-Dalini A, Xue B, Uversky V N, Qian X (2012). Accurate prediction of protein structural classes using functional domains and predicted secondary structure sequences. J Biomol Struct Dyn, 29(6): 623–633

[2]	Ashkenazi A, Presta L G, Marsters S A, Camerato T R, Rosenthal K A, Fendly B M, Capon D J (1990). Mapping the CD4 binding site for human immunodeficiency virus by alanine-scanning mutagenesis. Proc Natl Acad Sci USA, 87(18): 7150–7154

[3]	Chang D T, Huang H Y, Syu Y T, Wu C P (2008). Real value prediction of protein solvent accessibility using enhanced PSSM features. BMC Bioinformatics, 9(Suppl 12): S12

[4]	Cunningham B C, Wells J A (1989). High-resolution epitope mapping of hGH-receptor interactions by alanine-scanning mutagenesis. Science, 244(4908): 1081–1085

[5]	Dang T H, Van Leemput K, Verschoren A, Laukens K (2008). Prediction of kinase-specific phosphorylation sites using conditional random fields. Bioinformatics, 24(24): 2857–2864

[6]	Ellegren H (2008). Comparative genomics and the study of evolution by natural selection. Mol Ecol, 17(21): 4586–4596

[7]	Faraggi E, Xue B, Zhou Y (2009). Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by guided-learning through a two-layer neural network. Proteins, 74(4): 847–856

[8]	Faraggi E, Yang Y, Zhang S, Zhou Y (2009). Predicting continuous local structure and the effect of its substitution for secondary structure in fragment-free protein structure prediction. Structure, 17(11): 1515–1527

[9]	Gibbs C S, Zoller M J (1991). Identification of electrostatic interactions that determine the phosphorylation site specificity of the cAMP-dependent protein kinase. Biochemistry, 30(22): 5329–5334

[10]	Green J R, Korenberg M J, Aboul-Magd M O (2009). PCI-SS: MISO dynamic nonlinear protein secondary structure prediction. BMC Bioinformatics, 10(1): 222

[11]	Helles G, Fonseca R (2009). Predicting dihedral angle probability distributions for protein coil residues from primary sequence using neural networks. BMC Bioinformatics, 10(1): 338

[12]	Lafferty J, McCallum A, Pereira F(2001) Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proceedings of the 18th International Conference on Machine Learning 2001 (ICML 2001), pages 282–289.

[13]	Liu Y, Carbonell J, Weigele P, Gopalakrishnan V (2006). Protein fold recognition using segmentation conditional random fields (SCRFs). J Comput Biol, 13(2): 394–406

[14]	Morrison K L, Weiss G A (2001). Combinatorial alanine-scanning. Curr Opin Chem Biol, 5(3): 302–307

[15]	Sutton C, McCallum A (2007)An Introduction to Conditional Random Fields for Relational Learning. In: Getoor L, Taskar B, ed. Introduction to Statistical Relational Learning. MIT Press

[16]	Tokuriki N, Tawfik D S (2009). Stability effects of mutations and protein evolvability. Curr Opin Struct Biol, 19(5): 596–604

[17]	Vihinen M, Torkkila E, Riikonen P (1994). Accuracy of protein flexibility predictions. Proteins, 19(2): 141–149

[18]	Wallach H (2004). Conditional Random Fields: An Introduction. University of Pennsylvania CIS Technical Report MS-CIS-04-21

[19]	Wang G, Dunbrack R L Jr (2003). PISCES: a protein sequence culling server. Bioinformatics, 19(12): 1589–1591

[20]	Wang L, Sauer U H (2008). OnD-CRF: predicting order and disorder in proteins using [corrected] conditional random fields. Bioinformatics, 24(11): 1401–1402

[21]	Xue B, Dor O, Faraggi E, Zhou Y (2008). Real-value prediction of backbone torsion angles. Proteins, 72(1): 427–433

[22]	Xue B, Faraggi E, Zhou Y (2009). Predicting residue-residue contact maps by a two-layer, integrated neural-network method. Proteins, 76(1): 176–183

[23]	Zhao F, Li S, Sterner B W, Xu J (2008). Discriminative learning for protein conformation sampling. Proteins, 73(1): 228–240

[24]	Zhao F, Peng J, Xu J (2010). Fragment-free approach to protein folding using conditional neural fields. Bioinformatics, 26(12): i310–i317

RIGHTS & PERMISSIONS

Higher Education Press and Springer-Verlag Berlin Heidelberg

PDF (355KB)

1163

Accesses

Citation

Detail

Sections

Recommended

About the journal

Aims & scope

Description

Editorial board

Abstracting / indexing

Cover gallery

Contact us

Browse

Just accepted

Online first

Latest issue

All volumes and issues

Collections

Featured articles

Most accessed

Most cited