Структура и функционирование белков. Применение методов биоинформатики - Джон Ригден 2014

Распознавание фолда
Литература

Altschul SF, Madden TL, Schaffer AA, et al. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402

Bateman A and Finn RD (2007) SCOOP: a simple method for identification of novel protein super- family relationships. Bioinformatics 23:809-814

Bennett-Lovsey RM, Herbert AD, Sternberg MJ, et al. (2008) Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre. Proteins. 70:611-625

Berman HM, Westbrook J, Feng Z, et al. (2000) The protein data bank. Nucleic Acids Res 28:235- 242

Bowie JU, Lüthy R, Eisenberg D (1991) A method to identify protein sequences that fold into a known three-dimensional structure. Science 253:164-170

Bradford JR, Westhead DR (2005) Improved prediction of protein-protein binding sites using a support vector machines approach. Bioinformatics 21:1487-1494

Bryant SH (1996) Evaluation of threading specificity and accuracy. Proteins 26(2): 172-185

Busuttil S, Abela J, and Pace GJ (2004) Support vector machines with profile-based kernels for remote protein homology detection. Genome Inform Ser Workshop Genome Inform 15:191—200

Chivian D, Baker D (2006) Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection. Nucleic Acids Res 34:el 12

Copley RR, Bork P (2000) Homology among (beta/alpha)(8) barrels: implications for the evolution of metabolic pathways. J Mol Biol 303:627-641

Dodson G Wlodawer A (1998) Catalytic triads and their relatives. Trends Biochem Sci 23:347-352

Elofsson A (2002) A study on protein sequence alignment quality. Proteins 46:330-339 Fisher D (2003) 3D-SHOTGUN: a novel, cooperative, fold-recognition meta-predictor. Proteins 51:434—441

Garg A, Bhasin M, Raghava GP (2005) Support vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order and similarity search. J Biol Chem 280:14427-14432

Ginalski К, Elofsson А, Fischer D, et al. (2003) 3D-Jury: a simple approach to improve protein structure predictions. Bioinformatics 19:1015-1018

Heger A, Mallick S, Wilton C, et al. (2008) The global trace graph, a novel paradigm for searching protein sequence databases. Bioinformatics 23:2361-2367

Hou Y, Hsu W, Lee ML, et al. (2003) Efficient remote homology detection using local structure. Bioinformatics 19:2294-2301.

Jaakkola T, Diekhans M, Haussier D (2000) A discriminative framework for detecting remote protein homologies. J Comput Biol 7:95-114

Jain AK, Duin RPW, Mao JC (2000) Statistical pattem recognition: A review. IEEE Trans Pattem Anal 22:4-37

Jaroszewski L, Li W, Godzik A (2002) In search for more accurate alignments in the twilight zone. Prot Sci 11:1702-1713

Jones DT (1999a) Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 292:195-202.

Jones DT (1999b) GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. J Mol Biol 287:797-815

Jones DT, Taylor WR, Thornton JM (1992) A new approach to protein fold recognition. Nature 358:86-89

Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22:2577-2637

Kelley LA, MacCallum RM, Sternberg MJ (2000) Enhanced genome annotation using structural profiles in the program 3D-PSSM. J Mol Biol 299:499-520

Kim H, Park H (2003) Prediction of protein relative solvent accessibility with support vector machines and long-range interaction 3D local descriptor. Proteins 54:557-562

Kumar M, Bhasin M, Natt NK, et al. (2005) BhairPred: prediction of beta-hairpins in a protein from multiple alignment information using ANN and SVM techniques. Nucleic Acids Res 33 (Web Server issue): 154-159

Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51:181—207

Lathrop RH (1999) An anytime local-to-global optimization algorithm for protein threading in theta (m2n2) space. J Comput Biol 6(3-4):405-418

Lathrop RH, Smith TF (1996) Global optimum protein threading with gapped alignment and empirical pair potentials. J Mol Biol 255:641-665

Leslie C, Eskin E, Noble WS (2002) The spectrum kernel: a string kernel for SVM protein classification. Рас Symp Biocomput 564-575

Leslie CS, Eskin E, Cohen A, et al. (2004) Mismatch string kernels for discriminative protein classification. Bioinformatics 20:467-476

Liao L, Noble WS (2003) Combining pairwise sequence similarity and support vector machines for detecting remote protein evolutionary and structural relationships. J Comput Biol 10:857-868 Madej T, Gilbrat J-F, Bryant SH (1995) Threading a database of protein cores. Proteins 23:356-369

Marsden RL, Lee D, Maibaum M, et al. (2006) Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res 34:1066-1080

McGuffin LJ (2008) The ModFOLD server for the quality assessment of protein structural models. Bioinformatics 24:586-587

Miyazawa S, Jemigan RL (1996) Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol 256(3):623-644

Moult J, Fidelis К, Kryshtafovych A, et al. (2007) Critical assessment of methods of protein structure prediction - Round VII. Proteins 69 S8:3-9

Murzin AG Brenner SE, Hubbard T, et al. (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247:536-540

Nguyen MN, Rajapakse JC (2003) Multi-class support vector machines for protein secondary structure prediction. Genome Inform Ser Workshop Genome Inform 14:218-227

Ohlson T, Wallner B, Elofsson A (2004) Profile-profile methods provide improved fold-recognition: a study of different profile-profile alignment methods. Proteins 57:188-197

Park J, Teichmann SA, Hubbard T, et al. (1997) Intermediate sequences increase the detection of homology between sequences. J Mol Biol 273:349-354

Pearson WR (1998) Empirical statistical estimates for sequence similarity searches. J Mol Biol 276:71-84

Ponting CP, Russell RB (2000) Identification of distant homologues of fibroblast growth factors suggests a common ancestor for all beta-trefoil proteins. J Mol Biol 302:1041-1047

Prasad JC, Vajda S, Camacho CJ (2004) Consensus alignment server for reliable comparative modeling with distant templates. Nucleic Acids Res 32:W50-W54

Richmond TJ (1984) Solvent accessible surface area and excluded volume in proteins. Analytical equations for overlapping spheres and implications for the hydrophobic effect. J Mol Biol 178:63-89

Rychlewski L, Jaroszewski L, Li W, Godzik A (2000) Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Sci 9:232-241

Science Editorial (2005) So much more to know. Science 309:78-102

Seringhaus M, Gerstein M (2007) Chemistry Nobel rich in structure. Science 315:40-41

Sippl MJ (1990) Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 213:859-883

Skolnick J, Kihara D (2000) Defrosting the frozen approximation: PROSPECTOR - a new approach to threading. Proteins 42:319-331

Soeding J (2005) Protein homology detection by HMM-HMM comparison. Bioinformatics 21:951— 960

Tanaka S, Scheraga HA (1976) Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. Macromolecules 9:945-950 Tang CL, Xie L, Koh IY, et al. (2003) On the role of structural information in remote homology detection and sequence alignment: new methods using hybrid sequence profiles. J Mol Biol 334:1043-1062

Tress ML, Jones D, Valencia A (2003) Predicting reliable regions in protein alignments from sequence profiles. J Mol Biol 330:705-718

Venclovas C, Margelevicius M (2005) Comparative modeling in CASP6 using consensus approach to template selection, sequence-structure alignment, and structure assessment. Proteins(Suppl 7):99-105

Wallner B, Elofsson A (2005) Pcons5: combining consensus, structural evaluation and fold recognition scores. Bioinformatics 21:4248-4254

Wallner B, Elofsson A (2006) Dentification of correct regions in protein models using structural, alignment, and consensus information. Prot Sei 15:900-913

Westhead DR, Collura VP, Eldridge MD, et al. (1995) Protein fold recognition by threading: comparison of algorithms and analysis of results. Protein Eng 8:1197-1204

Weston J, Elisseeff A, Zhou D, et al. (2004) Protein ranking: from local to global structure in the protein similarity network. PNAS 101:6559-6563

Xia Y, Levitt M (2000) Extracting knowledge-based energy functions from protein structures by error rate minimization. Comparison of methods using lattice model. J Chem Phys 113:9318— 9330

Xu J, Li M, Kim D, et al. (2003) RAPTOR: optimal protein threading by linear programming. J Bioinform Comput Biol 1:95—117

Xu Y, Xu D, Uberbacher EC (1998) An efficient computational method for globally optimal threading. J Comput Biol 5:597-614

Zachariah MA, Crooks GE, Holbrook SR, Brenner SE (2005) A generalized affine gap model significantly improves protein sequence alignment accuracy. Proteins 58:329-338

Zhang Y (2007) Template-based modeling and free modeling by I-TASSER in CASP7. Proteins(Suppl 8): 108-117

Zhang Y, Skolnick J (2005) The protein structure prediction problem could be solved using the current PDB library. Proc Natl Acad Sci USA 102:1029-1034

Zhou H, Zhou Y (2005) Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments. Proteins 58:321-328