====== Protein structuring ====== Genes set the order that amino acids (the chemical building blocks of proteins) appear in the proteins which they code for. But, working from the gene, the form which the protein's 3-D structure will take cannot as yet be predicted. The extremely complex shapes in which the protein 'folds' has a profound effect on the properties it has within an organism. The so-called [[https://en.wikipedia.org/wiki/Protein_structure_prediction|'protein folding problem']] first described more than half a century ago, means that pharmaceutical and bio-informatic researchers (for example) are faced with very extensive problems when trying to design new medicines and enzymes - and also in understanding how currently known examples actually work. >We have little experimental knowledge of protein-folding energy landscapes. >[...] >We cannot consistently predict the structures of proteins to high accuracy. We do not have a quantitative microscopic understanding of the folding routes or transition states for arbitrary amino acid sequences. We cannot predict a protein’s propensity to aggregate, which is important for aging and folding diseases. We do not have algorithms that accurately give the binding affinities of drugs and small molecules to proteins. We do not understand why a cellular proteome does not precipitate, because of the high density inside a cell. We know little about how folding diseases happen, or how to intervene."\\ \\ //Source :// [[https://www.researchgate.net/profile/Ken_Dill/publication/233770794_The_Protein-Folding_Problem_50_Years_On/links/00b7d51a7648358726000000.pdf|The protein-folding problem, 50 years on]] in //Science//, vol. 338, 2012 Of the genes which are known to code for the generation of proteins, around 20% produce proteins which have functions that are as yet unknown. Oddly, this 20% figure seems to remain fairly constant for most of the organisms which have been studied. From the simplest yeast up to and including humans. See: [[https://royalsocietypublishing.org/doi/10.1098/rsob.180241|The Royal Society Open Biology, 2019]] === Computational folding predictions === Researchers using the [[https://deepmind.com/blog/article/alphafold-a-solution-to-a-50-year-old-grand-challenge-in-biology|DeepMind]] neural-network computing system have shown that it's possible to predict the folded protein shape with good accuracy - though it's currently not able to predict protein shapes from any arbitrary string of amino acids - the system relies instead on previous, experimentally-discovered, models of protein folds. See : [[https://www.newscientist.com/article/2330866-deepminds-protein-folding-ai-cracks-biologys-biggest-problem/|New Scientist]] , July 2022. There is also a lack of understanding about how artificial [[content:mathematics:neural_networks|neural networks]] actually operate, and therefore how the results were achieved. >We have shown that the deep distance prediction neural network achieves high accuracy, but we would like to understand how the network arrives at its distance predictions and—in particular—to understand how the inputs to the model affect the final prediction.\\ \\ Source : [[https://www.nature.com/articles/s41586-019-1923-7.epdf?author_access_token=Z_KaZKDqtKzbE7Wd5HtwI9RgN0jAjWel9jnR3ZoTv0MCcgAwHMgRx9mvLjNQdB2TlQQaa7l420UCtGo8vYQ39gg8lFWR9mAZtvsN_1PrccXfIbc6e-tGSgazNL_XdtQzn1PHfy21qdcxV7Pw-k3htw%3D%3D|Nature]] Jan. 2020 ---- Also see [[content:life_sciences:life_itself:protein_knotting|Protein Knotting]] and [[content:life_sciences:life_itself:disordered_proteins]] ~~stars>3/5~~