Do we know all the proteins (and their primary structure!)?
Moderators: honeev, Leonid, amiradm, BioTeam
Do we know all the proteins (and their primary structure!)?
Again, one of those 'basic' questions that I realise I simply don't know the answer to!
Do we know all the proteins in living organisms? Or even in humans? Or are there still unknown proteins whose existence we haven't realised yet?
If we do know all the proteins in the (human) proteosome, do we know their primary structure?
I am sort of assuming we do, because we now have the human genome, and we now know (don't we?) just how many protein-coding genes there are (don't we??)
Perhaps I should say I think we know all the polypeptides rather than the proteins (allowing for unknown secondary/tertiary/quaterary structures of the proteins to which those pp chains contribute)?
Or am I being naively optimistic?!
Many thanks for any light shed - Regards, Julie.
Do we know all the proteins in living organisms? Or even in humans? Or are there still unknown proteins whose existence we haven't realised yet?
If we do know all the proteins in the (human) proteosome, do we know their primary structure?
I am sort of assuming we do, because we now have the human genome, and we now know (don't we?) just how many protein-coding genes there are (don't we??)
Perhaps I should say I think we know all the polypeptides rather than the proteins (allowing for unknown secondary/tertiary/quaterary structures of the proteins to which those pp chains contribute)?
Or am I being naively optimistic?!
Many thanks for any light shed - Regards, Julie.
-
- Death Adder
- Posts: 65
- Joined: Mon Apr 18, 2011 11:35 am
- Location: University of Cambridge, UK
I think it would be safe to say that we know the structure of every human protein, but as far as conformation and function of each one, as well as variability, we cannot be sure. To discover this sort of thing you don't just need the peptide sequence, but to observe it in action. For many things this requires a high concentration of that particular protein (so ion channels in the electric organ of the torpedo fish for example) and some are just too rare to get this.
Some properties can only be observed through crystellising proteins, and this is a very difficult process with transmembrane proteins. This is why making drugs for certain receptors isn't a simple task, because we can't know exactly what size and shape these drugs need to be.
I suspect for non-mapped genomes there are going to be a lot of proteins we still don't know. There's just so much variation available!
Some properties can only be observed through crystellising proteins, and this is a very difficult process with transmembrane proteins. This is why making drugs for certain receptors isn't a simple task, because we can't know exactly what size and shape these drugs need to be.
I suspect for non-mapped genomes there are going to be a lot of proteins we still don't know. There's just so much variation available!
by knowing protein, we must know it's primary structure (or we can also know, there is some enzyme catalyzing some activity, of which we do not know the gene, truth. But nowadays we have rather opposite problem).
We have predicted genes and thus predicted proteins in human, but that doesn't guarantee, that we don't have some false positive or false negative
And the difference between polypeptide and protein is only in size, not in known level of structure
We have predicted genes and thus predicted proteins in human, but that doesn't guarantee, that we don't have some false positive or false negative

And the difference between polypeptide and protein is only in size, not in known level of structure

http://www.biolib.cz/en/main/
Cis or trans? That's what matters.
Cis or trans? That's what matters.
-
- Garter
- Posts: 3
- Joined: Thu Apr 28, 2011 9:30 am
We do not know every protein sequence. The genes do not code to the post traductionnal modifications. As exemple, on the collagen sequence, we have a repeat of proline but in the real molecule, some aa are hydroxyproline.
In addition, some protein need to loose a peptide to be active or to modify their activity. So these protein have not the same sequence and primary structure.
In addition, some protein need to loose a peptide to be active or to modify their activity. So these protein have not the same sequence and primary structure.
Re: Do we know all the proteins (and their primary structure
Firstly, I think it is impossible that we have known all the proteins structure.
Polypeptides are chains of amino acids. Proteins are made up of one or more polypeptide molecules.The amino acids are linked covalently by peptide bonds. The graphic on the right shows how three amino acids are linked by peptide bonds into a tripeptide.
One end of every polypeptide, called the amino terminal or N-terminal, has a free amino group. The other end, with its free carboxyl group, is called the carboxyl terminal or C-terminal.
The sequence of amino acids in a polypeptide is dictated by the codons in the messenger RNA (mRNA) molecules from which the polypeptide was translated. The sequence of codons in the mRNA was, in turn, dictated by the sequence of codons in the DNA from which the mRNA was transcribed.
The schematic below shows the N-terminal at the upper left and the C-terminal at the lower right.
Here is a exampal about natural polypeptides and synthetic polypeptides, which also
proof that we do not know all the polypeptides. I found some more info about the custom protein synthesis that might be helpful.

Polypeptides are chains of amino acids. Proteins are made up of one or more polypeptide molecules.The amino acids are linked covalently by peptide bonds. The graphic on the right shows how three amino acids are linked by peptide bonds into a tripeptide.
One end of every polypeptide, called the amino terminal or N-terminal, has a free amino group. The other end, with its free carboxyl group, is called the carboxyl terminal or C-terminal.
The sequence of amino acids in a polypeptide is dictated by the codons in the messenger RNA (mRNA) molecules from which the polypeptide was translated. The sequence of codons in the mRNA was, in turn, dictated by the sequence of codons in the DNA from which the mRNA was transcribed.
The schematic below shows the N-terminal at the upper left and the C-terminal at the lower right.
Here is a exampal about natural polypeptides and synthetic polypeptides, which also
proof that we do not know all the polypeptides. I found some more info about the custom protein synthesis that might be helpful.

http://www.biologicscorp.com/
Re:
stephroche wrote:We do not know every protein sequence. The genes do not code to the post traductionnal modifications. As exemple, on the collagen sequence, we have a repeat of proline but in the real molecule, some aa are hydroxyproline.
In addition, some protein need to loose a peptide to be active or to modify their activity. So these protein have not the same sequence and primary structure.
Agree, maybe we could know all the proteins in future, but not now. Even we know the genomics, we do not know which of them are encode proteins totally, and proteins need modification after translation. So there are no directly relationship between "we know the genomics" and "we know all the proteins".
Andrea Fortina
Genius only means hard-working all one's life.
CreativeBiomart, Recombinant protein expert
Genius only means hard-working all one's life.
CreativeBiomart, Recombinant protein expert
Who is online
Users browsing this forum: No registered users and 12 guests