Abstract:
How many different proteins can be produced from a single spliced transcript? Genome annotation projects usually do not consider the coding potential of altORFs. However, many altProts have been shown to carry out essential functions in various organisms. In addition to the existence of protein-coding potential in all the three reading frames, spliced eukaryotic transcripts may undergo programmed single or multiple ribosomal frameshifting events. Depending on whether a protein is produced by one or several such events, this novel protein is called either a chimeric protein or a mosaic protein, respectively. Proteins produced via single ribosomal frameshifting events have been known in viruses for a long time, and more recently, they have also been found in higher eukaryotes. In contrast, mosaic proteins so far are elusive, with only one example found in viruses. Detection of altORFs can help identify these unusual proteins because altORFs may act as building blocks for chimeric proteins and mosaic proteins. This way of extracting and combining genetic information from different reading frames may significantly increase proteome diversity, thus promoting organisms' flexibility and adaptability to various environmental conditions. This project aims to identify altProts based on the conservation evidence or detection by mass spectrometry (MS) analysis and to find proteins produced via single and multiple ribosomal frameshifting events to demonstrate the existence of mosaic translation. Our study in Medicago truncatula, a well- established model legume, detected 715 translated altProts and 146 chimeric proteins. Two transcripts support the existence of mosaic proteins and mosaic translation, which has never been detected in non-viral organisms before. In addition, we have found evidence for many thousands of conserved altProts. This work pioneers a new field of proteomics and is of immense value for plant biologists and specialists interested in translation. It also paves a way towards the major shift in current understanding of proteome complexity and diversity.