|
||||
|
||||
Identification of Human Proteins Using Linguist’s ToolsJ. Chakrabartiaffiliation not provided to SSRN Sujoy Chattopadhyayaffiliation not provided to SSRN A. Somaffiliation not provided to SSRN Debaprasad BandyopadhyayIndian Statistical Institute March 4, 2001 Indian Journal of Biochemistry and Biophysics, Vol. 38, pp. 124-127, February and April, 2001 Abstract: The symbolic sequences of the exons that make human proteins are subjected to methods of statistical linguistics. The ideas developed for the natural languages by G.K. Zipf, when applied to these sequences, show a significant promise. In Particular, we argue, The Zipf's exponent differentiate, and hence, identifies disparate human sequences. The codons, 64 in number, are distributed over the coding part of DNA sequences. Metaphorically speaking, the sequences almost resembles linguistic structure. The distribution function is the plot of frequency versus rank of the codons. These distributions are characterized by parameters that are almost universal, i.e., gene independent. Authors present the theory to calculate the universal (gene independent) with the help of linguistic theory. The part that is gene specific, however has undetermined overlaps and fluctuations.
Number of Pages in PDF File: 5 Keywords: Codon, Proteins, Syntax Accepted Paper SeriesDate posted: March 6, 2012Suggested CitationContact Information
|
|
|||||||||||||||||||
© 2013 Social Science Electronic Publishing, Inc. All Rights Reserved.
FAQ
Terms of Use
Privacy Policy
Copyright
This page was processed by apollo1 in 0.703 seconds