In the Beginning Was Information

Home > Other > In the Beginning Was Information > Page 8
In the Beginning Was Information Page 8

by Werner Gitt


  – According to Theorem 6, a coding system is required for compiling information, and this system should be able to identify uniquely all the relevant amino acids by means of a standard set of symbols which must remain constant.

  – As required by Theorems 14, 17, and 19, for any piece of information, this information should involve precisely defined semantics, pragmatics, and apobetics.

  – There must be a physical carrier able to store all the required information in the smallest possible space, according to Theorem 24.

  The names of the 20 amino acids occurring in living beings and their internationally accepted three-letter abbreviations are listed in Figure 16 (e.g., Ala for alanine). It is noteworthy that exactly this code with four different letters is employed; these four letters are arranged in "words" of three letters each to uniquely identify an amino acid. Our next endeavor is to determine whether this system is optimal or not.

  Figure 16: The 20 amino acids which are present in living systems, given in alphabetic order, together with their international three-letter abbreviations. The code combinations (triplets) which give rise to the relevant acid are indicated in the right-hand column.

  The storage medium is the DNA molecule (deoxyribonucleic acid), which resembles a double helix as illustrated in Figure 17. A DNA fiber is only about two millionths of a millimeter thick, so that it is barely visible with an electron microscope. The chemical letters A, G, T, and C are located on this information tape, and the amount of information is so immense in the case of human DNA that it would stretch from the North Pole to the equator if it was typed on paper, using standard letter sizes. The DNA is structured in such a way that it can be replicated every time a cell divides in two. Each of the two daughter cells must have identically the same genetic information after the division and copying processes. This replication is so precise, that it can be compared to 280 clerks copying the entire Bible sequentially each one from the previous one, with, at most, one single letter being transposed erroneously in the entire copying process.

  Figure 17: The way in which genetic information is stored. At the left, the "chemical paper" is shown in the form of a long sugar-phosphate chain with the four chemical letters, A, T, C, and G. The actual structure and dimensions of a DNA molecule can be seen at the top.

  When a DNA string is replicated, the double strand is unwound, and at the same time a complementary strand is constructed on each separate one, so that, eventually, there are two new double strands identical to the original one. As can be seen in Figure 17, A is complementary to T, and C to G.

  One cell division lasts from 20 to 80 minutes, and during this time the entire molecular library, equivalent to one thousand books, is copied correctly.

  Figure 18: The chemical formula of insulin. The A chain consists of 21 amino acids and the B chain is comprised of 30 amino acids. Three of the 20 amino acids present in living organisms, are absent (Asp, Met, Try), two occur six times (Cys, Leu), one five times (Glu), three occur four times (Gly, Tyr, Val), etc. The two chains are linked by two disulphide bridges. Insulin is an essential hormone, its main function being to maintain the normal sugar content of the blood at 3.9 to 6.4 mmol/l (70–115 mg/dl).

  6.2 The Genetic Code

  We now discuss the question of devising a suitable coding system. For instance, how many different letters are required and how long should the words be for optimal performance? If a certain coding system has been adopted, it should be strictly adhered to (theorem 8, par 4.2), since it must be in tune with extremely complex translation and implementation processes. The table in Figure 19 comprises only the most interesting 25 fields, but it can be extended indefinitely downward and to the right. Each field represents a specific method of encoding, for example, if n = 3 and L = 4, we have a ternary code with 3 different letters. In that case, a word for identifying an amino acid would have a length of L = 4, meaning that quartets of 4 letters represent one word. If we now want to select the best code, the following requirements should be met:

  – The storage space in a cell must be a minimum, so that the code should economize on the required material. The more letters required for each amino acid, the more material is required, as well as more storage space.

  – The copying mechanism described above requires n to be an even number. The replication of each of the two strands of DNA into complementary strands thus needs an alphabet having an even number of letters. For the purpose of limiting copying errors during the very many replication events, some redundance must be provided for (see appendix A 1.4).

  – The longer the employed alphabet, the more complex the implementing mechanisms have to be. It would also require more material for storage, and the incidence of copying errors would increase.

  Figure 19: The theoretical possibility of constructing a code consisting of words of equal length. Every field (block) represents a definite coding system as indicated by the number of different letters n, and the word length L.

  In each field of Figure 19, the number of possible combinations for the different words appears in the top left corner. The 20 amino acids require at least 20 different possibilities and, according to Shannon’s theory, the required information content of each amino acid could be calculated as follows: For 20 amino acids, the average information content would be iA ≡ i W ≡ ld 20 = log 20/log 2 = 4.32 bits per amino acid (ld is the logarithm with base 2).

  If four letters (quartets) are represented in binary code (n = 2), then (4 letters per word)x(1 bit per letter) = 4 bits per word, which is less than the required 4.32 bits per word. This limit is indicated by the hatched boundary in Figure 19. The six fields adjacent to this line, numbered 1 to 6, are the best candidates. All other fields lying further to the right could also be considered, but they would require too much material for storage. So we only have to consider the six numbered cases.

  It is, in principle, possible to use quintets of binary codes, resulting in an average of 5 bits per word, but the replication process requires an even number of symbols. We can thus exclude ternary code (n = 3) and quinary code (n = 5). The next candidate is binary code (No. 2), but it needs too much storage material in relation to No. 4 (a quaternary code using triplets), five symbols versus three implies a surplus of 67%. At this stage, we have only two remaining candidates out of the large number of possibilities, namely No. 4 and No. 6. And our choice falls on No. 4, which is a combination of triplets from a quaternary code having four different letters. Although No. 4 has the disadvantage of requiring 50% more material than No. 6, it has advantages which more than compensate for this disadvantage, namely:

  – With six different symbols, the recognition and translation requirements become disproportionately much more complex than with four letters, and thus requires much more material for these purposes.

  – In the case of No. 4, the information content of a word is 6 bits per word, as against 5.17 bits per word for No. 6. The resulting redundancy is thus greater, and this ensures greater accuracy for the transfer of information.

  Conclusion: The coding system used for living beings is optimal from an engineering standpoint. This fact strengthens the argument that it was a case of purposeful design rather than fortuitous chance.

  6.3 The Origin of Biological Information

  We find a unique coding system and a definite syntax in every genome.[16] The coding system is composed of four chemical symbols for the letters of the defined alphabet, and the syntax entails triplets representing certain amino acids. The genetic syntax system also uses structural units like expressors, repressors, and operators, and thus extends far beyond these two aspects (4 symbols and triplet words). It is not yet fully understood. It is known that the information in a cell goes through a cyclic process (Figure 20), but the semantics of this process is not (yet) understood in the case of human beings. The locations of many functions of chromosomes or genes are known, but we do not yet understand the genetic language. Because semantics is involved, it means that pragmatics also have t
o be fulfilled. The semantics are invariant, as can be seen in the similarity (not identity!) of uni-ovular twins. If one carefully considers living organisms in their entirety as well as in selected detail, the purposefulness is unmistakable. The apobetics aspect is thus obvious for anybody to see; this includes the observation that information never originates by chance, but is always conceived purposefully.

  Figure 20: A simplified representation of the cyclic information-controlled process occurring in living cells. The translation is based on pragmatics, but it is involved in the cyclic process of semantic information, since the DNA synthesis can only take place under enzymatic catalysis. This sketch clearly illustrates that such a cyclic process must have been complete right from the start, and could not have originated in a continuous process. The structure of this example of a complex information transfer system also corresponds to Figure 24.

  The substitutionary function of information is also satisfied (see Definition D5 in chapter 5), since the triplets in the DNA molecule represent those amino acids that will be synthesized at a later stage for incorporation into proteins (the amino acids themselves are not present). We can now establish an important theorem:

  Theorem 25: Biological information is not an exceptional kind of information, but it differs from other systems in that it has a very high storage density and that it obviously employs extremely ingenious concepts.

  In accordance with the theorems formulated in chapters 3 to 5, in particular the impossibility theorems at the end of chapter 4, it is clear that the information present in living organisms requires an intelligent source. Man could not have been this source, so the only remaining possibility is that there must have been a Creator. We can now formulate the following theorems:

  Theorem 26: The information present in living beings must have had a mental source.

  A corollary of Theorem 26 is:

  Theorem 27: Any model for the origin of life (and of information) based solely on physical and/or chemical processes, is inherently false.

  In their school textbook, R. Junker and S. Scherer establish a basic type that must have been "ready-made" [J3]. This result, which requires the information content of living beings to be complete right from the beginning, is biologically sound. The derived theorems about the nature of information fit this model.

  6.4 Materialistic Representations and Models of the Origin of Biological Information

  The question "How did life originate?" which interests us all, is inseparably linked to the question "Where did the information come from?" Since the findings of James D. Watson (*1928) and Francis H.C. Crick (*1916), it was increasingly realized by contemporary researchers that the information residing in the cells is of crucial importance for the existence of life. Anybody who wants to make meaningful statements about the origin of life would be forced to explain how the information originated. All evolutionary views are fundamentally unable to answer this crucial question.

  The philosophy that life and its origin are purely material phenomena currently dominates the biological sciences. Following are the words of some authors who support this view.

  Jean-Baptiste de Lamarck (1744–1829), a French zoologist and philosopher, wrote, "Life is nothing but a physical phenomenon. All life features originate in mechanical, physical, and chemical processes which are based on the properties of organic matter itself" (Philosophie Zoologique, Paris, 1809, Vol. 1).

  The German microbiologist R.W. Kaplan holds a similar materialistic view [K1]: "Life is effected by the different parts of a system which work together in a certain way…. Life can be completely explained in terms of the properties of these parts and their inevitable interactions…. The origin of life can be explained in terms of hypotheses describing fully the sequence of events since the origin of protobionts, and the fact that all these events could be deduced from physical, chemical, and other laws which are valid for material systems."

  Manfred Eigen (*1927), a Nobel laureate of Göttingen, discusses questions about life from the molecular biology view, with as point of departure the unwarranted postulate that natural laws controlled the origin of life. In his work on the self-organization of matter [E1], he uses an impressive array of formulas, but does not rise above the level of statistical information. This voluminous work is thus useless and does not answer any questions about the origin of information and of life. He writes in [E2, p 55], "Information arises from non-information." This statement is nothing but a confession of materialism, and it fails the tests required by reality.

  Franz M. Wuketits defines the target readership of his book [W8] as follows: "…not only biologists and theoretical scientists, but in equal measure scientists and philosophers, and everybody who is interested in the adventures of contemporary science." He then presents a so-called "evolutionary theoretical science," claiming to initiate a new Copernican revolution. Up to the present time, great scientific results were obtained by means of observation, measuring, and weighing, as was done for example by Copernicus, Galilei, Newton, Einstein, Born, and Planck. In his system, Wuketits follows the backward route: His point of departure is to assume that evolution is true, so that all natural phenomena have to be interpreted through these spectacles. He writes in the introduction of his book [W8, p. 11–12]:

  The fundamental truth of biological evolution is accepted beforehand, yes, we assume in advance that the principle of evolution is universally valid, that it is just as valid in the pre-organic domain as in the organic, and that it can be extended to the spheres of psychology, sociology, and culture. If we accept that the evolutionary view also holds for the human mind and cognition, then evolutionary ideas can also be applied to the analysis of those phenomena which are usually regarded as belonging to theoretical science. As a result this view then becomes relatively more important in the evaluation of the progress of scientific research. We thus arrive at an evolutionary theory of science, a theory of human knowledge which relates to an evolutionary establishment of itself.

  If such statements were based on a sufficient body of facts, then one might perhaps agree with the conclusions, but the reverse process was followed: All phenomena of nature are placed under the all-encompassing evolutionary umbrella. Scientists who submit themselves to such a mental corset and support it uncritically, degrade themselves to mere vassals of a materialistic philosophy. Science should, however, only be subservient to the truth, and not to pre-programmed folly. Evolutionary theory bans any mention of a planning Spirit as a purposeful First Cause in natural systems, and endeavors to imprison all sciences in the straightjacket called the "self-organization of matter." Wuketits supports evolutionary theory with a near ideological fervor, and accuses everybody of fable mongering who claims to be scientific and speak of "planning spirits" or of a "designer" in nature. He wishes to ban thoughts of "finality" and of "final and purposeful causes" from science and from the domain of all serious schools of thought.

  An appreciable fraction of all scientists who concern themselves with cosmological questions and with questions of origins, support the evolutionary view, to such an extent that the well-known American bio-informaticist Hubert P. Jockey [J1] bemoans the fact that the literature in this area is blandly and totally supportive. He writes in the Journal of Theoretical Biology [vol. 91, 1981, p. 13]:

  Since science does not have the faintest idea how life on earth originated…. it would only be honest to confess this to other scientists, to grantors, and to the public at large. Prominent scientists speaking ex cathedra, should refrain from polarizing the minds of students and young productive scientists with statements that are based solely on beliefs.

  The doctrine of evolution is definitely not a viable scientific leitmotiv (guiding principle); even the well-known theoreticist Karl Popper [H1], once characterized it as a "metaphysical research program." This assertion is just as noteworthy as it is honest, because Popper himself supports evolution.

  We now discuss some theoretical models which suggest that information can origin
ate in matter.

  Cumulative selection (Latin cumulare = gather): Richard Dawkins, a British neo-Darwinist, revives the historical example of the typewriter-thrumming monkeys (see appendix A1.5) and replaces them with "computer monkeys." As shown in Figure 21, he begins with a random sequence of 28 letters [D2 p. 66–67] and seeks to demonstrate how a predetermined phrase selected from Shakespeare, "Methinks it is like a weasel," can be derived through mutation and selection. The random initial sequence with the required number of letters is copied repeatedly, allowing for random copying errors (representing mutations). The computer program checks all the "daughter" sentences and selects that one which most resembles the target sentence. The process is subsequently repeated for the resulting "winning sentences," until eventually, after 43 "generations," the goal is reached.

  There is a spate of new Jesus books which constantly present strange new and false ideas contrary to the New Testament. Prof. Klaus Berger of the Heidelberg School of Theology remarked (1994): "Please buy and read such a book, then you will realize what degree of gullibility is ascribed to you." With equal zeal, Dawkins publishes his easily detectable fallacies about the way information originates. It is therefore necessary to discuss his representation fully so that you, the reader, can see what feeble-mindedness is ascribed to you.

 

‹ Prev