Information Contents Of Number Theoretic Functions

by

Huen Y.K.

CAHRC, P.O.Box 1003, Singapore 911101
http://web.singnet.com.sg/~huens/
email: huens@mbox3.singnet.com.sg

(A short communication - 1st release: 29/5/97. Update: 29/5,30/5)

Abstract

Measurements are important activities in sciences and commerce and these can only be performed accurately with reference to high precision standards. Therefore when the information content of an algebraic equation or its program is to be measured, one needs a reference standard. In this paper, the global natural number sequence Nat(z) is defined as the information reference standard by which other number theoretic functions are measured. Since most number theoretic sequences have sequence algebraic formulations but this cannot be said of conventional number theoretic formulations, measurements will be done in the sequence algebraic domain. Conversions from the algebraic to the sequence algebraic domain is possible by mapping using the canonical generating function called CGF(z) [6] although this is only used for class 3 sequences. The unit of information content is defined in "godch" where one godch equals 7 ascii characters. Measurements of most well known number theoretic sequences fall within expectations but real insights are obtained from the few exceptions which defy intuitive predictions.

1. Introduction

The work in this paper is not connected with information theory as practised by crypto-analysts [1] or random theorem proofing as described by Chaitin [3]. It is more closely connected with the practice of measurements. Every proven number theoretic function carry a finite amount of information. With the discovery of sequence algebra since 1994 [7], sequence formulations for predicting all primitive number sequences and most derived secondary sequences have been developed [5 to 9]. The aims of of this paper are twofold, viz., to determine implementation procedure for setting up information content calibration and to seek explanations to some unusual results obtained from the calibrations.

a) Implementations

(i) To define Nat(z) = z/(z-1) as the reference standard for information contents of number theoretic functions.

(ii) To calibrate the information contents of well known number theoretic sequences and to interprete the results obtained.

b) Enquiries

(i) To discover whether the null sequence Zero(z) and the random sequence Random(z) have any information content and whether these are measurable.

(ii) To find explanations on why Odd(z) has lower information content than Even(z).

(iii) to find explanations on why Prime(z) has higher information content than Comp(z).

(iv) Is there a plausible relation between equation lengths and information contents of number theoretic functions?

(v) To discover the principle governing the increase of information contents of number theoretic sequences based on results reported in table 1.

2. Classificattions of Sequence Algebraic Expressions

Much real progress has been made in number theory since 1637 but there were also glaring lack of progress in some quarters. The proof of Fermat's Last Theorem by Wiles comes from a discipline outside conventional number theory. Goldbach's conjecture still remains unsolved. Most number theoretic predictions are not written as explicit formulations. For example there are many primality tests but there is no explicit formula for the prime sequence. Neither is there one for the composite sequence. The nearest one could host a number theoretic formula maybe as an iterative algorithm but strictly this is not an explicit formula. Examples of some iterative sequences written in Maple V R 3 syntax are shown below. There is no way these sequences could be translated into closed explicit forms.

Natural numbers: from i = -1 to 10 do n = i + 1; od
...............................................Result:{0,1,2,3,4,5,6,7,8,9,10,11}
Even numbers: from i = 0 to 10 by 2 do n = 2*i; od
...............................................Result:{0,2,4,6,8,10,12,14,16,18,20}
Odd numbers: from i = 0 to 10 by 2 do n = 2*i + 1; od
...............................................Result:{1,3,5,7,9,11,13,15,17,19,21}

In sequence algebra, number sequences are written down holistically and are always explicit. There are three classes of sequence algebraic formulations given in order of complexity as shown below. Sequences which do not have sequence algebraic formulations are temporarily relegated to class 4. The method of counting ascii characters are demonstrated in table 1.

Class 1: Primitive number sequences

In this class, the power index is a fixed value and only arithmetic operators are used.

Nat(z) = z / ( z - 1 )
.............1 2 3 4 5 6 7 = 7 ascii characters.

Even(z) = z ^ 2 / ( z ^ 2 - 1 )
................1 2 3 4 5 6 7 8 9 10 11 = 11 ascii characters.

Odd(z) = z * Even(z) = z / ( z ^ 2 - 1 )
................1 2 (11) 1 2 3 4 5 6 7 8 9 = 9 ascii characters.

There are two formulations for Odd(z) but the second one is shorter and is adopted. Note that by convention we do not include Even( z) as an intrinsic arithmetic function which would have reduced the counts for Even(z) to 6 as shown below:

0 * Even ( z )
1 2 3 4 5 6 = 6 ascii characters

Class 2: Composite number sequences (not to be confused with Comp(z)).

In this class, the Normc ( ) operator has to be introduced which reduces numerator coefficients to unities. This is necessary because sequence products causes the numerator coefficients to increase above unities which will hamper further sequence manipulations.[8,9]

Comp(z) = 1 / ( z ^ i * ( z ^ i - 1 ) )
................1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 = 15 ascii characters.

Prime(z) = Nat ( z ) - Normc ( Comp ( z ) ) - 1 - 1 / z
....................(7) 1 2 (15) 3 4 5 6 7 8 9 = 31 ascii characters.

Oddcomp(z) = Normc ( Comp(z) ) - Even(z)
...........................1 2 (15) 3 4 (11) = 30 ascii characters.

Class 3: Canonical Generating Functions or CGF(z)

This class covers nonlinear sequence algebraic formulations. Both Normc( ) and logical operators are needed in the formulations. A separate paper on CFG(z) [6] can be found in the author's URL site: http://web.singnet.com.sg/~huens/ .

f(i)s are algebraic expressions which will be separately counted.

CGF(z) = 1 / ( z ^ f(i) - 1 ) - 1 / ( z ^ f(i) * ( z ^ f(i) - 1 ) )
................1 2 3 4 5 (k) 6 7 8 9 10 11 12 13 14 (k) 15 16 17 18 (k) 19 20 21 22
.........................................................................................................................= 22+3*k

This is the generalised sequence formulation which is required where intervals in CGF(z) become nonuniform due to the nonlinear f(i) expression. Examples are found in Mersenne and Fermat number sequences. In this class, mixed mode arithmetics is required. Although there is no proof, no formulations using purely arithmetic operations have been found. It is most probably impossible to write down explicit formulations using arithmetic operators alone.

Mersenne Numbers And Mersenne Primes:
f(i) := ( 2 ^ i - 1 )
..........1 2 3 4 5 6 7

Mersennenumb(z):= 1 / ( z ^ f(i) - 1) - 1 / ( z ^ f(i) * ( z ^ f(i) - 1 ) ) ; = 23 +3*7 = 44 ascii characters.

Mersenneprime:=
(AND Normc ( Mersennenumb(z) ) Prime(z) )
1 2 3 4 (44) 5 (31) 6 = 6 + 44 + 31 = 81 ascii characters.

Fermat Numbers And Fermat Primes:
f(i) := ( 2 ^ ( 2 ^ i ) - 1 )
..........1 2 3 4 5 6 7 8 9 10 11 = 11 ascii characters.

Fermatnumb(z):= 1 /(z^f(i) - 1) - 1/(z^f(i)*(z^f(i)-1)); = 23 + 3*11 = 56 ascii characters.

Fermatprime(z):= (AND Normc(Fermatnumb(z)) Prime(z)) = 6 + 56 + 31 = 93 ascii characters.

Perfect Number

f(i)= i / ( z ^ i * ( z ^ i - 1 ) - i / z ^ i
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 == 23 + 3*20 = 83 ascii characters.

Twin Primes

(OR (AND Prime(z)[ z ^ 2 * Prime(z) ] ) ( AND Prime(z) [ z ^ ( - 2 ) * Prime(z) ] )
1 2 3 4 (31) 5 6 7 8 9 (31) 10 11 12 13 (31) 14 15 16 17 18 19 20 21 (31) 32 33 34
..............................................................................= 23 + 4* 31 = 147 ascii characters.

3. Table Of Ascii Counts

Using the method of counting ascii characters, we summarise the length counts in both ascii units and Godch units. Since Nat(z) has 7 ascii characters, it is defined to have a unit of one Godch, i.e. one Godch equals 7 ascii characters. Godch is the acronym derived from Godel and Chaitin.[2,4]

Table 1 - Calibrations Of Information Contents Of Well Known Number Sequences


...sequences........................ascii units...................godch units

Class 1:

Nat(z) as the standard..........7 ascii characters...........1

Odd(z)..........z/(z^2-1)........9...................................1.285714286

Even(z).........z^2/(z^2-1)....11..................................1.571428571

Zero(z) ........0*Nat(z).........9...................................1.285714286


Classs 2:

Comp(z) 1/(z^i*(z^i-1)).................15.........................2.142857143

Prime(z)=Nat(z)-Comp(z)-1-1/z....36.........................5.142857143

Oddcomp(z)=Even(z)-Comp(z).....31.........................4.428571429


Class 3:

Mersennenumber(z).....................44 .........................6.285714286

1/(z^(2^i-1)-1)-1/(z^(2^i)*(z^(2^i-1)-1));

Fermatnumber(z)..........................56...........................8.
1/(z^(2^(2^i))-1)-1/(z^(2^(2^i))*(z^(2^(2^i))-1));

Perfectnumber(z)..........................83..........................11.85714286
f(i)= i/(z^i*(z^i-1)-i/z^i

Mersenneprime(z)........................81..........................11.57142857

Fermatprime(z).............................93..........................13.28571429

Twinprime(z)

(OR (AND Prime(z) [z^2*Prime(z)]) (AND Prime(z) [z^(-2)*Prime(z)])
.....................................................147........................21.


Class 4 (unclassified)

Random(z) no formulation..............?................................?


4. Interpretations Of Results

Class 1:

Why should Even(z) have more information content than Odd(z)?

Explanations: These are not proofs, only conjectures. The world is unsymmetrical. It reaches the present state via symmetry breaking. We have left and right handedness. The two lobes of our brain are different. In the natural number system, there is no symmetry between Even(z) and Odd(z) since the former has a zero but the latter has not. This affects the sequence properties profoundly. Just see what happens when both Even(z) and Odd(z) are squared as shown in equations (2) and (3):

...................................................1.......1......1.........1
...........................Even(z) := 1 + ---- + ---- + ---- + O(----)
......................................................2.....4.......6.........8
...................................................z......z.......z.........z
...................................................1.......1......1.........1
........................... Odd(z) :=1/z+ ---- + ---- + ---- + O(----)
......................................................3.....5.......7.........9
...................................................z......z.......z.........z
expand(Even(z)^2);

.........1...........1...........1............1..........1.........1...........1
1 +2 ---- +3 ---- +4 ---- +3 ---- +2 ---- + ---- + O(----).......(2).
............2.........4..........6.............8...........10...........12
..........z.........z...........z.............z............z............z
expand(Odd(z)^2);

..........1............1............1..........1..........1.........1..........1
..... 1---- +2 ---- +3 ---- +4 ---- +3 ---- +2 ---- +----- O(----)........(3).
.............2.........4..........6...........8..........10..........12.........14
...........z.........z..........z...........z...........z...........z............z
. There is no algebraic proof on why Odd(z) should have less information content than Even(z) but the following observations are relevant. Comparing equation (2) with equation (3), one notes that Even(z)^2 retains the first 1/z^o term whereas in Odd(z)^2 this term is suppressed and the sequence is shifted by one unit interval to the right. As the squaring continues to infinity, the order of Even(z)^n will remain the same but the order of Odd(z)^n approaches asymptotically to Zero(z). Even(z) is said to be an order invariant sequence but Odd(z) is not. Both Even(z) and Odd(z) have more information content than Nat(z). Even Zero(z) has more information than Nat(z). It appears that as the number of finite terms decreases, the information content increases. Why should Zero(z) have any information? In sequence algebra 0 is complement to 1 and it is not nothingness. The absence of integers in a number sequence is itself an information just like 1 and 0 in binary strings contribute to information.

Class 2:

It is probably reasonable to expect that Prime(z) and Oddcomp(z) to have more information than Comp(z). There are much fewer primes than composites. From previous observations in class 1, as the number of terms decreases, the information content increases. The difference between Prime(z) and Oddcomp(z) is probably insignificant. Most probably the number of primes is about the same as the number of Oddcomp(z).

Class 3:

Information contents of class 3 sequences increase rapidly as these are usually mixed mode expressions involving functions of more than one primitive sequences. The sequence with the most information in the list is Twinprime(z). The number of terms in these sequences are even fewer than those found in class 2. This again supports the observations that the fewer the number of terms, the more the information contents. Note that this relation is nonlinear. Otherwise one would expect Zero(z) to have infinite information content but it does not. Using Nat(z) and Zero(z) as a guide, the number sequence with the greatest information content must lie somewhere between the two extremes. Right now we do not know what it is but it is most probably Random(z) which is unclassifiable since there is no sequence formulation for it.

Class 4 (unclassified)

At this moment there is only one sequence in this category and that is Random(z). In information theory, random(z) is considered to have maximum entropy[1]. If randome(z) has n bits, then the formula description of it cannot be less than n-bits [2,3]. We do not have a formulation for Random(z). This must be a nonrepeating sequence and all random terms must have equal probability. Therefore you cannot derive it from Prime(z) or Comp(z) even though past workers assumed these to be random. Since these two sequences can be predicted deterministically and globally, these cannot be random. Another way of describing a Random(z) is to used a CGF with f(i) = Random generator. At least this method allows a way of measuring the information content of this sequence but there are many variations of pseudo-random generators which will affect the ascii counts. The measurements become very subjective. Right now, we assume that Random(z) to have the highest information content but we do not know its exact value. To evaluate Random(z) via CGF(z) we need an explicit expression for a random number generator. This is not available.

So far the results obtained seem to fall within expectations with few surprises. If an explicit compact sequence formulation can be found for Random(z), it would imply that the information content of this sequence is not infinite. Measurements must be done with explicit formulations. The method based on Nat(z) as a standard seems to give reasonable predictions.

Information contents do increase with decreasing number of finite terms appearing in the sequence. Information contents increase rapidly with nonlinear intervals contributed by f(i). If this is not the case then we would expect Nat(z) to have the greatest amount of information but it has the lowest information content. Prime(z) to have more information than Comp(z) seems reasonable since it is quite easy to derive the Comp(z) formula but not the Prime(z) formula.

5. Conclusions

The adoption of Nat(z) as a reference standard gives the opportunities to calibrate the information contents of some well known number theoretic sequences. No rigorous proof are found to prove why information contents are ordered in the way shown in table 1 but it seems that this is directly linked to the decreasing number of terms in the sequences. In other words, the principle that information contents increase with decreasing number of finite terms in the sequence is probably true although this is nonlinear. This information might be contradictory to what is known in information theory but here we are talking about holistic number sequences. In information transmission one never uses a message with infinite length as a reference standard but in holistic number theory we do. Until an explicit formulation is found for a random number generator, the information content of a random sequence cannot be computed. This finding is not reported in past literatures.

4. References

1. Schneier Bruce: Applied Cryptography, Protocols, Algorithms, and Source Code in C, chapter 11, pp 233 to 239, Wiley (2nd ed.), 1996

2. Chaitin G.J.: Godel's Theorem and Information, Interantional Journal of Theoretical Physics 22 (1982), pp 941-954.

3. Chaitin G.J. : How to Run Algorithmic Information theory on a Computer, Complexity 2:1 (1996), pp.?-?.

4. Silver L.C. : From Symbolic Logic ... To Mathematical Logic, WCB Publishers, 1994, Chapter 11 Godel's Theorems, pp321 to 360.

5. Huen Y.K.: A Simple Introduction To Sequence Algebra, a free downloadable paper available from the author's URL site: http://web.singnet.com.sg/~huens/,

6. Huen Y.K.: The Canonical Generting Function or CGF(z) - a Swiss-knife function. URL site: http://web.singnet.com.sg/~huens/,

7. Huen Y.K.: A Matrix Map for Prime and Non-prime Numbers, INT. J. Math. Educ. Sci. Technol., 1994, VOL. 25, NO.6, pp 913-920.

8. Huen Y.K.: Some Interesing Properties Of The Natural Number System, Int. J. Math. Educ. Sci. Technol., 1996, VOL.27, NO. 5, 685-691.

9. Huen Y.K.: Visual algebra and its applications, INT. J. Math. Educ. Sci. Technol.,1996, VOL.??, NO.?, ???-??? (In the press as proof paper mes 100421).

=======================END OF PAPER=====================