As pi never repeats itself, that also means that every piece of conceivable info...

mkl · 2024-07-15T10:36:07 1721039767

> As pi never repeats itself, that also means that every piece of conceivable information (music, movies, texts) is in there, encoded.

This is true for normal numbers [1], but is definitely not true for all non-repeating (irrational) numbers. Pi has not been proven to be normal. There are many non-repeating numbers that are not normal, for example 0.101001000100001...

Storing the index into pi for a file would usually take something like as much space as just storing the file, and storing or calculating enough digits to use that index would be impossible with the technology of today (or even probably the next century).

[1] https://en.wikipedia.org/wiki/Normal_number

tombert · 2024-07-15T15:07:08 1721056028

It's conjectured to be normal isn't it? I know it hasn't been proven yet, and I cannot seem to find where I read this, but I thought there was at least statistical evidence indicating that it's probably normal.

adgjlsfhk1 · 2024-07-15T17:33:22 1721064802

100% of real numbers are normal, so that's pretty strong statistical evidence

hn_throwaway_99 · 2024-07-15T18:12:30 1721067150

What? No they're not, e.g. no rational numbers are normal, and they are real.

GraphEnthusiast · 2024-07-15T18:47:08 1721069228

The rational numbers make up "zero percent" of the real numbers. It's a little hard to properly explain without assuming a degree in math, since the proper way to treat this requires measure theoretic probability (formally, the rationals have measure zero in the reals for the "standard" measure).

The short version is that the size of the reals is a "bigger infinity" than the size of the rationals, so they effectively have 'zero weight'.

Reference (very technical): https://math.stackexchange.com/questions/508217/showing-that...

hn_throwaway_99 · 2024-07-15T20:57:30 1721077050

But then the original implication, "100% of real numbers are normal, so that's pretty strong statistical evidence", still doesn't make any sense, as it's essentially using "100%" to imply "strong statistical evidence" that the rationals don't exist, which obviously doesn't follow.

mhink · 2024-07-15T22:36:16 1721082976

I got the impression that the comment was a bit tongue-in-cheek.

The joke lies in the fact that saying "100% of real numbers" isn't *technically* the same thing as saying "all real numbers", because there's not really a good way to define a meaning for "100%" that lets you exclude rational numbers (or any other countable subset of the reals) and get something other than 100%.

staunton · 2024-07-15T22:29:42 1721082582

> still doesn't make any sense

Right. I'm pretty sure actually that it was a joke...

adgjlsfhk1 · 2024-07-16T01:44:19 1721094259

it was about half a joke. statistical evidence doesn't really exist for the type of problem since polynomialy computable numbers are countably infinite so you can't define a uniform distribution over then

NooneAtAll3 · 2024-07-15T10:29:09 1721039349

> As pi never repeats itself, that also means that every piece of conceivable information (music, movies, texts) is in there, encoded.

may I interest you in the difference between *irrational* numbers and *normal* numbers?

look at https://en.wikipedia.org/wiki/Liouville_number - no repeats, but minuscule "contained information"

constantcrying · 2024-07-15T15:24:59 1721057099

>As pi never repeats itself, that also means that every piece of conceivable information (music, movies, texts) is in there, encoded.

It is somewhat shocking that again and again this logical fallacy comes up. Why do people think that this is true? It doesn't even sound true.

mywittyname · 2024-07-15T18:19:58 1721067598

The thinking is inspired by the Infinite Monkeys Theorem. Which does have an easy-to-understand mathematical proof (and the criticisms of said proof are more difficult to grasp).

hkhanna · 2024-07-15T15:33:51 1721057631

Isn't it a property of infinity? If pi goes on infinitely without repeating itself, every possible combination of numbers appears somewhere in pi.

It's sort of like the idea that if the universe is infinitely big and mass and energy are randomly distributed throughout the universe, then an exact copy of you on an exact copy of Earth is out there somewhere.

This property of infinity has always fascinated me, so I'm very curious for where the logical fallacy might be.

n2d4 · 2024-07-15T15:40:39 1721058039

Not necessarily. The number 1.01001000100001000001... never repeats itself, yet most other numbers can never be found in it.

A number that contains all other numbers infinitely many times (uniformly) would be called normal, but no one has managed to prove this for pi yet. In fact, no one even managed to prove that pi doesn't contain only 0s and 1s like the above after the X-th digit.

andrewla · 2024-07-15T21:17:26 1721078246

More trivially, there are an infinite number of even numbers, and they do not repeat, yet they do not contain a single odd number.

constantcrying · 2024-07-15T15:42:39 1721058159

>Isn't it a property of infinity? If pi goes on infinitely without repeating itself, every possible combination of numbers appears somewhere in pi.

No. Example: 0.1011011101111011111... does never repeat, yet there is no 2 in there, neither is there 00 in there.

onion2k · 2024-07-15T16:02:23 1721059343

The fact you can't encode arbitrary data in a structured-but-irrational number doesn't mean you can't encode data in a 'random' irrational number.

The question is really 'Does every series of numbers of arbitrary finite length appear in pi?' I can't answer that because I'm not a mathematician, but I also can't dismiss it, because I'm not a mathematician. It sounds like a fair question to me.

constantcrying · 2024-07-15T16:14:42 1721060082

>I can't answer that because I'm not a mathematician

So what? Mathematicians can't answer it either. It is an open question and because it is an open question claiming it is or isn't true makes no sense.

>The fact you can't encode arbitrary data in a structured-but-irrational number doesn't mean you can't encode data in a 'random' irrational number.

You can not encode data in a random number. If it is random you can not encode data in it, because it is random. I am not sure what you are saying here.

I demonstrated that numbers where the digits go on forever and never repeat exist, which don't contain every single possible substring of digits. Therefore we know that pi can either be such or a number or it is not, the answer to that is not known. Definitely it is not a property of pi being infinitely long and never repeating.

onion2k · 2024-07-15T20:10:41 1721074241

You can not encode data in a random number

That's why I put random in quotes. Pi is not a random number. You can encode data in it eg find a place that matches your data and give people the offset. That's not very helpful for most things though.

fragmede · 2024-07-15T16:48:38 1721062118

just index on the number of ones. Ex 0.10110 there are two ones in a row, so reference those two ones to be the number two. For 00, flip it and refer to the same pair of ones.

constantcrying · 2024-07-15T17:00:48 1721062848

That is totally missing the point. Of course for every number there is an encoding that contains all pieces of information.

That obviously applies to 0.00... = 0 as well, it contains 0, then 00, then 000 and so on. So every number and therefore every piece of information is contained in 0 as well, given the right encoding. Obviously if you can choose the encoding after choosing the number all number "contain" all information. That is very uninteresting though and totally misses the point.

dist-epoch · 2024-07-15T16:52:54 1721062374

Most physicists don't believe that infinity can actually exist in the universe.

Put another way, the program which searches those works of art in the digits of pi will never finish (for a sufficiently complex work of art). And if it never finishes, does it actually exist?

constantcrying · 2024-07-15T16:56:22 1721062582

>Most physicists don't believe that infinity can actually exist in the universe.

Citation needed.

Believing in real numbers requires you to believe in far more than infinity. How many physicists reject real numbers?

n_plus_1_acc · 2024-07-15T20:55:37 1721076937

Yeah, last time I checked physicists use many integrals, derivatives and nablas.

staunton · 2024-07-15T22:36:09 1721082969

That's a completely different issue. Using math to solve physics problems deals with physical models. Models are imperfect and what kinds of math they use is completely separate from asking "does infinity exist in our actual universe".

To answer that question, you would have to dismiss with experimental evidence all models people can come up with that try to explain the universe without "infinities". It's neither completely clear what that would mean, nor whether it's even in principle possible to determine experimentally (it's also most likely completely irrelevant to any practical purpose).

bubblyworld · 2024-07-16T06:00:20 1721109620

It's not that shocking to me - you should try tutoring a class of mathematics undergrads! They make this class of error all the time. It's a "this sounds like it's obviously true, so the obvious reason must be right" kind of thing. Rigorous logic takes a lot of time to click for people.

RamblingCTO · 2024-07-16T12:25:27 1721132727

I'll answer here instead of all the subcomments:

feel free to prove me wrong. I never said it's efficient, the point is just that the information is out there. If pi has the following subnumbers 00, 01, 10, 11 in there, we can construct every perceivable data we can encode as binary. Even with 0 and 1. So we can construct a file by pointers to these four numbers. The bigger substrings we can match, the bigger the compression ratio. The set of pointers might even be way bigger than the file itself. It's nowhere near efficient or clever, but just entertaining

I don't think you can argue against IP because the way you arrange the pointers is IP itself, but still a funny thought experiment anyway

I'm not saying, that every piece of information is in there end to end, but that there are parts in there which can be used to construct it. I think I should've made the "encoded" part a bit more transparent haha. But I love the discussion that I kicked off!

IsTom · 2024-07-15T09:52:52 1721037172

There are many ways in which a number might not never repeat itself, but not contain all sequences (e.g. never use a specific digit). What you want is normal numbers and pi is not proven to be one (though probably it is).

its_ethan · 2024-07-15T15:28:12 1721057292

https://libraryofbabel.info/

you might find this to be pretty cool. It's similar to what you're describing. Whoever made it has an algorithm where you can look up "real" strings of text and it'll show you where in the library it exists. you can also just browse at random, but that doesn't really show you anything interesting (as you would expect given it's all random).

tetris11 · 2024-07-15T15:55:29 1721058929

the hashing algorithm should encode some locality, but disappointingly doesn't...

...and can't because there is no original corpus that the locality hashing algorithm can use as a basis

A_D_E_P_T · 2024-07-15T10:23:49 1721039029

> every piece of conceivable information (music, movies, texts) is in there, encoded

Borges wrote a famous short story, “The Library of Babel,” about a library where:

“... each book contains four hundred ten pages; each page, forty lines; each line, approximately eighty black letters. There are also letters on the front cover of each book; these letters neither indicate nor prefigure what the pages inside will say.

“There are twenty-five orthographic symbols. That discovery enabled mankind, three hundred years ago, to formulate a general theory of the Library and thereby satisfactorily resolve the riddle that no conjecture had been able to divine—the formless and chaotic nature of virtually all books. . .

“Some five hundred years ago, the chief of one of the upper hexagons came across a book as jumbled as all the others, but containing almost two pages of homogeneous lines. He showed his find to a traveling decipherer, who told him the lines were written in Portuguese; others said it was Yiddish. Within the century experts had determined what the language actually was: a Samoyed-Lithuanian dialect of Guaraní, with inflections from classical Arabic. The content was also determined: the rudiments of combinatory analysis, illustrated with examples of endlessly repeating variations. These examples allowed a librarian of genius to discover the fundamental law of the Library. This philosopher observed that all books, however different from one another they might be, consist of identical elements: the space, the period, the comma, and the twenty-two letters of the alphabet. He also posited a fact which all travelers have since confirmed: In all the Library, there are no two identical books. From those incontrovertible premises, the librarian deduced that the Library is “total”—perfect, complete, and whole—and that its bookshelves contain all possible combinations of the twenty-two orthographic symbols (a number which, though unimaginably vast, is not infinite)—that is, all that is able to be expressed, in every language.”

I've done the (simple) math on this -- in fact I'm writing a short book on the philosophy of mathematics where it's of passing importance -- and the library contains some 26^1312000 books, which makes 202T look like a very small number.

So though everything you describe is encoded in Pi (assuming Pi is infinite and normal) we're a long, long way away from having useful things encoded therein...

Also, an infinite and normal Pi absolutely repeats itself, and in fact repeats itself infinitely many times.

WillAdams · 2024-07-15T12:28:30 1721046510

And for an amusing example of this see:

https://www.piday.org/find-birthday-in-pi/

NeoTar · 2024-07-15T15:41:07 1721058067

I'm not sure why, but that website is beautifully broken for me

- it asked for my birthday (e.g. 25th Feb 1986) using a day / month / year form

- then converted to the m/dd/yy form (i.e. a string 22586),

- found that string in Pi,

- forgot my birthday and messed up displaying that somehow when converting back - saying that it found my birthday of 22 / 5 / 86

no_news_is · 2024-07-15T19:43:34 1721072614

You might be interested in the online version:

https://libraryofbabel.info/

I just submitted a sub-page of that site, which has some discussion that touches more on the layout of the library as described by Borges: https://news.ycombinator.com/item?id=40970841

_fizz_buzz_ · 2024-07-15T12:03:29 1721045009

This is not necessarily true. Pi might not repeat but it could at some point - for example - not contain the digit 3 anymore (or something like that). It would never repeat, but still not have all conceivable information.

pilaf · 2024-07-15T13:56:04 1721051764

But the number 3 is there just because we decide to calculate digits in base 10. We could encode Pi in binary instead, and since it doesn't repeat it necessarily will never be a point where there will never be another 1 or a 0, right?

bubblyworld · 2024-07-15T14:12:21 1721052741

That's true - you can quite easily prove that an eventually constant sequence of decimals codes for a rational number.

But it's also true that pi may not contain every _possible_ sequence of decimals, no matter what base you pick. Like the Riemann hypothesis, it seems very likely and people have checked a lot of statistics, but nobody has proven it beyond a (mathematical) shadow of doubt.

_fizz_buzz_ · 2024-07-15T19:15:34 1721070934

Obviously, it was just an example to illustrate what a non-periodic number could look like that doesn’t contain all possible permutations. If the number never contains the digit 3 in base 10 it will also not contain all possible permutations in all other bases.

Moosturm · 2024-07-15T09:48:41 1721036921

https://github.com/philipl/pifs

maxmouchet · 2024-07-15T09:48:43 1721036923

https://news.ycombinator.com/item?id=8018818 and https://github.com/philipl/pifs :-)

sammex · 2024-07-15T09:46:55 1721036815

Would the index number actually be smaller than the actual data?

waldrews · 2024-07-15T10:01:03 1721037663

It would average the same size as the actual data. Treating the pi bit sequence as random bits, and ignoring overlap effects, the probability that a given n bit sequence is the one you want is 1/2^n, so you need to try on average 2^n sequences to find the one you want, so the index to find it is typically of length n, up to some second order effects having to do with expectation of a log not being the log of an expectation.

psychoslave · 2024-07-15T09:57:11 1721037431

You need both index and length, I guess. If concatenating both value is not enough to gain sufficient size shrink, you can always prefix a "number of times still needed to recursively de-index (repeat,start-point-index,size) concatenated triplets", and repeat until you match a desired size or lower.

I don’t know if there would be any logical issue with this approach. The only logistical difficulty I can figure out is computing enough decimals and search the pattern in it, but I guess that such a voluminous pre-computed approximation can greatly help.

waldrews · 2024-07-15T10:12:59 1721038379

No invertible function can map every non-negative integer to a lower or equal non-negative integer (no perfect compression), but you can have functions that compress everything we care about at the cost of increasing the size of things we don't care about. So the recursive de-indexing strategy has to sometimes fail and increase the cost (once you account for storing the prefix).

psychoslave · 2024-07-15T12:08:50 1721045330

Is there some inductive proof of that? Or is that some conjuncture?

Actually any resources related to that point could be fun to explore

waldrews · 2024-07-15T17:50:57 1721065857

It's a classic application of the pigeonhole principle, the first on in this list:

https://en.wikipedia.org/wiki/Pigeonhole_principle#Uses_and_...

euroderf · 2024-07-15T10:32:00 1721039520

> every piece of conceivable information (music, movies, texts) is in there, encoded.

So that means that if we give a roomful of infinite monkeys an infinite number of hand-cranked calculators and an infinite amount of time, they will, as they calculate an infinite number of digits of pi, also reproduce the complete works of Shakespeare et al.

_joel · 2024-07-15T13:17:33 1721049453

and then do it all again, but backwards.

sxv · 2024-07-15T11:24:11 1721042651

Isn't 202TB (for comparison) way too small to contain every permutation of information? That filesize wouldn't even be able to store a film enthusiast's collection?

RamblingCTO · 2024-07-16T12:33:29 1721133209

Well it all comes down to encoding, doesn't it. We can represent almost everything with just 0 and 1 as well, can't we? The description of that data is way bigger than the elements used to describe it of course.

worewood · 2024-07-15T12:58:27 1721048307

The sad thing is that the index would take just as much space as the data itself, because in average you can expect to find a n-bit string at the 2^n position.

criddell · 2024-07-15T12:28:49 1721046529

> every piece of conceivable information is in there

Wouldn't the encoded information have to have a finite length? For example, pi doesn't contain e, does it?

tzs · 2024-07-15T15:24:32 1721057072

> For example, pi doesn't contain e, does it?

Assuming we are only interested in base 10 and that pi contains e means that at some point in the sequence of decimal digits of pi (3, 1, 4, 1, 5, 9, 2, ...) there is the sequence of decimal digits of e (2, 7, 1, 8, 2, 8, ...), then I believe that question is currently unanswered.

Pi would contain e if and only if there are positive integers n and m such that 10^n pi - m = e, or equivalently 10^n pi - e = m.

We generally don't know if combinations of e and pi of the form a pi + b e where a and b are algebraic are rational or not.

Even the simple pi + e is beyond current mathematics. All we've got there is that at least one of pi + e and pi e must be irrational. We know that because both pi and e are zeros of the polynomial (x-pi)(x-e) = x^2 - (pi+e)x + pi e. If both pi+e and pi e were rational then that polynomial would have rational coefficients, and the roots of a non-zero polynomial with rational coefficients are algebraic (that is in fact the definition of an algebraic number) and both pi and e are known to not be algebraic.

RamblingCTO · 2024-07-16T12:32:15 1721133135

I implied that, yes

voytec · 2024-07-15T10:13:03 1721038383

> As pi never repeats itself, that also means that every piece of conceivable information (music, movies, texts) is in there, encoded.

You reminded me of this Person of Interest clip: https://www.youtube.com/watch?v=fXTRcsxG7IQ

sundry_gecko · 2024-07-15T15:15:38 1721056538

Reminds me of a scene of Finch teaching in Person of Interest.

https://m.youtube.com/watch?v=yGmYCfWyVAM

Zambyte · 2024-07-15T14:12:52 1721052772

https://news.ycombinator.com/item?id=36357466

2OEH8eoCRo0 · 2024-07-15T14:54:24 1721055264

Does pi contain pi?

schoen · 2024-07-15T19:17:55 1721071075

It does, starting right at the beginning!