Become a MacRumors Supporter for $50/year with no ads, ability to filter front page stories, and private forums.

Oracle1729

macrumors 6502a
Feb 4, 2009
638
0
the 48th known Mersenne prime, 2^57,885,161 - 1, a 17,425,170 digit number....
A text file with all of the digits of this new prime number would be over 22MB in size! But it makes wonderful bedtime reading.

I find it amusing that you say that. A text file to hold a 17,425,170 digit number would be oh, say about 17,425,171 bytes long give or take a few bytes. From where did you pull your silly little number? Even more interesting is that you could compress this file below 100 bytes including the program to reexpand it without too much trouble.

I find it amusing that they say that the 47th Mersenne prime was discovered over 4 years ago. According to their own logs it was April 12, 2009, which makes it 3.79 years ago. I guess arithmetic was never their strong suit. ;)

Pot, meet the kettle. I guess arithmetic is not your strong suit either.
 

LPZ

macrumors 65816
Jul 11, 2006
1,221
2
I find it amusing that you say that. A text file to hold a 17,425,170 digit number would be oh, say about 17,425,171 bytes long give or take a few bytes. From where did you pull your silly little number?

And how large would the text file be if commas were used as separators every three digits? :mad:
 

SilentPanda

Moderator emeritus
Oct 8, 2002
9,992
31
The Bamboo Forest
I find it amusing that you say that. A text file to hold a 17,425,170 digit number would be oh, say about 17,425,171 bytes long give or take a few bytes. From where did you pull your silly little number? Even more interesting is that you could compress this file below 100 bytes including the program to reexpand it without too much trouble.

The number is a little over 22MB because of commas. You end up with 23 million some characters then divide by 1024. The number was likely pulled from the web sites reporting it as being over 22MB because the main file that is out there holding this prime number contains the commas. While your comment is accurate, so is Doctor Q's.

Also, do you have a source that shows you could compress this under 100 bytes along with the program to reexpand it? I'm not asking as a challenge. I'm a programmer and genuinely curious.
 
Last edited:

ChristianVirtual

macrumors 601
May 10, 2010
4,122
282
日本
Even more interesting is that you could compress this file below 100 bytes including the program to reexpand it without too much trouble.

Now I'm interessted; how you can compress 17m digits in less the 100 bytes; and get it back ? :confused: typo ?
Something like Huffman with binary tree would not help; the distribution of members is equal; the tree would be balanced hence no advantage due to different entropies ...
 

SilentPanda

Moderator emeritus
Oct 8, 2002
9,992
31
The Bamboo Forest
Now I'm interessted; how you can compress 17m digits in less the 100 bytes; and get it back ? :confused: typo ?
Something like Huffman with binary tree would not help; the distribution of members is equal; the tree would be balanced hence no advantage due to different entropies ...

Yeah that's what I was looking at. Even if you went into double or triple digit numbers I doubt there would be enough repetition to bother storing them in the table. Coding just the single digits doesn't reduce your bits enough on average. I have the file downloaded and have removed all non-digit characters. I compressed it with 7-zip's ultra compression (not that it is optimized for a random distribution of digits at all) and got it to 7,667,712 bytes. 7.6 is a little under half of the 17 million bytes which makes sense. Not sure how to get rid of the other 7,667,612+ bytes to get it under 100 bytes (need room for the decompressor).
 

Oracle1729

macrumors 6502a
Feb 4, 2009
638
0
Now I'm interessted; how you can compress 17m digits in less the 100 bytes; and get it back ? :confused: typo ?
Something like Huffman with binary tree would not help; the distribution of members is equal; the tree would be balanced hence no advantage due to different entropies ...

I honestly didn't mean that comment to be sarcastic, I thought it was obvious in the same spirit as the person who asked how many 0's are in the binary representation (the answer as given above was 0).

I was basically saying that the Kolmogorov complexity was under 100 bytes. I'm a bit surprised you know of Huffman coding but not complexity theory.

My compressed form is 57885161 which is 4 bytes (it fits in a 32-bit integer), and that leaves me 96 bytes to write my decompressor (an assembly program which basically says "convert that many 1's as a binary number to decimal").

I could claim that simply printing that many 1's is a valid decompressor since binary is a valid representation, but since you can do the decimal conversion in under 96 bytes easily enough, it's a moot point.

Edit: By the way, your Huffman tree is not remotely balanced. You're talking about bytes, so you have 256 possible values of which the most common 10 (11 if you insist on using commas) make up all your bytes. Huffman would get a very high compression ratio on that 17 meg text file. I'm actually surprised that 7-zip did as poorly as it did on purely numerical data stored in ascii.
 
Last edited:

Doctor Q

Administrator
Original poster
Staff member
Sep 19, 2002
39,778
7,500
Los Angeles
The Great Internet Mersenne Prime Search has a new discovery:

2 to the power 77,232,917, minus 1, is the largest and most recently discovered prime number and also the largest and most recently discovered Mersenne prime. It has over 23 million digits.​

This also means that (2^77,232,916) x ((2^77,232,917)-1) is a 46-million-digit perfect number.

This Mersenne prime is closer to its predecessor than its predecessor was to its own predecessor, leading us to speculate on whether there are more Mersenne primes than expected, or the differences don't indicate a pattern.

It's possible that there are a finite number of Mersenne primes, which means that this could even be the largest Mersenne prime number, but I'm more inclined to think that there are an infinite number of them, so we'll have reason to continue this thread for centuries to come.
 
Register on MacRumors! This sidebar will go away, and you'll see fewer ads.