ansaurus

Question

Need help optimizing solution for Project Euler problem #12.

Answer 1

A:

Have you considered breaking into prime factors, and keeping track of the primes so you don't have to recalculate them?

Thorbjørn Ravn Andersen 2010-08-01 15:52:39

@Thorbjoern: I've thought about compiling a .txt file, as it would be much quicker than recalculating, but I feel as if this is cheating the system.

Justian Meyer 2010-08-01 16:57:22

You can calculate them on demand and save the results as you go.

Thorbjørn Ravn Andersen 2010-08-01 17:26:12

Answer 2

+4 A:

Use the underlying mathematical structure, this will dramatically change your program's running time. This also applies to problem 10, by the way; if you can't do it in a few milliseconds, you've used a massively inefficient algorithm. In fact, I advise you to work on problem 10 first, because problem 12 builds on it.

I'm going to give a better algorithm for problem 12 below, but first, here's an observation that should speed up your program significantly. If two numbers x and y are coprime (i.e. they have no common divisor other than 1), then d(x·y) = d(x)·d(y). In particular, for a triangle number, d(n·(n+1)) = d(n)·d(n+1). So instead of iterating over the triangle numbers n·(n+1), iterate over n: this will significantly reduce the size of the arguments passed to d(n).

If you do that optimization, you'll notice that you compute d(n) twice in a row (once as d((n-1)+1) and once as d(n)). This suggests that caching the result of d is a good idea. The algorithm below does it, but also computes d bottom-up rather than top-down, which is more efficient because multiplying is a lot faster than factoring.

Problem 10 can be solved by a simple application of the sieve of Eratosthenes. Fill up an array of booleans (i.e., a bit vector) of size 2000000 such that with sieve[i]==true if i is prime; then sum up the numbers for which sieve[i]==true.

Problem 12 can be solved by a generalization of the sieve of Eratosthenes. Instead of making sieve[i] a boolean indicating whether i is prime, make it a number indicating the number of ways in which it is non-prime, i.e. the number of divisors of i. It is easy to modify the basic sieve of Eratosthenes to do that: rather than set sieve[x*y] to false, add 1 to it.

Several subsequent project Euler problems benefit from a similar approach.

One issue you may have is that in problem 12, it's not clear when to stop computing the sieve. You can go two ways about it:
1. compute the sieve by chunks on demand, a worthwhile programming exercise in itself (this will require more complex code that the second method)
2. or start by overestimating a bound: find some triangle number that has over 500 divisors, you know you'll stop before or at that number.

You can gain more time if you realize that you only need to care about odd numbers, since d(2^k·n) = (k+1)·d(n) if n is odd, and finding k and n given only (2^k·n) is fast on a binary computer. I'll leave the details of that optimization as an exercise.

Gilles 2010-08-01 16:26:08

+1, great non-spoiler explanation!

Jim Lewis 2010-08-01 16:50:39

@Gilles: I will look into the 1st observation, but I've already tried problem 10 with the sieve of Eratosthenes with little luck. In fact, that implementation took 10 minutes to run on my machine. Of course, it's likely a programming error on my part, but I followed the instructions from the Wikipedia article to a T. +1 for an overall wonderful (non-spoiler, as Jim said) explanation. You certainly read my question correctly.

Justian Meyer 2010-08-01 17:02:24

@Gilles: I will attempt these algorithms when I next find the time.

Justian Meyer 2010-08-01 17:02:47

@Justian: I recommend you figure out what your sieve implementation is doing wrong, it's an important stepping stone (to solve project Euler problems, at any rate). Sanity check: did you notice that the wikipedia article has the loop repeating until p²>n, not p>n?

Gilles 2010-08-01 17:55:20

@Gilles: My original implementation looked similar to the one here: rosettacode.org/wiki/Sieve_of_Eratosthenes#Java. My new one runs far faster than my first attempt and slightly faster than my old solution. I just need to optimize my new solution, which seems a lot more possible.

Justian Meyer 2010-08-01 20:02:58

Answer 3

A:

I did this one a while ago, so I don't remember all of the optimizations, here are some:

use summation formula for sum(1...n)
find prime factors with methods described in problem 7 and problem 10
determine how many divisors n has based on its prime factorization*

*Treat this as a probability question if you don't know the "rule". E.g., you have four flavour-shots you can add to your coffee, how many options do you have?

Cambium 2010-08-02 01:27:05

ansaurus

tags:

views:

answers:

Need help optimizing solution for Project Euler problem #12.

related questions