ansaurus

Question

How Do You Predict A Non-Linear Script's Run Time?

Answer 1

A:

Well, if you are on linux you can use 'time' command and then parse it's result.

For your problem I would do the timing for 1000s of large primes of different size and would draw a chart, so it would be easy to analize.

BarsMonster 2010-09-07 09:49:58

It isn't linear! So, that means as the numbers get larger and larger the number of primes I've found get lesser and lesser. On the other hand the list is growing and it takes longer to run the check.

JustA. 2010-09-07 09:52:58

@JustA: then you'll see the non-linearity in the chart, and you'll be able to see which non-linear function might fit the points well.

liori 2010-09-07 09:55:04

Yes, it's not linear. And on the graph you will see exactly how non-linear it is :-)

BarsMonster 2010-09-07 09:55:20

Presumably, the idea of estimating the time is in order to work out how long it will take to test 1000's of large primes, before actually doing it.

sje397 2010-09-07 10:28:26

Answer 2

+2 A:

I think you would have to use an approximation of the distribution of primes, a la PNT which (I think) states that between 1 and x you'll have approximately x/ln(x) primes (ln being natural log). So given rough estimates of the time taken for a single iteration, you should be able to create an estimate.

You have approximately x/ln(x) primes in your list. Your main code block (inside the while loop) has constant time (effectively)...so:

t(x) ~ x/ln(x) * a + b + t(x-1)

where t(x) is the time taken up to and including iteration x, a is the time taken to check each prime in the list (modulous operation), and b is the 'constant' time of the main loop. I faintly remember there is a way to convert such recursive functions to linear ones ;)

sje397 2010-09-07 09:51:43

"the time taken for a single iteration"; That's a recursive problem as it takes longer for the 'ifprime' to execute as the list grow larger. Any other suggestions?

JustA. 2010-09-07 09:55:04

[This](http://en.wikipedia.org/wiki/Recurrence_relation) might help converting to a closed-form equation.

sje397 2010-09-07 10:02:42

How come the main function has constant time ???

mb14 2010-09-07 10:35:47

What I said not quite accurate - what I meant is the inner part of the main loop (excluding the `ifprime` function call) has constant time. I'll edit.

sje397 2010-09-07 10:48:03

Answer 3

+2 A:

If you want to predict the time an arbitrary process needs until it is finished, you can't do that, as that is basically the problem behind the Halting Problem. In special cases you can estimate the time your script will take, for example if you know that it is generated in a way that doesn't allow loops.

In your special case of finding primes, it is even harder to guess the time it will take before running the process, as there is only a lower bound for the number of primes within an intervall, but that doesn't help finding them.

evnu 2010-09-07 09:58:00

This hasn't got much to do with the Halting problem - that is about finding a generic algorithm to tell if any program will ever stop. Finding out whether a specific program will stop is quite possible, whether it has loops or not.

sje397 2010-09-07 10:06:24

@sje397 You are right. What I meant to say was more sth like: In general (for any arbitrary program) it is impossible to estimate the time it takes, as that would involve figuring out if the process contains infinite loops. In this special case one can estimate the time it takes by using the theory of probability.

evnu 2010-09-07 10:19:19

But we are analyzing a specific program here.

Nathan Davis 2010-09-07 18:28:38

Answer 4

A:

Well, there is a large branch of theoretical computer science -- complexity theory -- dedicated to just this sort of problem. The general problem (of deciding on whether a code will finish for arbitrary input) you have here is what is called "NP-complete" and is therefore very hard.

But in this case you probably have two options.

The first is to use brute force. Run timeit for isprime(a) for a=1, 2, 3, 4, ..., plot the graph of the times, and try to see if it looks like something obvious: a^2, a log a, whatever.

The right -- but harder -- answer is to analyze your algorithm and see if you can work out how many operations it takes for a "typical case".

Andrew Jaffe 2010-09-07 09:59:39

Answer 5

A:

When you call isprime(pstep) you are looping pstep * ln(pstep) times, if you have a prime, of which the probability is 1/ln(pstep). So the cost of testing the primes is proportional to step. Unknown is the cost of testing the composites, because we don't know the average lowest factor of the composites between 2 and N. If we ignore it, assuming it is dominated by the cost for the primes, we get a total cost of SUM(pstep) for pstep = 3 to N+3, which is about proportional to N**2.

You can reduce this to N**1.5 by cutting off the loop in isprime() when checked > sqrt(a).

Amos Newcombe 2010-09-07 12:55:05

Why would you be looping `pstep * ln(pstep)` times? The PNT theorem suggests there are likely `pstep/ln(pstep)` primes, and the probability of `1/ln(pstep)` for a number being prime suggests the same thing.

sje397 2010-09-07 15:22:10

ansaurus

tags:

views:

answers:

How Do You Predict A Non-Linear Script's Run Time?

related questions