ansaurus

Question

Pitfalls of number values in Python, "How deep?"

Answer 1

+1 A:

You get a different behavior because print truncates numbers:

In [1]: 1.23456789012334
Out[1]: 1.23456789012334 
In [2]: print 1.23456789012334
1.23456789012

Note, at the precision used in Python's floats:

In [3]: 4.7 == 4.7000000000000002
Out[3]: True

This is because floats have a limited (relative) precision because they use a finite number of (binary) digits to represent real numbers. Thus, as above, different decimal representations of a given number can actually be equal for Python, after being approximated by the closest float. This is a general property of floating point numbers.

EOL 2010-08-22 16:28:46

Answer 2

+2 A:

When working with floating point numbers, the common approach goes like this:

a == b if abs(a-b) <= eps, where eps is the required precision.

In programming contests, eps is given along with the problem to solve. My advice is to establish an accuracy that you need for your stuff, and use it

Gabi Purcaru 2010-08-22 16:31:30

Answer 3

+3 A:

This has to do with how computers store floating point numbers. A detailed description of this is here. However, for your case, the quick solution is to check not the printed representation of point.x but if point.x is equal to 4.7. So...

>>> point = Point(4.7, 8.2)
>>> point.x == 4.7
True

Or better:

>>> point = Point(4.7, 8.2)
>>> eps = 2**-53 #get epsilon for standard double precision number
>>> -eps <= point.x - 4.7 <= eps
True

Where eps is the maximum value for rounding errors in floating-point arithmetic. For details on epsilon, see here.

EDIT: -eps <= point.x - 4.7 <= eps is equivalent to abs(point.x - 4.7) <= eps. I only add this because not everyone is familiar with Python's chaining of comparison operators.

EDIT 2: Since you mentioned numpy, numpy has a method to get the eps without calculating it yourself. Use eps = numpy.finfo(float).eps instead of 2**-53 if you're using numpy. Note that the numpy epsilon is for some reason bigger than it should be and is equal to 2**-52 rather than 2**-53. I have no idea why this is.

Chinmay Kanchi 2010-08-22 16:32:04

Machine epsilon is a bound for **relative** error. You can't use it as you did, as the absolute error will be larger for values farther away from zero. In this specific case, `point.x - 4.7` will always give exactly 0 anyway.

interjay 2010-08-22 17:29:00

Answer 4

+1 A:

This comprehensive guide explains everything.

Here are Python-specific explanations.

niscy 2010-08-22 17:21:14

Answer 5

+4 A:

>>> point.x

calls repr function which is for string representation holding more technical information than strfunction, which is called when

>>> print point.x

occurs

Odomontois 2010-08-22 17:53:27

Thank you for answering a question I should have asked

tel 2010-08-22 18:03:01

ansaurus

tags:

views:

answers:

Pitfalls of number values in Python, "How deep?"

related questions