ansaurus

Question

Determine precision and scale of particular number in Python

Answer 1

+2 A:

I think you should consider using the decimal type instead of a float. The float type will give rounding errors because the numbers are represented internally in binary but many decimal numbers don't have an exact binary representation.

Mark Byers 2010-06-10 21:43:13

Answer 2

+1 A:

Basically, you can't with floating point numbers. Using the decimal type would help and if you want really large precision, consider using gmpy, the GNU Multiple Precision library's port to Python.

Daniel DiPaolo 2010-06-10 21:44:23

Answer 3

+1 A:

Getting the number of digits to the left of the decimal point is easy:

int(log10(x))+1

The number of digits to the right of the decimal point is trickier, because of the inherent inaccuracy of floating point values. I'll need a few more minutes to figure that one out.

Edit: Based on that principle, here's the complete code.

import math

def precision_and_scale(x):
    max_digits = 14
    int_part = int(abs(x))
    magnitude = 1 if int_part == 0 else int(math.log10(int_part)) + 1
    if magnitude >= max_digits:
        return (magnitude, 0)
    frac_part = abs(x) - int_part
    multiplier = 10 ** (max_digits - magnitude)
    frac_digits = multiplier + int(multiplier * frac_part + 0.5)
    while frac_digits % 10 == 0:
        frac_digits /= 10
    scale = int(math.log10(frac_digits))
    return (magnitude + scale, scale)

Mark Ransom 2010-06-10 22:32:45

Wow, thanks! It looks like that should do it.

jrdioko 2010-06-11 17:02:05

Out of curiosity, where does the number 14 come from? Is it platform-independent?

jrdioko 2010-06-11 18:09:53

It's platform dependent, but most platforms will use IEEE-754. http://en.wikipedia.org/wiki/Double_precision_floating-point_format#Double_precision_binary_floating-point_format I probably could have made it 15, but I wanted to be conservative and make sure my rounding worked properly.

Mark Ransom 2010-06-11 18:21:48

Answer 4

+2 A:

(0) Please confirm or deny: You are given floats to use, this is unavoidable, you can't get your data as decimal, the Oracle datatypes include decimal-based types, and this fundamental mismatch is unavoidable. Please explain any full or partial denial.

(1) Your "fail for large numbers" remark is misleading/irrelevant/wrong -- you say that your starting point is a float, but 1234567890.0987654321 can't be represented as a float, as shown by the result of repr().

(2) Perhaps you could use the NEW repr (Python 2.7 and 3.1) which provides the minimum possible precision of repr(x) that still satisfies float(repr(x)) == x

E.g. old repr(1.1) produces "1.1000000000000001", new repr(1.1) produces "1.1"

About "I guess map(len, repr(num).split('.')) is the closest I'll get to the precision and scale of the float?": You need a strategy to handle (a) negative and zero numbers (b) numbers like 1.1e20

Digging in Objects/floatobject.c should turn up the C code for the new repr() of a float object, should you need to use Python 2.6 or earlier.

(3) Perhaps if you told us the specs for the relevant Oracle data types, we could help you devise checks for choosing which type can contain a given float value.

John Machin 2010-06-10 23:08:00

0. Confirm, essentially. Everything is set up to use floats and I don't believe cx_Oracle will handle Decimal types, at least as I have it set up.1. Edited to correct.2. Interesting, unfortunately I'm behind 2.7.3. I'm trying to write code that will work dynamically even if the Oracle column types change in precision or scale. (ouch, sorry for the bad formatting)

jrdioko 2010-06-11 17:07:29

Answer 5

+2 A:

Not possible with floating point variables. For example, typing

>>> 10.2345

gives:

10.234500000000001

So, to get 6,4 out of this, you will have to find a way to distinguish between a user entering 10.2345 and 10.234500000000001, which is impossible using floats. This has to do with the way floating point numbers are stored. Use decimal.

import decimal
a = decimal.Decimal('10.234539048538495')
>>> str(a)
'10.234539048538495'
>>>  (len(str(a))-1, len(str(a).split('.')[1]))
(17,15)

Chinmay Kanchi 2010-06-10 23:09:08

@Chinmay: IS possible with floats; see my answer.

John Machin 2010-06-10 23:20:03

My answer works too. The key is to limit the number of digits to the known accuracy of the float type and round the result.

Mark Ransom 2010-06-11 15:36:34

Answer 6

A:

seems like str is better choice than repr:

>>> r=10.2345678
>>> r
10.234567800000001
>>> repr(r)
'10.234567800000001'
>>> str(r)
'10.2345678'

Nas Banov 2010-06-11 04:11:54

ansaurus

tags:

views:

answers:

Determine precision and scale of particular number in Python

related questions