ansaurus

Question

long/bigint/decimal equivalent datatype in R

Answer 1

+3 A:

See help(integer):

 Note that on almost all implementations of R the range of
 representable integers is restricted to about +/-2*10^9: ‘double’s
 can hold much larger integers exactly.

so I would recommend using numeric (i.e. 'double') -- a double-precision number.

Dirk Eddelbuettel 2010-01-13 00:09:15

I looked at the as.numeric() function, but was confused by the fact that mode(1) also gives "numeric" as the type, so I thought I was already dealing with them. I then tried as.numeric("123456789123456789") and saw only a few numbers printed, so assumed that it lost the precision. I didn't know about options("digits") before.

haridsv 2010-01-13 01:34:01

Ah, yes, the digits thing. Also, if you need higher-precision or large numbers, CRAN has packages for that as e.g. the (oddly named :-) Brobdingnag package for large numbers, and there is also the gmp package to interface GNU gmp.

Dirk Eddelbuettel 2010-01-13 01:39:57

Answer 2

+1 A:

Dirk is right. You should be using the numeric type (which should be set to double). The other thing to note is that you may not be getting back all the digits. Look at the digits setting:

> options("digits")
$digits
[1] 7

You can extend this:

options(digits=14)

Alternatively, you can reformat the number:

format(big.int, digits=14)

I tested your number and am getting the same behavior (even using the double data type), so that may be a bug:

> as.double("123456789123456789")
[1] 123456789123456784
> class(as.double("123456789123456789"))
[1] "numeric"
> is.double(as.double("123456789123456789"))
[1] TRUE

Shane 2010-01-13 01:12:46

Thanks for pointing the options() and format(), they are helpful. However, these options seem to only control how the number is formatted for display, so it shouldn't change how the number is parsed while using as.double() or as.numeric(). The behavior could be a bug.

haridsv 2010-01-13 01:31:44

Answer 3

+1 A:

I understood your question a little differently than the two that answered before me. If R's largest default value is not big enough for you, you have a few choices. (Disclaimer: I have used each of the libraries i mention below, but not through the R bindings, rather either other language bindings or the native library.)

The Brobdingnag package: uses natural logs to store the values; (like Rmpfr, implemented using R's new class structure). Math for real men:

library(Brobdingnag)
googol <- as.brob(1e100)

The gmp package: R bindings to the venerable GMP (GNU Multi-precision library). This must go back 20 years because i used it in University. This Library's motto is "Arithmetic Without Limits," which is a credible claim--integers, rationals, floats, whatever, right up to the limits of the RAM on your box.

library(gmp)
x = as.bigq(8000, 21)

The Rmpfr package: R bindings which interface to both gmp (above) and MPFR, (MPFR is in turn a contemporary implementation of gmp. I have used the Python bindings ('bigfloat') and can recommend it highly. This might be your best option of the three, given its scope, given that it appears to be the most actively maintained, and and finally given what appears to be the most thorough documentation.

Note: to use either of the last two, you'll need to install the native libraries, GMP and MPFR.

doug 2010-01-13 02:02:01

Thanks, but currently I am satisfied with the limitations of numeric datatype, though it didn't really meet my original question. I will keep your suggestion in mind and will look into them in case I need to handle larger values.

haridsv 2010-01-13 19:41:54

Answer 4

A:

I fixed few issues related to integers in rpy2 (Python can swich from int to long when needed, but R does does not seem to be able to do that. Integer overflows should now return NA_integer_.

L.

2010-01-25 06:26:52

ansaurus

tags:

views:

answers:

long/bigint/decimal equivalent datatype in R

related questions