ansaurus

Question

R - optimize objective function (does lots of matrix manipulation)

Answer 1

+2 A:

Lots of loops, lots of sweeping along arrays, very few statistical functions... I'd rewrite it in C.

Keep your slow R version for checking, and rewrite this in C. Make sure your R and C give the same values for test data sets.

Oh, but first profile everything to make sure its this bit that is slow - it certainly looks like a prime candidate.

Spacedman 2010-10-05 17:46:53

Spacedman is correct here. If this is as r-like as you can code then do it in C. However, perhaps you have quite a bit of code around this that's best expressed in R. In that case you need to reconsider how you're doing things a lot.

John 2010-10-05 18:12:42

It certainly _seems_ like it could be more R-like, but I don't know R comprehensively so I don't know how to make it faster. I'm using the `rgenoud` package for the actual genetic optimization; I suppose I could make this a `.C()` subroutine but this was the way to get things going quickly.

Zack 2010-10-05 18:41:21

++ Right on. R is used for ease of trying out ideas, not for speed of execution. To do the profiling part, it's very simple. Just hit the Escape key and display the call stack a few times. If that code is typically on the stack, then that's what needs to be recoded in C.

Mike Dunlavey 2010-10-05 19:25:30

Nearly a week later, I messed around with this at length and did, in fact, end up rewriting it in C. So you get the check mark. (And now I have to start over, because now that it's not taking an hour per generation, `rgenoud` has a chance to eat all my RAM and crash. Need to find a representation of the function under optimization that doesn't involve a 1500-element matrix...)

Zack 2010-10-12 21:20:45

Answer 2

+1 A:

Maybe post a question about your pkt.matrix function by itself (seems like bad R code). That might be something that you can provide toy sample data for and give a simple description of. In fact, as near as I can tell, you'd be better off if that were a list. Do you really want it symmetric on every row? If packets are ragged then just make a list of packets. It's easier and will work faster.

Isn't jam just a vector? If so then "sum(apply(jam, 2, sum) > 0)" is sort of nonsense. It should just be sum(jam).

John 2010-10-05 18:22:08

I've updated the question with an explanation of the data structures and enough additional code that it should be possible to play with. It really doesn't make sense to make the packets be a list, that would lose information about timing that the full version of this would need. And jam is a matrix, not a vector.

Zack 2010-10-05 18:46:59

Answer 3

+2 A:

One thing that will help is to replace:

runs <- rle(ifelse(apply(tpat, 2, sum) > 0, TRUE, FALSE))  # replace this
runs <- rle(colSums(tpat) > 0)  # with this

and generally replace apply(foo, 2, sum) with colSums(foo) and apply(foo, 1, sum) with rowSums(foo).

EDIT: Here's an updated version of pkt.matrix. Nothing stunning, but it's quite a bit faster.

pkt.matrix <- function(tpat) {
  runs <- rle(colSums(tpat) > 0);
  pkt  <- matrix(FALSE, nrow=sum(runs$values),
                 ncol=sum(runs$lengths));

  endpts <- cumsum(runs$lengths)[runs$values]
  begpts <- endpts-runs$lengths[runs$values]+1

  for(i in 1:NROW(pkt)) {
    #pkt[i,seq(begpts[i],endpts[i])] <- TRUE
    pkt[i,begpts[i]:endpts[i]] <- TRUE  # eyjo's suggestion
  }

  return(pkt);
}

> # Times on my machine:
> # Original
> system.time( for(i in 1:1e4) pktm <- pkt.matrix(sat.fhss) )
   user  system elapsed 
  68.21    0.23   68.50
> # Updated
> system.time( for(i in 1:1e4) pktm <- pkt.matrix(sat.fhss) )
   user  system elapsed 
   4.28    0.00    4.28

Joshua Ulrich 2010-10-05 19:19:02

thanks, I'll try that (didn't know about *Sums).

Zack 2010-10-05 19:25:52

Change the "seq(begpts[i],endpts[i]) into just "begpts[i]:endpts[i]", one less function call (and 2.5 times faster) ...

eyjo 2010-10-05 21:05:07

ansaurus

tags:

views:

answers:

R - optimize objective function (does lots of matrix manipulation)

related questions