ansaurus

Question

Getting the excluded elements for each of the combn(n,k) combinations

Answer 1

+1 A:

Here a more general solution (you can replace X by any vector containing unique entries):

X<-1:n
B<-apply(A,2,function(x,ref) ref[!ref%in%x],ref=X)
B<-do.call(cbind,B)

Whereas in your previous question x and y were not sets, provided that the columns of A are proper sets, the above code should work.

teucer 2010-03-22 09:12:06

Thank you but in most cases there will be duplicates as was the case in the referenced question.

gd047 2010-03-22 09:14:30

Answer 2

+4 A:

using Musa's idea

B <- apply(A,2,function(z) x[is.na(pmatch(x,z))])

as regards the first example:

B <- apply(A,2,function(z) (1:n)[is.na(pmatch((1:n),z))])

gd047 2010-03-22 10:35:01

Answer 3

+2 A:

Use the setdiff function:

N <- 5
m <- 2    
A <- combn(N,m)
B <- apply(A,2,function(S) setdiff(1:N,S))

MODIFIED: The above works only when the vectors have unique values. For the second example, we write a replacement for setdiff that can handle duplicate values. We use rle to count the number of occurence of each element in the two sets, subtract the counts, then invert the RLE:

diffdup <- function(x,y){
  rx <- do.call(data.frame,rle(sort(x)))
  ry <- do.call(data.frame,rle(sort(y)))
  m <- merge(rx,ry,by='values',all.x=TRUE)
  m$lengths.y[is.na(m$lengths.y)] <- 0
  rz <- list(values=m$values,lengths=m$lengths.x-m$lengths.y)
  inverse.rle(rz)
}

x<-c(0,1,0,2,0,1) ; k<- 4
A <- combn(x,k)
B <- apply(A,2,function(z) diffdup(x,z))

Jyotirmoy Bhattacharya 2010-03-24 14:24:44

Thanks. How must be modified in order to work for the 2nd example too?

gd047 2010-03-24 15:18:32

Modified to add a solution for the second problem too.

Jyotirmoy Bhattacharya 2010-03-25 01:28:06

@jmoy Instead of this combination you could just reverse gd047 solution: `apply(A,2,function(S) x[setdiff(1:N,S)])` where `N<-length(x)`.

Marek 2010-03-25 08:14:06

@marek. Tried it on the original post's second example but it doesn't work (assuming that i got the question right). The elements of S here are the values chosen while 1:N are potential indices. Would it make sense to take their set difference?

Jyotirmoy Bhattacharya 2010-03-25 10:08:39

I was thinking about `N<-length(x); m<-k; (A<-combn(N,m)); apply(A,2,function(S) x[setdiff(1:N,S)])`, but disadvantage of this that we don't get `A` with elements of `x`.

Marek 2010-03-25 11:10:40

ansaurus

tags:

views:

answers:

Getting the excluded elements for each of the combn(n,k) combinations

related questions