ansaurus

Question

unexpected behavior when extracting factor levels

Answer 1

+5 A:

probably walk.df is a subset of the factor variable with 3 levels. say,

a<-factor(1:3)
b<-a[1:2]

then b has 3 levels.

A easy way to drop extra level is:

b<-a[1:2, drop=T]

or if you cannot access the original variable,

b<-factor(b)

kohske 2010-05-03 08:56:31

You are correct, it is a subset. I've been mincing the object so hard that I forgot I'm sampling from the raw object - which has three levels.

Roman Luštrik 2010-05-03 09:26:26

Answer 2

A:

You can assign several factor levels to a factor that contains two levels:

 > set.seed(1234)
 > x <- round(runif(10, 1, 2))
 > x
  [1] 1 2 2 2 2 2 1 1 2 2
 > y <- factor(x)
 > levels(y)
 [1] "1" "2"
 > levels(y) <- c("1", "2", "3")
 > y
  [1] 1 2 2 2 2 2 1 1 2 2
 Levels: 1 2 3

or even no levels at all:

 > p <- NA
 > q <- factor(p)
 > levels(q)
 character(0)
 > levels(q) <- c("1", "2", "3")
 > q
 [1] <NA>
 Levels: 1 2 3

aL3xa 2010-05-03 19:50:21

What I really wanted was extract the levels that appear in the subset. I have solved this with list.of.walkers <- sort(unique(walk.df$label)).

Roman Luštrik 2010-05-05 06:59:41

ansaurus

tags:

views:

answers:

unexpected behavior when extracting factor levels

related questions