ansaurus

Question

How to access elements in a complex list?

Answer 1

+5 A:

R generally wants to handle these things as data.frames, so I think your best bet is to turn your list into one (or even make a data.frame instead of a list to begin with, unless you need it to be in list form).

x <- do.call(rbind,tmp)
dat <- data.frame(x)
dat$count <- as.numeric(dat$count)

> dat
    count status     menu   dbname
1 1057230     Ok   PubMed   pubmed
2  305215     Ok      PMC      pmc
3       1     Ok Journals journals

and then to get your answer(s) you can use normal data.frame subsetting operations:

> dat$dbname[dat$count<10]
$resultitem
[1] "journals"

Fojtasek 2010-05-20 15:15:04

This data.frame isn't proper data.frame. Each column is a list. It will be ok if you do `x<-do.call(rbind, lapply(tmp,unlist))` and then `dat<-data.frame(x,stringsAsFactors=FALSE,row.names=NULL)`.

Marek 2010-05-20 15:31:10

I noticed the issue with the row names and the columns being lists, but wasn't immediately sure what to do about them. Nice fix.

Fojtasek 2010-05-20 15:53:52

This works perfectly for my example, thanx. But the problem with dataframes is, that they don't support columns with different lengths. And I have some other lists, where this will be the case. So I'm bound to lists.

Martin 2010-05-20 16:45:30

But if they are very similar to what you've shown and you want to do similar actions then you're still much better off organizing them as data frames with NA's to equalize the columns for missing data. If you truly have ragged lists that can't be a data frame then the kind of question you asked isn't really sensible. You can't ask about the count being less than 10 with no count field. Therefore, for all data you need to ask this kind of question of you can use data frames and make your life much easier.

John 2010-05-20 18:17:37

Answer 2

+1 A:

If you're absolutely insistent that you must do this in a list the following will work for the present case.

x <- tmp[sapply(tmp, function(x){x$count>10})]
str(x)
(the list items you wanted)

More generally, if you would like to actually use ragged lists in this way you could use the same code but check for the presence of the item first

testForCount <- function(x) {if ('count' %in% names(x)) x$count>10 else FALSE}
tmp[sapply (tmp, count)]

This will work for your cases where the lists are not the same length as well as the present case. (I still think you should be using data frames for both speed and sensible representation of the data).

John 2010-05-20 18:47:23

The problem with my data is that it comes from a webservice. And it is not certain that a column exists. If the webservice changes the R package won't work anymore. Even if the query changes the columns might not be the same as before. So I decided to use lists as representation of the results. And now I'm looking for some ways to handle these lists. You helped me a lot, thank you.

Martin 2010-05-20 19:31:07

I think you're saying that you can't be sure the cell exists in the particular query. That's fine, just NA that cell. If the column doesn't exist at all then that's just a different data frame and you'd have to adjust your code anyway. I'm not trying to make your life difficult. We're all on here trying to make it easier for you. Nothing you've said precludes a data frame. Aside from all of that, given that you're keen on sticking to lists, you should mark mine as the correct answer. :)

John 2010-05-20 23:33:32

ansaurus

tags:

views:

answers:

How to access elements in a complex list?

related questions