ansaurus

Question

Fast assessment of corrupted Affymetrix CEL files

Answer 1

+3 A:

One simple suggestion:

Can you just use a tryCatch block around your read.table (or whichever read command you're using)? Then just skip a file if you get that error message. You can also compile a list of corrupted files within the catch block (I recommend doing that so that you are tracking corrupted files for future reference when running a big batch process like this). Here's the pseudo code:

corrupted.files <- data.frame()
for(i in 1:nrow(files)) {
    x <- tryCatch(read.table(file=files[i]), error = function(e) 
         if(e=="something") { corrupted.files <- rbind(corrupted.files, files[i]) } 
         else { stop(e) }, 
       finally=print(paste("finished with", files[i], "at", Sys.time())))
    if(nrow(x)) # do something with the uncorrupted data            
}

Shane 2009-11-24 16:38:02

Not bad thanks :-) It works for removing corrupted files. (To read them, I use a specific ReadAffy function from BioConductor, but that's ok).I still need something to check the name of the platform, but that is something for a bioconductor forum maybe.

Thrawn 2009-11-24 17:30:47

ansaurus

tags:

views:

answers:

Fast assessment of corrupted Affymetrix CEL files

related questions