ansaurus

Question

Trying to return a specified number of characters from a gene sequence in R

Answer 1

A:

Could you first just make a temporary string that's a trimmed from the long one?

lod3n 2009-09-28 23:04:57

How do I trim it?....sorry for naive question (I am a new user)

C_BioInfo 2009-09-28 23:06:26

I think you would use substr

lod3n 2009-09-29 15:31:46

Answer 2

+8 A:

Try

substr("cgtcgctgtttgtcaa[...]", 5, 200)

See substr().

Artelius 2009-09-28 23:15:16

THanks a lot!Chris

C_BioInfo 2009-09-30 13:22:08

That link for substr documentation appears to be dead. How about this one: http://stat.ethz.ch/R-manual/R-patched/library/base/html/substr.html

Argalatyr 2010-03-28 15:29:06

Answer 3

+4 A:

Use the substring function:

> tmp.string <- paste(LETTERS, collapse="")
> tmp.string <- substr(tmp.string, 4, 10)
> tmp.string
[1] "DEFGHIJ"

Shane 2009-09-28 23:16:19

Answer 4

+2 A:

See also the Bioconductor package Biostrings that is a good choice if you need to handle large biological sequences or set of sequences.

#source("http://bioconductor.org/biocLite.R");biocLite("Biostrings") 
library(Biostrings)
s <-paste(rep("gtcgctgtttgtcaac",20),collapse="")
d <- DNAString(s)
d[5:200]
as.character(d[5:200])

Paolo 2009-09-30 12:25:20

ansaurus

tags:

views:

answers:

Trying to return a specified number of characters from a gene sequence in R

related questions