r

R: Referencing a matched string in sub

How can I reference a matched string (or captured group) in R so as to append in a sub(). For instance, I would do it like this in sed to append a 'd' to the match echo "66xx" | sed 's/[0-9][0-9]/&d/g' 66dxx ...

jitter if multiple outliers in ggplot2 boxplot

I am trying to find a suitable display to illustrate various properties within and across school classes. For each class there is only 15-30 data points (pupils). Right now i am leaning towards a whisker-less boxplot, showing only 1.,2. and 3. quartile + datapoints more then e.g. 1 population SD +/- the sample median. This I can do. H...

Sequence reduction in R

Assume you have a vector like so: v <- c(1,1,1,2,2,2,2,1,1,3,3,3,3) How can it be best reduced to a data.frame like this? v.df <- data.frame(value=c(1,2,1,3),repetitions=c(3,4,2,4)) In a procedural language I might just iterate through a loop and build the data.frame as I go, but with a large dataset in R such an approach is ineffi...

R: how to construct a mathematical expression from a character object?

In R, I'm wanting to create a graph with x axis label expression(varname), where varname is a character object. For example: varname <- "beta[1]" hist(rnorm(20),xlab=expression(varname)) But that gives me a graph with xlab="varname" rather than xlab=expression(beta[1]). How to I convince expression() to evaluate the variable? ...

Plotting 3-tuple data points in a surface / contour plot using matplotlib

I have some surface data that is generated by an external program as XYZ values. I want to create the following graphs, using matplotlib: Surface plot Contour plot Contour plot overlayed with a surface plot I have looked at several examples for plotting surfaces and contours in matplotlib - however, the Z values seems to be a functio...

[R] if not conditions in R?

Hi there, is there anything like "if not" conditions in R? easy Example (not working): fun <- function(x) { if (!x > 0) {print ("not bigger than zero")} } fun(5) Best wishes Philipp ...

Non-graphical linearity estimation

In my previous post, I was looking for correlation ratio ( or 2) routines in R. I was surprised by the fact that no one uses for linearity checking in the GLM procedures. Let's start form a simple example: how do you check linearity of bivariate correlation? Solely with scatterplot? There are several ways of doing this, one way is to ...

R: Why does read.table stop reading a file?

I have a file, called genes.txt, which I'd like to become a data.frame. It's got a lot of lines, each line has three, tab delimited fields: mike$ wc -l genes.txt 42476 genes.txt I'd like to read this file into a data.frame in R. I use the command read.table, like this: genes = read.table( genes_file, sep="\t", na.str...

In R draw two lines, with slopes double and half the value of the best fit line

I have data with a best fit line draw. I need to draw two other lines. One needs to have double the slope and the other need to have half the slope. Later I will use the region to differentially color points outside it as per: http://stackoverflow.com/questions/2687212/conditionally-colour-data-points-outside-of-confidence-bands-in-r ...

tool to auto-format R code

Is there any tool (editor, script, whatever...) available that can automatically reformat R code? It does not need to be customizable but it must be able to recognize statements separated by either semicolons or newlines since this code has both. If it can put all statements on a separate line, consistently indent code blocks and consist...

R ggplot2: possible to customize the continuity of a time scale?

Is it possible to have a datetime scale not consider weekends as part of the time continuum? For instance, if I am plotting stock prices over a 2 week period with a line geometry, I do not want to plot a 2 day period of flattness during the weekend. I would like friday to connect with Monday. ...

Preventing what was/were previously copied on to my clipboard from appearing on R Console when using R

Hi all, I am having some problems with my clipboard contents when using R. When I run my scripts/commands in tinn-r, very often I would get something that I had ran earlier pasted onto my R Console instead of the command that I have just selected. To get over this, I would go to my clipboard and delete its content(text/syntax). Howeve...

How do I increase the number of evaluation points in geom_smooth for ggplot2 in R

I'm creating a plot and adding a basic loess smooth line to it. qplot(Age.GTS2004., X.d18O,data=deepsea, geom=c('point')) + geom_smooth(method="loess",se=T,span=0.01, alpha=.5, fill='light blue',color='navy') The problem is that the line is coming out really choppy. I need more evaluation point for the curve in certain areas. Is the...

Help Window using StatET in Eclipse

Hi All, I have been getting into R recently and have been using the StatET plugin for Eclipse. When I search for help, it prints the 'contents' of the help to the console. I am hoping to have the help open up in a new window, similar to what happens when in R itself. I am hoping this is just a setting. Any help will be greatly appre...

subset in geom_point SOMETIMES returns full dataset, instead of none.

I ask the following in the hope that someone might come up with a generic description about the problem.Basically I have no idea whats wrong with my code. When I run the code below, plot nr. 8 turns out wrong. Specifically the subset in geom_point does not work the way it should. (update: With plot nr. 8 the whole dataset is plottet, in...

Statistics Question: Kernel Smoothing in R

I have data of this form: x y 1 0.19 2 0.26 3 0.40 4 0.58 5 0.59 6 1.24 7 0.68 8 0.60 9 1.12 10 0.80 11 1.20 12 1.17 13 0.39 I'm currently plotting a kernel-smoothed density estimate of the x versus y using this code: smoothed = ksmooth( d$resi, d$score, bandwidth = 6 ) plot(...

using R.zoo to plot multiple series with error bars

I have data that looks like this: > head(data) groupname ob_time dist.mean dist.sd dur.mean dur.sd ct.mean ct.sd 1 rowA 0.3 61.67500 39.76515 43.67500 26.35027 8.666667 11.29226 2 rowA 60.0 45.49167 38.30301 37.58333 27.98207 8.750000 12.46176 3 rowA 120.0 50.22500 ...

ESS workflow for R project/package development

Can anyone share his experience on workflow for R peject development under ESS? I tried several times to learn emacs but I have not get it yet. I can understand ESS as an editor, but is there a project view in ESS? what's the efficient ways to set up/view R project directory, coding, and testing, and how's ESS has an edge to facilitate t...

Calculating all distances between one point and a group of points efficiently in R

Hi, First of all, I am new to R (I started yesterday). I have two groups of points, data and centers, the first one of size n and the second of size K (for instance, n = 3823 and K = 10), and for each i in the first set, I need to find j in the second with the minimum distance. My idea is simple: for each i, let dist[j] be the distanc...

mystified by qr.Q(): what is an orthonormal matrix in "compact" form?

R has a qr() function, which performs QR decomposition using either LINPACK or LAPACK (in my experience, the latter is 5% faster). The main object returned is a matrix "qr" that contains in the upper triangular matrix R (i.e. R=qr[upper.tri(qr)]). So far so good. The lower triangular part of qr contains Q "in compact form". One can extra...