ansaurus

Question

Comparing specific columns in 2 different files using R

Answer 1

A:

I don't know R but I'm suggesting this as a general advice. You should paginate your table and then use your query. I mean I think in general is not wise to execute specific comparison instructions over a table that big.

dierre 2010-07-02 12:08:27

Answer 2

A:

Au, this sqldf hurts my mind -- better use plain R capabilities than torture yourself with SQL:

which(abs(prod$V10-test$V10)/prod$V10>0.0001)

In a more general version:

which(abs(prod[,colTest]-test[,colTest])/prod[,colTest]>tolerance)

where colTest is an index of column that you want to test and tolerance is tolerance.

mbq 2010-07-02 12:55:16

many thanks mbq ... can you also let me know how I can sort by col1, col2, etc? That will give me a complete solution.Also, as an aside, can u suggest good books/tutorials for learning R? I seem to have lots of such data issues that I end up manually doing in Excel, and I think R will help tremendously in automating.Thx a ton.

Chapax 2010-07-02 13:23:50

Check http://cran.r-project.org/manuals.html , especially R-introduction. You may also find something shorter in Contributed documentation. Sorting a vector is just `sort`, `order` returns the index vector of sorted vector, so it helps sorting data frames. R has a built-in documentation, issue `?sort` or `?order` to get more info and examples.I would be grateful if you accept this answer.

mbq 2010-07-02 14:39:12

Many thanks mbq ... I'll check out the manuals.

Chapax 2010-07-05 13:26:39

ansaurus

tags:

views:

answers:

Comparing specific columns in 2 different files using R

related questions