ansaurus

Question

Answer 1

+2 A:

Something like:

vector<int> inint;
vector<int> inbase;
vector<int> outbase;
while (fgets(buf, fh)) {
   char *tok = strtok(buf, ", ");
   inint.push_back(atoi(tok));
   tok = strtok(NULL, ", ");
   inbase.push_back(atoi(tok));
   tok = strtok(NULL, ", ");
   outbase.push_back(atoi(tok));
}

Except with error checking.

MattSmith 2008-11-06 02:03:55

I would avoid such a "C-ish" solution for, well, aesthetics...but more importantly in this case because strtok has some serious thread-safe issues. Correct code though!

MattyT 2008-11-06 05:35:23

Answer 2

+1 A:

std::getline allows you to read a line of text, and you can use a string stream to parse the individual line:

string buf;
getline(cin, buf); 
stringstream par(buf);

char buf2[512];
par.getline(buf2, 512, ','); /* Reads until the first token. */

Once you get the line of text into the string, you can actually use any parsing function you want, even sscanf(buf.c_str(), "%d,%d'%d", &i1, &i2, &i3), by using atoi on the substring with the integer, or through some other method.

You can also ignore unwanted characters in the input stream, if you know they're there:

if (cin.peek() == ',')
    cin.ignore(1, ',');
cin >> nextInt;

Raymond Martineau 2008-11-06 02:39:53

Answer 3

+1 A:

If you don't mind using the Boost libraries...

#include <string>
#include <vector>
#include <boost/lexical_cast.hpp>
#include <boost/regex.hpp>

std::vector<int> ParseFile(std::istream& in) {
    const boost::regex cItemPattern(" *([0-9]+),?");
    std::vector<int> return_value;

    std::string line;
    while (std::getline(in, line)) {
        string::const_iterator b=line.begin(), e=line.end();
        boost::smatch match;
        while (b!=e && boost::regex_search(b, e, match, cItemPattern)) {
            return_value.push_back(boost::lexical_cast<int>(match[1].str()));
            b=match[0].second;
        };
    };

    return return_value;
}

That pulls the lines from the stream, then uses the Boost::RegEx library (with a capture group) to extract each number from the lines. It automatically ignores anything that isn't a valid number, though that can be changed if you wish.

It's still about twenty lines with the #includes, but you can use it to extract essentially anything from the file's lines. This is a trivial example, I'm using pretty much identical code to extract tags and optional values from a database field, the only major difference is the regular expression.

EDIT: Oops, you wanted three separate vectors. Try this slight modification instead:

const boost::regex cItemPattern(" *([0-9]+), *([0-9]+), *([0-9]+)");
std::vector<int> vector1, vector2, vector3;

std::string line;
while (std::getline(in, line)) {
    string::const_iterator b=line.begin(), e=line.end();
    boost::smatch match;
    while (b!=e && boost::regex_search(b, e, match, cItemPattern)) {
        vector1.push_back(boost::lexical_cast<int>(match[1].str()));
        vector2.push_back(boost::lexical_cast<int>(match[2].str()));
        vector3.push_back(boost::lexical_cast<int>(match[3].str()));
        b=match[0].second;
    };
};

Head Geek 2008-11-06 03:31:18

Answer 4

+2 A:

There's no real need to use boost in this example as streams will do the trick nicely:

int main(int argc, char* argv[])
{
    ifstream file(argv[1]);

    const unsigned maxIgnore = 10;
    const int delim = ',';
    int x,y,z;

    vector<int> vecx, vecy, vecz;

    while (file)
    {
        file >> x;
        file.ignore(maxIgnore, delim);
        file >> y;
        file.ignore(maxIgnore, delim);
        file >> z;

        vecx.push_back(x);
        vecy.push_back(y);
        vecz.push_back(z);
    }
}

Though if I were going to use boost I'd prefer the simplicity of tokenizer to regex... :)

MattyT 2008-11-06 05:30:44

Answer 5

+1 A:

There is really nothing wrong with fscanf, which is probably the fastest solution in this case. And it's as short and readable as the python code:

FILE *fp = fopen("file.dat", "r");
int x, y, z;
std::vector<int> vx, vy, vz;

while (fscanf(fp, "%d, %d, %d", &x, &y, &z) == 3) {
  vx.push_back(x);
  vy.push_back(y);
  vz.push_back(z);
}
fclose(fp);

ididak 2008-11-06 08:26:16

Answer 6

A:

why not the same code as in python :) ?

std::ifstream file("input_hard.dat");
std::vector<int> inint, inbase, outbase;

while (file.good()){
 int val1, val2, val3;
 char delim;
 file >> val1 >> delim >> val2 >> delim >> val3;

 inint.push_back(val1);
 inbase.push_back(val2);
 outbase.push_back(val3);
}

da_m_n 2008-11-06 09:02:50

Answer 7

A:

If you want to be able to scale to harder input formats, you should consider spirit, boost parser combinator library.

This page has an example which almost do what you need (with reals and one vector though)

David Pierre 2008-11-06 09:29:12

ansaurus

tags:

views:

answers:

c++ file io & splitting by separator

related questions