ansaurus

Question

Answer 1

+2 A:

On Unix-based systems, use the wc command on the command-line.

Peter Hilton 2009-01-17 09:03:02

wc -l for the line count..

IainMH 2009-01-17 09:05:05

@IainmH, your second suggestion just counts the number of entries in the current directory. Not what was intended? (or asked for by the OP)

Paul 2009-01-17 10:14:03

@IainMH: that's what wc does anyway (reading the file, counting line-ending).

PhiLho 2009-01-17 11:29:08

@PhiLho You'd have to use the -l switch to count the lines. (Don't you? - it's been a while)

IainMH 2009-01-17 12:24:21

@Paul - you are of course 100% right. My only defence is that I posted that before my coffee. I'm as sharp as a button now. :D

IainMH 2009-01-17 12:25:09

You can get wc.exe for Win32 systems: see http://unxutils.sourceforge.net/

Jason S 2009-01-17 15:17:43

Answer 2

+1 A:

Only way to know how many lines there are in file is to count them. You can of course create a metric from your data giving you an average length of one line and then get the file size and divide that with avg. length but that won't be accurate.

Esko 2009-01-17 09:09:01

Interesting downvote, no matter what command line tool you're using they all DO THE SAME THING anyway, only internally. There's no magic way to figure out the number of lines, they have to be counted by hand. Sure it can be saved as metadata but that's a whole another story...

Esko 2009-01-17 09:27:32

+1 to make you feel better.

Richie_W 2009-01-17 11:04:18

Answer 3

+12 A:

This is the fastest version I have found so far, about 6 times faster than readLines. On a 150MB log file this takes 0.35 seconds, versus 2.40 seconds when using readLines(). Just for fun, linux' wc -l command takes 0.15 seconds.

public int count(String filename) throws IOException {
    InputStream is = new BufferedInputStream(new FileInputStream(filename));
    byte[] c = new byte[1024];
    int count = 0;
    int readChars = 0;
    while ((readChars = is.read(c)) != -1) {
        for (int i = 0; i < readChars; ++i) {
            if (c[i] == '\n')
                ++count;
        }
    }
    return count;
}

martinus 2009-01-17 09:35:17

you were right david, I thought the JVM would be good enough for this... I have updated the code, this one is faster.

martinus 2009-01-17 10:01:19

BufferedInputStream should be doing the buffering for you, so I don't see how using an intermediate byte[] array will make it any faster. You're unlikely to do much better than using readLine() repeatedly anyway (since that will be optimized towards by the API).

wds 2009-01-17 13:23:22

Ive benchmarked it with and without the buffered inputstream, and it is afaster when using it.

martinus 2009-01-17 13:32:26

Its neat, than you so much

Mark 2009-01-17 21:16:00

You're going to close that InputStream when you're done with it, aren't you?

bendin 2009-05-24 18:15:07

If buffering helped it would because BufferedInputStream buffers 8K by default. Increase your byte[] to this size or larger and you can drop the BufferedInputStream. e.g. try 1024*1024 bytes.

Peter Lawrey 2009-05-24 19:02:27

Answer 4

A:

If you don't have any index structures, you'll not get around the reading of the complete file. But you can optimize it by avoiding to read it line by line and use a regex to match all line terminators.

David Schmitt 2009-01-17 09:36:41

Sounds like a neat idea. Anyone tried it and has a regexp for it?

willcodejavaforfood 2009-01-17 11:02:06

I doubt it is such a good idea: it will need to read the whole file at once (martinus avoids this) and regexes are overkill (and slower) for such usage (simple search of fixed char(s)).

PhiLho 2009-01-17 11:31:41

@will: what about /\n/ ?@PhiLo: good point.

David Schmitt 2009-01-18 10:23:50

Answer 5

A:

The answer with the method count() above gave me line miscounts if a file didn't have a newline at the end of the file - it failed to count the last line in the file.

This method works better for me:

public int countLines(String filename) throws IOException {
    LineNumberReader reader  = new LineNumberReader(new FileReader(filename));
int cnt = 0;
String lineRead = "";
while ((lineRead = reader.readLine()) != null) {}

cnt = reader.getLineNumber(); 
reader.close();
return cnt;
}

Dave Bergert 2009-10-29 22:41:33

ansaurus

tags:

views:

answers:

Number of lines in a file in Java

related questions