ansaurus

Question

I need to generate 50 Millions Rows csv file with random data: how to optimize this program ?

Answer 1

+1 A:

Use open with /direct and /lines refinement to write directly to file without buffering the content:

file: open/direct/lines/write %myfile.txt
loop 1000 [
  t: random "abcdefghi"
  append file t
]
Close file

This will write 1000 random lines without buffering. You can also prepare a block of lines (lets say 10000 rows) then write it directly to file, this will be faster than writing line-by-line.

file: open/direct/lines/write %myfile.txt
loop 100 [
  b: copy []
  loop 1000 [append b random "abcdef"]
  append file b
]
close file

this will be much faster, 100000 rows less than a second. Hope this will help.

Note that, you can change the number 100 and 1000 according to your needs an memory of your pc, and use b: make block! 1000 instead of b: copy [], it will be faster.

endo64 2010-05-05 07:10:12

Thanks a lot, I have cheated by launching several rebol instances :) I will now try to incorporate your suggestions.

Rebol Tutorial 2010-05-05 18:01:35

ansaurus

tags:

views:

answers:

I need to generate 50 Millions Rows csv file with random data: how to optimize this program ?

related questions