ansaurus

Question

Answer 1

A:

You can't directly tell matplotlib to make a histogram from an input file - you'll need to open the file yourself and get the data from it. How you'd do that depends on the format of the file - if it's just a file with a number on each line, you can just go through each line, strip() spaces and newlines, and use float() to convert it to a number.

Daniel G 2010-04-07 06:17:15

@Daniel G: Did that, but it gets rather messy !

Arkapravo 2010-04-07 07:36:13

@Arkapravo - True, and I hadn't realized about doug's solution - use that :)

Daniel G 2010-04-07 07:45:58

Answer 2

+2 A:

i would recommend using 'loadtxt' which is actually in the NumPy library. There are related functions in Matplotlib (csv2rec) but Matplotlib is actually standardizing on loadtxt.

Here's how it works:

from matplotlib import pyplot as PLT

with open('name_of_your_file.csv') as f:
  v = NP.loadtxt(f, delimiter=",", dtype='float', comments="#", skiprows=1, usecols=None)

'v', the object returned from 'loadtxt', is an n x m NumPy array.

'loadtxt' accepts either a file or a file descriptor. The instance above has most of the method signature. 'skiprows' is an integer that specifies the number of rows counting from the top that you want to skip; it's common to set it to "1" to skip the header row; 'usecols' begins at '0' and is a list reciting the columns you want to include ('None' is the default, and means 'include all'). The other parameters work as expected.

To plot a histogram from this data:

from matplotlib import pyplot as PLT

v_hist = NP.ravel(v)   # 'flatten' v
fig = PLT.figure()
ax1 = fig.add_subplot(111)

n, bins, patches = ax1.hist(v_hist, bins=50, normed=1, facecolor='green')
PLT.show()

doug 2010-04-07 07:00:08

@doug : Many Thanks

Arkapravo 2010-04-07 07:35:38

ansaurus

tags:

views:

answers:

Histogram in Matplotlib with input file

related questions