ansaurus

Question

counting non-zero elements within each row and within each column of a 2D numpy array

Answer 1

+2 A:

import numpy as np

a = np.array([[1, 0, 1],
              [2, 3, 4],
              [0, 0, 7]])

columns = (a != 0).sum(0)
rows    = (a != 0).sum(1)

The variable (a != 0) is an array of the same shape as original a and it contains True for all non-zero elements.

The .sum(x) function sums the elements over the axis x. Sum of True/False elements is the number of True elements.

The variables columns and rows contain the number of non-zero (element != 0) values in each column/row of your original array:

columns = np.array([2, 1, 3])
rows    = np.array([2, 3, 1])

EDIT: The whole code could look like this (with a few simplifications in your original code):

ANOVAInputMatrixValuesArray = zeros([len(TestIDs), 9], float)
for j, TestID in enumerate(TestIDs):
    ReadOrWrite = 'Read'
    fileName = inputFileName
    directory = GetCurrentDirectory(arguments that return correct directory)
    # use directory or filename to get the CSV file?
    with open(directory, 'r') as csvfile:
        ANOVAInputMatrixValuesArray[j,:] = loadtxt(csvfile, comments='TestId', delimiter=';', usecols=(2,))[:9]

nonZeroCols = (ANOVAInputMatrixValuesArray != 0).sum(0)
nonZeroRows = (ANOVAInputMatrixValuesArray != 0).sum(1)

EDIT 2:

To get the mean value of all columns/rows, use the following:

colMean = a.sum(0) / (a != 0).sum(0)
rowMean = a.sum(1) / (a != 0).sum(1)

What do you want to do if there are no non-zero elements in a column/row? Then we can adapt the code to solve such a problem.

eumiro 2010-09-26 09:22:38

Answer 2

A:

Thank you. So I followed your advice and wrote:
columns = (ANOVAInputMatrixValuesArray != 0).sum(0)
rows = (ANOVAInputMatrixValuesArray != 0).sum(1)
print 'columns are: ',columns
print 'rows are: ',rows
This resulted in the following output being printed:
columns are: [2 2 2 2 2 2 2 1 1]
rows are: [9 7]

Now how do I access these numbers so that the sum of all numerical values in row 0 can be divided by 9 and the sum of all numerical values of row 1 can be divided by 7, etc.?

I will want to do the same thing with columns, but the row example may express my question more clearly, given all the duplicate values in the list of column counts above.

MedicalMath 2010-09-26 09:45:39

Please write your subsequent questions to the original post, do not create new answers.

eumiro 2010-09-26 09:48:08

I have answered in my original answer as a new EDIT...

eumiro 2010-09-26 09:54:44

ansaurus

tags:

views:

answers:

counting non-zero elements within each row and within each column of a 2D numpy array

related questions