How to get the number of nonzero elements row-wise for a numpy array?

Question

I wanna find the indices of the rows that all have entries smaller than 1e-6 or where the number of nonzero values is less than 3. Something like this would be nice:

import numpy as np

prob = np.random.rand(15, 500)
all_zero = np.where(prob.max(1) < 1e-6 | np.nonzero(prob, axis=1) < 3)

So what is your question? You just need to sum the nonzero, i.e. np.nonzero(prob).sum(axis=1). — a_guest
– a_guest, Commented Sep 11, 2019 at 15:45
Oops you’re right. I didn’t think of adding up the number of nonzeros. — Rani
– Rani, Commented Sep 11, 2019 at 16:52

Massifox · Accepted Answer · 2019-09-11 17:29:55Z

3

I tried to measure the execution times of the solutions proposed so far:
Benchmark data:

prob = np.random.rand(10000, 500)

@Massifox' solution with list:

%%timeit
[i for i, val in enumerate(prob>1e-6)if val.sum() < 3]
# 39.5 ms ± 1.4 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

@Massifox' solution only numpy:

%%timeit
np.where(np.sum(prob>1e-6, axis=1) < 3)
# 9.92 ms ± 199 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

@a_guest's solution:

%%timeit
all_zero = np.logical_or(prob.max(axis=1) < 1e-6, np.sum(prob != 0, axis=1) < 3)
np.where(all_zero)
# 13.9 ms ± 150 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

The most efficient solution seems to be the second one.

answered Sep 11, 2019 at 17:29

Massifox

4,5071 gold badge13 silver badges33 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

a_guest · Accepted Answer · 2019-09-11 15:50:58Z

1

You can use np.logical_or and np.sum the non-zero values to check which row has fewer than 3 non-zero elements:

all_zero = np.logical_or(prob.max(axis=1) < 1e-6, np.sum(prob != 0, axis=1) < 3)

answered Sep 11, 2019 at 15:50

a_guest

36.7k15 gold badges75 silver badges137 bronze badges

Comments

Massifox · Accepted Answer · 2019-09-11 17:10:24Z

1

This code returns the list of index of rows with less than 3 values other than 0 (less than 1e-6):

[i for i, val in enumerate(prob>1e-6) if val.sum()<3]

or using only numpy functions:

np.where(np.sum(prob>1e-6, axis=1)<3)

edited Sep 11, 2019 at 17:10

answered Sep 11, 2019 at 16:23

Massifox

4,5071 gold badge13 silver badges33 bronze badges

Collectives™ on Stack Overflow

How to get the number of nonzero elements row-wise for a numpy array?

3 Answers 3

@Massifox' solution with list:

@Massifox' solution only numpy:

@a_guest's solution:

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

@Massifox' solution with list:

@Massifox' solution only numpy:

@a_guest's solution:

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related