PostgreSQL: Select unique rows where distinct values are in list

Question

Say that I have the following table:

with data as (
select 'John' "name", 'A' "tag", 10 "count"
union all select 'John', 'B', 20
union all select 'Jane', 'A', 30
union all select 'Judith', 'A', 40
union all select 'Judith', 'B', 50
union all select 'Judith', 'C', 60
union all select 'Jason', 'D', 70
)

I know there are a number of distinct tag values, namely (A, B, C, D).

I would like to select the unique names that only have the tag A

I can get close by doing

-- wrong!
select
  distinct("name")
from data
group by "name"
having count(distinct tag) = 1

however, this will include unique names that only have 1 distinct tag, regardless of what tag is it.

I am using PostgreSQL, although having more generic solutions would be great.

You are not filtering the TAG value anywhere so why would it restrict to A? — Stu
– Stu, Commented Jun 27, 2022 at 11:49
because I do not want to include names that have tags other than A — user554319
– user554319, Commented Jun 27, 2022 at 11:50
I would like to select the unique names that only have the tag A So what's wrong with select distinct("name") from data where "tag" = 'A' — Stu
– Stu, Commented Jun 27, 2022 at 11:52
this gives 3 names: Jane, John, Judith. But John also has tag B, and Judith has B and C. I want to exclude those. — user554319
– user554319, Commented Jun 27, 2022 at 11:52

Tomáš Záluský · Accepted Answer · 2022-06-27 12:12:48Z

1

You're almost there - you already have groups with one tag, now just test if it is the tag you want:

select
  distinct("name")
from data
group by "name"
having count(distinct tag) = 1 and max(tag)='A'

(Note max could be min as well - SQL just doesn't have single() aggregate function but that's different story.)

edited Jun 27, 2022 at 12:12

answered Jun 27, 2022 at 12:07

Tomáš Záluský

12.3k4 gold badges44 silver badges74 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

user554319 Over a year ago

Nice trick with max @Tomáš, thanks!

Stu · Accepted Answer · 2022-06-27 11:59:57Z

1

You can use not exists here:

select distinct "name" 
from data d 
where "tag" = 'A'
and not exists (
  select * from data d2 
  where d2."name" = d."name" and d2."tag" != d."tag"
);

answered Jun 27, 2022 at 11:59

Stu

32.7k6 gold badges17 silver badges34 bronze badges

2 Comments

user554319 Over a year ago

thanks! how would you compare your not exists approach with my not in approach?

Stu Over a year ago

Not exists is almost always preferable - also note it's redundant to use distinct within an in(...)

user554319 · Accepted Answer · 2022-06-27 11:57:38Z

0

This is one possible way of solving it:

select
  distinct("name")
from data
where "name" not in (
  -- create list of names we want to exclude
  select distinct name from data where "tag" != 'A'
)

But I don't know if it's the best or most efficient one.

answered Jun 27, 2022 at 11:57

user554319

Collectives™ on Stack Overflow

PostgreSQL: Select unique rows where distinct values are in list

3 Answers 3

1 Comment

2 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related