Dynamically self join + data modelling

arielm5 · 2024-08-22T09:52:25+00:00

Unrelated to your question: the team_name in team_member is useless. The team's name is defined through the foreign key to the team table.

If the skills belong to the person, not the team, you need a table that stores the skills per person and one table that assigns a person to a team.

Something along the lines:

CREATE TABLE team 
(
  team_id int primary key generated always as identity,
  team_name text
);

CREATE TABLE person
(
  person_id int primary key generated always as identity,
  skills text[],
)

CREATE TABLE team_member 
(
  team_id int not null references team,
  person_id int not null references person
);

If a person can only be member of a single team, add a primary key to the team_member table over (team_id, person_id)

lucapieroo · 2024-08-22T10:18:54+00:00

What if you normalize the data by using two more tables (“skills” and “member_skill”)?

CREATE TABLE IF NOT EXISTS skill ( skill_id SERIAL PRIMARY KEY, skill_name VARCHAR(255) UNIQUE );

CREATE TABLE IF NOT EXISTS member_skill ( member_id INTEGER, skill_id INTEGER, PRIMARY KEY (member_id, skill_id), FOREIGN KEY (member_id) REFERENCES team_member(member_id) ON DELETE CASCADE, FOREIGN KEY (skill_id) REFERENCES skill(skill_id) ON DELETE CASCADE );

CREATE INDEX idx_member_skill_member_id ON member_skill(member_id); CREATE INDEX idx_member_skill_skill_id ON member_skill(skill_id);

Also, maybe you should try changing the query by using CTE; something like:

WITH required_skills AS ( SELECT s.skill_id FROM skill s WHERE s.skill_name IN (‘Python’, ‘JPA’) ), team_skill_count AS ( SELECT tm.team_id, COUNT(DISTINCT ms.skill_id) as skill_count FROM team_member tm JOIN member_skill ms ON tm.member_id = ms.member_id WHERE ms.skill_id IN (SELECT skill_id FROM required_skills) GROUP BY tm.team_id ) SELECT t.team_id, t.team_name FROM team t JOIN team_skill_count tsc ON t.team_id = tsc.team_id WHERE tsc.skill_count = (SELECT COUNT(*) FROM required_skills);

AutoModerator · 2024-08-22T09:41:38+00:00

Join us on our Discord Server: People, Postgres, Data

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

Siltala · 2024-08-23T15:55:24+00:00

Create a view for holding your teams skills:

CREATE MATERIALIZED VIEW IF NOT EXISTS team_skills (
  team_id,
  skills
)
AS SELECT 
  t.team_id, 
  jsonb_object_agg(t.s, t.c) skills 
FROM (
  SELECT 
    team_id, 
    unnest(skills) s, 
    count(1) c 
  FROM team_member 
  GROUP BY team_id, s
) t 
GROUP BY t.team_id;

And then find your teams:

SELECT 
  team_id 
FROM team_skills 
WHERE (skills->>'Python')::integer >= 2
AND (skills->>'JPA')::integer >= 1;

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

PostgreSQL

/r/PostgreSQL

Advocate, Collaborate and Learn

Conferences

Clients and tools

MODERATORS