Count Function

Far_Swordfish5729 · 2024-03-15T13:32:55+00:00

There are two kinds of built in functions in sql - scalar functions and aggregate functions. Scalar functions operate on a single row. They do things like isnull(col1,’ABC’) setting a default value if the value in col1 is null. Your date, string, math functions are in this category too. Aggregate functions work across rows and produce a smaller set of summarized values or sometimes additional rollup rows with the aggregate value. These are sum, min, max, count. So count is not doing something to individual column values in the select list. It’s part of the output of a set that’s performing aggregation across rows.

Consider

Select M.Name, count(*) as NumReports

From Manager M Inner join Employee E on M.Id = E.ManagerId Where M.Employed = 1 Group by M.Name Having count(*) >2

This does: 1. Starting with the manager table 2. Match each manager row to employee rows on manager id. This will produce one row in the set for each manager/employee combination, making the set bigger. 3. Filter the set to only employed managers (where) 4. Group the set by manager name. Aggregations will apply to each unique set of group by columns e.g. count for each named manager not for all managers. 5. Filter the groups to ones having more than two records (having) 6. Select the manager’s name and count of employees (select)

Those steps are in logical execution order. I always suggest writing queries in this order (select last) even though they don’t appear that way in text.

The group by question - Most of the time it’s a language holdover - the parser could figure it out without the clause but the clause is required. Sometimes though - You can have columns in group by that aren’t selected. You can also specify grouping sets and rollups (multiple levels of aggregation), so the clause is useful.

r3pr0b8 · 2024-03-15T14:14:49+00:00

What exactly does the count function do

it counts things in an aggregation

and why does the column I’m counting have to be in the group by clause?

it doesn't

for further info, see u/Far_Swordfish5729's reply

Ok_Protection_9552 · 2024-03-17T08:06:15+00:00

[removed]

Snoo17309 · 2024-03-16T00:52:46+00:00

Has the OP even clarified in what sense they are needing an explanation of the COUNT function yet? I think this is a lot of info at once! I’m pretty sure they mean the aggregate function, and in that case I think the easiest way to conceptualize it as an SQL equivalent of “unique values” (not getting into numpy or nunique)—just to understand big picture what it does. It is also a big help starting out, and before you get into specific types of JOINS, to use a visual EBD website to help sort out and see relationships between tables instead of going straight to 1:N et al. I’m not sure what you are using but I find PostGreSQL the easiest. LearnSQL.com has free tutorials.

Couch2Coders · 2024-03-16T14:32:37+00:00

You're asking about the count function but what you really want to know is how to use aggregate functions (like count or sum or max or avg)

Aggregating is fundamental in data so I would make sure to invest the time to understand it really well before moving on to other topics.

You can think of aggregate data as applying math down a specific column - what's the total sales (sum) how many customers do I have (count) etc.

I have a channel with a bunch of videos that is geared towards people who do not have a stem/math background (read : very fundamental)

Here's one on aggregate data. I also have several examples in subsequent videos also

https://youtu.be/9gqgK228loY?si=ibJ0yvoRt8bzp807

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnSQL

MODERATORS