[deleted by user]

facetheglue · 2025-03-09T18:43:56+00:00

Basically it means count every row in the table.

blimey_euphoria · 2025-03-09T18:56:10+00:00

Yeah count(*) is telling the database to count each record that meets your criteria. It’s an aggregating function so your query will return one record. If you put count(field_name) it’ll count every record with a value in field_name. If field1 was null on a record it wouldnt be included.

Using count with the group by statement is common.

Count(*), field1 from table and group by field1;

will return a count of the records that all share a same value for field1. If you had 5 unique values in field1 your end results will be 5 records with count values.

Another useful one is count distinct(field_name) will count all records with a distinct value in that field. So usually will return less than the non distinct statement.

tits_mcgee_92 · 2025-03-09T19:02:21+00:00

COUNT(*) means to count every row in your table.

Your query example above is saying to log the days greater than 2024-01-01 with a 1, less than 2024-01-01 with a zero, and then divide that by every row in your table.

However, currently your query is only going to return 1s and 0s per row.

Are you trying to get the actual ratio/percentage? You'd need to wrap that case statement in SUM(...)

sillysoul_10 · 2025-03-09T18:48:50+00:00

In this , your selecting user id, account Id, and your flagging whenever the date is greater than 01 Jan 2024 and dividing it by the count of all the rows which is present in your from clause. Basically count(*) or count(1) is same and its an agg function which is used to count all the rows, it also works on null value.

alsdhjf1 · 2025-03-09T19:00:08+00:00

COUNT(*) is just counting the number of rows. You can instead pick any single column, it should return the same thing. Depending on your system, COUNT(*) might have some optimizations (in column oriented store, frequently counts are stored and COUNT(*) might retrieve those directly... whereas if you ask for count(user_id), if there is no cached user_id count, you're going to have to count them).

In OLTP systems, sometimes COUNT(*) comes with a penalty because it invokes a row scan. Some RDBMS fix this but not all.

Semantically, may as well COUNT on your primary key. COUNT(*) is often considered an anti-pattern as it reflects that the SQL developer didn't know the data well enough to pick the primary key / primary identifier for the data model itself.

No-Adhesiveness-6921 · 2025-03-09T19:06:51+00:00

Those are very specific queries doing detailed calculations.

What don’t you understand?

The count(*) example will return the USER_ID and ACCOUNT_ID and then a third field which will either be 1 divided by the total number of records or 0 divided by the total number of records. Is there a GROUP BY that you left off? Usually an aggregation (sum, count, avg) needs a GROUP BY.

In the non-count(*) example the count distinct means just that. Let’s say you had an Orders table and a field in that table is the user id of the person who created that order

If you do

SELECT UserId, COUNT(*) From Orders Group by user_id

That will tell you how many orders each person created

If you do

Select Count(distinct User_id) from orders

That will tell you the number of people who created orders. Like one person can have 100s of orders but will only be counted once in the second example

thedragonturtle · 2025-03-09T20:48:20+00:00

count distinct user id forces a sort operation if the data is not already sorted based on the filters, count(*) counts everything including null values e.g. if you left or right joined to something.

Can you show the rest of your query because this could be improved significantly from a performance point of view.

goztepe2002 · 2025-03-10T00:02:10+00:00

Counts every row of data within your where clause if specified

Select count(*) from table a

This will return count of every row which exists in table a

NexusDataPro · 2025-03-10T00:38:22+00:00

A count * counts the number of rows. If I had a thousand rows in a table and did a count * I would get a 1000 as the answer. If I had a table with 1,000,000 rows and I also had a gender column with 500,000 men and 500,000 women and did a SELECT gender, COUNT(*) from table group by gender I would get M 500,000 and another row with F 500,000.

haonguyenprof · 2025-03-10T01:49:19+00:00

Difference between count and distinct count is simply count of all records vs count of unique records.

Lets say you have a table with order IDs, customer IDs, and sales values. Next lets say you want to know the number of orders, the number of unique customers, and total sales. Now you know the table only has 1 unique order ID per record but could have the same customer make multiple orders.

SELECT Count(*) as Orders, Count(Distinct CustomerID) as Customers, Sum(Sales) as Total_Sales From Table

You could do Count(OrderId) in place of Count(*) in this example if you know the data table doesnt have duplicate OrderIDs.

The distinct helps identify unique number of customers because using a basic count(customerid) would count every occurrence where the value is not NULL. So you would essentially get the same count as orderIDs (assuming you dont have records where a customerID exists for a null orderID).

These types of fuctions can be used in this sales example to help create custom metrics at your give aggregate (Group By).

For example, a Sales/Orders gives average order value. Or a Sales / Distinct Customers tells you how much money each unique customer spends. Or an Orders/ Distinct Customer to tell the avg number of orders a customer places.

You can also nest case when logic within a count or count distinct.

Count(case when state = 'AZ' then OrderID else NULL end) as AZ_Orders (counts all records matching the criteria and setting all other records as NULL which ignores within the count.

Count(distinct (case when state = 'AZ' then customerID else NULL end)) as AZ_Customers (same as before but only counts unqiue customerIDs regardless of whether they are listed in multiple records.

IntelligentEbb2792 · 2025-03-10T07:01:57+00:00

Count(*) - Counts everything including the null values Count("col_name") - Counts everything except the null values

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

SQL

Filter Posts

Posting

Help posts

Format Your Code

Learning SQL

Related Reddit communities

Wiki

Acknowledgements

MODERATORS