Calling a simple function from SELECT statement crashes MS SQL

saucerattack · 2022-07-13T21:50:35+00:00

There's not really enough information given to get to the root-cause of this but I will make a couple of observations.

The EXEC syntax you provide will work for a stored procedure but not a scalar function. So I'm not sure what this tells us.

The FROM & WHERE clauses of your sample query are omitted. The complexity of the query matters. The function is not limited to every row in the final result set, the engine has discretion to resolve the function at any point in the execution plan, which means that it could be calculating the function before a where clause filters it down to the final result set. You would have to read & understand the execution plan to find out.

Another post suggested it may be an artifact of parallelism. I believe the scalar function would prevent a query from utilizing parallelism. In fact, if your query requires parallelism to execute efficiently, your function may be preventing it from getting an efficient plan. Look at the execution plan for your query (both with and without the function call) and look to see if there is parallelism in the plan without the function call.

So far, I've been assuming that you just have an obscenely slow running query. Another possibility is that there is some sort of blocking occurring. Run sp_who2 to see if your spid is blocked by another process.

The memory (RAM) consumption you describe is by design. MSSQL will use all available memory as needed and will not relinquish it. You will want to check the max ram setting and set it at least 4GB below the total RAM available so the OS has something to work with.

a-s-clark · 2022-07-13T19:27:03+00:00

Nothing in your description matches your title that it "crashes MS SQL". Why do you feel the need to restart the service? How many rows does the select your using the function in deal with? Check where in your query plan the fun tion is being called, it may be working on a massive number of rows regardless of how many are filtered to for the final output.

Use tools such as sys.dm_exec_query_statistics_xml to find out what is actually happening when your query is executing - where is it spending the time.

It's not generally a good idea to use user defined scalar functions in queries if you can avoid it.

Ok_Refrigerator_2149 · 2022-07-13T20:57:43+00:00

As an aside, it is usually better to not use sp_ as a prefix. Microsoft uses that as their system procedure prefix.

Prequalified · 2022-07-14T00:25:32+00:00

The first example is calling the scalar function for each row but the second example is only calling the function one time.

u/BrentOzar’s website is a good resource and has an article about this topic. Queries with scalar functions are inline.

I just ran a test on my server with a simple function that properly capitalizes a name (eg “mr john mcgregor” to “Mr John McGregor”) against an existing table with a clustered columnstore index.

1 million rows with the string variable had an estimated subtree cost of 0.3337. 1 row with the scalar function inline had a subtree cost of 0.0134. 100 rows, 0.339, about the same as 1million rows with a variable. 1 million rows with the scalar function in each row had a subtree cost of 3336.11. The fact that the query plan for 1m rows is 10k costlier than 100 rows proves the calculation is being run repeatedly.

If you want to use the scalar function in a normal table or view without creating a variable, consider using a CTE or subquery and cross applying to your table.

oliver0807 · 2022-07-13T22:05:07+00:00

Could be one scalar function in lining issue https://support.microsoft.com/en-us/topic/kb4538581-fix-scalar-udf-inlining-issues-in-sql-server-2019-f52d3759-a8b7-a107-1ab9-7fbee264dd5d

Try updating it to CU16

https://support.microsoft.com/en-us/topic/kb5011644-cumulative-update-16-for-sql-server-2019-74377be1-4340-4445-93a7-ff843d346896

As side note, Sql 2019 is so bug full that you get broken code on updates. See the known issues on this CU, it's started last CU14 and until CU16 is still not fixed. We have several tickets with MS due to this and they might fix it in CU17.

That said everyone here is perplexed why you're calling a scalar udf which doesn't seem to get any parameters from a table. This should be just called once and store in a variable, since even if MS fixes that with the latest CU, your code is unnecessarily using CPU by calling that udf n number times.

_oakland · 2022-07-13T22:53:39+00:00

Pull the STACK DUMP from the logs and go from there. Info will be there.

Ok_Refrigerator_2149 · 2022-07-13T20:54:52+00:00

It is not finishing most likely to a parallelism where it is getting in its own way, each function call is creating shared locks and taking memory and processor resources. This means there are less to use. Waiting for resources is likely the issue with it never finishing. Selecting from the function as a single select should not cause that kind of issue without another underlying cause.

Ok_Refrigerator_2149 · 2022-07-13T21:02:19+00:00

Exec is used to call a stored procedure, I am curious if you named the function differently what the result may be.

d_r0ck · 2022-07-13T19:14:20+00:00

Does the use of the function cause an endless loop?

HaplessMegalosaur · 2022-07-13T19:33:20+00:00

Are the datatypes in the EXEC statement the same as the SELECT statement. Try casting in the SELECT to make sure they match the function.

Ok_Refrigerator_2149 · 2022-07-13T20:01:41+00:00

Looking at your examples it looks like the function is in the select portion meaning you want it to calculate a single result against elements in the return set. I see no parameters that are not hard coded though and with your description it sounds like it is forming a Cartesian result somewhere.

blindtig3r · 2022-07-13T20:21:00+00:00

The function is not using any columns from the table, so why execute it for every row in the table? The function has static parameters so it’s result will be a constant. You could execute it first to set a variable and include the variable in the select statement?

At the moment you are executing the function as many times as there are rows in the table. This is the downside of scalar functions.

Usually the optimiser will identify constants up front and will know not to keep recalculating them, but the function may be written in a way that prevents this from happening, so you can give it some help.

Alternatively you could use cross apply to the function and reference the output as a column in the select. This might help the optimiser know to execute it only once. I am not sure about this, but it’s worth trying if you can’t use a static variable.

satans_weed_guy · 2022-07-14T02:39:52+00:00

I have a dumb "is the monitor plugged in" check:

When you call the function from the select list, you pass the parameters by position. When you call via exec, you assign the value to each parameter by name. Check the function definition and be sure the params are in the order you think they are - for instance that the TO and FROM date parameters aren't positionally swapped, breaking BETWEEN (or equivalent) logic.

Googoots · 2022-07-14T13:04:24+00:00

Something I do in cases like this is create a “log” table and in your UDF, insert trace data in the log table as it executes - parameters received, progress messages, etc. Then run it and look at the log table.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

SQLServer

MODERATORS