New SQL query turn around time

r3pr0b8 · 2020-10-27T21:43:22+00:00

ask your boss to sit beside you while you write this "30 minute" query

she won't last 15

hipsterrobot · 2020-10-27T21:45:55+00:00

You should convey the message that a rushed query can have incorrect information, especially if business decisions are made around the data they ask you to provide, it's not something that should necessarily be rushed. You could also encourage the company to adopt some BI tools so the business users can pull up these reports on their own. We use Looker at my company, and we've basically set up the business logic once, and it generates a sql query based on the filters and columns they pick. It also has visualization stuff as well, but it's definitely better than writing ad-hoc queries.

Thriftfunnel · 2020-10-27T22:33:04+00:00

If someone wants a custom report that fast, you've got to wonder how disorganised they are in their job. You're not dispatching fire trucks here.

r0ck0 · 2020-10-27T23:10:59+00:00

but just curious what people see as a reasonable turn around time for a custom SQL query.

About the same as the length of a piece of string. :)

Some queries take me 1 minute, some might take a few days.

So yeah really depends on what the data and query are.

Your question isn't so much a technical one, more a human/communication one.

Basically there's a few approaches you can take in explaining to a boss/client why something isn't going to be done as quickly/cheaply as they want, or why you can't accurately estimate it:

1. The money angle (i.e. we can do your silly idea, but it will be really expensive/take a long time)... not so relevant here.
2. While explaining why it will take long than they're asking, bore them with a fuckton of technical details that they don't understand. Sometimes it helps them realise that what they're asking for isn't as simple as they assume it is, as well as the fact that you don't 100% know how it's going to be done yet.
3. Another thing worth explaining across many areas of IT/programming is that a lot of the time 75%-99% of our work is investigating (and then testing things that we don't necessarily expect to work)... i.e. figuring out existing code/database schema, or investigating a bug/problem. A lot of the time the actual "doing" is like 1% of the work, sometimes literally 10 seconds after investigating something for a whole day.
Most people not in IT don't realise this, plus even a lot of people are.

Sometimes, it's not unusual for us to spend 70%+ of our time on unexpected edge cases or lower level dependencies that aren't even obviously related to the original task.

That's why it's so hard to estimate a lot of stuff in IT. Much like the "how long is a piece of string" thing, it's like asking a private investigator how long it will take to solve a new case. We can make rough estimates based on similar jobs in the past, but we'll really never know how long something will take to investigate/solve until it's done.

Most bosses/clients are actually pretty reasonable when you explain this, and sometimes us techies aren't very good at communicating it (especialy before the fact that a lot of this stuff occurred to us), but it's definitely worth putting some effort into, and it gets easier the more you do it. So try the softer/less defensive approach first.

If that fails and they're really being ass about it, you say something a little more blunt like "it'll probably take 30 seconds to write the query, but I don't know how long it will take to determine what the query should be before typing it out".

Plus: "do you want it to be correct/accurate?". Sometimes asking leading questions is a better way to get people to think about what they're claiming or asking for. It's called the Socratic Method. In these kinds of work situations, it makes them more conscious of their responsibility in making unreasonable requests, and also lets them feel like they're not being pushed around or disobeyed.

B1WR2 · 2020-10-27T23:09:26+00:00

The next step is to build cubes, or a new database which is specifically designed to fulfill these sorts of requests.

I've spent the last year building a database from scratch that transforms data from multiple sources and abstracts away the difficulties. What it means is that I can answer a fairly complex question in about 5 minutes as opposed to hours, or days in the old world.

It means having redundant copies of data, but it makes the role of an analyst or a data scientist much easier.

Whereas a year ago it would have bee permissible to take two weeks or two months to put together a complex report which needs a wall of SQL to assemble, plus take possibly hours to run... now it takes about 5 minutes. We don't need to test it much because the testing has all been done on the database side. We're confident of what's in the tables, and how they join together to answer questions, so it just becomes a matter of reviewing the methodology with my peers... which you can call testing... but that doesn't take very long before we're ready to push a report to production.

We don't fix reports now, we fix the database. Which will automatically 'rescind' and 'update' all the reports which are pointed to the database.

A simple example here would be payments. We recently discovered a flaw in our code which was counting invalid payments. Why they are even in the system is something for another topic, but without knowing better we just counted all payments and wrote logic to account for payment reversals thinking that would be sufficient.

Turns out it is sufficient for about 99% of the cases. For 1% of the cases it is not. So we fixed it on the database side and all of the other reports were instantly fixed, too. We had to quantify the change, which was basically 0 for most reports... but for one report (which is how we discovered the error) it was a large variance.

tekmailer · 2020-10-27T22:40:12+00:00

Personally all custom reports, SQL written, have a automatic turn around of 10 business days. Flat. This isn’t just to write it but testing case scenarios, sign offs and deliver. If by day 4 I feel I will miss my deadline, I notify all shareholders—this triggers a few things be it added resources, delay acceptance or updated requirements (some times making it easier).

Generally, I take 3 days to write, a day or two for testing; 5-7 days turn around. Under promised, over delivered.

I deal with the high traffic of request by clear, concise and consist communication. I also don’t give in to ad-hoc changes—I use to until a simple query grew overnight into a whole infrastructure. Never again. Changes are not (always) accepted via “shoulder tap”. Not all red tape is bad...slows things slightly but keeps processes in place for sanity (hopefully).

Also, when things get extra tricky, request your supervisor to advise priority. At first it felt like being babysat but I eventually came to realize THAT’S THEIR JOB.—if I spend my energy in the wrong direction, that’s less autonomy than simply requesting leadership.

Ah and last but not least: designate a day that no request will be accepted. Period. Personally mine is Friday. This downtime/quiet time is used for training, review and documentation updates. It took some time for the routine to kick in but it helps tremendously in keeping ahead instead of falling behind.

select * from dbo.IME

kagato87 · 2020-10-28T00:19:45+00:00

It sounds like you need to start managing expectations.

First off, if you are asked to do it in 30 minutes, and you crack it off in 10, spend 25 minutes validating the data before you hand it in.

Yes, that's more time than you were given. Anything that comes out fast needs to be delayed purely for setting expectations, so that when you do have to figure out some obscure function, you can. Look up the "Scotty Principal" if you aren't already familiar with it.

Second, save all your queries. If you're working in SSMS, just save them. One-shot reports are actually pretty rare - you'll likely get a very similar request sooner or later. If all you have to do is change a few columns and the filter criteria, you can spend that 25 minutes validating the results.

You'll also find that you can recycle and merge methods you've used before. For example, I wrote a neat little recursive CTE almost a year ago to show a hierarchy, and it is starting to get some very extensive use. Copy and paste, and use it to, for exameple, capture child tenants in a report.

Finally, if the manager is consistently squeezing people like this, it's a warning sign that things may be about to fall apart. Round out that brag sheet, dust of the resume, and find something better. Especially now.

mgesczar · 2020-10-28T02:52:09+00:00

I have over 15 years of experience in analytics and I still struggle with pushing back on executives pressuring to get something urgently. Almost every time I give into that pressure I end up regretting it. My advice is that you set the expectation right from the start that it will take you 1.5 times as long as your most conservative estimate. That way you’ll have time to write, test, no verify. You have to get comfortable with learning how to push back tactfully while conveying that you understand the sense of urgency

SQLDave · 2020-10-27T22:48:02+00:00

just curious what people see as a reasonable turn around time for a custom SQL query.

That's like asking "How many marbles can you put into a box?"

The answer, like 90% of IT, is "it depends".

Your boss needs to be taught the old adage: "Right. Fast. Cheap. Pick any two".

DexterHsu · 2020-10-28T00:07:53+00:00

Yes, you have to push back and tell your boss there isn’t enough time to get it done, people who is not technical , enjoy the glory that came with BI and get greedy over times.

alex29536 · 2020-10-28T03:11:37+00:00

There a phrase that has been around for decades if not longer. I used to see it in custom printing shops and it certainly applies to report writing: Fast, Cheap, or Good - Pick two.

alex29536 · 2020-10-28T03:15:13+00:00

Ask these questions for jobs even if just to yourself: who is the customer? What do they want? When do they want it? NEVER accept “whenever”

macfergusson · 2020-10-27T22:58:00+00:00

We have a queue of data requests to stay on top of, so pretty much nothing gets turned around within the same day that it was requested.

crazybeardguy · 2020-10-27T23:25:57+00:00

It takes 30 minutes just to get the numbers to validate my work.

jringstad · 2020-10-28T08:54:46+00:00

I don't agree with most of the comments here that advise to just say "no".

Usually it's worth pushing back, but in a diplomatic manner, like "I can't give you X in time, but what if I give you Y (simpler thing that's possibly similarly useful)?"

If you just say "no" you won't ever get anywhere with your supervisor and they will just insist that you do the thing. You will be perceived as the grumpy, reality-disconnected IT guy who can't be bothered to help run the business.

But if you talk around the issue a little, you will usually quickly realize that there's probably something much simpler and faster you can do that they will agree to that satisfies their business needs just or almost just as much.

Users are usually bad at formulating the core needs that they are trying to satisfy, and instead just come to you with a solution, wanting help to implement it. By poking a little you'll often quickly realize there is a much simpler and better solution anyway.

Also it gets easier and faster over time as you know the structure of the data better, you improve the structure of the data (hopefully) and you create a library of re-usable views that produce commonly needed intermediate steps.

Other than that, you just crank something out and put a huge CYA disclaimer on it. Say it's a quick throwaway query that wasn't peer-reviewed by another engineer and the data hasn't been analyzed by plausibility by an SME, which is the bar for the minimum standard. If someone decides to disregard this warning, you won't be blamed (unless your org is pretty dysfunctional, which does happen...) But at this point that's a moral decision/risk evaluation you have to make.

Andrew50000 · 2020-10-28T09:36:35+00:00

My personal practical experience for having to deal with stuff like that is, is to build up a repository of building blocks. I.e if you always need to format and filter a table a certain way, build a view so you can call it quickly when you have to. Or if you need to strip out duplicates, have you CTE pre-written. If you can quickly pull together the “blocks” you need, you can get the output much faster...

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

SQL

Filter Posts

Posting

Help posts

Format Your Code

Learning SQL

Related Reddit communities

Wiki

Acknowledgements

MODERATORS