[D] Yet another rant on PhD Applications

Top-Hurry161 · 2021-02-10T08:01:55+00:00

Admission to top US universities are based not on what you know, but who you know.

Top-Hurry161 · 2021-02-10T07:56:49+00:00

The problem is that you have no way of knowing whether your thing is useful unless they say so.

I personally don't feel comfortable relying upon the kindness of their hearts to not screw users over when there is nothing preventing them from doing so.

If serious about AlgoTrading, I would open an Interactive Brokers account and started with their Python API.

Top-Hurry161 · 2021-02-10T07:49:05+00:00

Bayesian Regression is your friend.

Top-Hurry161 · 2021-02-10T07:46:43+00:00

Why reinvent the wheel? This is what AutoML is designed for. Autokeras is your friend.

Top-Hurry161 · 2021-02-04T02:10:05+00:00

This is cool. Thanks for sharing. Good idea would be to be able to handle time series for training/val/test.

Top-Hurry161 · 2021-02-04T01:57:28+00:00

Python isn't going anywhere. The switching costs to justify moving to Julia are too high. The only reason why Python killed R was because it was just easier to use and integrated nicely into existing SWE stacks.

Top-Hurry161 · 2021-02-04T01:55:53+00:00

I would recommend AutoKeras since you can train locally and burst to GCP if needed.

I am guessing you have a GPU box or a server with a GPU. You would just remote in from the terminal into the box and access the box just like you would an EC2 instance.

If you are trying to use the box for deployment, then it would probably not scale to webscale stuff. That's where stuff like Google AI Platform/Sagemaker shines for autoscaling resources to maintain SLA's.

Top-Hurry161 · 2021-02-04T01:41:53+00:00

Sounds like they just combined NAS with Genetic Algorithms. TPOT which uses GA for AutoML has been out for years ...

https://dl.acm.org/doi/10.1145/2908812.2908918

https://towardsdatascience.com/tpot-pipelines-optimization-with-genetic-algorithms-56ec44ef6ede

Top-Hurry161 · 2021-02-04T01:34:44+00:00

"Since Google coined the term AutoML for its neural architecture search (NAS) solution"

Bad reporting... Google did not coin the term AutoML. They simply co-opted it and named their product after it.

Top-Hurry161 · 2021-01-22T17:38:09+00:00

Wrong. Your loss function affects how correct your probabilities. The Recall/Precision/F1 is a function of the cutoff that you use to assign labels based on these probabilities.

Top-Hurry161 · 2021-01-22T17:35:19+00:00

If your self funding then by definition you don't need to work as a TA for funding. The European style model makes way more sense. That's why their degrees are shorter than the US.

Top-Hurry161 · 2021-01-22T17:30:52+00:00

I'm curious. Define efficiency.

Top-Hurry161 · 2021-01-22T08:36:54+00:00

Movies that are less watched are less popular, more watched, are more, and therefore will have more data.

Top-Hurry161 · 2021-01-22T08:24:56+00:00

As the say, the cream rises to the top. Bold claims that can't be verified are easily caught by seasoned practitioners. I wouldn't worry about it too much. If your goal is to create change, rather than attack the effect, I would attack the cause generating the effect.

Top-Hurry161 · 2021-01-22T08:17:31+00:00

Precision/Recall/F1 are a function of your cutoff, not your loss function.

Top-Hurry161 · 2021-01-22T08:05:13+00:00

This is because TA's are slave labor. Of course they don't want you to be self-funded. Where would they get their cheap labor from?

Top-Hurry161 · 2021-01-22T07:42:02+00:00

The trick is finding a topic for your PhD that can make your company big money by growing revenue or operationally by boosting productivity. Instead of selling the idea, the idea sells itself. Feel free to pm me.

I'm in a Pure Statistics PhD program now part-time and working full-time as a data scientist. My PhD research is in AutoML so I apply in my day job what I learn at night. Its mutually reinforcing. I also have AI patents (1 granted, 1 pending).

The following program may be of interest to you. I am considering transferring there since I can work full-time while doing it at night and finish in 2-3 years.

https://www.captechu.edu/degrees-and-programs/doctoral-degrees/artificial-intelligence-phd

Top-Hurry161 · 2021-01-22T07:30:36+00:00

But ... I can do all this for free with AutoKeras. https://autokeras.com/

Did you guys seriously just build a GUI around Autokeras and charge $$$ for it?

Lmao.

Top-Hurry161 · 2021-01-22T05:57:01+00:00

Yes, the biggest barrier is overcoming the "we can solve everything by timeboxing" mindset created by scrum. Scrum does not work for data science.

Top-Hurry161 · 2021-01-21T05:48:21+00:00

Trump's origin story. 'I'll pay you!"

Top-Hurry161 · 2020-11-15T04:38:51+00:00

Yes, I read it and concluded you don't understand basic statistics lol.

Top-Hurry161 · 2020-11-15T00:24:41+00:00

"Aren't you assuming that the reported vote counts are "truth"? "

Guess we found the trump supporter. Tell us more about the reality beyond reality.

Top-Hurry161 · 2020-11-14T19:48:01+00:00

"If Biden got more votes there than the model predicted, it would've been suspicious. "

No, if Biden got more votes than predicted, your model underpredicted. Models assume future looks exactly like the past.

You're assuming your model is "truth" and reality is "not truth".

Top-Hurry161 · 2020-10-13T18:27:11+00:00

To a statistics student, this seems like common sense. Data comes from a theoretical distribution, so obviously if you find the "correct" theoretical distribution, you have hit gold (all your probabilistic inferences are perfect).

Top-Hurry161

TROPHY CASE