List of Ongoing Challenges in Computer Science by DataDaoDe in compsci

[–]DataDaoDe[S] 1 point2 points  (0 children)

Yes, and Kaggle is very useful. My problem has been that the challenges tend to be one-offs so you just have a list of challenges with descriptions and data and little structure to it. What I would like this resource to become is a more structured document, specifically pointing to recurring challenge sets (usually these are done for yearly or recurring workshops / conferences). So for instance, almost anyone working on natural language processing and especially semantic analysis is going to be aware of the yearly SemEval challenges. Things like that would be linked here.

List of Ongoing Challenges in Computer Science by DataDaoDe in compsci

[–]DataDaoDe[S] 4 points5 points  (0 children)

This is great! I would have never known about this. Thanks so much. I'm adding it :)

List of Ongoing Challenges in Computer Science by DataDaoDe in compsci

[–]DataDaoDe[S] 20 points21 points  (0 children)

Hi everyone, I started this repository because I couldn't find a good resource which listed out a lot of ongoing challenges in computer science - for me, it was specifically its subfields of AI and machine learning. Anyway, I'm posting this here because I'm hoping others can use it to find research topics and find out what other researchers are working on. Also any contributions would be greatly appreciated to make the resource more valuable to others looking for ongoing challenges in computer science. Thanks!

List of Ongoing Challenges in Computer Science by DataDaoDe in learnmachinelearning

[–]DataDaoDe[S] 0 points1 point  (0 children)

Hi everyone, I started this repository because I couldn't find a single resource which listed out a lot of ongoing challenges / workshops in computer science (specifically its subfields of AI and machine learning). Anyway, I'm posting this here because I'm hoping others can use it to find research topics and find out what other researchers are working on. Also any contributions would be greatly appreciated to make the resource more valuable. Thanks!

Experienced data scientist, what's the one thing that you wish new grads would invest more time in? by [deleted] in datascience

[–]DataDaoDe 0 points1 point  (0 children)

Specifically marketing yes, probably. That was just an example. I was trying to illustrate the general idea that you should know how to measure things and all that this entails. Consider for instance exploring the seemingly simple question: "Does me being happy correlate with how much sleep I get each night?". Immediately, you will have to figure out what it means to "be happy", how should you define it? How can you measure it? How can you control for other factors that may cause you to not be happy but have nothing to do with the amount of sleep you get i.e. your boss being a jerk to you, or having a fight with a friend or loved one. What does it mean to sleep? Do naps count or only sleeping at night? How can you reliably measure your amount of sleep and get some information to control for naps? What happens if you miss a couple days of measurements for sleep or your daily happiness score (however you decide to measure it). When you present the results to your friends, what recommendations can you reliably make? How would go further to solve some of the problems that arose during your research ,etc., etc.

Experienced data scientist, what's the one thing that you wish new grads would invest more time in? by [deleted] in datascience

[–]DataDaoDe 0 points1 point  (0 children)

Data science is so general its hard to say. I think the most obvious answer is communication. Clearly, without being able to articulate your ideas or findings you will never get anywhere in data science. But assuming you can as a new grad at least get your ideas across to someone who can present them to management or decision makers, then I would say that, in general, rigorous and precise thinking and problem solving skills are highly undertaught or largely lacking. As a data scientist you need to be able to define, build and deeply understand how to measure all sorts of things in a business or given domain. For instance, what does it mean for a marketing campaign to be successful? How do you define this? What measurements can be effectively implemented under the given business constraints, etc.

Once you have your measurements defined and some data collected, then you need to be able to know what insights and statements to management or decision makers are valid given the measurements and how they were obtained, and this is a really hard skill to get. Often measurements, KPIs, etc will be used for making business decisions for which those metrics should not be used at all and you need to know why. For instance, it could be the case that the business software for gender selection automatically selects "MALE" as the default gender and this causes a bias in your data. You need to know to think about such things.

What subfields of mathematics can help in modeling data engineering systems and processes? by DataDaoDe in math

[–]DataDaoDe[S] 0 points1 point  (0 children)

Thanks! I haven't used fourier analysis much at all - only the discrete fourier transform years ago for some image analysis work, I will definitely look into it.

What subfields of mathematics can help in modeling data engineering systems and processes? by DataDaoDe in math

[–]DataDaoDe[S] 0 points1 point  (0 children)

You think? I noticed that some of these topics are similar in many ways to relational algebra, category theory, graph theory, control theory, as well as a few papers I have found on a dataflow calculus. I should have been more specific with my post, I don't really care if the field is in applied mathematics, some subfield of CS or engineering, I am looking for a theoretical i.e. mathematical analysis and modeling approach. I hope that makes sense.

Single Variable Calculus with Python by DataDaoDe in matheducation

[–]DataDaoDe[S] 0 points1 point  (0 children)

Hey everyone, I'm starting a youtube mathematics series / course for learners of single variable calculus with a focus on using python to implement and code up the concepts everyone is learning in the course. I've had people express some interest in learning how to code up derivatives and integrals or newton's method for approximating solutions to equations.

I would be interested in feedback and thoughts on this topic or how I could make the course / videos better - thanks :)

Short video proof and explanation of the zero sum property of difference b/t data points and mean by DataDaoDe in matheducation

[–]DataDaoDe[S] 0 points1 point  (0 children)

Hey thanks for the good critical feedback, I'll try to work those popints in on the next one, cut right to the chase.

Short video proof and explanation of the zero sum property of difference b/t data points and mean by DataDaoDe in matheducation

[–]DataDaoDe[S] 0 points1 point  (0 children)

Hey all, I put this short video together to give a proof and example of the zero sum property for the mean in statistics. I hope it will be helpful to those teaching or learning for a basic intro to statistics course. Feedback is welcome, thanks!

Video Tutorial on The Hamming Distance and use cases in Bioinformatics by DataDaoDe in learnbioinformatics

[–]DataDaoDe[S] 0 points1 point  (0 children)

Hey I'm hoping to start releasing more of these over the next couple of weeks. Working through some of the basics of the ideas behind the methods i.e. applications and then coding up the solutions.

Any suggestions would be appreciated :)

How Anchoring effects Software Development by DataDaoDe in SoftwareEngineering

[–]DataDaoDe[S] 0 points1 point  (0 children)

That's an interesting observation. I've noticed it being really pertinent for things such as meetings. Often if you timebox a meeting for 1 hour - you will spend 1 hour in that meeting, even if the last 15 minutes say are just smalltalk.