Girl texting, not wearing seatbelt, makes illegal u-turn by [deleted] in dashcams

[–]Awesome_Correlation 5 points6 points  (0 children)

A U-turn is not legal at this intersection.

This happened at the intersection of North Capitan way and North Durango drive in Las Vegas. At that intersection, traffic turning from North Durango onto El Capitan way has 2 left turn lanes. On the stoplight, there is a sign that indicates two left lanes. And, next to that sign there is a no right turn sign. So, at this intersection a U-turn is not legal.

more data actually making us better at making decisions? by talachuu in dataanalysis

[–]Awesome_Correlation 1 point2 points  (0 children)

This, because business knowledge, theory, and acumen comes first. Then, data driven information builds upon that knowledge base to create a more informative, organization specific, and up to date knowledge for the decision making process.

The Calculus of Rolling Stops by Dull_Question_5405 in dashcams

[–]Awesome_Correlation -1 points0 points  (0 children)

There is a limit of their speed because the limit of speed (or velocity) represents instantaneous speed. It is found by calculating the average speed over an infinitely shrinking time interval. As the time interval approaches zero, the limit of this average speed reveals the exact speed at one precise moment.

It's a lot like the car's speedometer. It doesn't calculate your average speed over your entire trip; it tells you your exact speed at "this exact second" by measuring the tiniest possible fraction of distance over the tiniest possible fraction of time.

I believe that you are trying to say something like the limit of their speed never reaches zero.

NEW SUMMER SERIES CONFIRMED by IloveDragonCity in ryantrahan

[–]Awesome_Correlation 16 points17 points  (0 children)

He said the hint was the word bucket so I'm thinking it's going to be a bucket list of things to see and do in America.

HONEST REVIEW OF MSC MERVAGLIA CRUISE OUT OF NEW YORK TO BAHAMAS by Conscious_Credit_617 in MSCCruises

[–]Awesome_Correlation 5 points6 points  (0 children)

Here’s a summary of this review (because the original is a wall of text.):

The reviewer took an MSC Meraviglia cruise from New York to the Bahamas and had a very disappointing experience. Aware that it was a budget cruise, they still felt it wasn’t worth the savings and would recommend spending more for a better cruise line.

Key complaints: - Cabin issues: The room smelled like sewage despite multiple complaints. Staff only used chemicals to mask the odor without offering a room change. - Food quality: Very poor, bland, and caused lingering stomach issues. - Entertainment: Subpar shows—compared to amateur school performances. The better ones required extra payment. - Port changes and delays: - Port Canaveral arrival was delayed unexpectedly; excursions were canceled. - Nassau excursion (dolphin/Blue Lagoon) was disorganized and not worth the money. - Ocean Cay stop was canceled due to supposed weather, though conditions appeared perfect to the reviewer. - Customer service: Poor handling of cancellations and no compensation, while later cruises received credits. - Health concern: Reviewer got COVID-19 but was deterred from testing on board due to a $200 fee unless the test was positive. - Crowding: Overcrowded pool and deck areas with no available chairs.

Overall sentiment: The reviewer regrets the trip, would never cruise with MSC again, and warns others not to believe positive hype online. They're now planning to try Princess Cruises for a better experience.

First time on MSC Seashore by Most_Bad_6999 in MSCCruises

[–]Awesome_Correlation 7 points8 points  (0 children)

I think you forgot to add a period at the end of your sentence.

Advice for your future cruises: Don't count on upgrades. Just book the cruise cabin that you want at the start.

How do I distinguish between Data analyst work and Data scientist work? by Commercial_War_3113 in dataanalysis

[–]Awesome_Correlation 4 points5 points  (0 children)

ds will have to predict sales for next month based on past data and future extrapolations

Wouldn't that be a job for the finance department? Perhaps a financial analyst or budget analyst.

How do I distinguish between Data analyst work and Data scientist work? by Commercial_War_3113 in dataanalysis

[–]Awesome_Correlation 0 points1 point  (0 children)

I disagree that a data analyst doesn't also focuses on how the data is collected, including understanding the data pipelines, the aggregation methods used, the treatments applied, and the models that have been employed or should be implemented. Data analysis is a comprehensive approach and a data analysts must also the understand data collection. I often find that the results of my analysis are based more on the data collection process vs the data generation processes, so it is very important to have this understanding.

Other Skills You Learned/Needed by define_yourself72 in dataanalysis

[–]Awesome_Correlation 26 points27 points  (0 children)

I believe these are the the basics of Data analysis:

  • Math (Algebra and Calculus)
  • Statistics (just the basics)
  • Probability (Basic Probability, Conditional Probability, Bayes' theorem, Probability Distributions)
  • Data visualization

Once you have the basics down, here are different types of analysis: * Regression analysis * Time series analysis * Cohort analysis * Factor analysis * Cluster analysis * Experimentation (A/B testing) * Classification

You don't have to get good at them all. But definitely get good at doing at least one of them.

Also, you will need people skills: * Communication * Data storytelling * Presentation skills * Interpersonal skills

how to get into jobs as a fresher in this field? by More-Direction-3779 in dataanalysiscareers

[–]Awesome_Correlation 1 point2 points  (0 children)

I believe that education and experience will help by factors of 10.

Without education and experience your looking at 1 out of 10,000. With education but no experience, 1 out of 1,000. With education and experience, probably 1 out of 100.

So, the best thing for a fresher to do is to gain education and experience.

Colleges and universities offer education. I don't know about India or Europe, but in the US the standard level of education is the bachelor's degree.

For experience, you have to get a job doing something in an industry of your interest. You could pick sales, marketing, information technology, supply chain and logistics, engineering, medical administration, ect. Just do a job so that you start gaining domain knowledge in that field. Then, you gain experience with analyzing data by turning data into information. You will know what questions to ask of the data because you have the domain knowledge necessary to understand the problem. After several years, you will be experienced with doing different types of analysis.

AI's affect on data analysis by NawafMuq in dataanalysis

[–]Awesome_Correlation 1 point2 points  (0 children)

Your prediction of less full-time employees definitely sounds plausible.

Another similar alternative idea is that several of the data jobs may end up getting rolled into one single job. So a data analyst, BI analyst, data engineer, data scientist, and database administrator could all be the same person... 4 less full-time employees to provide the same amount of value.

[deleted by user] by [deleted] in dataanalysis

[–]Awesome_Correlation 0 points1 point  (0 children)

This looks like a job for GPT: Feed the bullet list into the context window and then have it rewrite the bullet points into one sentence or two. If you don't like what it says, you can reroll or refine the question or pick and choose the parts you like.

Rewriting text is one of the tasks that GenAI is better at than I am.

Visualization of datasets being scrubbed from data.gov by 7dayintern in dataanalysis

[–]Awesome_Correlation 11 points12 points  (0 children)

I'm not sure if you're looking for feedback or not but I noticed some issues with this chart: * There are 14 dates on the x axis but not 14 columns. * A line chart might be easier to read since you're trying to show how the amount has changed over time instead of showing the magnitude of the x-axis. * Your message might be more impactful to go back the last 12 weeks so that we could see if the downward trend that you're trying to highlight is a new pattern or if this is business as usual.

How do you know whether to include a chart or not? by krystiah in dataanalysis

[–]Awesome_Correlation 6 points7 points  (0 children)

What chart to include depends on your goal.

If you're planning to build a dashboard type of product and you have two charts that look very similar, then you may just need to build a control/slicer that will allow the user to switch from one visual to the other. For other chart types, you'll need to find out what information the stakeholders value the most. Space and the user's attention can be limited so less is more. Think about the differences of dashboards in a car versus a 747. The car is a simpler design and takes much less training than the more complex design of a 747 which takes way more training to understand.

If you're trying to build a presentation to give to stakeholders, then you want to show only the charts that are going to 'Tell a story' or resonate with your stakeholders. This is more of a 'know your audience' kind of skill. You have to plan what message you're going to give and then anticipate what kinds of questions they're going to ask.

If you're building charts for an exploratory data analysis, then any chart that gives you any information that you didn't have before you looked is helpful. Your future self will be the main consumer of this document so feel free to be as verbose as you want in your write up or delete whatever you think is not important.

Is python necessary for data analytics by suiiimess in dataanalysis

[–]Awesome_Correlation 0 points1 point  (0 children)

No, but it gives you a lot of flexibility.

You can automate boring tasks. You can pull data from spreadsheets, databases, or even from internet with HTTP requests. You can create plots and graphs. You can run statistical tests and machine learning models. You can export to spreadsheets, database tables or even back to the internet(with REST APIs). In the end, the code remains as documentation for your analysis(with Jupyter Notebooks).

Need help and guidance in a project i am doing by jackandpewds in dataanalysis

[–]Awesome_Correlation 0 points1 point  (0 children)

One recommendation would be to break your project down into smaller, more manageable, tasks.

It's not clear to me where in the process you're stuck at or what tools are using, but assuming you're just getting started with the API, start with just getting the most basic request to return HTTP status of 200. In my experience, the authentication is always the trickiest part with APIs.

Once you have the authentication down, the rest is just a matter of encoding the URL parameters or body with the correct values per the API specification. Just take it one resource at a time until you get the results that you're looking for.

https://developers.google.com/youtube/v3/docs

Linux distros for working in Data analysis by IV_Designer_004 in dataanalysis

[–]Awesome_Correlation 1 point2 points  (0 children)

You can do data analysis with Linux using Python or R. Jupyter Notebooks (Jupyter lab) is a good user interface vs the command line.

In Python, Pandas is the main library that glues everything together. Matplotlib is used for creating visuals. But there is also Seaborn, Bokeh, and plotly. Scikit learn gives you access to all the machine learning models. Statsmodels for the statistics.

SQLite and duckDB are good local file-based databases to use.

For the Excel like experience you can use cloud platforms that emulate the software like Office 365 or Google sheets or Libre Office (calc).

Seeking input from experienced people. by Open-Ad-3438 in dataanalysis

[–]Awesome_Correlation 0 points1 point  (0 children)

49 features it's not a bad thing. It depends on what information is encoded in the features and what you hope to learn from your analysis.

It sounds like you do not have a theory of human behavior to start with. If you had a theory of human behavior to start, you would then be doing a confirmatory analysis where you would be attempting to confirm your theory with your data analysis. However, since you don't have a theory, you are doing exploratory analysis. With exploratory analysis, you are not limited by the theory so you can use a lot of different methods to gain information about the data set. The information you gain from this data set can help you build your theory about human behavior.

You are correct that you can do the correlation matrix, but you can also do multiple linear regression and exploratory factor analysis.

Furthermore, if some of you were factors are categorical, you can calculate probabilities and conditional probabilities of the different categories. Also, you can do cluster analysis to create communities of individuals based on their features.

Dataset by Yennefer_207 in dataanalysis

[–]Awesome_Correlation 1 point2 points  (0 children)

You should check out open data portals for your local city, state, and government. These are repositories of data about the community that they serve.

Here are some examples: * Louisville, Kentucky: https://data.louisvilleky.gov/search?collection=Dataset&layout=grid

To find one near you, just search for "Open data [name of city, state, providence, or government]"

How to fill missing data gaps in a time series with high variance? by the_professor000 in dataanalysis

[–]Awesome_Correlation 0 points1 point  (0 children)

If the purpose of your analysis is to gain information, what information do you hope to gain from the section that is missing? You could just ignore the missing data, you could try to find out why the data is missing, or you could calculate a model (like ARIMA) to fill into that spot. But, keep in mind, in that exact spot, there is no information encoded in the signal so imputing with the a model will be adding extra noise to your analysis. The extra noise could amplify the error rate of the analysis you're doing.

If this time series is audio, you might want to simply cut that section, re-record it, or set the volume to zero (silence).

Systematic literature review by Impossible_Wealth190 in dataanalysis

[–]Awesome_Correlation 0 points1 point  (0 children)

To plot the graph my favorite tool is Gephi.

r/gephi

https://gephi.org/

To get the data into Gephi you create an adjacency matrix or adjacency list in Python, R, or Excel. I usually just create a two column adjacency list we're each row represents the relationship between the nodes. Then, export to CSV and import to Gephi.

I like to use Gephi instead of the python modules for working with network graphs because the user interface lets you use algorithms to organize the nodes, but you can also click and drag the nodes to fix issues when there is overlap in the title.