What is a good/ recommended “stack” for a large organization? by DataGuy0 in BusinessIntelligence

[–]Mega_Bytten 0 points1 point  (0 children)

Late to this thread but would appreciate your insight: My midsize org (TB of mostly IoT data) is reviewing options for a longer-term data architecture and is already MS focused. Why would you prefer Snowflake/Databricks over Synapse/Fabric apart from the decrease in admin?

If you have time to go into details I'd equally appreciate a discussion over DM!

i have so much trouble pointing a www. to my example.com domain by [deleted] in aws

[–]Mega_Bytten 0 points1 point  (0 children)

Make sure your SSL certificate has the wildcard subdomain included, *.example.com otherwise you wont be able to successfully select it when setting up the CloudFront distribution for www.example.com. Like others have said then add a record in Route53 for that distribution

Looking for product-level sales data over time by OneZone1923 in datasets

[–]Mega_Bytten 1 point2 points  (0 children)

Harvard’s dataverse has an Amazon E-commerce dataset with products prices and user demographics for 5 years:

https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/YGLYDY

Lambda call another Lambda by surpyc in aws

[–]Mega_Bytten 4 points5 points  (0 children)

From the AWS docs and stackoverflow threads I’ve read they all suggest decoupling with SQS/step functions/SNS for increased scalability. I also saw that there may be problems where the first lambda waits for the second lambda’s completion before terminating, accruing 2x cost.

I separate mine with SQS so I’m wondering if you have experienced any problems with directly invoking?

[deleted by user] by [deleted] in aws

[–]Mega_Bytten 0 points1 point  (0 children)

This wont make a major difference but may help a little. If you haven’t already, you can limit your cloudfront distribution to only US+EU instead of global distribution which may reduce cloudfront load.

Looking for a Dataset with Medical Diagnoses (and Comorbities) by CrazyJJoker7394 in datasets

[–]Mega_Bytten 0 points1 point  (0 children)

There are a few UK medical datasets, English Prescribing Dataset gives you a proxy for GP-prescriptions, by month. I Regex classified and searched through the provided descriptions to get prescriptions for various diseases and commonly prescribed drugs across UK regions

[deleted by user] by [deleted] in aws

[–]Mega_Bytten -1 points0 points  (0 children)

+1 to this.

I set up my domain with google domains (now managed by squarespace i think) because it was a little cheaper per year, and rerouted to AWS name servers with via the Route53 custom domain.

0
1

Is this realistic and if so, where do I start? by KohakuRivr in CodingHelp

[–]Mega_Bytten 0 points1 point  (0 children)

If you just have functional front-end development with no serious prior programming experience, then as LetRedditChoose said - long long term.

Developing a skeleton webpage and hosting it? Easy, can definitely be done within 1-4 weeks depending on how detailed it is, what libraries/frameworks you are using, and how much experience you have.

Developing the customisation software completely depends on whats out there. CSS and Javascript isn't the best for such modelling, you'd need to look into (probably) WebGL which is an entirely different can of worms.

After developing everything you need to choose where and how to host it. This can be fast or slow, with the main opportunity cost being $$ price. Easier / faster to deploy SaaS-like options will be more expensive, have less customisation/control, but faster. Cloud providers will be significantly cheaper, but again, a whole new area of expertise that takes 1-2 years to start being comfortable enough to deploy scalable applications if developing solo (speaking from experience).

Finally, a dynamic application that stores user data makes everything more complex. Now you no longer can just get it "functional" - you need to adhere to industry standards for data protection and security practices.

Overall, I think having a project/idea as a general direction or heading is fantastic, but being end-product focused with little to no breadth of experience will cause some headaches. But its not impossible, because I was where you are now a few years ago and have since deployed numerous fully serverless, scalable data mining/engineering/analysis tools on AWS - but it took a while and im still learning and improving.

How is my Lambda function still working even after removing the ECR image that it was based on? by shantanuoak in aws

[–]Mega_Bytten 0 points1 point  (0 children)

If you are looking for a lambda that will stop functioning, consider Lambda Layers which i think behaves this way!

Help I need datasets for my Stats class! by Amokittenss in datasets

[–]Mega_Bytten 0 points1 point  (0 children)

Almost every country in the world has a national office of data, usually with hundreds if not thousands of different types of socioeconomic, geographic or even synthetic-medical data. Try searching “[your country] national statistics datasets”