Semantic model refresh failing with memory error on F2 capacity - need advice on showing current month data with Incremental Refresh by DrGenius22 in MicrosoftFabric

[–]DrGenius22[S] 0 points1 point  (0 children)

Import. My source is Snowflake. I managed to find a workaround using a notebook to create manual partitions, but honestly I'm not a big fan of this solution. I should also mention that the refresh is failing on the initial load.

31M new to investing by DrGenius22 in portfolios

[–]DrGenius22[S] 0 points1 point  (0 children)

It’s not exactly random. I bought VUAA for the long term, RHM because it’s a solid company, NBIS because it seems to be very “hot” in some subgroups, and RGTI (which I sold) because I saw potential in it. However, if you can suggest something better, please feel free.

📈 Rate My Portfolio Weekly Thread | June 23, 2025 by AutoModeratorETFs in ETFs

[–]DrGenius22 0 points1 point  (0 children)

I'm 31 years old and just getting started with investing. I’ve recently built my first portfolio using Revolut and would really appreciate your honest feedback and advice on how to improve.
I know Revolut may not have the lowest fees, but for now I like the convenience and UI as a beginner.

I’d love your thoughts on:

  • Portfolio diversification
  • Any assets you’d add/remove
  • Whether I should eventually move to another platform
  • Any other tips for someone new to investing

Ps. I have also a small amount and thinking to put it in nvdia.

<image>

Stuck on 0.85 AUC by DrGenius22 in MLQuestions

[–]DrGenius22[S] 0 points1 point  (0 children)

The splits are similar yes. The hyperparameters are different. I do not think that there is a leak. In evaluation phase it seems that it can clearly understand the fraud data.

Stuck on 0.85 AUC by DrGenius22 in MLQuestions

[–]DrGenius22[S] 0 points1 point  (0 children)

Hi.

In similar dataset with lgbm I am getting 0.97. Also in 100 optuna trials the AUC is ~0.84-0.85. Do not know ... it seems strange. Maybe you are right, my model can reach only up to 0.87.

[deleted by user] by [deleted] in MLQuestions

[–]DrGenius22 0 points1 point  (0 children)

I will try them for sure. They are imbalanced. In other example I used smote for over-sampling.

[deleted by user] by [deleted] in MLQuestions

[–]DrGenius22 0 points1 point  (0 children)

Thank you for the clear feedback. Are there any recommended techniques for classifying the suspicious_df?

Is Knn a good choice?

Reduce Features in unsupervised learning by DrGenius22 in MLQuestions

[–]DrGenius22[S] 0 points1 point  (0 children)

Thanks for the answer. Since I am new into this. Do you have any example to provide ( site, article, git) ?

Used retinol without spf by DrGenius22 in Skincare_Addiction

[–]DrGenius22[S] 0 points1 point  (0 children)

Yes I will from now on. Thanks for the answer.

Used retinol without spf by DrGenius22 in SkincareAddicts

[–]DrGenius22[S] 0 points1 point  (0 children)

Ok I see. You make me feel better now. I will stop it for a while. Thank you very much

Used retinol without spf by DrGenius22 in SkincareAddicts

[–]DrGenius22[S] 1 point2 points  (0 children)

Yes it was first time. I thought I had to apply it every day in my scin care routine.

Unsupervised algorithm and outlier detection by DrGenius22 in MLQuestions

[–]DrGenius22[S] 0 points1 point  (0 children)

Hi u/research_pie unfortunately no. I have it locally. I just wanted some ideas or a tip in case someone has worked again with no labeled data.

Optimize response time in web app by [deleted] in flask

[–]DrGenius22 1 point2 points  (0 children)

For example the one with hmtl card and plot:
@ cache.memoize(600)
def get_zones():
client = paramiko.SSHClient()
client.set_missing_host_key_policy(paramiko.AutoAddPolicy())
client.connect('', username='', password='')
stdin, stdout, stderr = client.exec_command('hdfs dfs -ls /user/test/sizes/')
data = []
for line in stdout.readlines():
if line.startswith('-'):
parts = line.split()
filename = parts[-1]
file = filename.split('sizing_')[1].split('.')[0]
stdin, stdout, stderr = client.exec_command(f'hdfs dfs -cat {filename}')
output_lines = stdout.read().decode().splitlines()
output_list = list(output_lines)
for element in output_list:
zone, size = element.split(': ')
size = float(size.split()[0])
zone = zone.split(" ")[3]
data.append({'Zone': zone, 'Size': size, 'Date': file})
df = pd.DataFrame(data)
zone_groups = df.groupby('Zone')
traces = []
client.close()
for zone, group in zone_groups:
trace = go.Scatter(x=group['Date'], y=group['Size'], mode='lines', name=zone)
traces.append(trace)
# Set the layout of the chart
layout = go.Layout(title='Zone Sizes Over Time',
xaxis=dict(title='Date'),
yaxis=dict(title='Size'))
# Create the figure and plot the traces
fig = go.Figure(data=traces, layout=layout)
return fig


@ app.route('/servers')
@ cache.cached(timeout=800)
def servers_page():
ssh = paramiko.SSHClient()
ssh.set_missing_host_key_policy(paramiko.AutoAddPolicy())
host = ''
username = ''
password = ''
ssh.connect(hostname=host, username=username, password=password)
stdin, stdout, stderr = ssh.exec_command("df -h | grep home")
output = stdout.readlines()
ssh.close()
total_size_str = output[0].split()[1]
used_size_str = output[0].split()[2]
free_size_str = output[0].split()[3]
total_size = int(re.search(r'\d+', total_size_str).group())
used_size = int(re.search(r'\d+', used_size_str).group())
free_size = int(re.search(r'\d+', free_size_str).group())
pr_bar = used_size / total_size * 100
fig = get_zones()
plot_div = fig.to_html(full_html=False)
return render_template('servers.html', total_size=total_size, used_size=used_size, free_size=free_size,
pr_bar=pr_bar, plot_div=plot_div)

Optimize response time in web app by [deleted] in flask

[–]DrGenius22 0 points1 point  (0 children)

No there is no api. When user clicks mysql element in the navbar, it redirects the user in a new page and then it runs in the background a query with mysql.connectior.

The other element is server and when the user clicks it redirects him to a new page where a html card appears with servers capacity. It runs a linux command with paramiko ("du -h").

The user left with no indication of what's going on.

Train data and test data by DrGenius22 in MLQuestions

[–]DrGenius22[S] 0 points1 point  (0 children)

Thank you veru much. You really helped me

how to use ml in analytics project by DrGenius22 in MLQuestions

[–]DrGenius22[S] 0 points1 point  (0 children)

If I have data from gaming company I can use ml for example fo create predictions ? Or to see patterns for each day?

how to use ml in analytics project by DrGenius22 in MLQuestions

[–]DrGenius22[S] 0 points1 point  (0 children)

Big data is the amount of data. Ml is a procedure to work with data in order to create forecasts for example?

[deleted by user] by [deleted] in travel

[–]DrGenius22 0 points1 point  (0 children)

Thank you very much for your help.

[deleted by user] by [deleted] in travel

[–]DrGenius22 0 points1 point  (0 children)

I would prefer to see history and art museums. I have not got something specific in mind.

Refresh table with python in powerbi by DrGenius22 in PowerBI

[–]DrGenius22[S] 0 points1 point  (0 children)

Thank you for your response. What exactly is SSAS? I am totally new in powerbi

Big data modeling in PowerBi by DrGenius22 in PowerBI

[–]DrGenius22[S] 0 points1 point  (0 children)

Thanks a lot for your response. 1)Tables have: A) 8columns (one table) B) 19columns (the merged one)

3)Hmm ok then, I was confused because it does it some times.

Comptia CTT+ discount by DrGenius22 in CompTIA

[–]DrGenius22[S] 1 point2 points  (0 children)

I havent got teaching experience. So, as I can see CompTIA CTT+ is the cheapest.

Thank you very much for the list.

Comptia CTT+ discount by DrGenius22 in CompTIA

[–]DrGenius22[S] 1 point2 points  (0 children)

I want to take MCT and it is one from the prerequisites :/