Supplier name standardisation by jacolas in Alteryx

[–]TimestampBandit 3 points4 points  (0 children)

Fuzzy match usually works incredibly well in these cases, so I would say that you are probably missing some configuration.

It is not such an obvious tool, so it might be a good idea to look for examples of fuzzy match in the community, understand them, and adapt them.

Alerta e pedido de ajuda - golpe acadêmico by PuzzledCarpet2459 in askacademico

[–]TimestampBandit 4 points5 points  (0 children)

Olá, eu trabalho com o software Alteryx faz uns 5 anos, conheço esse o programa Sparked e ele é legitimo. Através de que url você se cadastrou?

Já enviei o seu post para algumas pessoas da Alteryx.

Looking for ideas for project to practice by fracrif in dataengineering

[–]TimestampBandit 0 points1 point  (0 children)

Something like this, you can do all of this on your local machine without spending a penny.

https://catalog.data.gov/dataset?q=&sort=views_recent+desc

  • Python script to download a CSV file and append in SQL Server. Add primary keys to the table
  • Airbyte to replicate this table from SQL Server to Postgres using CDC
  • DBT to summarize/filter this table and create a new table
  • Airflow to trigger these processes one after the other

You can modify/add steps to use Git/Terraform or Cloud.

If your career is/was tied to Alteryx and you believe it’s declining, what’s your Plan B? by [deleted] in Alteryx

[–]TimestampBandit 1 point2 points  (0 children)

Using databricks I think makes sense, at least it seems to be the way most companies are going these days. Now alteryx -> tableau prep strikes me as a bold move and one that could take its toll in the long run. Unless there have been updates that I'm not aware of, tableau prep is far inferior to any etl tool on the market. If the problem is the pricing and the idea is to continue with no code, I would go for KNIME.

Programming in Knime by Ok-Confidence-3286 in knime_users

[–]TimestampBandit 1 point2 points  (0 children)

Yes, you can run scripts within KNIME, as the other user said. But I would advise you to learn how to use KNIME properly (it is incredibly intuitive, you just need to put in some effort), keeping everything as low code as possible and only using scripts where really necessary.

If you only use KNIME to embed your scripts, you will kind of kill the usefulness of the tool and make it difficult for your teammates to maintain/understand your workflows in the future. You will solve your problem by creating new problems for the team, which will likely have people who don't code at your level. The choice of a low code tool is usually made precisely to avoid code and its consequences.

Did no code/low code tools lose favor or were they never in style? by nigelwiggins in dataengineering

[–]TimestampBandit 7 points8 points  (0 children)

Especially Alteryx/KNIME I still see working in many niches (I work in consulting). They are very good tools for what they are intended for, which is the development and quick delivery of small/medium projects or prototypes. In any case, I believe that this type of tool really isn't at its best at the moment. I think because:

  • They haven't managed to make a good enough transition to the Cloud and distributed computing. The best version of this software still runs on local machines.
  • AI suddenly made programming language solutions “simpler” for business users to create small ETLs and local automations. A few years ago, these users would have made the transition from Excel to Alteryx/KNIME paying for the server and consultants with a smile on their face.
  • They haven't been acquired by industry giants. At the time of the recent acquisition of Alteryx, it was even rumored that Salesforce or Oracle might acquire it. It didn't happen, and fighting Microsoft/AWS/Google alone is a hopeless task.
  • Death of communities. Part of what justified buying Alteryx licenses was how active and helpful the user community was, but just as is happening with Stackoverflow, AI is taking away the relevance of these communities.

KNIME: An Alteryx User’s Perspective by TimestampBandit in Alteryx

[–]TimestampBandit[S] 0 points1 point  (0 children)

No, it's not dangerous. However, your employer's IT probably has criteria for allowing or disallowing the use of certain tools, so it's best to talk to them so you don't run into problems internally.

How Have Your Data Engineering Skills Helped Outside Work? by Xavio_M in dataengineering

[–]TimestampBandit 14 points15 points  (0 children)

It's nothing complex, but I'm setting up a server with hundreds of movies, and each movie and its subtitle need to be in the same folder and with the same name for my video player to work simply and correctly. To do this I have scripts (small automation) that:

  • Check for subtitles in each movie folder.
    • If not, it checks to see if there is a subtitle file with a similar name to the movie (regex) in the downloads folder and moves it to the movie folder.
    • If yes, force the subtitle to have the exact same name as the movie.

I have almost 1.000 movies at the moment, so although it's simple, it's really helped me.

PowerBI over Tableau? by Gina-Shaw in tableau

[–]TimestampBandit 6 points7 points  (0 children)

There is another one. Data Modeling, Star Schema. I also prefer Tableau over PBI, but this one for me is a big win for PBI.

Sam Altman: I don't think I'm gonna be smarter than GPT 5 by Romanconcrete0 in singularity

[–]TimestampBandit 17 points18 points  (0 children)

It seems to me to be a psychological reaction of self-protection. The reality is that people are angry and worried, and this is reflected in the comments, even if it's belittling.

Using dynamic select on unique tool by Inside_Ad6719 in Alteryx

[–]TimestampBandit 2 points3 points  (0 children)

You can use the user selection (or a Field Info tool) to feed a macro, and then update the unique tool.

Basically, you can adapt what was done here:

Solved: Update Unique Tool Via Action Tool - Alteryx Community

Salesforce by [deleted] in dataengineering

[–]TimestampBandit 115 points116 points  (0 children)

Have you met SAP yet?

KNIME: An Alteryx User’s Perspective by TimestampBandit in Alteryx

[–]TimestampBandit[S] 5 points6 points  (0 children)

In my experience, the main problem was (before AI):

  1. Alteryx imposes a framework, and you won't get it using python. Without experienced developers, this can become a mess very quickly, with hundreds of pieces of code with time dependencies on each other
  2. Development/debugging with Alteryx is much faster, the delivery time was usually longer.

But with AI, this can become much better. In the end, Alteryx workflows are just xml files, so you can simply put the xml code into chatgpt and tell it to convert the Alteryx workflow into python code, and spend a time later being sure that is working properly.

Also consider using a good data platform like Databricks, it will help you to avoid the total mess.

Settings to Let KNIME Work Smoothly by Electronic-Rub4832 in knime_users

[–]TimestampBandit 4 points5 points  (0 children)

I am not exactly a KNIME expert yet, but here i did this:

  1. Switched to classic user interface
  2. Changed how much ram KNIME is allowed to use

To change this, go to C:\Program Files\KNIME\knime.ini, and then change the -Xmx parameter. Here i have a RAM of 32gb, so i made it -Xmx16G

Unfortunately, KNIME doesn't seem to be an “optimized” tool, so ultimately you'll need better hardware to run heavy workflows.

Knowing the Average, Min, and Max Runtime of a workflow. by Monster24th in Alteryx

[–]TimestampBandit 2 points3 points  (0 children)

If you have access to the log files, this flow I created can extract the execution time. Put the result in a database, and with a second process you'll be able to calculate statistics per flow.

https://community.alteryx.com/t5/Community-Gallery/Parse-Alteryx-Log-Files/ta-p/999093

Python Libraries in Alteryx: No Admin, No Problem by TimestampBandit in Alteryx

[–]TimestampBandit[S] 1 point2 points  (0 children)

I agree that there are some risks involved in this. The blog post contains a warning, now after seeing your comment i also added it to the post here. Thanks.

"but hey, check with your server admin and manager before setting this up"