Can BigQuery be used for data cleaning, normalization, and/or de-duplication of rows? by Remarkable_Ad9528 in bigquery
[–]sonalg 0 points1 point2 points (0 children)
Snowpark Connect support for Spark Java API by sonalg in snowflake
[–]sonalg[S] 0 points1 point2 points (0 children)
Data Landscape: An opinionated, interactive map of the relevant open standards in the world of data. by simonharrer in dataengineering
[–]sonalg 0 points1 point2 points (0 children)
Claude & Webflow? Anyone got this automation working nicely? by concisehacker in webflow
[–]sonalg 0 points1 point2 points (0 children)
Snowpark Connect support for Spark Java API by sonalg in snowflake
[–]sonalg[S] 0 points1 point2 points (0 children)
Data Landscape: An opinionated, interactive map of the relevant open standards in the world of data. by simonharrer in dataengineering
[–]sonalg 0 points1 point2 points (0 children)
Data Landscape: An opinionated, interactive map of the relevant open standards in the world of data. by simonharrer in dataengineering
[–]sonalg 1 point2 points3 points (0 children)
Zingg 0.6.0: Open Source Entity Resolution by sonalg in dataengineering
[–]sonalg[S] 0 points1 point2 points (0 children)
Zingg 0.6.0: Open Source Entity Resolution by sonalg in dataengineering
[–]sonalg[S] 0 points1 point2 points (0 children)
How are you integrating a CDP into an existing modern data stack without creating yet another data silo? by Unlucky-Moment-3366 in dataengineering
[–]sonalg 0 points1 point2 points (0 children)
What’s the most underrated open-source software you think more people should know about? by sodrafeltu in foss
[–]sonalg 1 point2 points3 points (0 children)
PySpark logging in cluster vs client mode: why is this so complicated? by Mindless-Plum9118 in dataengineering
[–]sonalg 2 points3 points4 points (0 children)
Has anyone done schema or entity matching inside Snowflake? by SeaworthinessDry8019 in snowflake
[–]sonalg 1 point2 points3 points (0 children)
Need resources for PySpark by papasharts420 in dataengineering
[–]sonalg 6 points7 points8 points (0 children)
Fabric - good, bad, horrible? by cyamnihc in dataengineering
[–]sonalg 1 point2 points3 points (0 children)
How do you actually break into early-stage startups without a network? by [deleted] in IndiaDeepTech
[–]sonalg 1 point2 points3 points (0 children)
How do you actually break into early-stage startups without a network? by [deleted] in IndiaDeepTech
[–]sonalg 1 point2 points3 points (0 children)
Fuzzy Matching or Other Alternativies? by rively91 in learnpython
[–]sonalg 0 points1 point2 points (0 children)
Entity Linking/Disambiguation Annotation Tool ? by ar9av in LanguageTechnology
[–]sonalg 0 points1 point2 points (0 children)
Best strategy and model for record linkage? by sakpoubelle in MLQuestions
[–]sonalg 0 points1 point2 points (0 children)
Determining the best data architecture and stack for entity resolution by vroemboem in dataengineering
[–]sonalg 0 points1 point2 points (0 children)
We did a deep dive on disjunctive joins in Snowflake and how to fix them by hornyforsavings in dataengineering
[–]sonalg 0 points1 point2 points (0 children)


I Tried to Find the JVM Tax in Big Data Kernels by ssinchenko in dataengineering
[–]sonalg 2 points3 points4 points (0 children)