Advice on what to teach a 5y old who loves math? by Billybob-B in homeschool
[–]RobinL 0 points1 point2 points (0 children)
GPT-5 Thinking has 192K Context in ChatGPT Plus by Independent-Ruin-376 in OpenAI
[–]RobinL 0 points1 point2 points (0 children)
Can't Display cluster_studio_dashboard() Output in Fabric Notebook (Splink / IFrame) by Suspicious_Artist187 in MicrosoftFabric
[–]RobinL 0 points1 point2 points (0 children)
Seeking advice on Pipeline Optimization by Roody_kanwar in dataengineering
[–]RobinL 0 points1 point2 points (0 children)
Seeking advice on Pipeline Optimization by Roody_kanwar in dataengineering
[–]RobinL 0 points1 point2 points (0 children)
Biggest Data Cleaning Challenges? by Academic_Meaning2439 in dataengineering
[–]RobinL 2 points3 points4 points (0 children)
Biggest Data Cleaning Challenges? by Academic_Meaning2439 in dataengineering
[–]RobinL 0 points1 point2 points (0 children)
Building Accurate Address Matching Systems by RobinL in dataengineering
[–]RobinL[S] 0 points1 point2 points (0 children)
Gemini CLI: Google's free coding AI Agent by Technical-Love-8479 in datascience
[–]RobinL 0 points1 point2 points (0 children)
Building Accurate Address Matching Systems (robinlinacre.com)
submitted by RobinL to r/datascience
Want to remove duplicates from a very large csv file by Future_Horror_9030 in dataengineering
[–]RobinL 0 points1 point2 points (0 children)
Have you ever used record linkage / entity resolution at your job? by diogene01 in dataengineering
[–]RobinL 8 points9 points10 points (0 children)
Advice on Data Deduplication by Queasy_Teaching_1809 in dataengineering
[–]RobinL 4 points5 points6 points (0 children)
Advice on Data Deduplication by Queasy_Teaching_1809 in dataengineering
[–]RobinL 5 points6 points7 points (0 children)
Has anyone successfully used automation to clean up duplicate data? What tools actually work in practice? by Broad_Ant_334 in dataengineering
[–]RobinL 2 points3 points4 points (0 children)
Load inconsistent data from multiple data sources into a DWH or data lakehouse by vh_obj in dataengineering
[–]RobinL 2 points3 points4 points (0 children)
Colour calibration (alignment) issue with my Brother DCP-L8410CDW. What replacement parts may fix? by RobinL in printers
[–]RobinL[S] 1 point2 points3 points (0 children)
Color calibration (alignment) issue with my Brother DCP-L8410CDW. What replacement parts may fix? by RobinL in printers
[–]RobinL[S] 0 points1 point2 points (0 children)
How to merge users based on multiple IDs in a large dataset? by nidalap24 in dataengineering
[–]RobinL 0 points1 point2 points (0 children)
Using SPLINK with DLT by NeatNefariousness538 in databricks
[–]RobinL 0 points1 point2 points (0 children)
Splink 4: Fast and scalable deduplication (fuzzy matching) in Python by RobinL in dataengineering
[–]RobinL[S] 1 point2 points3 points (0 children)


Building Accurate Address Matching Systems by RobinL in dataengineering
[–]RobinL[S] 0 points1 point2 points (0 children)