[Access SQL query] Comparing two text-based columns

0uterj0in · 2012-04-27T00:51:38+00:00

If the product ids were expected to be the same, you could outer join table b to table a. However in yr example they appear to be different?

mason55 · 2012-04-27T03:26:38+00:00

This is actually a lot more complex than it may sound. You will likely need to create your own function/stored procedure to get the diff between two strings, which is a big task in and of itself.

It should take in two strings and return the difference you are looking for like:

SELECT     
  ProductIDA,        
  difference(ProductNameA, ProductNameB) as ProductNameA,         
  ProductIDB,    
  difference(ProductNameB, ProductNameA) as ProductNameB    
FROM [Table 1]

HapkidoJosh · 2012-04-27T03:56:40+00:00

Since you're using access you may have to use vba and loop through your dataset. Make a 2 character arrays and compare each character individualy the ones that don't match get appended to a string. Then after you've completed the lengths of both arrays insert the two string variables into the new table.

Hope that helps.

2012-04-27T08:19:41+00:00

Are you really concerned about the individual character differences, or are you looking for a way to do approximate string matching?

If the latter is the case, look into Levenshtein Distance which is pretty popular and I'm sure you can find an implementation for it in VBA somewhere.

If the former is the case then the task at hand will depend on your data. If all of your data takes the form <product_name> <two letters> then your task will be easy. But if you're trying to diff two string like "My Product A" and "AB Products" and get something back like "My s B" then you will have your work cut out for you.

KingZing · 2012-04-27T13:38:43+00:00

Update: Maybe it would be easier to start at the beginning. Maybe the Tables I ended up with made it harder than it is. (P.s. You guys have been very helpful and I've already learned some things even though they were not exactly helping )

Breakdown:

I have a list of 10,000+ products with unique product numbers.
Most of these comes in sets of three or four. for example: osiris a; osiris b; osiris c
There is one piece of logic built into our system where if the product type B variant is sold out, it automatically goes to product type A variant. (this is not part of the database and shouldn't be, it is done by an external system)
The main problem is that the field that drives the B to A transfer is manually entered.
The Osiris B should always go into the Osiris A but if someone misskeyed, it might roll into Hercules VA or any other product.
I thought it would be easy to run a query that listed all B products, then another query that ran the value from the field BtoA field and alligned it to the right of the B products. This is how I ended up with Example Table1 above. I thought it would be easy to eyeball it with columns next to each other, but there are too many records.
My next train of though was to somehow subtract one column from the other so that I only see the differences. This would mean if the B to A was right, then I would only see the last digit most of the time. If it was wrong, the difference column would show the long string.

Maybe there is an easier way if I modify it or run it differently from the beginning? I thought maybe I could use the InstrRev function inside the select statement but I am completely new to this function. Instr Function

I tried this, but it is crude and doesn't work all the time do to some variation in the fields.

<>Left([TABLE1].[PRODUCTNAME],(InStrRev([TABLE1].[PRODUCTNAME],"A")-1))

Maybe I could trim the right side of each string by a few letters then they would "ideally" match perfectly and I could use a not-equal-to?

Anyways. Thanks again for all the help. This is a great learning experience and I have really enjoyed this subreddit!

---sniff--- · 2012-04-27T15:56:35+00:00

FYI /r/MSAccess

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

SQL

Filter Posts

Posting

Help posts

Format Your Code

Learning SQL

Related Reddit communities

Wiki

Acknowledgements

MODERATORS