Partial Fuzzy Lookup? -
06-03-2008
, 12:02 PM
I need to do a partial fuzzy lookup in SSIS. This means that some columns
need to match exactly, while others are fuzzy matched. Picture an incoming
data stream T1 with two columns T1_C1 and T1_C2, and a lookup table T2 with
columns T2_C1 and T2_C2. Stream T1 has 16,000 rows, and lookup T2 has
215,000 rows. I want to tell the system to fuzzy match T1_C2 to T2_C2 with a
threshold of 80%, but only where T1_C1 and T2_C1 match exactly (100%).
That's the basic problem. To complicate this slightly, T1_C1 and T2_C1
actually will be matched using a SQL "LIKE" operator. T1_C1 contains values,
and T2_C1 contains a pattern which we store. FYI, I'm trying this in SQL
2008, though I don't believe there's been any change in fuzzy capabilities. |