dbTalk Databases Forums  

Re: Are there any MySQL queries or software packages for "findingsimilar items"

comp.databases.mysql comp.databases.mysql


Discuss Re: Are there any MySQL queries or software packages for "findingsimilar items" in the comp.databases.mysql forum.



Reply
 
Thread Tools Display Modes
  #1  
Old   
Robert Billing
 
Posts: n/a

Default Re: Are there any MySQL queries or software packages for "findingsimilar items" - 07-05-2010 , 04:42 PM






We, the Senate of Arcturus, take note that Ignoramus12110 said:

Quote:
So... Any suggestion for software to ran strings by similarity and
provide "top 5" or something like that?
I'd suggest that you make a dictionary of all the words in the questions,
then for each question make a map of the words it uses. Looking for
similarity of the maps might then get what you want.

Reply With Quote
  #2  
Old   
Ignoramus12110
 
Posts: n/a

Default Re: Are there any MySQL queries or software packages for "findingsimilar items" - 07-05-2010 , 05:10 PM






On 2010-07-05, Robert Billing <unclebob (AT) tnglwood (DOT) demon.co.uk> wrote:
Quote:
We, the Senate of Arcturus, take note that Ignoramus12110 said:

So... Any suggestion for software to ran strings by similarity and
provide "top 5" or something like that?

I'd suggest that you make a dictionary of all the words in the questions,
then for each question make a map of the words it uses. Looking for
similarity of the maps might then get what you want.
Robert, thanks. The devil is in the details (and speed of
implementation) and that is why I would love to find an existing
solution. But if worst comes to worst, I will implement something
along your lines.

i

Reply With Quote
  #3  
Old   
Ignoramus12110
 
Posts: n/a

Default Re: Are there any MySQL queries or software packages for "findingsimilar items" - 07-05-2010 , 05:11 PM



On 2010-07-05, Erick T. Barkhuis <erick.use-net (AT) ardane (DOT) c.o.m> wrote:
Quote:
Ignoramus12110:
...
So... Any suggestion for software to ran strings by similarity and
provide "top 5" or something like that?

All I can come up with is Levenshtein (not much experience using it,
though).
May I suggest you use "levenshtein mysql" or "levenshtein php" as a
search phrase?


Eric, thanks a lot. You got me started. Levenshtein distance and
"approximate string matching" are big topics that are well described
in Wikipedia. I am sure that if something exists, I can find it by
going from these two terms. Thank you again.

i

Reply With Quote
Reply




Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off



Powered by vBulletin Version 3.5.3
Copyright ©2000 - 2012, Jelsoft Enterprises Ltd.