[KLUG Members] Match-making data types and algorithms

Scott Wood members@kalamazoolinux.org
Thu, 28 Jun 2001 11:10:54 -0700 (PDT)


Well, I have a couple of different things for which I would like to create a
matching type scenario which I could only really compare to a dating service. 
For example:  On one DVD Ram I have close to 10000 sound bytes.  They range
from cartoon sounds, to sound effects, to movie snippets etc.  Ultimately, I
would like to set up a corresponding SQL database to include information on the
snippets.  I would prefer more than just a simple search against a description
or a transcription.  I would like to include something related to the style of
the snippet (e.g. humorous, sarcastic, dramatic, inane, etc) or other similar
parameters.  Then ultimately, should I have one sound that I like to use, to
have a tool available to pick the 'next closest' in content, style, etc.

The point being that for the various things I am looking to use something like
this on, there are multiple categories of relevant data that I would like to
create a result from with some sort of ranking for the closest, next closest
and so forth and to have the various categories be weighted or at least
weightable.  (i.e. search for something related to 'cars' would weight "content
LIKE '%cars%'" while looking for sarcastic snippets about government might
equally weight "style LIKE '%sarcastic%' AND description = 'sarcasm'".

Of course this is a simple example.  Ultimately, I will have close to a dozen
or more criteria set up for searching with different weightings based on
relevance, but would like to find examples of or text on searches like this.

Scott


--- Adam Tauno Williams <adam@morrison-ind.com> wrote:
> >Does anyone know of any good sources for information on data matching
> >formats and the algorithms that match them up?  The kind of stuff I am
> looking
> >for might be things such that - as hokie as it might sound - a dating
> >service might use.  I would be even interested in any references as to any
> >'psychology' that goes into formulating questions to create the answers to
> be 
> >matched.  I am especially interested in anything that would include 
> >'weighting' of some answers to take priority over others.
> >If anyone knows of a data source, be it online, in print or elsewhere -
> >that might help me get more information, it will be greatly appreciated.
> 
> I'm a little fuzzy as to what you mean... Are you looking for sources of data
> (tables, files, etc...) about a certain topic (dating services) or are you
> looking for tools to analyze data?  I'm currently researching data access
> tools
> for Linux so I may be able to help you with the latter.
> 
> Systems and Network Administrator
> Morrison Industries
> 1825 Monroe Ave NW.
> Grand Rapids, MI. 49505
> _______________________________________________
> Members mailing list
> Members@kalamazoolinux.org
> 


__________________________________________________
Do You Yahoo!?
Get personalized email addresses from Yahoo! Mail
http://personal.mail.yahoo.com/