Bayesian Filters Are Cool

3/29/2007 7:44:32 AM
I've been working on a little side project and found the need to filter data into "good" and "bad" types of data. After a bit of research, I settled on giving a simple Bayesian filter a try. I essentially modelled my approach off of what I had seen in spam arena since the ideas about good/bad data were similiar (though my data includes both words and numbers).

Well let me just say - cool stuff. Surprisingly easy to implement and once you get them trained, they do a very good job. I've trained my filters on about 1000 pieces of data and so far, the filter is able to correctly filter out the bad data at about a 90-95% rate, which is more than good enough for my scenario.

I read a quote somewhere once that said Google used Bayesian Filters like Microsoft used if-then statements. Well, if true, that is a scary thought now that I have experienced them first hand.

Be the first to rate this post

  • Currently 0/5 Stars.
  • 1
  • 2
  • 3
  • 4
  • 5

Tags:

Slick Thoughts

Related posts

Comments are closed

Powered by BlogEngine.NET 1.3.1.0
Theme by Mads Kristensen

About the author

Jeff Brand Jeff Brand

This is the personal web site of Jeff Brand, self-proclaimed .NET Sex Symbol and All-Around Good guy. Content from my presentations, blog, and links to other useful .NET information can all be found here.

E-mail me Send mail


Calendar

<<  October 2008  >>
MoTuWeThFrSaSu
293012345
6789101112
13141516171819
20212223242526
272829303112
3456789

View posts in large calendar

Twitter Updates

XBOX
Live

Recent comments

Disclaimer

The opinions expressed herein are my own personal opinions and do not represent my employer's view in anyway.

© Copyright 2008

Sign in