gecko-dev/mailnews/base/util
scott%scott-macgregor.org ea0646f211 Bug #230093, Bug #181534, Bug #237095 --> Port thunderbird junk mail improvements to the trunk.
Replace the core bayesian junk mail algorithm with a chi-squared probability distribution
modeled after spam bayes and Gary Robinson's work.

Change the model for how we count tokens across messages.

Token counts get out of alignment when re-training against already classified messages.

Revamp the junk mail tokenizer. Make it a hdr sink listener and add custom tokens for attachment
information. Ignore tokens larger than 13 characters. Tokenize purely off of white space.
Ignore tokens less than 3 bytes in length. There is still a lot more work to be done to the tokenizer.


Many thanks to Miguel Varga for working out the initial core algorithm improvement and to all
of the folks at spam bayes and of course Gary Robinson for helping to make this happen.
2004-05-12 18:16:32 +00:00
..
.cvsignore
Makefile.in Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
msgbaseutil.pkg Beginning to land the PACKAGING_20030906_BRANCH for bug 20640. Not part of the build, yet. 2004-01-07 01:22:31 +00:00
nsAdapterEnumerator.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsAdapterEnumerator.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsLocalFolderSummarySpec.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsLocalFolderSummarySpec.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgDBFolder.cpp Bug 226270 Rationalization of folder listeners, part 2 r=bienvenu sr=mscott 2004-04-26 22:51:27 +00:00
nsMsgDBFolder.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgGroupRecord.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgGroupRecord.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgI18N.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgI18N.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgIdentity.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgIdentity.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-18 13:54:27 +00:00
nsMsgIncomingServer.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgIncomingServer.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgKeyArray.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgKeyArray.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgKeySet.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgKeySet.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgLineBuffer.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgLineBuffer.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgMailNewsUrl.cpp Bug #230093, Bug #181534, Bug #237095 --> Port thunderbird junk mail improvements to the trunk. 2004-05-12 18:16:32 +00:00
nsMsgMailNewsUrl.h Bug #230093, Bug #181534, Bug #237095 --> Port thunderbird junk mail improvements to the trunk. 2004-05-12 18:16:32 +00:00
nsMsgProtocol.cpp landing patch for bug 241124 "move GSSAPI code behind nsIAuthModule so it can be used by mail protocols" r=cneberg sr=bryner 2004-04-28 02:12:52 +00:00
nsMsgProtocol.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgTxn.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgTxn.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsMsgUtils.cpp Bug 242970 Server Settings/local directory was changed to \\.\D:\pop3.server.name (Other than C: drive,After 2004-04-29 build) 2004-05-12 06:11:07 +00:00
nsMsgUtils.h bug 240558 : mail filters moving emails to a folder with '@' or '!' in its name get disabled (r=bienvenu, sr=mscott) 2004-04-21 10:39:37 +00:00
nsNewsSummarySpec.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsNewsSummarySpec.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsUint8Array.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsUint8Array.h fix for performance bug #109557. stand alone message display time depends 2002-05-11 01:52:02 +00:00
nsUInt32Array.cpp Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
nsUInt32Array.h Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-17 18:33:16 +00:00
win32.order 65845 - new order files 2001-04-05 06:02:32 +00:00