Forum

Share

Please consider registering
guest

Log In RegisterMembers
Or log in with

Register | Lost password?
Advanced Search:

— Forum Scope —



— Match —



— Forum Options —




Wildcard usage:
*  matches any number of characters    %  matches exactly one character

Minimum search word length is 4 characters - maximum search word length is 84 characters

Topic RSS
How is Running % calculated from Token %
April 12, 2011
8:40 pm
Hobart
New Member
Forum Posts: 2
Member Since:
April 12, 2011
Offline

This may be non-trivial to answer (given the BAyesian system behind it),
but some reference to it is surely worthwhile in the help. The
relationship is anything but intuitive or easy to understand and
disorienting at first. It seems that high Token % will nudge the Running
% up slightly … but how much by?

Share
April 13, 2011
8:52 am
Admin
Forum Posts: 323
Member Since:
July 12, 2008
Offline

Yes it is difficult to answer that, as it depends on the total number of tokens. What I have found useful to understand is that if an individual token is great than 50%, it pushes the score up, and vice versa. If you want more, you should really look at the underlying code in the extension or in the base code.

I did not write the underlying bayes code, I only tried to show the concept of running % an accurate representation of individual token effects, that sums to the same value calculated by the bayes algorithm. One thing to note is that I first sort the tokens by their absolute distance from 50%, so that you can see which are the most important tokens.

Share
August 9, 2011
2:18 am
New Member
Forum Posts: 1
Member Since:
August 9, 2011
Offline

Where is the source of Junquilla available, and how do I build and install a customized version if I want to do some debugging? It would be interesting to know why some of my messages have "Junk %" of 100% in the message list, but when I look at Junk Analysis Detail for the message, there is only a single token with high Token-% (94%), and a ton of tokens with rather low Token-% (<10%).

Share
August 26, 2011
9:16 am
Admin
Forum Posts: 323
Member Since:
July 12, 2008
Offline

The addon file, which ends in .xpi, is just a zipped version of the source. So if you unzip this you will have the source. You can then zip it back again to reinstall it. There are better ways of doing this if you do development though, see https://developer.mozilla.org/en/Extensions

As for your symptoms, the Junk % in the message list shows the junk % at the time the message was first analyzed, while the Junk Analysis Detail shows the analysis using the current token set. If you as for a message to be reclassified, then the junk % column should match the detail.

Share
Forum Timezone: UTC -8

Most Users Ever Online: 18

Currently Online:
9 Guest(s)

Currently Browsing this Page:
1 Guest(s)

Top Posters:

bobkatz: 8

BigMike: 8

t2m: 7

zabolyx: 7

taa: 6

onlyme: 6

Member Stats:

Guest Posters: 130

Members: 565

Moderators: 1

Admins: 1

Forum Stats:

Groups: 1

Forums: 7

Topics: 231

Posts: 802

Moderators: rkent (323)

Administrators: rkent (323)