Token segmentation of spamassassin
Hello!
I'm researching about Vietnamese antispam by improving Spamassassin.
I think in spamassassin program have a Bayesian Filter that detects SPAM email depend on tokens . According me , tokens are segmented by blanks . This is suitable for English language but in Vietnamese language isn't suitable. So i want to change "Token segmentation of spamassassin" to accordance with Vietnamese language, but i don't know position of the "Token segmentation" code is writted in spamassassin.
Hope you let me know.
Thanks so much!
|
Recent comments
13 hours 29 min ago
18 hours 28 min ago
19 hours 54 min ago
20 hours 47 min ago
22 hours 30 min ago
1 day 2 hours ago
1 day 3 hours ago
1 day 5 hours ago
1 day 19 hours ago
1 day 20 hours ago