Token segmentation of spamassassin
Hello!
I'm researching about Vietnamese antispam by improving Spamassassin.
I think in spamassassin program have a Bayesian Filter that detects SPAM email depend on tokens . According me , tokens are segmented by blanks . This is suitable for English language but in Vietnamese language isn't suitable. So i want to change "Token segmentation of spamassassin" to accordance with Vietnamese language, but i don't know position of the "Token segmentation" code is writted in spamassassin.
Hope you let me know.
Thanks so much!
|
Recent comments
23 hours 3 min ago
1 day 5 hours ago
1 day 9 hours ago
1 day 11 hours ago
1 day 19 hours ago
2 days 5 hours ago
2 days 5 hours ago
2 days 9 hours ago
2 days 13 hours ago
2 days 14 hours ago