IRedMail/FAQ/Store.SpamAssassin.Bayes.In.SQL

From iRedMail

Revision as of 14:05, 16 May 2012 by ZhangHuangbin (Talk | contribs)
Jump to: navigation, search

Contents


THIS ARTICLE IS STILL A DRAFT, DO NOT APPLY IT IN PRODUCTION SERVER.

Summary

This article is used to configure related components to store SpamAssassin Bayes data in SQL server.

Tested with:

  • iRedMail-0.8.0 with MySQL backend.
  • CentOS 6.2 (x86_64)
  • SpamAssassin-3.3.1
  • Amavisd-new-2.6.6
  • MySQL-5.1.61

Notes:

  • This article should work with all iRedMail releases. We take iRedMail-0.8.0 for example.
  • This article should work with all backends: OpenLDAP, MySQL, PostgreSQL. We take MySQL backend for example.
  • This article should work with Amavisd-new-2.6.0 and later versions, includes Amavisd-new-2.7.x.

Create required SQL database used to store bayes data

We need to create a SQL database and necessary tables to store SpamAssassin bayes data. The RPM package installed on CentOS 6 doesn't ship SQL template for bayes database, but we can download it from Apache web site. We're running SpamAssassin-3.3.1, so what we need is this SQL template file: http://svn.apache.org/repos/asf/spamassassin/tags/spamassassin_release_3_3_1/sql/bayes_mysql.sql (if you're running different version, please find the proper SQL file here: http://svn.apache.org/repos/asf/spamassassin/tags/)

Terminal:
# cd /root/
# wget http://svn.apache.org/repos/asf/spamassassin/tags/spamassassin_release_3_3_1/sql/bayes_mysql.sql

Create MySQL database and import SQL template file:

Terminal:
# mysql -uroot -p
mysql> CREATE DATABASE sa_bayes;
mysql> SOURCE /root/bayes_mysql.sql;

Create a new MySQL user (with password sa_user_password) and grant permissions:

  • Note: Please replace password sa_user_password by your own.
Terminal:
mysql> GRANT SELECT, INSERT, UPDATE, DELETE ON sa_bayes.* TO sa_user@localhost IDENTIFIED BY 'sa_user_password'; 
mysql> FLUSH PRIVILEGES;

Enable Bayes modules in SpamAssassin

Edit /etc/mail/spamassassin/local.cf, add (or modify below settings):

File: local.cf
use_bayes          1
bayes_auto_learn   1
bayes_auto_expire  1

# Store bayesian data in MySQL
bayes_store_module Mail::SpamAssassin::BayesStore::MySQL
bayes_sql_dsn      DBI:mysql:sa_bayes:127.0.0.1:3006

# Store bayesian data in PostgreSQL
#bayes_store_module Mail::SpamAssassin::BayesStore::PgSQL
#bayes_sql_dsn      DBI:Pg:sa_bayes:127.0.0.1:5432

bayes_sql_username sa_user
bayes_sql_password sa_user_password

# Override the username used for storing
# data in the database. This could be used to group users together to
# share bayesian filter data. You can also use this config option to
# trick sa-learn to learn data as a specific user.
bayes_sql_override_username vmail

Make sure SpamAssassin will load bayes modules:

Terminal:
# /etc/init.d/amavisd stop
# amavisd -c /etc/amavisd/amavisd.conf debug 2>&1 | grep -i 'bayes'
May 16 09:59:33 ... SpamAssassin loaded plugins: ..., Bayes, ...

Looks fine. Now press 'Ctrl-C' to terminate above command.

Start Amavisd service:

Terminal:
# /etc/init.d/amavisd restart

It is required we initialize the database by learning a message. We use the sample spam email shipped in the RPM package provided by CentOS 6:

Terminal:
# rpm -ql spamassassin | grep 'sample-spam'
/usr/share/doc/spamassassin-3.3.1/sample-spam.txt

# sa-learn --spam --username=vmail /usr/share/doc/spamassassin-3.3.1/sample-spam.txt
Learned tokens from 1 message(s) (1 message(s) examined)
Personal tools