sa-learn klappt nicht.....

Postfix, QMail, Sendmail, Dovecot, Cyrus, Courier, Anti-Spam
kantiran
Posts: 11
Joined: 2006-03-28 19:41

sa-learn klappt nicht.....

Post by kantiran » 2007-04-05 13:42

Hallo

habe einen vserver mit qmail und spamassassin.
Nun wollte ich ihm per script (sa-wrapper.pl) beibringen das er e-mails
ich an spam@meine-domain.de sende künftig als spam erkennt.

Tut er aber nicht. Er lernt gar nix.

Hier ein Auszug aus der sa-learn.log

Code: Select all

[18218] dbg: logger: adding facilities: all
[18218] dbg: logger: logging level is DBG
[18218] dbg: generic: SpamAssassin version 3.1.7
[18218] dbg: config: score set 0 chosen.
[18218] dbg: util: running in taint mode? yes
[18218] dbg: util: taint mode: deleting unsafe environment variables, resetting PATH
[18218] dbg: util: PATH included '/var/qmail/bin', keeping
[18218] dbg: util: PATH included '/usr/local/sbin', keeping
[18218] dbg: util: PATH included '/sbin', keeping
[18218] dbg: util: PATH included '/bin', keeping
[18218] dbg: util: PATH included '/usr/sbin', keeping
[18218] dbg: util: PATH included '/usr/bin', keeping
[18218] dbg: util: final PATH set to: /var/qmail/bin:/usr/local/sbin:/sbin:/bin:/usr/sbin:/usr/bin
[18218] dbg: message: ---- MIME PARSER START ----
[18218] dbg: message: main message type: text/plain
[18218] dbg: message: parsing normal part
[18218] dbg: message: added part, type: text/plain
[18218] dbg: message: ---- MIME PARSER END ----
[18218] dbg: dns: is Net::DNS::Resolver available? yes
[18218] dbg: dns: Net::DNS version: 0.48
[18218] dbg: config: using "/etc/mail/spamassassin" for site rules pre files
[18218] dbg: config: read file /etc/mail/spamassassin/init.pre
[18218] dbg: config: read file /etc/mail/spamassassin/v310.pre
[18218] dbg: config: read file /etc/mail/spamassassin/v312.pre
[18218] dbg: config: using "/usr/share/spamassassin" for sys rules pre files
[18218] dbg: config: using "/usr/share/spamassassin" for default rules dir
[18218] dbg: config: read file /usr/share/spamassassin/10_misc.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_advance_fee.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_anti_ratware.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_body_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_compensate.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_dnsbl_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_drugs.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_fake_helo_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_head_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_html_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_meta_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_net_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_phrases.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_porn.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_ratware.cf
[18218] dbg: config: read file /usr/share/spamassassin/20_uri_tests.cf
[18218] dbg: config: read file /usr/share/spamassassin/23_bayes.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_accessdb.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_antivirus.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_body_tests_es.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_body_tests_pl.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_dcc.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_dkim.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_domainkeys.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_hashcash.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_pyzor.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_razor2.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_replace.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_spf.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_textcat.cf
[18218] dbg: config: read file /usr/share/spamassassin/25_uribl.cf
[18218] dbg: config: read file /usr/share/spamassassin/30_text_de.cf
[18218] dbg: config: read file /usr/share/spamassassin/30_text_fr.cf
[18218] dbg: config: read file /usr/share/spamassassin/30_text_it.cf
[18218] dbg: config: read file /usr/share/spamassassin/30_text_nl.cf
[18218] dbg: config: read file /usr/share/spamassassin/30_text_pl.cf
[18218] dbg: config: read file /usr/share/spamassassin/30_text_pt_br.cf
[18218] dbg: config: read file /usr/share/spamassassin/50_scores.cf
[18218] dbg: config: read file /usr/share/spamassassin/60_awl.cf
[18218] dbg: config: read file /usr/share/spamassassin/60_whitelist.cf
[18218] dbg: config: read file /usr/share/spamassassin/60_whitelist_dk.cf
[18218] dbg: config: read file /usr/share/spamassassin/60_whitelist_dkim.cf
[18218] dbg: config: read file /usr/share/spamassassin/60_whitelist_spf.cf
[18218] dbg: config: read file /usr/share/spamassassin/60_whitelist_subject.cf
[18218] dbg: config: using "/etc/mail/spamassassin" for site rules dir
[18218] dbg: config: read file /etc/mail/spamassassin/local.cf
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::URIDNSBL from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x8ce9714)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::Hashcash from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::Hashcash=HASH(0x8d082b8)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::SPF from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::SPF=HASH(0x8d2e470)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::Pyzor from @INC
[18218] dbg: pyzor: network tests on, attempting Pyzor
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::Pyzor=HASH(0x8d30894)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::Razor2 from @INC
[18218] dbg: razor2: razor2 is available, version 2.67
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::Razor2=HASH(0x8d0add4)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::SpamCop from @INC
[18218] dbg: reporter: network tests on, attempting SpamCop
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::SpamCop=HASH(0x9217f20)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::AWL from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::AWL=HASH(0x915df54)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::AutoLearnThreshold from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::AutoLearnThreshold=HASH(0x924c60c)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::TextCat from @INC
[18218] dbg: textcat: loading languages file...
[18218] dbg: textcat: loaded 73 language models
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::TextCat=HASH(0x9261abc)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::WhiteListSubject from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x94cfb94)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::MIMEHeader from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::MIMEHeader=HASH(0x94d71f4)
[18218] dbg: plugin: loading Mail::SpamAssassin::Plugin::ReplaceTags from @INC
[18218] dbg: plugin: registered Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0x94e7654)
[18218] dbg: config: adding redirector regex: /^http://chkpt.zdnet.com/chkpt/w+/(.*)$/i
[18218] dbg: config: adding redirector regex: /^http://www(?:d+)?.nate.com/r/w+/(.*)$/i
[18218] dbg: config: adding redirector regex: /^http://.+.gov/(?:.*/)?externalLink.jhtml?.*url=(.*?)(?:&.*)?$/i
[18218] dbg: config: adding redirector regex: /^http://redir.internet.com/.+?/.+?/(.*)$/i
[18218] dbg: config: adding redirector regex: /^http://(?:.*?.)?adtech.de/.*(?:;||)link=(.*?)(?:;|$)/i
[18218] dbg: config: adding redirector regex: m'^http.*?/redirect.php?.*(?<=[?&])goto=(.*?)(?:$|[&#])'i
[18218] dbg: config: adding redirector regex: m'^https?:/*(?:[^/]+.)?emfd.com/r.cfm.*?&r=(.*)'i
[18218] dbg: config: adding redirector regex: m'/(?:index.php)??.*(?<=[?&])URL=(.*?)(?:$|[&#])'i
[18218] dbg: config: adding redirector regex: m'^http:/*(?:w+.)?google(?:.w{2,3}){1,2}/url?.*?(?<=[?&])q=(.*?)(?:$|[&#])'i
[18218] dbg: config: adding redirector regex: m'^http:/*(?:w+.)?google(?:.w{2,3}){1,2}/search?.*?(?<=[?&])q=[^&]*?(?<=%20|..[=+s])site:(.*?)(?:$|%20|[s+&#])'i
[18218] dbg: config: adding redirector regex: m'^http:/*(?:w+.)?google(?:.w{2,3}){1,2}/search?.*?(?<=[?&])q=[^&]*?(?<=%20|..[=+s])(?:"|%22)(.*?)(?:$|%22|["s+&#])'i
[18218] dbg: config: adding redirector regex: m'^http:/*(?:w+.)?google(?:.w{2,3}){1,2}/translate?.*?(?<=[?&])u=(.*?)(?:$|[&#])'i
[18218] info: config: failed to parse line, skipping: use_dcc 0
[18218] info: config: failed to parse line, skipping: detailed_phrase_score 1
[18218] dbg: plugin: Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0x94e7654) implements 'finish_parsing_end'
[18218] dbg: replacetags: replacing tags
[18218] dbg: replacetags: done replacing tags
[18218] dbg: bayes: tie-ing to DB file R/O /var/spool/spamassassin/bayes_toks
[18218] dbg: bayes: tie-ing to DB file R/O /var/spool/spamassassin/bayes_seen
[18218] dbg: bayes: found bayes db version 3
[18218] dbg: bayes: DB journal sync: last sync: 0
[18218] dbg: bayes: not available for scanning, only 1 spam(s) in bayes DB < 200
[18218] dbg: bayes: untie-ing
[18218] dbg: bayes: untie-ing db_toks
[18218] dbg: bayes: untie-ing db_seen
[18218] dbg: config: score set 1 chosen.
[18218] dbg: learn: initializing learner
[18218] dbg: bayes: bayes journal sync starting
[18218] dbg: bayes: bayes journal sync completed
[18218] dbg: bayes: expiry starting
[18218] dbg: locker: safe_lock: created /var/spool/spamassassin/bayes.lock.h1232389.stratoserver.net.18218
[18218] dbg: locker: safe_lock: trying to get lock on /var/spool/spamassassin/bayes with 0 retries
[18218] dbg: locker: safe_lock: link to /var/spool/spamassassin/bayes.lock: link ok
[18218] dbg: bayes: tie-ing to DB file R/W /var/spool/spamassassin/bayes_toks
[18218] dbg: bayes: tie-ing to DB file R/W /var/spool/spamassassin/bayes_seen
[18218] dbg: bayes: found bayes db version 3
[18218] dbg: locker: refresh_lock: refresh /var/spool/spamassassin/bayes.lock
[18218] dbg: bayes: DB expiry: tokens in DB: 1336, Expiry max size: 150000, Oldest atime: 1175767402, Newest atime: 1175772269, Last expire: 0, Current time: 1175772271
[18218] dbg: bayes: expiry completed
[18218] dbg: learn: learning spam
[18218] dbg: dns: name server: 81.169.163.104, family: 2, ipv6: 0
[18218] dbg: dns: testing resolver nameservers: 81.169.163.104, 81.169.163.106
[18218] dbg: dns: trying (3) google.com...
[18218] dbg: dns: looking up NS for 'google.com'
[18218] dbg: dns: NS lookup of google.com using 81.169.163.104 succeeded => DNS available (set dns_available to override)
[18218] dbg: dns: is DNS available? 1
[18218] dbg: metadata: X-Spam-Relays-Trusted:
[18218] dbg: metadata: X-Spam-Relays-Untrusted:
[18218] dbg: metadata: X-Spam-Relays-Internal:
[18218] dbg: metadata: X-Spam-Relays-External:
[18218] dbg: plugin: Mail::SpamAssassin::Plugin::TextCat=HASH(0x9261abc) implements 'extract_metadata'
[18218] dbg: message: ---- MIME PARSER START ----
[18218] dbg: message: main message type: text/plain
[18218] dbg: message: parsing normal part
[18218] dbg: message: added part, type: text/plain
[18218] dbg: message: ---- MIME PARSER END ----
[18218] dbg: message: no encoding detected
[18218] dbg: textcat: classifying, skipping: yi sco lv is bs sl la ga sa eu et rm cy eo fy gd lt
[18218] dbg: textcat: language possibly: de
[18218] dbg: textcat: X-Languages: "de", X-Languages-Length: 740
[18218] dbg: uri: parsed uri found, http://www.infotransport.cc/?MID=105633
[18218] dbg: uri: cleaned parsed uri, http://www.infotransport.cc/?MID=105633
[18218] dbg: uri: parsed domain, infotransport.cc
[18218] dbg: uri: parsed uri found, http://www.infotransport.cc/?MID=105633
[18218] dbg: uri: parsed domain, infotransport.cc
[18218] dbg: uri: parsed uri found, mailto:unsubscribe@powerdefense.cc?subject=unsubscribe
[18218] dbg: uri: cleaned parsed uri, mailto:unsubscribe@powerdefense.cc?subject=unsubscribe
[18218] dbg: uri: parsed domain, powerdefense.cc
[18218] dbg: uri: parsed uri found, mailto:unsubscribe@powerdefense.cc
[18218] dbg: uri: cleaned parsed uri, mailto:unsubscribe@powerdefense.cc
[18218] dbg: uri: parsed domain, powerdefense.cc
[18218] dbg: locker: refresh_lock: refresh /var/spool/spamassassin/bayes.lock
[18218] dbg: bayes: e8967ccb2d95838d4ee6cbf3e4cc86a42d95bd93@sa_generated already learnt correctly, not learning twice
[18218] dbg: bayes: untie-ing
[18218] dbg: bayes: untie-ing db_toks
[18218] dbg: bayes: untie-ing db_seen
[18218] dbg: bayes: files locked, now unlocking lock
[18218] dbg: locker: safe_unlock: unlink /var/spool/spamassassin/bayes.lock
Learned tokens from 0 message(s) (1 message(s) examined)
Hat jemand eine Idee?

kantiran

kantiran
Posts: 11
Joined: 2006-03-28 19:41

Re: sa-learn klappt nicht.....

Post by kantiran » 2007-04-05 16:13

Habe gerade festgestellt das der SA
die betreffende E-Mail im Header X-Spam-Status falsch setzt.

Code: Select all

No, score=1.9 required=3.0 tests=AWL,UNPARSEABLE_RELAY 	autolearn=ham version=3.1.7
Das "No" bedeutet das er es nicht als Spam klassifiziert. Soweit so gut.
Kann es sein das das "autolearn=ham" diese Mail als HAM klassifiziert und
es deshalb nicht per sa-learn nachträglich als SPAM eingestuft werden kann?

Ich blick da einfach nicht mehr durch.

kantiran