1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71
|
These eml files are sample spam emails to test your installation of FuzzyOCR.
Use spamassassin -t < samplefile.eml to test :)
ATTENTION: If FuzzyOcr does not trigger on one of the messages, then make sure you have the focr_autodisable_score set high enough.
Otherwise, if a message gets enough hits by SA, FuzzyOcr will not scan it. This is generally depending on your other SA rules.
ocr-gif.eml: Contains a corrupted gif image, additionally I changed the content-type to jpeg, so the output should show:
1.5 FUZZY_OCR_WRONG_CTYPE BODY: Mail contains an image with wrong
content-type set
Image has format "GIF" but content-type is
"image/jpeg"
2.5 FUZZY_OCR_CORRUPT_IMG BODY: Mail contains a corrupted image
Corrupt image: GIF-LIB error: Image is
defective, decoding aborted.
8.8 FUZZY_OCR BODY: Mail contains an image with common spam text inside
Words found:
"target" in 1 lines
"service" in 1 lines
"stock" in 2 lines
"price" in 2 lines
"company" in 1 lines
"recommendation" in 1 lines
(12 word occurrences found)
ocr-animated.eml: Contains an animated gif. If all deanimation routines are working properly on your system, the output should contain:
6.5 FUZZY_OCR BODY: Mail contains an image with common spam text inside
Words found:
"price" in 1 lines
"company" in 1 lines
"alert" in 1 lines
"news" in 1 lines
ocr-obfuscated.eml: Contains an obfuscated gif image, to test the ocrad-decolorize scansets. If you want to test this scanset, either set the minimal_scanset option to 0 or put the decolorize scanset temporarily at the beginning of the scansets file. The output should be:
5.9 FUZZY_OCR BODY: Mail contains an image with common spam text inside
Words found:
"target" in 1 lines
"profit" in 1 lines
"trade" in 1 lines
(4.5 word occurrences found)
ocr-jpg.eml: Contains a jpeg file. Output should show:
5.9 FUZZY_OCR BODY: Mail contains an image with common spam text inside
Words found:
"levitra" in 1 lines
"viagra" in 2 lines
(4.5 word occurrences found)
ocr-png.eml: Contains a png file. Output should show:
14 FUZZY_OCR BODY: Mail contains an image with common spam text inside
Words found:
"buy" in 1 lines
"target" in 2 lines
"service" in 1 lines
"stock" in 1 lines
"investor" in 1 lines
"price" in 3 lines
"company" in 2 lines
"trade" in 1 lines
"software" in 1 lines
"recommendation" in 1 lines
"news" in 3 lines
(25.5 word occurrences found)
|