Youâve been there before. A spam email pops up in your inbox and itâs utter nonsense. A rancid word jambalaya of âHello dears,â a bunch of names you never even knew could be names, and sentences not of this world. Why? Why does anyone do that? Let me explain. With the help of Harry Potter quotes.
A new breed of spam emails popping up especially in Ireland have taken to prefacing their information thieving efforts with random one-or-two-line Harry Potter quotes. Stuff like, âAS THE WEREWOLF REARED SNAPPING ITS LONG JAWS SIRIUS DISAPPEARED FROM HARRYS SIDE HE HAD.â Weird, right? Well, only sort of.
While hilariously out-of-context quotes from JK Rowlingâs spellbinding opus seem to pair with âyoU have 17 nEww private mesages fr0M local singles!!!111â about as well as wine and a big olâ bowl of sour cream, itâs actually an attempt to outsmart spam blocking software. As The Daily Edge explains, anti-spam engines tend to work on three levels: RBLs (blacklists of known spammer IP addresses), URIBLs (blacklists of known spam sites), and a Bayes Engine, which handles everything the other two parts canât.
The latter is key because Bayes Engines use statistics to figure out which words and phrases are associated with spam. âViagra,â for instance, is usually a pretty obvious tell. So the Harry Potter quotes are meant to fool the systemâyou know, talk about the sort of wand that might be made of sacred oak and unicorn hair instead of, er⊠the other kind (although now I really want to see Harry-Potter-themed male enhancement spam).
Hereâs where it gets crazy: Bayes Engines learn over time by way of emails being marked as spam. They catch on pretty quickly, in other words. This why it frequently seems like every spammer is kinda doing the same thing, using the same sorts of phrases, sentences, etc. It comes in waves. Then, when a topic is exhausted and the mighty Spam Skynet has assimilated it completely, spammers move on to something newâperhaps a bizarre set of words or a news item or a celebrityâin an attempt to fool Bayes Engines again.
Thatâs why spam emails are so weird, but in an almost consistent way. Spammers are paying attention to what does and doesnât get blocked and reacting accordingly.
By that logic, youâd think todayâs hardened veteran spam filters wouldnât let a single even slightly suspicious word slide. Heartfelt letter from grandma? She said âmagic,â so itâs probably spam. That, however, isnât quite how it actually works. Spam filters are also programmed to essentially forget with time, precisely so that we donât end up with a situation where the entire dictionary ends up on their banned book list.
TMI is a branch of Kotaku dedicated to telling you everything about my adventures in the gaming industry (and sometimes other offbeat and/or uncomfortable subjects). Itâs an experiment in disclosure, storytelling, interviewing, and more. The gaming industry is weird. People are weird. I am weird. You are weird. Why hide that? Letâs explore it.