The initial point I was getting at is that pure vocabulary is not sufficient for useful story categorization. It's rare for me to even say "fembot" in Virus Alert, and the sleeper nature of the 'bots themselves means that I couch a lot of things in innuendo. Yearbook Pictures may well be the sexiest, disassembly-est, malfunction-est chapter so far, but there's nothing about a word cloud analysis of it that would suggest so. Beyond 1.00, there's been virtually no usage of even the term "sleeper".dale coba wrote:We're going to need a bigger exclusion list.
All common words, all words below four letters... not hard to set that up
(but did you ever notice how no one around here ever seems to actually be a programmer?)
- Dale Coba
p.s. your cloud is semi-side-waysish, and the colors make my eyes bleed)
I hadn't noticed the thing about programmers, largely because I am, by profession, a kind of programmer. My area of expertise simply isn't suitable to creating a word cloud generation tool. It seems a moot point, since they exist in the wild, with highly configurable options. I just don't think it would provide a useful basis for story categorization.
Ps: I used wordle.net, and picked purely random settings. The color probably didn't bother me because I'm colorblind.