Hey, remember how I used to do these random word clouds of my blog? No? Cos I haven't done them in about a year? Right, makes sense.
The other day I remembered I hadn't done one of those in awhile and decided I wanted to again. But whenever I do build one it's just made up of my 6 most recent posts. But I want my word cloud to represent MOAR DATA. So I decided to play around with a social listening tool to pull in more posts. Here's what that looks like
Yeah I know, not that interesting. Oh I say "book" and "read" a lot? Who would have guessed.
BUT while I was in this tool I decided to take a look at the sentiment section and see what it says. And it told me I am mostly negative. Wait, what?
It said, for the past year, 22% of my content as been positive, 37% neutral and 41% negative. Granted, my first reaction to those stats was to IM a friend and go "Apparently my blog is 41% negative. Well fuck you then!". So yeah, maybe they have a point.
Then I really looked at why the system was being so judgy. Essentially anytime I used a negative word like "bad" or "don't like" it said the section was negative. Of course, as with any automated system, it can't take into account context. The system can't tell I said "Now normally I don't like this, but this book nailed it perfectly. Kudos" It also marks my entire rant about wanting to set a character on fire because I hate them so much as "positive" because I said the character was "SO PERFECT".
Moral of this post:
Word clouds are pretty boring but have shapes and colors so people like them (me included)
Context is important
Computers do not get sarcasm
Automated sentiment is pointless
Thursday, October 18, 2012
Comments (18)

Sort by: Date Rating Last Activity
Loading comments...
Post a new comment
Comments by IntenseDebate
I'm not negative, you jerk
2012-10-18T09:15:00-04:00
Red
Book Word Cloud|
Subscribe to:
Post Comments (Atom)
readingrambo 112p · 647 weeks ago
You should start saying flibbertigibbet all the time.
What Red Read 121p · 647 weeks ago
I'm going to start saying flibbertigibbet. Or more likely I'll start typing it and it will end up flibbertighahdada
readingrambo 112p · 647 weeks ago
Also my most-used word is 'like.' Gotta start watching that. I TYPE LIKE I SPEAK AND I OVERUSE THIS WORD.
What Red Read 121p · 647 weeks ago
Tikabelle 87p · 646 weeks ago
What Red Read 121p · 646 weeks ago
Laura · 647 weeks ago
Re: Your lessons learned through this experience- You'd think that by now, someone would have made computers get sarcasm! I just... IT'S REALLY FRUSTRATING WHEN YOU'RE A SARCASTIC PERSON! Also it's annoying how you can't do sarcasm in writing. Well, you *can*, but then you have to explain it, and that just sucks all the fun out of everything!
What Red Read 121p · 647 weeks ago
Most auto-sentiment just uses keyword analysis and says "they said the word "bad" which is on our negative list, therefore this post is negative".
If you have an archive of data, some systems let you "retrain" their sentiment and learn from how you update the sentiment.
There are other systems where from the start you have to train the system on sentiment before it runs anything
Then there are "natural language processing" sentiment systems which are the best. And super expensive.
Loni · 647 weeks ago
What Red Read 121p · 647 weeks ago
Loni · 647 weeks ago
What Red Read 121p · 647 weeks ago
Loni · 647 weeks ago
Tikabelle 87p · 646 weeks ago
It's not sarcasm, but it's utterly hilarious.
libereadingrayna 58p · 646 weeks ago
READ LIKE UGH GIF BOOKS TBR READATHON ENJOY AUTHOR LOVED HATED YES PAGES WILKIEEEEE
What Red Read 121p · 646 weeks ago
I think you do need a certain version of Java for it to work
You should make a word cloud and post it!
briefraser 73p · 646 weeks ago
What Red Read 121p · 646 weeks ago