Sentiment 101 - FAQs from our Customers

Just wanted to push some information out there addressing some of the common questions we get here @lymbix. We love talking about this kind of stuff, so any more questions don't hesitate to fire them at us! (or comment)

"We’ve checked out Sentiment and it sucks…"

Thats often what people say in the space, mostly because there is a misunderstanding with respect to the capabilities of sentiment. First off, there's no "right" way to measure it. There are some different schools of thought when it comes to how to directly measure the effectiveness of sentiment analysis, but there has yet to be a standard. Mostly because of our human subjectivity that comes into affect. You feel differently about something than I do (just the way that we see things differently). A simple example of this would be to consider talking about a new pen that you bought the other day to a friend; "Man, this pen is comfortable but its not good to write with". Here we have 2 different sentiments expressed in one simple statement. So then asking, if you were were to tag this as positive, negative, or neutral.. what would it be? Eventually if you ask enough people you'll find that the statement is too ambiguous to determine whether it is in fact a positive or negative statement. That's where we come in; we've created the first ever engine that can measure the levels of subjectivity that could arrise in a given statement, to through off the "accurate" judgement of the sentiment. Of course, its helpful to still know which is mostly likely the "best guess" of the sentiment, but when it comes down to actually using this, you might consider it to be less impactful than a statement that is a lot more clear. Lymbix calls this "clarity".

"Do you do sentiment in other languages?"

Because roughly 60% of north america speaks english (96% US), we jumped on board to developing our lexicons to that language first. The beauty of our system is the fact that we developed a methodology to understand language and sentiment, so we're very loosely coupled to english; meaning of course that it would be easy for us to focus our learning efforts on another language, such as spanish, german, french etc,. The only factor in this is cost, and what we've been doing is looking for the right partner to specify which language we should cover next. That is to say, once we've established a positive working relationship with english, we'd be happy to jump into another language once we've identified that with this partner that it will be profitable right away. Safer for us to proceed this way, rather than gamble on which language will be the next "right" choice for us.

 

"How accurate is your sentiment technology..really?"

We love this question, because it's the one that lets us talk a bit about how other people(sentiment companies) are gauging their "accuracy". Let's first start by making a determination between what is article versus aggregate sentiment. Nine times out of ten, if you hear a public statement regarding the accuracy of a sentiment engine, it will be on an aggregate level. Aggregate means you take all of the individual articles in a given dataset and determine the overall sentiment or trend of the data. This is helpful, and is usually highly "accurate" (roughly 90%). Thats because the error is hidden. And when I say error, I'm really referring to the misnomers in the sentiment processing. Error is inherent, its always there, and we have to accept it. It's an imperfect technology, based on imperfect data, review by an imperfect process. Then again, does 10% really potential error really make a difference in what value you're taking form consuming millions of articles and making determinations on the results? Now looking a bit deeper, lets consider article sentiment. This is any sentiment engines worst enemy, and thats because now we have to deal with you, the imperfect (but incredibly competent) reader. This really refers to the question 1, the subjectivity in how you feel when you read something compared to someone else. On average, you'll see "accuracy" scores around 50-60% with a good sentiment engine. Always look at the fine print, or impose the question when trying to measure the "quality" of sentiment results. So how does that pertain to Lymbix? Well, on aggregate, we're 90-95% - and thats validated by our customers who use our aggregate measurements. On an article level, which by the way, we absolutely love, we're variable based on your levels of need. If you want 90%, guess what, we can achieve that! but it takes some work. We're a learning network, meaning that we get better with more feedback (our automated feedback mechanisms is a whole other conversation). First off, we start assuming that 50% is bad (not that it is, but we push our engine as hard as we can). So we call that, the inherent 50-50 recall. Now, on the 50% of data thats "good", or "acceptable", we don't measure "accuracy". We measure, "agreeability". That is to say, based on our clarity measurements, how much does our readers "agree" with our results. We've done massive tests and studies to indicate that our article results idle around 75-85%... not bad.. now how do you get better? Well, thats for our paying customer when we turn on our learning algorithms specifically for your datasets.

 

Game Time: Get ready to Name that Tone!

As promised, here's something fun we've been working on! We've created a little game that tests how well you understand the tone of your words with the help of some really enthusiastic talent. 

Studies show that the majority of people think they have a good idea of what their words convey but you’d be surprised at how often people don’t consider the emotional impact they have. (That's why we designed ToneCheck!) Now, we're bringing you another way to test your skills: Name that Tone - the game!  

Name that Tone uses our Sentiment Intelligence API to analyze the tone of tons of random phrases. For every correct answer, you'll get a special dance from one of our emotional friends.

Want to try your own? Go ahead and try rating your own phrases - you might even find some hidden “Easter eggs” and surprises. The mission is to make people more aware of the emotions in text, and perhaps even point them to ToneCheck. 

We'd love to hear what you think and how you like the game. We'd also be super pumped if you'd consider sharing it (if you like it) on your Facebook page or Twitter feed. 

Are you ready? Go play the game now.  

 

 

Charlie Sheen spoofs amuse Technorati

Image001

You'd have to be living under a rock to not have caught wind of the recent media frenzy surrounding Charlie Sheen. In case you're just unburying yourself from the landslide of recent buzz, it shakes down like this:

Sheen (who has always battled drug and alcohol addictions) spends time in and out of rehab. While 'out' in October of 2010 he's thrown out of a NY City hotel for causing $7,000 worth of damage to the room. Next, he's sued by an 'adult film star', goes on a bender, and gets checked into hospital. Sheen enters rehab. Producers of Charlie's hit show Two and a Half Men put the show on hiatus

Well, this recent stint in rehab has done one of two things for Charlie. Either sent him over the edge completely, or he's learning to express his feelings. He slams Two and a Half Men creator and executive producer Chuck Lorre on The Dan Patrick Show and the show is put on indefinite hiatus. Now the fun part begins: Charlie joins Twitter.

So what does the social web think of all of the controversy? Take a guess. (Answer: they love it.)

Check out a quick snapshot of the current sentiment surrounding Sheen:

Screen_shot_2011-03-03_at_9

An easy 72% of all twitter mentions of Sheen fall within a positive category: Friendliness, Amusement, Enjoyment or Contentment. 

-----------------------------------------------------------------------------

Some of our favourite Sheen mentions this week: 

@JohnStamos: contrary to the rumors, i am not replacing charlie sheen on two and half men. however, martin sheen has asked me to be his son. reply

@LoniLove: The bar just announced that I don't have to go home but I can go to Charlie Sheen's house...

@outtacontext BREAKING: Charlie #Sheen announces he'll star in new sitcom: 2 1/2 Personalities. No other costars needed.

And... Charlie Sheen quotes as New Yorker Cartoons.

 

 

The Ultimate Rejection Letter [HUMOUR]

Rejected

Rejection is never easy. The next time you get a rejection e-mail, instead of feeling dejected, you may want to consider this tone-conscious response. It's full of positivity, well-written grammatically, and is certain to at least get a second glance from that recruiter! 

 

Dear Bob,

Thank you for your letter of January 16. After careful consideration, I regret to inform you that I am unable to accept your refusal to offer me a position in your marketing department.

This year I have been particularly fortunate in receiving an unusually large number of rejection letters. With such a varied and promising field of candidates, it is impossible for me to accept all refusals.

Despite your company’s outstanding qualifications and previous experience in rejecting applicants, I find that your rejection does not meet my needs at this time. Therefore, I will assume the position of marketing assistant in your department this March. I look forward to seeing you then.

Best of luck in rejecting future applicants.

Sincerely,

Danielle

 

Would you dare send this? We think you should give it (or your own version) a try. ToneCheck may be able to help!

Resurrected from several places on the web like here, here, and here and edited somewhat. Know the original source? Post it in the comments!

 

Measuring Gross National Happiness (or Unhappiness): is it possible?

We think so.

You've heard us share a lot about ToneCheck, and how our API can be used to measure Twitter sentiment, however another use for our Sentiment Intelligence has been on our radar for quite a while and it would appear we're not alone in seeing it's usefulness.

What if government organizations could measure their country's Gross National Happiness. An emotional index, if you will.

Every now and again someone in our network shares with us something that really gets us excited about the validation of our technology beyond the corporate world. While enjoying a holiday read of Dan Brown's The Lost Symbol, a friend nearly fell out of his chair reading pages 74 and 75. Dan details a meeting of two intellects passionate about monitoring a populations response to everything from election candidates, events, or holidays or, even more interesting: before, during or following natural disasters, terrorist threats, or acts of war.

He writes: "My software," Trish explained, "was designed to help government agencies better evaluate and respond appropriately to wide-scale crisis - pandemic diseases, national tragedies, terrorism, that sort of thing."  

Pretty powerful stuff. And insightful, too. 

With real-time monitoring using our API, government would be able to do just exactly what was foretold in the book. After all, as Brown points out in the book: "Following 9/11, the government was intercepting and crunching enormous data fields - civilian e-mail, cell phone, fax, text, Web sites - sniffing for keywords associated with terrorist communications." Our API goes far beyond keywords and dives into true understanding of the emotions present in any form of text-based communication.

And, imagine how understanding the response a population might have in advance of such an event might actually allow governments to better analyze, react, and respond to it.

While we certainly haven't yet been in touch with Mr. Brown, we're fans of his work and found this coincidence too interesting not to share.

What do you think? Should governments monitor their country's Gross National Happiness? We'd be interested in your feedback on the subject.

*You can find Dan Brown's The Lost Symbol at Amazon.com

*For info on licensing our API, contact us at sales@lymbix.com.

 

NY Times dubs ToneCheck among best ideas in 2010

What does Lady Gaga's meat dress, Kanye West's comeback and LeBron James 'Decision' have in common with ToneCheck?

We all made the New York Times Magazine's 10th Annual Year in Ideas issue.

View the interactive edition here. 

A huge shout-out to @michaelbeirut for his amazing animation of the proof of concept for ToneCheck and to @jaschavsjascha for the interview.

 

Twitter users (somewhat) support Wikileaks, angry about Assange

Should Wikileaks be condemned or a celebrated? While opinions differ one thing is for certain: Wikileaks and its director Julian Assange are trending topics to consider. While we expect most everyone will have caught a news story on one or more of these topics we appreciate some may be in need of a catch up. Find excellent summaries of the scandal here, here, and here.  

We were naturally curious about which side of the debate the majority of Twitter users were on, so we began tracking both Wikileaks and Assange using our Sentiment API.  Much like the division of opinion on the subject matter itself, sentiment seems to be split as to whether the recent leaked release of U.S. State department diplomatic cables by the media organization is a brave stand in favour of openness and against government conspiracy or rather, an irresponsible and inappropriate act worthy of prosecution

Without getting into the politics surrounding the leaked documents, here’s a quick snapshot of how Twitter users really feel about both the subject matter and Assange.

Keyword: Wikileaks
Tweets Processed: 39,734
Dominant Emotion: Enjoyment/Elation

Screen_shot_2010-12-13_at_2

Keyword: Assange
Tweets Processed: 9,320
Dominant Emotion: Anger/Loathing

Screen_shot_2010-12-13_at_2

It's safe to say that general sentiment surrounding Wikileaks is leaning towards the positive, however the conversation skews more negatively when Assange is concerned. Why? Could it be backlash due to his recent arrest or a negative opinion of him in general? We welcome your thoughts in the comments! Bring them on.

Google to acquire Groupon: talk about buying power!

Google-buy-groupon
**Update: Groupon said to have rejected Google offer. Read more at BusinessWeek.com

Word on the street is that Google is on the cusp of acquiring group-buying site Groupon for a staggering 6 billion dollars. While many wouldn't actually consider that a deal, a 6 billion dollar price tag isn't actually out of the question for a Groupon acquisition, given their recent uptake.

In case you're joining us from outside of the tech sect: Groupon is huge. They currently get 79% of all site traffic for group-buying visits, it’s now active in 52 cities and counting, and brings in over $50 million dollars a month. And hey: it has by far the catchiest name. There’s Group and then there’s CouponGroupon!… Get it? 

Now, while we have no way of knowing what goes on behind closed doors or when if the acquisition will go down, but what we can do is speculate on how Google’s Groupon acquisition would go over with the tweeting public. 

 We sampled 3125 twitter messages and can report that overall: the news is positive. 

Screen_shot_2010-12-03_at_9

Our analysis indicates that people love Groupon. Considered by many to be the future of commerce, the site makes money by offering local daily deals to the group-buying public. And who doesn’t need steep discounts on laser hair removal or bubble tea smoothies, anyway? 

Screen_shot_2010-12-03_at_9

What's particularly interesting is *how* passionate Groupon fans are. Of the tweets surveyed, we scanned them looking for ones that were emotionally intense. Those that love Groupon and whos tweets were in the Affection, Enjoyment, Amusement, and Gratitude categories *really* love the group-buying site. Those expressing negative sentiments in the Sadness, Anger, Fear, and Humiliation categories were significantly less enthusiastic. 

Screen_shot_2010-12-03_at_9

So - what do you think? Will Groupon sign with Google? Is the estimated 5 Billion dollar price tag too steep? Tell us in the comments.

Want to know how we got this info? Contact Us regarding our Sentiment API or visit www.ToneAPI.com

 

 

Oh Sarah… Will you ever learn?

Sarah_palin

A day doesn’t go by lately without a Palin-ism making it’s way into mainstream media. We’ve all shared a laugh about her invention of the words refudiate, misunderestimate and wee-wee'd up and despite intense criticism of it all, she continues to thrust herself into uncomfortable situations.  Between her book tour, her reality show, and supporting her daughter on ABC’s Dancing with the Stars – she doesn’t need to do or say much to get attention.

Her latest triumph? Announcing “We Gotta Stand With Our North Korean Allies” in an interview with Glenn Beck when asked how she would handle a situation like the one currently developing in North Korea. While many defend Palin’s slipup (she presumably meant South Korea), this recent misstep does add one more criticism to her already overflowing plate.

We took to the Twitterverse to find out what the general consensus was on Sarah’s most recent blunder. 

Using TweetTone, we pulled a sample of 1490 tweets mentioning "Sarah Palin" following her slip-up. Not surprising, the overall sentiment of the sample is humiliating. Not ideal for someone considering running for Presidency in 2012. 

Screen_shot_2010-11-25_at_4

You can download or view the entire PDF report on Sarah Palin below. 

Click here to download:
Sarah%20Palin_20101125_1452.pdf (230 KB)
(download)

A few of our favourite tweets:

Sarahpalintweet3
Sarahpalintweet1
Sarahpalintweet2

You can give Tweettone a try with a topic of your choice at tweettone.com.