One thing that I never want to be.

We all have a tendency to learn up to a point, we get comfortable and keep chugging along rarely investing in our ongoing education.

I call it the slow but sure path to irrelevancy.

Let me share my prescription for avoiding irrelevancy: Try new things.

Simple right?

At any given time I have six or seven interesting tools running on this website. That's not including others I actively seek out around the web. Most of them are not even related to my current job or problems I know of. And that's on purpose.

I want to constantly be in the know of new and more clever ways of working with data, tools that are often solutions to problems we don't know we have yet or tools that are sometimes seeking problems to solve!!

Irrelevancy is not fun. Stale people are not appealing (just like, as your mom taught you, a week old bread). If there is one thing you take away from it post I hope it is the importance in investing in yourself / your education / your ongoing awesomeness.

In this blog post I want to share four analytics tools that I have been playing with for a while… tools that solve an interesting problem… tools that point to what might be in terms of our near term analytical future… and in almost all cases they don't even know!

I love doing this, I hope you'll have as much fun as I do.

Terra Cotta Warriors Xian

First Some Context.

Remember I am the creator of the 10/90 rule of investment in web analytics.

I had created the rule many years ago, early into my job at Intuit, and quite simply it states:

If you have a budget of $100 to make smarter decisions on the web…. invest $10 in tools + vendor contracts and invest $90 in people (big human brains inside or outside the company to do analysis and the process of producing insights).

When I had created the rule Google Analytics did not even exist!

The rule was borne out from my own experience having inherited a world class tool we were paying $250k a year for and produced crap. Well not crap… lots of data that no one cared about or actioned. I threw out the world class tool, purchased ClickTracks for a fraction of the cost and put money into Analysts and boom!

Ok not boom overnight… but over the course of a few months the org started to be more data driven, because analysts we hired produced analysis. That fed a virtuous cycle. More analysts. More insights. More desire to be data driven.

So as you look at the tools below remember the 10/90 rule.

In the end it does not matter who has the coolest or the biggest tool. Or for that matter how many tools.

People matter.

You matter.

Remember that, at least for the rest of this post. Ok?

Let's go look at some tools…

Measuring "Invisible Virality": Tynt.

Tynt's promise is simple. Add a piece of javascript to your web page (do a View Source on this page to see it), and it will tell you how often your content is being copied.

Copied! Say it ain't so! :)

tynt report summary sm

[Please click on the above image for a higher resolution version, including all the other metrics.]

In the last month data was copied off one of my posts 5,616 times, with most of it being content and some of it images.

But that's not all.

If you look at the higher resolution version (click above) you'll see it also reports other data like Visits Generated etc.

The way it works is that when someone copies a piece of content Tynt adds a little bit of additional text and a trackable code with a hash (#) at the end of the url from where content was copied.

Like so… the text that was copied from my blog is the first two lines… the Read More and link was added automatically by Tynt…

tynt copied text

When people click on that link Tynt can report visits generated, page views, where the links were posted (in case there is a referrer) etc.

There is additional data like how many of your copies created links that were posted and then clicked on…

tynt silver gold data

Gold are places were the copied text was pasted with the additional "Read more: http://…" text+link were also posted AND someone clicked on it.

You'll note that Tynt's selling point is connected to SEO. The idea that your copied text creates links back to you which in turn creates visits back to you, and per Tynt, better SEO goodness. You know links and page rank and what not!

I *personally* do not see much value in all that data. Two reasons:

1. Most likely the additional text+link will be posted as is only by someone who is quite careless and perhaps only on the least desirable sites. I mean if someone smart's going to copy they'll be clever enough to get rid of the link+text. :)

2. Search engines are complicated little beings. The days of just inbound links counting towards SEO goodness are long behind us.

So I am less enamored by Tynt data that focuses on all that.

I love the data you saw in the very first screenshot, and I absolutely love this…

tynt most engaging content sm

[Please click on the above image for a higher resolution version, including all the other metrics.]

The first screenshot shows how often content is being copied and the above indicates the blog post / web page where the content is being copied from.

Why is this cool?

If you are a regular reader you'll notice that at the end of every blog post (before the start of the comments section) is a Topsy widget.

blog topsy widget

It measures how often a blog post is tweeted/retweeted. Goes viral. Higher the number the better, makes sense?

I also measure the # of Comments Per Post as a measure of how "engaging" / "valuable" people found the content to be. Looking at how often it was tweeted/retweeted is one more layer of information in understanding what subject / ideas in a post / things I write are well received by people and which are not.


Both the above attempts measure two minorities.

1. The rarest of the rare who post a comment.

Context: I write twice a month. This blog has around 70k Visits a month, 39k Feed Subscribers and the average number of comments on each blog post is just 35. Minority perspective right?

2. The rarest of the rarest of the rare who are on social media. Who tweets after all. :)

The cool thing about Tynt is that it allows me to get some sense of "engagement" / "perceived value" / "Like" with the v a s t majority of people who will neither submit a comment nor write a tweet.

People who still use email. People who like something I wrote so much (or hate it so much) that they copy the text and paste it and forward it to others. Or copy the text and post it on their blogs (without attribution of course :)).

I like that a lot.

This entire interaction that was completely invisible to me is now a bit more visible. I can measure the "invisible virality" / "spread" by this big huge non-commenting, non-tweeting audience.

In the time period above I had written 4 posts (5,616 times copies). Check this out… It turns out the post with the fewest comments, just 25, and the fewest tweets, just 100…

tynt invisible virality

…was copied an astonishing 506 times, when all other posts were copied in small double digits.

See what I mean… something I would have perhaps considered to be only a small success turns out was a huge hit with the blog's audience. I just would not have known that so far.

Here's another interesting application. . . Lots of people are measuring "influence" of a blogger (/ piece of content) using data from the "minority activity" (comments, retweets etc) and selling it as the complete truth. But how can you do that without some insight from the majority?

Tynt shares one very interesting piece to the puzzle that perhaps in the future fit some place where we can use it with all other context we have.

Invisible Virality. Cool right?


Applying Smarter Ideas to Measuring "Sentiment": Analyze Words.

Raise you hand if you are in the "If I am any more excited about doing sentiment analysis then I'll pee in my pants".

So many raised hands!

Here's the problem: Most solutions stink. Not just stink… dinosaur's breath after a meal stink.

We are algorithmically trying something that as yet does not lend itself to algorithmic measurement… "emotion". It is darn near impossible to cleanly buckets feelings and nuance into clean Positive, Negative, Neutral buckets.

We, computer programs, are simply not there yet. [Though I am absolutely confident that we will get there at some point.]

For now you are most likely wasting time (and money). Sorry.

sentiment analysis Here's the other problem…

Even if it works… I don't think it works. [What!]

Let's say you have a 100% perfect human read and 100% human categorized analysis on hundreds of thousands of rows of text. Clean into the three desired categories. Like in the image above.

Now pause for a second and think… what could you do with this?

You have aggregated data into three pieces and we all know aggregated data stinks at delivering insights!

That does not mean wanting to identify insights from lots and lots of text is not prudent. It is.

I like a much more nuanced approach.

Analyze Words applies one such nuanced approach to text analysis.

It uses the well established and long use LIWC (Linguistic Inquiry and Word Count) methodology to categorize all your delightful text (in this case your tweets).

Why the LIWC? Here's the idea behind the LIWC:

"The ways that individuals talk and write provide windows into their emotional and cognitive worlds."

Cool right?

You go to Analyze Words and you punch in your twitter id and bam (!) your "psychological" profile, or in this case mine…

analyze words avinashkaushik analysis

Nice eh?

No simplified over promise under deliver aggregates!

The three categories and 11 sub categories provide much much much more nuanced understanding of what your text is saying, in this case for your twitter profile.

Why is this cool?

In this case measuring "Personable": Engaged in other people's well-being and at peace with expressing your own uncertainty about the world. High Scores in personable use positive emotion words, ask questions, express their own ambivalence and reference others frequently.

Better than positive, negative, neutral right?

Or "Analytic": "If law school exams were a persona, they would rank real high in this category. Ample large words and phrases that include complex thinking styles (e.g. "if – but not …")."

Love it!

Two magnificent things about this approach (remember it's not the tool, its what you do with it :))…

1. It is very sophisticated in the approach it is applying. Nuance and segmentation rule the day. There is nothing, nothing, more sexy in the world of web analytics.

2. It is immensely actionable. You can quickly see areas where you are scoring well, where you are not and you can start to take action to fix things!

Of course you can do even more.

You know how you are doing… now compare it to your "competition" and find their strengths and weaknesses…

analyze words competitive intelligence analysis

When you do competitive analysis, like above, find contrasts with your own profile, what your brand stands for in the world and their brand stands for.

Highlight differences where you brand strength is strong. Hopefully they'll discover where they stink and for the sake of humanity fix that.

Nice eh?

Analyze Words provides a glimpse of an approach that I hope others follow.

Rather than trying to find short cuts, where none exist, and provide aggregate data, where it just gets crapified, follow a well established methodology while leveraging segmentation and nuance.

We've applied it just for Twitter in the above case but you can easily see how you could apply it to call center data, tech support websites, forums, online survey open text voc answers and so much more.

Applying Simpler Ideas to Measuring "Sentiment": StatsIt.

StatsIt started off as a differentiated web analytics tool, but has morphed into a delightful social media monitoring tool.

[Update: Oct 18: StatsIt is evolving its solution. But in this section my hope is to focus less on the tool itself and more a type of analysis that we can use in our daily life.]

It's approach is to index blogs and tweets and delicious and twitter and youtube and on and on and analyze that data to find yummy actionable insights about your social media presence / activity.

Like all tools it gives you pretty charts…

statsit mentions analysis sm

Sweet, now you know how much "activity" is happening. Give it to your boss, she'll be impressed. You on the other hand realize "activity" rarely has insights.

I want to focus on just one part of StatsIt that I adore because of how simple it is in its brilliance when it comes to finding insights from lots of text.

StatsIt has indexed a ton of content from all the social web activity. When you tell it your brand terms (or just your brand name, in my case "avinash kaushik") and it churns through that social web data to provide you with something awesome…. a tag cloud!

statsit emotional tag cloud sm

[Click on the image for a higher resolution version, along with a peek at other metrics.]

Why is this cool?

Mikko and his team have taken 1,000 words from the English language that are connected to emotion. Good emotion, bad emotion, ugly emotion.

They look at their social web data and in that they look at the words around your brand mention and finally identify the emotional words people are using in context of… you!

The tag cloud above shows the emotional words use around mentions of me for a month's worth of time.

Without having to read all the text I can at a glance now get a really good understanding of the tone and texture of activity around my presence. More importantly it does not take all that long to figure out what emotions should be there but aren't.

A very simple, effective and elegant solution to a complicated problem.

Oh and guess that happens when you click on one of the words in the tag cloud?

You are right… it takes you directly to the text from all the data that StatsIt has collected!

By clicking on the words you are essentially segmenting your data and drilling down to the text (tweets, blog posts) where you can learn more about what the person was saying when they express, say, "great" as an emotion. :)

Effective "sentiment analysis" baby!

Why can't we be this simple in other places?

We are always seeking complexity. Here are two ideas that popped into my head from the StatsIt's approach that might apply in other places.

We collect lots of open text from our online surveys right?

Rather than finding the perfect answer to what's expressed in the text, and of course getting it wrong, why don't the vendors show us a emotional tag cloud?

Can there be a better / easier / faster way to allow us to make sense of all that text, leverage as a segmentation tool and find insights every day?

Vendors! Come on!!

Another idea.

Reviews are important. Most ecommerce sites have them.

But why is it that we only see "quantitative" analysis of the reviews? 5 stars. 3.2 moons. 61% rotten tomatoes. Etc etc.

The richness of the review is only partly in the quantitative analysis of the rating. The real sweet nectar is in the words people write in reviews.

I recently gave a talk at eBay. So let's use that as an example.

You get quick quant rating on eBay that you typically use. But perhaps the real gold is here….

ebay reviews

This seller, me, is 100% positively rated.

Now let's say that you want to buy a Sony digital camera that is listed by both me and Emer. We both have 100% positive ratings for our 60 or so prior eBay auctions.

How can you best decide if you should buy from me or Emer? You can't possibly read 120 reviews, or even scan them quickly.

Now would your life be much much easier if eBay choose to provide an "emotional tag cloud" for both Emer and Avinash?

Very quickly you could see that while we both have same quant ratings it turns out that my emotional cloud shows a neutral to positive feelings expressed while Emer's is outrageously positive.

Now is it easier to decide who to buy from?

As our dear friend Sarah Palin would say: You betcha!

So why does eBay not provide this simple emotional tag cloud?

Or for that matter Trip Advisor or Amazon or any site that hosts reviews and ratings?

Simplicity rocks. Especially when it's actionable.

Quick, Efficient, Effective Mobile Analytics: Percent Mobile.

It is always a really good idea in web analytics to understand how data is captured (case in point the delightful blog post on Competitive Intelligence data capture).

No where is this more true than when it comes to mobile analytics.

There are many methods of collecting data depending on the platform you are on, and if Steve Jobs gets upset he can totally shut you down with a mere update of his TOS! :)

I am not going to cover all that here today. For those of you who already have my second book Web Analytics 2.0 please jump to Page 250 to learn all about data collection options, platform limitations, challenges with campaign analysis and finally reports and KPI's you should measure for mobile.

In this blog post I want to share a lightweight wonderful mobile analytics platform called Percent Mobile.

Now most web analytics tools, like Google Analytics and WebTrends and others, will capture and report data for javascript enabled smart phones (like the iPhone, Android and some Nokia phones). Honestly that is all the traffic that is of commercial value, so even if you miss the rest it is not the hugest of deals.

But all these "big boys" have simply "added on" mobile analytics to their tools. The result is that they suffer from both a lack of imagination and, this is important, truly great databases when it comes to devices and carriers and other unique mobile information.

Not Percent Mobile.

They have two incredible benefits:

1. A really expansive and accurate database and detection mechanism when it comes to mobile platforms.

2. A really simple UI and reporting layer, even your mom will understand the data.

They also have four different methods of enabling data collection, I am using their standard javascript tag on this blog (do a View Source).

Here is what the resulting data looks like…

percent mobile dashboard sm

[Please click on the above image for a higher resolution version.]

No hunting and pecking to find the data, like you would in Google Analytics or Site Catalyst or CoreMetrics. A quick at a glance view of how much traffic is mobile, key stats about the devices, the devices themselves (go iPad!!), vendors and operating systems.

If you compare this to your web analytics tool you'll notice almost immediately how much better this data is compared to what the "big boys" are reporting.

Click on the image above and you'll see a bit more clearly other really sweet metrics. % of mobile devices accessing your site via WiFi. Phones with touch screens and full keyboards etc.

[Can you imagine how cool it would be to segment your mobile traffic for full keyboard phone vs none and see which convert better. Or does access via WiFi mean more content consumption than via 3G? Etc. So cool.]

That is not all… if you scroll a bit more you can get a country map view, the networks used to access your site (AT&T still #1 for me!) and countries etc.

Of course it would be hard for me to like any tool that does not allow segmentation. :) You simply drag and drop on to the box on top..

segmented mobile analytics

And what would an analytics tool be without the normal search, referrer and all that data we have so come to love (and hate!).

percent mobile search site data

I particularly like the "Activity Types" box at the bottom left, I don't know why web analytics tools don't categorize referrers by default.

I am also surprised at the long tail of referrers. Yes Google is big but there are 91 other referrers for this segment. More mobile SEO!

key mobile metrics

Why is this cool?

It might seem odd that I would like a tool that would give me similar data that I can get out of WebTrends or Omniture or Xiti or whatever.

The first reason is that, as mentioned above, the data is actually much better because of the databases that power Percent Mobile.

The other thing is that getting this data causes less pain than pulling my two front teeth.

Finally I so do like supporting pretty tools, especially if they have good data!

The one thing Percent Mobile lacks is some way of measuring any outcomes. I can certainly dig to my "conversion pages" but it would be great if they just let me just input them into the tool and then they could measure outcomes for me (even if it is like the Goals process in GA).

But if you want a light weight easy to use free mobile analytics tool just throw Percent Mobile on your site and have fun. Go to www.percentmobile.com , click Sign Up (top right) and use the Invitation Code "Avinash" (no quotes).

Mobile rocks no?

Summary Of Our Lovely "Let's Keep Learning" Cruise.

It is important to point out that I am not affiliated in any way with any of these tools / companies. I am also not recommending overtly or covertly that you buy / use them. That is totally your call.

Of course I would not personally use them or write about them if I did not thing they had value. :)

My sincere hope is that you'll internalize:

1. How important your ongoing education is. DBS: Don't be stale!

2. What it is that each tool does that is so unique, what unique problem each solves.

3. Why it is important that you can separate the wheat from the chaff, notice how I quickly put aside most data from Tynt to focus on just what was important to me.

4. Where are new places in your business you can apply things you learn from analytics, like in my example of emotional tag clouds for Ebay or Amazon.

5. Why simple and effective is better than expensive and complicated (even if "perfect").

I hope you got that, more than names of interesting tools.

I cannot tell you how much fun it is to step outside the world of Omniture and Google Analytics and other traditional web analytics tools. It stretches your mind and sometimes you look at these new techniques and data and you notice you are smiling and feel so happy.

Try it, and have fun.

[In case you were curious at the moment I am playing with these incredibly cool tools: PostRank, Next Stage Sentiment Analysis, SEO Effect, and Colligent. Each in its own way does something magical and quite unlike anyone else.]

Ok your turn now.

What do you think of the work that Tynt, Analyze Words, StatsIt & Percent Mobile do? Have you tried any of 'em? What obvious flaws did I overlook? Are there other tools you are using in the Viral, Social, Sentiment, Mobile space that you really love? If so would you please post them in comments?

Please share your feedback / critique / ideas.


Couple other related posts you might find interesting:

Social Bookmarks:

  • services sprite
  • services sprite
  • services sprite
  • services sprite
  • services sprite
  • services sprite
  • services sprite
  • services sprite
  • services sprite
  • services sprite
  • services sprite