Web Analytics Blogs

Judah Phillips is an experienced web analytics practitioner and Internet expert currently working as a Senior Director at a large, global Internet company. His blog is full of useful, unbiased, actionable insights learned from the real-world practice of a process-oriented, integrated approach to strategic Web Analytics for improving business performance.

Subscribe to Judah Phillips weblog

Archive for 'Web Analytics'

« Previous Entries Next Entries »

Tracking Rich Internet Applications with Google Analytics

About a year ago, I wrote a guest blog post over on Robbin Steif’s blog about using Google Analytics for tracking Javascript and Flash events.  This weekend Jeremy Geelan, SVP over at Sys-Con Media, asked if he could republish the work.  Of course I said “yes.”  Then I noticed that a lot has happened to GA in a year (and more to come, ahem, API’s!).  What I had wrote was now incomplete, so what you’ll find below is my attempt to sum up “event tracking” using ga.js and the Great Google’s Event Tracking Data Model.  Let me know how I did covering it, and if you think I should clarify of expand on anything.

Since we all know about page tags, let’s get down to business with “the Google” and how it tracks “the Rich Media.”  Google Analytics currently has two different javascript page tags:

  • urchin.js.  The legacy version of the Google Analytics page tag.
  • ga.js.  The current, rebranded version of the Google Analytics page tag.

How you track rich media depends on which page tag you are using.  I’ll discuss using urchin.js first, then ga.js.  I’ll also provide some information about Google’s Event Tracking function for capturing specific “events” within their event architecture.

Tracking Rich Media using Urchin.js

In the legacy version of Google Analytics, the smarties at Google created a little JavaScript function called urchinTracker() that enables event tracking.  Use the JavaScript function with an argument specifying a name for the event. For example, the function:

javascript:urchinTracker(’/mysite/flashrichmedia/playbutton’); 

logs each occurrence of that Flash event as a page view of:

/mysite/flashrichmedia/playbutton

Some caveats:

  1. Always use a forward slash to begin the argument.
  2. Actual pages with these filenames do not need to exist.
  3. You can organize your events into any structure or hierarchy you want.

Important: Google says to place your tracking code “between the opening tag and the JavaScript call” if your pages include a call to urchinTracker(), utmLinker(), utmSetTrans(), or utmLinkPost(). For example, if the page view is the major event and the “play” event a minor event; then, your hierarchy would be Page View > Event, where the page contains an event, such that:

/mysite/ria_bittons/playbutton
/mysite/ria_bittons/pausebutton
/mysite/ria_bittons/playbutton
/mysite/ria_clips/clip

Some examples of the code (from Google Help):

on (release) {
// Track with no action
getURL(”javascript:urchinTracker(’/folder/file’);”);
}

This one above tracks when you click and release (although technically, it just notices the release) of a flash button (and records the file you specify as a page view).

on (release) {
//Track with action
getURL(”javascript:urchinTracker(’/folder/file’);”);
_root.gotoAndPlay(3);
myVar = “Flash Track Test”
}

The second one is the same, but by using a function, passing it a parameter, and identifying the instance you want to track, you can measure when your file was used in a specific scene in a little flash movie. So it is a more specific method for handling event tracking in Flash.

onClipEvent (enterFrame) {
getURL(”javascript:urchinTracker(’/folder/file’);”);
}

And the third one repeats the action throughout the movie so that each time the file is loaded, it gets tracked as an event. If you were to pass a unique file at the end of the movie, you could recognize it using this method (or the other methods) to know that the whole movie was watched (as long as your session doesn’t time out). Next, wait until Google updates your analytics, then check the Top Content report to see if it all worked. Now let’s discuss how to the exact same thing using the new trackPageview function released with ga.js.

Tracking Rich Media using ga.js

In the current version of Google Analytics, the brainiacs at Google created a little JavaScript function called trackPageview() that enables event tracking.  Use the JavaScript function with an argument specifying a name for the event.For example, the function:  

javascript:pageTracker._trackPageview (“/mysite/flashrichmedia/playbutton”);

logs each occurrence of that Flash event as a page view of:

/mysite/flashrichmedia/playbutton

Some caveats:

  1. Always use a forward slash to begin the argument and use quotes around the argument.
  2.  Actual pages with these filenames do not need to exist.
  3. You can organize your events into any structure or hierarchy

You must put calls to _get._getTracker and _initData above the call to _trackPageView.  For example, you would insert the following code:

<script type=”text/javascript”>
var pageTracker = _gat._getTracker(”UA-xxxxxx-x”);
pageTracker._initData();
pageTracker._trackPageview();
</script>

Here are some examples of the ga.js code (from Google Help) that replicate what I described above using the most recent code:

on (release) {
// Track with no action
getURL(”javascript:pageTracker._trackPageview(’/folder/file.html’);”);
}

This one above tracks when you click and release (although technically, it just notices the release) of a flash button (and records the file you specify as a page view).

on (release) {
//Track with action
getURL(”javascript:pageTracker._trackPageview(’/folder/file.html’);”);
_root.gotoAndPlay(3);
myVar = “Flash Track Test”;
}

The second one is the same, but by using a function, passing it a parameter, and identifying the instance you want to track, you can measure when your file was used in a specific scene in a little flash movie. So it is a more specific method for handling event tracking in Flash.

onClipEvent (enterFrame) {
getURL(”javascript:pageTracker._trackPageview(’/folder/file.html’);”);
}

And the third one repeats the action throughout the movie so that each time the file is loaded, it gets tracked as an event. If you were to pass a unique file at the end of the movie, you could recognize it using this method (or the other methods) to know that the whole movie was watched (as long as your session doesn’t time out).

Tracking Rich Media using Google Analytics Event Tracking

When Google released ga.js in fourth quarter 2007, Google also released a data model for tracking events.  It provides more flexibility and ease of customization than the methods I described above.   The data model makes use of:

  • Objects. These are named instances of the eventTracker class and appear within the reporting interface.

var videoTracker = pageTracker._createEventTracker(”Movies”);

  • Actions. A string you pass to an event tracker class instance as a parameter.

videoTracker._trackEvent(”Stop”);

  • Labels. An optional parameter you can supply for a named object.

downloadTracker._trackEvent(”Movies”, “/mymovies/movie1.mpg”);

  • Values. A numerical value assigned to a tracked object.

To set up event tracking you should:

1. Identify the events you want to track.
2. Create an event tracker instance for each set of events.
3. Call the _trackEvent() method on your page.
4. Enable “event tracking” in your profile.

To instantiate an event tracker object, you might do something like this:

var myEventObject = pageTracker._createEventTracker(”Object Name”);
myEventObject._trackEvent(”Required Action Name”, “Optional Label”, optionalValue);

createEventTracker() is order dependent and must be called after the main tracking code (ga.js) has been loaded.Next you would call the _trackEvent() method in your source code either on every page that contains the event or as part of the tracking code for every page:

_trackEvent(action, optional_label, optional_value)

If you wanted to track interaction with the Flash UI, such as the button on a Flash Video Player, you would create a videoTracker object with name “Video”:

var videoTracker = pageTracker._createEventTracker(’Video’);

Then, in your Flash code for the video player, you would call the videoTracker object and pass a value for the action and label for the event:

onRelease (button) { 
   ExternalInterface (”javascript:videoTracker._trackEvent(’Play’, ‘MyVideo’);”)
}

You could also use the ExternalInterface ActionScript function as an eval() function to parse FlashVars and attach them to every Flash UI element that needs a tracking action.  For example, the code below associates a Stop action for the Video object and retrieves the provided label and value from the FlashVars:

onRelease (button) { 
   ExternalInterface (”javascript:videoTracker._trackEvent(’Stop’” + label + “,” + value + “);”)
}

Adding event tracking code would generate event reports in the Content section of the Google Analytics Interface.  Pretty cool stuff, Google!

google-analytics-event-tracking.png

Web Analytics Report 2.0 is Out!

One of my favorite personalities in Web Analytics is Phil Kemelor - author of the Web Analytics Report.  Phil is also Vice President of Strategic Consulting Services at SEMphonic and an analyst at CMS Watch

I first encountered Phil in late 2006 when he called me up to discuss my take on the vendor landscape at that time.  We instantly hit it off because, like him, I seek truth, and have a lot of trouble believing vendor spin and hype.   We found each other’s insights “refreshing,” and we’ve kept in touch ever since, grabbing food together/swinging back a few beers/hanging out at conferences, chatting via email, and just generally staying in touch.

Fast forward to late 2007, early 2008, when I find out that Phil was revising his awesomely comprehensive Web Analytics Report and releasing the 2.0 version.  We’ll the time has come, and WAR 2.0 is out.  If you can swing the cost (it’s over $1000 US), I highly recommend purchasing it, especially if you are new to the industry and trying to make a vendor selection, or if you are old to the industry and want to get a solid sense of current vendor capabilities.

The 343-page report goes over all sort of juicy stuff - from beginner information about “what is web analytics” to concepts around the “web analytics business case” to deeper dives into “web analytics technology.”  It covers an abundance of useful information about data collection, data sampling, data exporting, dashboarding/reporting, segmentation, licensing, organizational requirements, and how to select a web analytics vendor.  He’s also done a good job, imho, discussing 15 different vendor tools, which shows you that the vendor landscape is a lot larger than just Google Analytics and Omniture - that’s for sure.

So if you have the $$$ to spend, check it out.  It’s an excellent addition to my analytics library.  Good work, Phil!

Web Analytics needs IT and the Business needs Web Analytics

I’ve been so busy folks, I’ve had no time to blog, so forgive me for my two week hiatus.   

The classic problem of “marketing versus IT” is real.  If you are lucky, you work with an excellent IT team (like me!), then this problem will be minimal if at all.  But in most cases, based on what I hear from my industry colleagues, the analytics team often has issues with IT resources being sufficiently delegated to supporting a web analytics implementation and program.

The classic problem goes something like this:

  1. Marketing:  We need advanced customizations, deep integrations, increased scalability, better performance, and more control overall over Web Analytics.
  2. IT: We don’t have resources, time, or budget to help you right now.  Fill out these forms and in the future maybe we can help.

In a nutshell, this is one of the reason why hosted solutions exist (SaaS, ASP, on-demand, whatever).  While it’s hard to do web analytics, it’s even harder to do it internally using actual software that you run.

Wouldn’t we prefer it to go something like this:

  1. Marketing: We need advanced customizations, deep integrations, increased scalability, better performance, and more control overall over Web Analytics.
  2. IT: Yes.  Can do.  Will do.  What do you need and when do you need them by?

My belief is that to “do web analytics” the right way, you need an allocation of IT resources to support your implementation and extend it to fulfill strategy and improve business performance.   After all, I firmly believe web analytics is for optimizing business performance, guiding strategy, and supporting tactical decisions.   And to do all that, you need resources when you need them.  The larger your site or portfolio of sites, the more resources you need.  It’s all pretty logical.  Getting back to IT, if you’re using a hosted solution, you need fewer IT resources.  The vendor takes care of a lot of IT stuff.  If you are running your analytics in-house, you need a team of IT resources because you will be doing it all yourself.  

I would prefer those technical resources report into Web Analytics, but I’m not sure if the general business world (as in non-Internet companies) sees the ROI of Web Analytics clearly enough to immediately delegate a full-time “mini IT” team to support analytics at phase zero (i.e. when you first get hired and plan the rollout).  And that’s why you need to be very wary of what vendors tell you about IT requirements and web analytics. 

If management expects that you just need to tag the pages and you the analyst can do that yourself, your company will be in for surprise.  It’s never that simple.  Smaller companies with one or a few sites that use the same technology may be able to pull off the solo cowboy analyst including tags and doing all the tech work.  Google has made that fairly easy.  But larger companies that have many sites and many different technologies serving those sites are a much different animal. 

My advice is that you can’t be fooled by vendor messaging that claims “you don’t need IT.”  That’s bull$4!+.  Marketers can’t do Web Analytics alone and in isolation.  You will need IT to help you extend your web analytics solution.   And as I’ve already stated, the level at which you need IT will vary on how you “do” web analytics.  It differs greatly if you are running an in-house proprietary solution, an internal vendor solution, or a hosted solution. 

If you are doing web analytics using a proprietary solution you created internally, you may probably then already understand what I mean when I say ”web analytics needs IT.”  Chances are you are using an OLAP-based solution that has huge BI infrastructure behind it and the cubes contain latent information.  Your data model may be limited compared to the major vendors.  Your tool may be overly complex, hard for business users to use, and limited in terms of features, or it may be the coolest thing since sliced bread, and the people who created it may know more than the vendors.  Still, unless resources are adequately delegated to support analytics and extending the implementation, your tool users and report consumers will make thousands of requests to IT, and they will go unfulfilled leading to user frustration.

If you are running an in-house software solution, such as that provided by Unica, WebTrends or Visual Sciences, you will rely on IT for all sorts of things, like hardware and software maintenance, database administration, network support, and will need to leverage help desk and ticketing systems.  In addition, web analytics projects become part of the IT project planning cycle with budget requests and consideration.

If you are having your web analytics tool hosted.  IT may be the ones who actually put the tags you field on your web site.  Modifications to any javascript may need to be done by IT.  You will need to reach out to IT for help with setting up cookies, changing the DNS, and writing any code that assists with web analytics.  “Change management” will be required. ;)

If a business wants to succeed with Web Analytics, it must determine how to effectively resource the implementation and ongoing extension of an analytics platform.  Here are some tips for ensuring you get the resources you need:

  • Factor web analytics resource needs into the capital budgeting and yearly planning process.  Business stakeholders must identify the IT resources they need in advance, and then align the IT team according to business goals.  Resources must be allocated according to financial guidelines that maintain corporate profitability. 
  • Document your web analytics projects and business requriements and share the documentation with IT.  Whether your web analytics projects are related to implementation, campaign optimization, data description, or integration, you need to share that information with IT so they can determine how to support analytics. 
  • Identify and document why you need IT resources.   In other words, identify and document what IT will be doing for web analytics and how their work is necessary for improving corporate performance.  On the business side, explain that you won’t be able to fulfill X business goal without IT resources.
  • Leverage a project manager.  Project managers are critical and important to cross-functional team success.  They focus work, align people, determine tasks, monitor completion, and allow a multifaceted team of business marketers and IT to do what they do without worrying about managing the project.
  • Share your analytics success with IT and let stakeholders know how IT has helped you.  Often times corporations forget that these very talented IT folks are working really hard behind the scenes, often without getting much (or any) credit for the complex work they do.  When you have an analytics success, share it with the folks that helped you tag the pages or configure your servers.  When people are singing your praises in the cafeteria because they now have the data they need to do their jobs and/or you’ve improved their business performance, let them know IT backed you up and helped you deliver.  There’s enough glory to go around.

If you do what I’m saying in this blogviation, the problem  of ”marketing versus IT” will be minimized.  IT will be able to keep up with all of your constantly-evolving business requirements and the dynamic, high-maintenance nature of your web analytics program.   And your marketing department, business stakeholders, and executive team will be very happy with the results. 

Anil Batra needs your help with a Bounce Rate Survey!

My friend Anil “Batman” Batra, over at ZeroDash1, created a new survey on “bounce rates.”  He’d really like you, good reader, to take the survey, and so would I.  It doesn’t take very long to complete.  I’m looking forward to him sharing the results, for free, with the entire industry. 

His survey can be found here: 

http://www.surveymonkey.com/s.aspx?sm=IFDf5Jtenl_2fsq_2fuwemHmJA_3d_3d

If you’ve never met Anil, he’s a wonderful dude who keeps up an awesome blog over at: http://webanalysis.blogspot.com.  I’m a regular reader of all his thought-provoking blogviations.  He batted 5 for 5 on his Web Analytics 2007 predictions, and has posted his predictions for 2008 too.  Check it out.  And thanks in advance for taking his survey!

Web Analytics Prognostications for 2008

What’s the future hold for Web Analytics in 2008?  Here are a few predictions:

  • Google Analytics releases a real API for getting (and perhaps setting) data.  As you know, I think GA is a fine tool for web analytics, but has severe limitations when you want to control over your data or to feed data into other systems.  Thus, I predict Google Analytics will go beyond the “Tracking API” and release a real API that allows you to at least get data out of the tool (if not set data as well).  Think of what Feedburner does with their REST-based Awareness API.  Wouldn’t that be nice to have with GA?!
  • HBX Analytics goes away.  I’d be more than a bit nervous if I were an HBX customer because Omniture is going to sunset HBX and migrate everyone to SiteCatalyst, then try to aggressively sell them the rest of the suite. 
  • Long live Visual Sciences.  VS is a powerful tool quite superior in some regards and very different than anything else Omniture offers.  It’s also real in-house software, not some blackbox.  VS’ extensible schema, flexibility in reporting, scalability, and performance is quite unparalleled in the industry.  I can’t envision Omniture killing it (unless they peel it apart in order to create Discover 3), like they will HBX. 
  • WebTrends rebrands.  I’m not sure if you agree, but imho WebTrends Marketing Lab was an attempt to rebrand WebTrends.  I expect that interim management will continue attempting to differentiate WebTrends by rebranding products and perhaps the entire company.
  • New and updated standards are released.  As a member of the IAB’s Measurement Council I can tell you that the IAB is getting ready to release the IAB Audience Measurement Reach Guidelines, which attempt to clarify and take a stand on various aspects of server/client-side analytics and audience measurement.  I also envision the WAA increasing the number of terms they define.  But standards are just dandy and quite meaningless unless they are adopted… thus…
  • Standards enforcement is attempted in order to propel adoption. Existing and forthcoming standards will be enforced in 2008.  Enforcement from the WAA will probably come in the form of a publication of a matrix or documentation citing which vendors adhere to the standards and to what degree, what’s missing, what’s different, and so on.  If decision-makers who control budgets believe in standards, this type of document will cause the question ”do you adhere?” to be asked.  If vendors start losing deals because the answer is “no, not at all,” vendors will adopt the standards. 
  • Internal data integration becomes more important for companies and problematic for ASP’s.  When we talk about “integration” I often think people can be a bit shortsighted.  They want to integrate data from other third-party services and tools (like Salesforce.com and their ad server).  While there is certainly real value in integrating external data with web analytics data, significant value comes from integrating web analytics with internal data, such as data residing in internally-hosted CRM systems, finance, subscription, and lead generation databases. Most vendors have barely figured out how to deal with detail-level external data integration in 2007, even though many customers are demanding it.  I expect that in 2008, internal data integration will be more commonly demanded and even more problematic for ASP’s. 
  • BI tools provide better support for and integration with Web Analytics tools.  The current allotment of “enterprise” level web analytics tools are inferior to the capabilities provided by business intelligence tools from companies like Business Objects or Cognos.  Expect these BI vendors to create features for dealing with web analytics data in 2008.  Either that, or these web analytics tools need to grow up and learn a few things from BI. 
  • Web Analytics as performance management.  KPI-based site optimization means using data to guide the modification of user experience to deliver on goals.   Since goals are measurable and can be plotted against performance, it’s totally logical to use web analytics as a performance management tool.  Expect to see that gestalt in tool usage come into vogue and be discussed more in 2008. 
  • Web Analytics as part of business process automation.  Having the marketing department fielding page tags with campaign codes may work for some (small) companies, but when you work for an enterprise with thousands of clients and simultaneous campaigns across multiple channels, endemic tagging and subsequent tool configuration becomes challeging.  As part of the web analytics process, I expect to see tools support some level of business process automation enabling web analytics.
  • Features for measuring the Mobile Web.  Right now, with a log file based tool, I can segment out Mobile traffic based on user agent.  If I want to use a page tag, I have to consider js limitations.  The mobile web is the next frontier, and I only know of one web analytics vendor who is doing a decent job measuring it right now, so I expect to see more features released this year for measuring Mobile.  

So that’s that.  Like a band named PIL once said in the song called Rise “I could be wrong, could be right!”  Am I off-base, misguided, accurate, do you disagree, agree, then let me know… I’d love to hear your thoughts and your predictions for Web Analytics 2008…

crystalball1.jpg

Thinking Back on Online Metrics in 2007 and Looking Forward to 2008

Every month I write column for MediaPost.  This month I wrote  a short summary piece I thought I’d share with you in case you missed it.  Here it is:

As 2007 ends, I thought it worth looking back, from the practitioner perspective, at just a few of the issues that have shaped Internet measurement and thus online metrics over the last year:

  • The Page View is Dead, Long Live the Page View.  During 2007, technologies like AJAX and Flash continued to erode the construct of the page view.  These technologies render content in a browser but do not always make requests to the server to do so.  If the technology you are using to measure behavior requires the page request and you do not have a page request, what do you measure?  The major vendors of online metrics tried to answer that question. 

Various audience measurement companies claimed “total minutes” and other time-based derivatives were better alternatives to measuring the page view.  Web Analytics companies rolled out features for measuring “events” subordinate or equal to the page view (and highlighted already existing time-based metrics).  Ad serving companies made inroads in reconciling view-through to assist advertisers in understanding the latent effect of ad exposure on the purchasing lifecycle.  Yet all these technologies still count and report page views.

  • Engagement, Engagement, Engagement.  One of the hot topics in 2007 was a carryover from 2006.  Definitions for “engagement” emerged from the worlds of advertising, social media, online metrics, and more.  Engagement has been described as “turning on a prospect to a brand idea enhanced by the surrounding context” to “repeated, satisfied interactions that strengthen the emotional connection a customer has with the brand” to “apparent interest” to the more metrical “estimate of the degree and depth of visitor interaction against a clearly defined set of goals.” 

“Engagement” is very specific to the site being measured and full of nuance.  This fact has led agencies, consultants, and various companies to create complex engagement indices consisting of measures of key behaviors.  Behaviors are tallied and segmented in order to calculate an engagement metric, which is then used as the basis for site optimization.  These indices go far beyond often-cited simple time-based measures of engagement.  For a well-thought-of example, see Eric Peterson’s Engagement Metric.

  • Cookie Deletion, Again!  Jupiter Research, in 2005, first uncovered and quantified how cookie deletion can affect unique visitor numbers in web analytics systems.  The effect of cookie deletion is not quantifiable in the basic way audience measurement companies want to quantify it in 2007 – by only examining cookie deletion rates from self-selecting panelists who visited one portal site and an ad server. 

Cookie deletion behavior varies greatly by audience segment and by site.  It may be as much of an accuracy problem in web analytics as selection bias and coverage errors are in panel measurement.  It is worth noting that some audience measurement firms use cookies to collect panel data. 

  • Black Box Audience Measurement.  Many questions were asked about whether audience measurement companies adequately measure “unique visitors” or “unique users” or just the frame of self-selecting “unique panelists.”  In audience measurement, counts of “unique visitors” are generated using complex, black-box mathematics that project observed metrics to the entire online universe.  The projections are always unequal to the same data provided by other audience measurement companies or web analytics tools.  Panel inconsistencies (across at-home, at-work, at-university, or specific to the geography being measured) may cause some level of bias and error. 

Accounting for the difference between actual, observed panel metrics and projected metrics is perhaps even more challenging to clarify and resolve than the measurable effect of cookie deletion. 

  • The Continuing Need for Standards Enforcement.  2007 was the year two significant industry bodies continued working on standards related to online metrics: the Internet Advertising Bureau and the Web Analytics Association.  While each organization serves the needs of different constituencies, they both share the inability to enforce standards.  Both bodies can say what you should do, but not what you “must” do. 

Throughout 2007, these issues (and others) brought increased attention and scrutiny to online metrics.  Corporations are inextricably linking online metrics to site and channel strategy and performance, and thus to overall corporate profitability.  The “numbers” are now more important than ever for managing an online business and maximizing online revenue.  Nevertheless, questions are still being asked about accuracy, precision, usage, and sources of online metrics.  We have a ton of collaborative work to do in 2008 to provide the best answers and numbers we can. 

Happy New Year!

happynewyear.jpg

A Few Guiding Principles for when you’re new to Web Analytics Management

If you’re new to web analytics, or the management of web analytics, it’s always helpful to get advice from someone whose been in the industry.  Here are a couple of simple principles I’ve found helpful:

  • Investigate and be skeptical of claims by vendors. Upgrading a tool?  Migrating to a new tool?  Completely replacing one?  Adding a tool to your analytics arsenal?   Someone is then trying to sell you something.  Vendors really want your money.  And when it comes down to it, it seems the bigger the vendor and the smaller you are, the less attention you will get, even when you need their help really badly.  Vendors attempt to have their own, differentiated spin on web analytics - one of the reason the WAA developed standards.  Avoid falling into the mental model of any one vendor… push their envelope… champion for standards.  In doing so, make sure to realize that a good relationship with a vendor based on an understanding of mutual strengths and weaknesses will lead to collaboration important to your success.
  • Don’t bias yourself when it comes to data collection.  Page tags are imperfect.  Orchestrating and maintaining comprehensive, deep tagging is never easy.  Log file analysis, while often disdained, has utility for situations where page tags are insufficient (say you want to detect how search bots crawl your site) or when tags are impractical (mobile).  Packet sniffing can be beneficial when you can’t tag or access log files.  In fact, some combination of hybrid data collection, for example page tags and logs, might be the best option for a given situation.  It’s up to you to work with your team to determine what method is most appropriate to use to reach your goals. 
  • Realize that your IT department is your ally.  Marketers can’t do web analytics without IT.  Whether they help you include page tags, locate and synch your log files, or monitor the implementation of your packet sniffer, you will need to develop and maintain a relationship with IT.  You, the business marketer, may think quite differently about web analytics than IT, so work toward a mutual understanding that supports business and technology goals.
  • Reconcile what it means to you when you hear that “Web Analytics isn’t easy… Web Analytics is hard!”  That’s a quote from Eric T Peterson.  He’s right.  You need people, processes, workflows, dedicated resources, teamwork, teamwork, teamwork.  How do you prepare an organization for something that is hard?  How do you prepare a team?   Training, education, evangelism?  Yes, that helps… You can make it easier by reading the books, checking out the blogs, going to the conferences, taking a UBC course, and participating in industy organizations like the Web Analytics Association.   

principles.gif

Part 2: Web Analytics Tools – How Do I Know I’ve Outgrown Mine?

Web analytics tools can be outgrown, like houses, clothes, shoes, music, books, and ways of thinking about the world…  But how do you know when you’ve outgrown your web analytics tool?  In part 1, I began the list of five symptoms of an outgrown web analytics solution, which was spawned out a preso I recently gave.  The five symptoms include:

So without further adieu, here’s the rest of the list and some thoughts regarding these symptoms (click here for Part 1):

  • Limited Integration.  Soon after deploying a web analytics solution, you will become intimately familiar with clickstream data and simple counts of things (like page views) and measurements (like time).  You will hopefully have deployed Key Performance Indicators for understanding how effective your web site is at converting visits and meeting defined business goals.  Depending on the web analytics tool you use you may even have insight into the behavior and KPI’s of visitors from online channels like newsletters, search, and rss because you have applied “tracking codes.”

Soon will come a time when you may ask yourself how do I integrate data from other data sources?  You may want to bring data from an email service provider, CRM system, and registration databases, so that you can see delivery rates next to conversion rates from newsletters, or so that you may pass behavioral information about a visitor who registered on your web site.

You may want to move data out of your web analytics system.  Perhaps you want to feed your data warehouse?  Or you may want to feed web analytics data into a targeting system.   Simple XML-based feeds from a hosted solution may not suffice.  You will need access to your data in a open database. You may even want to stop non-human readable text and character strings from appearing in your reports. To do so you may need to lookup data using various methods in order to make reporting comprehensible.  All of these goals require some level of integration.

If your current web analytics tool can’t:

  • Provide insight into all online channels
  • Enable you to bring data from other systems into your web analytics platform
  • Pass Web Analytics data to other systems.
  • Manipulate data by looking up values, resolving urls, and decoding parameters

And do all of this at a reasonable cost in a maintainable way using in-house resources, then you may have outgrown your web analytics solution.

  • Cost.  Web analytics done right isn’t cheap.  It costs money to maintain and extend whether you run an in-house solution or external solution.

When running an in-house web analytics costs are spread out across hardware and software and resources:

  • Software license.
  • Recurring maintenance costs.
  • Servers (one or more).  Perhaps you virtualize (it cost money for the virtualization software).
  • Database license(s).
  • Storage.
  • IT resources - people like project managers, application engineers, and dba’s.

As you expand your web analytics operation, all of these resources and technologies will need to scale.  Time will need to be devoted to maintaining it all, and time costs salary dollars. 

Your company will, hopefully, grow.  Then you will have more sites.  These will need to be tracked.  New reports will need to be created, tested, and rolled out into production.  New data and systems integration requirements will spring up.  All this has cost.

On the other hand, if you are using a hosted solution, you will need to extend the page tag and tool configuration when you want use more features or integrate systems and data.  That means spending money to use vendor professional services or consultants, unless you want to dedicate internal resources in IT who may already be overburdened.

At some point you say enough is enough. The COST is too much!  You then decide to invest in well-negotiated vendor solution that provides a lower TCO over some horizon.  When you start to run up against the barrier of cost, you may have outgrown your web analytics solution.

So that’s the list.  I can think of many more reasons why companies outgrow their web analytics tools…  What have I omitted?  What do you think?

  • web_analytics_cost1.jpg

The Yin and Yang of Online Metrics: Audience Measurement and Web Analytics

I write a monthly column for Mediapost’s Metrics Insider.  This month I wanted to talk about the different schools of thought in online metrics because at the end of the day we are all in Internet measurement together. Hope you enjoy the read:

Audience measurement and Web analytics systems are like the yin and yang of online metrics. Yin and yang are different, opposing forces, but they also complement each other. Think of Web analytics and audience measurement data in the same way: different, sometimes in opposition, but complementary.

The major difference between these systems is data collection:

  • Audience measurement companies don’t collect data directly from the sites being measured. They all rely on proprietary methods. Hitwise gets data from ISPs. Compete uses a toolbar that you can download as well as ISP and panel information. Nielsen and comScore use data collected from panels to create online metrics that they believe accurately represent overall Internet usage. Due to all these different data collection methods and no shared standards across companies, metrics from audience measurement firms are never identical with each other.
  • In Web analytics, data is collected directly from actual site activity. Methods include client-side data collection via javascript page tagging, server-side data collection via log file processing, or network data collection via packet sniffing. Sometimes methods such as page tagging and log file processing are combined in what’s called “hybrid data collection.” Vendors include Coremetrics, Webtrends, Unica, Visual Sciences, Omniture, Google, and others. The challenge with Web analytics tools is that each tool will calculate different numbers from the same source for identical metrics. In other words, Omniture numbers won’t match Google’s. That’s because each tool has its own “secret sauce” for “sessionization” — the fancy term for the way metrics are counted and measured by analytics technology. For example, certain tools may be configured to include or exclude certain filetypes or server responses. Robotic traffic may or may not be filtered.

It’s worth noting that a company named Quantcast uses panel data and also enables a site to add page tags to collect actual site data, which are then merged together in a completely different type of “hybrid” model.

All these different approaches to data collection lead to opposition when these systems are used for the same purpose. For example, conflict arises between the yin and yang when identifying reach using unique visitor metrics. Audience measurement firms may cry “cookie deletion” when analytics tools are used to count unique visitors, and Web analytics firms may shout back “coverage error” and “selection bias” at the unique visitor numbers from panel-based firms. Another area of opposition is demographics. I’ve been told that only audience measurement firms provide demographic data, and that you can’t get demographic data from Web analytics systems. That’s not true at all.

All enterprise-level Web analytics systems provide demographic location information at the country, city, state, and MSA levels. This information will be different than that provided by audience measurement companies.

Demographics that are harder to elicit from a Web analytics system, but are easily provided by audience measurement, include attributes like a visitor’s age, gender, occupation, income, and education.

But it is possible to integrate very detailed demographic attributes per visitor into a Web analytics system! Once demographic information is captured in a registration database, it can be joined with behavioral data in the Web analytics system and reported on. For a real-world example of analytics/demographic integration, take a look at what Microsoft is doing with Gatineau, the company’s free Web analytics offering currently in beta. Microsoft is joining Web site behavioral data with rich demographic data from MS Live profiles.

Even with differences and oppositions between these online metrics systems, companies find ways to use the data in complementary ways:

  • Audience measurement data is useful for competitive intelligence. All the paid and free services provide data for comparing the performance of a site to other sites, for understanding audience behavior across one or more sites by demographics, and for understanding generalized Internet traffic trends and search terms.
  • Web analytics data is useful for understanding site effectiveness, for defining key performance indicators, for determining conversion rates for marketing campaigns by channel (such as search, email, rss), for understanding what sites and keywords are driving traffic to your site, and for segmenting and reporting online metrics.

You can even use both data sources as part of the same site optimization activity. For example, you could use audience measurement data to determine that a competitor is gaining ground on a particular product or search term. Then you could look at your Web analytics tool to see how you’re doing for the same term and how visitors who searched for that keyword behave on your site. You may find a high bounce rate and low conversion rate for the keyword, so you segment that data perhaps by demographics! Next you suggest a hypothesis to minimize bounce and maximize conversion for each segment. Then you test your hypothesis, and reexamine the data. Based on the results, you then continuously improve your online performance through controlled experimentation. At the end of the day, you will drive more online revenue by understanding how the yin of audience measurement and the yang of Web analytics complement each other, than by worrying about how they differ and oppose.

yin_yang.jpg

Web Analytics and Targeting: A Quick Blogviation

Targeting refers to the process of identifying characteristics of a segment so that relevant content may be matched to it and delivered at a time when the segment is most open to the message. The idea is the right content to the right visitor at the right time (optimally in real time). 

For example, you may visit a site, and see some type of ad unit calling out at you to “meet singles in <insert_your_city>.” When browsing real estate you may see ad units for realtors and mortgage companies.  After entering a keyword such as “car prices” and clickingthrough the SERP, you may see an ad for a local car dealer.   That’s targeting in a nutshell.  It’s simple: 

  1. Visitor X has these attributes. 
  2. We have content that we think will appeal to Vistor X’s attributes. 
  3. Let’s show that content. 

While targeting has helped to increase ad clickthrough rates, it’s far from an ideal science.  Current methods for targeting have inefficiencies.  What if Visitor X just bought a new car after his recent marriage?  Unless the targeting engine is made aware of the visitor’s current state, the targeting may be off and not yield desired results. 

Even with limitations around “current awareness” targeting is perceived in the Internet industry as a crucial activity for maximizing the effectiveness of advertising and content.  Targeting is the next stage after A/B and multivariate testing.  Once you determine the preference of segments based on testing, you identify content to target. 

In new media, targeting is something associated with paid search campaigning, ad serving, and content optimization.  It’s not uncommon for targeting activities to be based on:

  • Category and sub-category.  Conceptual constructs like “categories” of topics on a media web site or products on an ecommerce site can be targeted to include certain types of ads or messages.  The notion of a “zone” fits in here as well.  The idea is that if visitors are browsing in your category for “hardware floors” you could offer them an ad or content specific to “flooring installation services.” 
  • Geography.  Country, region, city, state, DMA are all targetable constructs.  You may choose target people surfing from 02141 (Cambridge, MA) an ad for pre-sale Red Sox tix or content about Mike Lowell’s recent contract.
  • Browsing environment such as the connection speed, type of browser, operating system, user software, domain, and ISP.  An ad network serves an ad for Verizon DSL to a modem-based surfer by detecting the visitor’s browsing environment.
  • Time.  The idea of only showing content during specific periods of time is called “parting.”  Common types include day-parting and season-parting.  For example, a B2B site only choosing to show ads for a