<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Data Quality Sucks, Let&#8217;s Just Get Over It</title>
	<atom:link href="http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html</link>
	<description>Pluralitas non est ponenda sine neccesitate.</description>
	<pubDate>Mon, 08 Sep 2008 12:44:45 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.1</generator>
		<item>
		<title>By: Trending Upward &#124; Are you a skeptic? Come on, I know you are!</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-468887</link>
		<dc:creator>Trending Upward &#124; Are you a skeptic? Come on, I know you are!</dc:creator>
		<pubDate>Wed, 06 Aug 2008 00:06:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-468887</guid>
		<description>[...] A lot of skepticism around analytics centers around the fact that the numbers are never perfect. Cookie deletion, cookie rejection, turning off javascript can all skew numbers. True. But, let’s heed Avanish Kaushik’s advice and get over it. Why? Because it doesn’t matter. Why doesn’t it matter? Because it’s *off* consistently across your site. That’s why trends are so important. You can argue that the statement, “I got 1,500 visits last month” may or may not be perfectly true. You can’t argue, however, the truth to the statement, “our visits are up 10% over last month.” See the difference? The 1,500 number doesn’t mean anything anyway, even if it were a concrete, true, golden number. Putting it in context with the previous month (or other date range) makes all the difference. You can do the same thing with all your other metrics. [...]</description>
		<content:encoded><![CDATA[<p>[...] A lot of skepticism around analytics centers around the fact that the numbers are never perfect. Cookie deletion, cookie rejection, turning off javascript can all skew numbers. True. But, let’s heed Avanish Kaushik’s advice and get over it. Why? Because it doesn’t matter. Why doesn’t it matter? Because it’s *off* consistently across your site. That’s why trends are so important. You can argue that the statement, “I got 1,500 visits last month” may or may not be perfectly true. You can’t argue, however, the truth to the statement, “our visits are up 10% over last month.” See the difference? The 1,500 number doesn’t mean anything anyway, even if it were a concrete, true, golden number. Putting it in context with the previous month (or other date range) makes all the difference. You can do the same thing with all your other metrics. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gilligan on Data by Tim Wilson &#187; Blog Archive &#187; Capturing Web Traffic Data &#8212; Two Methods that Suck</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-398541</link>
		<dc:creator>Gilligan on Data by Tim Wilson &#187; Blog Archive &#187; Capturing Web Traffic Data &#8212; Two Methods that Suck</dc:creator>
		<pubDate>Wed, 02 Jan 2008 23:21:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-398541</guid>
		<description>[...] First off, it is largely a waste of time to try to completely reconcile data from two different web analytics tools. This post really isn’t about that. Mark Twain, Lee Segall, or perhaps someone else coined the saying, “A man with one watch knows what time it is; a man with two watches is never quite sure.” The same is true for web analytics. Thanks to different data capture methods, different data processing algorithms, different data storage schemas, and different definitions, no two tools running concurrently will ever report the same results. The good news, though, is that most tools will show very similar trends. WebTrends preaches, “in web analytics, it’s the trends that matter — that’s why it’s part of our name!” But, even in the broader web analytics community, this is widely accepted. Avinash Kaushik had a great post titled Data Quality Sucks, Let’s Just Get Over It way back in 2006, but it still applies. Read more there! [...]</description>
		<content:encoded><![CDATA[<p>[...] First off, it is largely a waste of time to try to completely reconcile data from two different web analytics tools. This post really isn’t about that. Mark Twain, Lee Segall, or perhaps someone else coined the saying, “A man with one watch knows what time it is; a man with two watches is never quite sure.” The same is true for web analytics. Thanks to different data capture methods, different data processing algorithms, different data storage schemas, and different definitions, no two tools running concurrently will ever report the same results. The good news, though, is that most tools will show very similar trends. WebTrends preaches, “in web analytics, it’s the trends that matter — that’s why it’s part of our name!” But, even in the broader web analytics community, this is widely accepted. Avinash Kaushik had a great post titled Data Quality Sucks, Let’s Just Get Over It way back in 2006, but it still applies. Read more there! [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Judah Phillips at Web Analytics Demystified &#187; Blog Archive &#187; Part 1: Your Web Analytics Data Quality Stinks and Here&#8217;s Why!</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-196582</link>
		<dc:creator>Judah Phillips at Web Analytics Demystified &#187; Blog Archive &#187; Part 1: Your Web Analytics Data Quality Stinks and Here&#8217;s Why!</dc:creator>
		<pubDate>Sat, 11 Aug 2007 16:56:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-196582</guid>
		<description>[...] Sampling, sampling, sampling.  My friend Avinash Kaushikdoes a good job covering issues around sampling in web analytics.  I recommend reading his post!  I&#8217;ll add that statistical methods applied to web analytics data are completely valid; however, sampling at the site, page, or database level opens the possibility that you miss key data.  Sites looking for the &#8220;long tail&#8221; of visitors and using data sampling or other data trimming methods may not find it.  [...]</description>
		<content:encoded><![CDATA[<p>[...] Sampling, sampling, sampling.  My friend Avinash Kaushikdoes a good job covering issues around sampling in web analytics.  I recommend reading his post!  I&#8217;ll add that statistical methods applied to web analytics data are completely valid; however, sampling at the site, page, or database level opens the possibility that you miss key data.  Sites looking for the &#8220;long tail&#8221; of visitors and using data sampling or other data trimming methods may not find it.  [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tooling Around on the IBM Information Server</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-91041</link>
		<dc:creator>Tooling Around on the IBM Information Server</dc:creator>
		<pubDate>Fri, 27 Apr 2007 04:33:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-91041</guid>
		<description>&lt;strong&gt;Blogtipping Day: fixing the world one data quality problem at a time...&lt;/strong&gt;

If you slapped the words "data quality" on a hotdog stand someone would offer to buy it before the end of the day.  Data quality tools are hot properties so I am BlogTipping three vendor blogs....</description>
		<content:encoded><![CDATA[<p><strong>Blogtipping Day: fixing the world one data quality problem at a time&#8230;</strong></p>
<p>If you slapped the words &#8220;data quality&#8221; on a hotdog stand someone would offer to buy it before the end of the day.  Data quality tools are hot properties so I am BlogTipping three vendor blogs&#8230;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Web Analytics is so 2006</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-74480</link>
		<dc:creator>Web Analytics is so 2006</dc:creator>
		<pubDate>Mon, 02 Apr 2007 02:02:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-74480</guid>
		<description>[...] 
I&#8217;m hearing it all over. There&#8217;s a new day on the horizon, a day when we in the web world recognizes that none of this is really an exact science anyway, so why pretend? 

I’m hearing it all over. There’s a new day on the horizon, a day when we in the web world recognize that none of this is really an exact science anyway, so why pretend?

Enough with the weighted regressions and Taguchi Methods already. It’s time to take the anal out of analysis. Instead of Web Analytics, I propose Web Casualytics. Or Fuzzylytics maybe. Or Estimytics.
[...]</description>
		<content:encoded><![CDATA[<p>[...]<br />
I&#8217;m hearing it all over. There&#8217;s a new day on the horizon, a day when we in the web world recognizes that none of this is really an exact science anyway, so why pretend? </p>
<p>I’m hearing it all over. There’s a new day on the horizon, a day when we in the web world recognize that none of this is really an exact science anyway, so why pretend?</p>
<p>Enough with the weighted regressions and Taguchi Methods already. It’s time to take the anal out of analysis. Instead of Web Analytics, I propose Web Casualytics. Or Fuzzylytics maybe. Or Estimytics.<br />
[...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: People Over Process &#187; Blog Archive &#187; links for 2007-02-17</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-44427</link>
		<dc:creator>People Over Process &#187; Blog Archive &#187; links for 2007-02-17</dc:creator>
		<pubDate>Sat, 17 Feb 2007 07:26:12 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-44427</guid>
		<description>[...] Data Quality Sucks, Let’s Just Get Over It » Occam’s Razor by Avinash Kaushik &#8220;We should simply not expect that data quality will exist on the internet, atleast for now.&#8221; Nice list of &#8220;my butt hurts!&#8221; type excuses&#8230; (tags: via:mrchippy data integration databases web2.0 ajax stats) [...]</description>
		<content:encoded><![CDATA[<p>[...] Data Quality Sucks, Let’s Just Get Over It » Occam’s Razor by Avinash Kaushik &#8220;We should simply not expect that data quality will exist on the internet, atleast for now.&#8221; Nice list of &#8220;my butt hurts!&#8221; type excuses&#8230; (tags: via:mrchippy data integration databases web2.0 ajax stats) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andy Choi</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-10239</link>
		<dc:creator>Andy Choi</dc:creator>
		<pubDate>Tue, 07 Nov 2006 18:41:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-10239</guid>
		<description>Insightful post! Just wonder if you have tried any of the data quality tools from the enterprise software world?</description>
		<content:encoded><![CDATA[<p>Insightful post! Just wonder if you have tried any of the data quality tools from the enterprise software world?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: this just in &#187; Please Configure Google Analytics Correctly</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-2606</link>
		<dc:creator>this just in &#187; Please Configure Google Analytics Correctly</dc:creator>
		<pubDate>Sat, 02 Sep 2006 02:46:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-2606</guid>
		<description>[...] We all know that web analytics data quality sucks and we&#8217;re all OK with that. Thanks Avinash! But there comes a point when the data quality is just abysmal. This usually means that there is something wrong with the web analytics application configuration. [...]</description>
		<content:encoded><![CDATA[<p>[...] We all know that web analytics data quality sucks and we&#8217;re all OK with that. Thanks Avinash! But there comes a point when the data quality is just abysmal. This usually means that there is something wrong with the web analytics application configuration. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris Furlong</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-493</link>
		<dc:creator>Chris Furlong</dc:creator>
		<pubDate>Fri, 30 Jun 2006 18:03:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-493</guid>
		<description>Excellent post. I wish that more people outside the analytics community understood that you are never going to get two different tools to give you the same numbers, and that your time is much better spent doing analysis.</description>
		<content:encoded><![CDATA[<p>Excellent post. I wish that more people outside the analytics community understood that you are never going to get two different tools to give you the same numbers, and that your time is much better spent doing analysis.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: this just in &#187; It&#8217;s The Trending</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-139</link>
		<dc:creator>this just in &#187; It&#8217;s The Trending</dc:creator>
		<pubDate>Mon, 05 Jun 2006 16:33:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-139</guid>
		<description>[...] Second, Avinash Kaushik has a fantastic blog post titled Data Quality Sucks, Let&#8217;?s Just Get Over It concerning the quality of analytics data and how to make informed decisions based on your comfort level with the data. He stresses that you will not get 100% accurate data but that doesn&#8217;t matter. You can still make informed decisions with the data that your analytics solution provides. It is a really great post and I highly recommend giving it a read. [...]</description>
		<content:encoded><![CDATA[<p>[...] Second, Avinash Kaushik has a fantastic blog post titled Data Quality Sucks, Let&#8217;?s Just Get Over It concerning the quality of analytics data and how to make informed decisions based on your comfort level with the data. He stresses that you will not get 100% accurate data but that doesn&#8217;t matter. You can still make informed decisions with the data that your analytics solution provides. It is a really great post and I highly recommend giving it a read. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Justin Cutroni</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-137</link>
		<dc:creator>Justin Cutroni</dc:creator>
		<pubDate>Mon, 05 Jun 2006 14:56:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-137</guid>
		<description>I couldn't agree more.  I can't tell you how many times our clients ask us to compare statistics from Analytics Package A to Analytics Package B.  It's a time consuming process that usually ends in failure.  We try to stress two things to people:

1.  Look at the trends in your data
2.  Make one change at a time so you can accurately measure the results.

Measuring change to your website is a scientific, controlled experiment.  If you start with a baseline statistic and implement controlled change then there will be little doubt as to the results of that change.  It doesn't matter if your baseline is off by 2, 5 or 10 percent from the actual numbers.  If you implement a single change then there will be little doubt of its effect when you're looking at the trends in your data.

I think &lt;a href="http://www.kaushik.net/avinash/2006/05/the-10-90-rule-for-magnificient-web-analytics-success.html#comment-26"&gt;Caleb&lt;/a&gt; touched on this in his comment a couple of weeks ago:

&lt;blockquote&gt;
Start with benchmarks, make a report that can be easily shared throughout the organization. Host a meeting to talk about it, then pick one area to take action on that will show some real results and give the credit to the people who actually made the changes (the IT, developers, designers, etc…)
&lt;/blockquote&gt;</description>
		<content:encoded><![CDATA[<p>I couldn&#8217;t agree more.  I can&#8217;t tell you how many times our clients ask us to compare statistics from Analytics Package A to Analytics Package B.  It&#8217;s a time consuming process that usually ends in failure.  We try to stress two things to people:</p>
<p>1.  Look at the trends in your data<br />
2.  Make one change at a time so you can accurately measure the results.</p>
<p>Measuring change to your website is a scientific, controlled experiment.  If you start with a baseline statistic and implement controlled change then there will be little doubt as to the results of that change.  It doesn&#8217;t matter if your baseline is off by 2, 5 or 10 percent from the actual numbers.  If you implement a single change then there will be little doubt of its effect when you&#8217;re looking at the trends in your data.</p>
<p>I think <a href="http://www.kaushik.net/avinash/2006/05/the-10-90-rule-for-magnificient-web-analytics-success.html#comment-26">Caleb</a> touched on this in his comment a couple of weeks ago:</p>
<blockquote><p>
Start with benchmarks, make a report that can be easily shared throughout the organization. Host a meeting to talk about it, then pick one area to take action on that will show some real results and give the credit to the people who actually made the changes (the IT, developers, designers, etc…)
</p></blockquote>
]]></content:encoded>
	</item>
	<item>
		<title>By: Laurent Flores</title>
		<link>http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-135</link>
		<dc:creator>Laurent Flores</dc:creator>
		<pubDate>Mon, 05 Jun 2006 09:33:13 +0000</pubDate>
		<guid isPermaLink="false">http://www.kaushik.net/avinash/2006/06/data-quality-sucks-lets-just-get-over-it.html#comment-135</guid>
		<description>Hi Avinash -

Not sure I have the ability to comment on the technical apsects of your post, but as you said the key is to recognise that data quality is never optimal...once, you know this, what I say to clients, and as decision maker myself, I'd rather know NOW 70 or 80% of the truth rather than 100% (knowing that truth is a "relative" concept of course), to take a decision... Too many "technically oriented people" forget this (I was probaly too much like this in the past, my intial training is in computer science as well), and at the end, the key is to decide and learn over time...specifically from mistakes...At a time where marketers finally recognize that a a lot consumer decisions are "irrational", data is good but so is Intuition...data dive helps shape the right intuition...to make the right decision...

Cheers
Laurent</description>
		<content:encoded><![CDATA[<p>Hi Avinash -</p>
<p>Not sure I have the ability to comment on the technical apsects of your post, but as you said the key is to recognise that data quality is never optimal&#8230;once, you know this, what I say to clients, and as decision maker myself, I&#8217;d rather know NOW 70 or 80% of the truth rather than 100% (knowing that truth is a &#8220;relative&#8221; concept of course), to take a decision&#8230; Too many &#8220;technically oriented people&#8221; forget this (I was probaly too much like this in the past, my intial training is in computer science as well), and at the end, the key is to decide and learn over time&#8230;specifically from mistakes&#8230;At a time where marketers finally recognize that a a lot consumer decisions are &#8220;irrational&#8221;, data is good but so is Intuition&#8230;data dive helps shape the right intuition&#8230;to make the right decision&#8230;</p>
<p>Cheers<br />
Laurent</p>
]]></content:encoded>
	</item>
</channel>
</rss>
