<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: &#8220;Anonymized&#8221; Data Really Isn&#8217;t</title>
	<atom:link href="http://www.discoveringidentity.com/2009/09/10/anonymized-data-really-isnt/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.discoveringidentity.com/2009/09/10/anonymized-data-really-isnt/</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Tue, 24 Aug 2010 03:29:46 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Mark Dixon</title>
		<link>http://www.discoveringidentity.com/2009/09/10/anonymized-data-really-isnt/comment-page-1/#comment-568</link>
		<dc:creator>Mark Dixon</dc:creator>
		<pubDate>Fri, 11 Sep 2009 04:20:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.discoveringidentity.com/2009/09/10/anonymized-data-really-isnt/#comment-568</guid>
		<description>&lt;p&gt;Thanks for sharing the story!  Just another example of the flaws of anonymizing.&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>Thanks for sharing the story!  Just another example of the flaws of anonymizing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mike waddingham</title>
		<link>http://www.discoveringidentity.com/2009/09/10/anonymized-data-really-isnt/comment-page-1/#comment-567</link>
		<dc:creator>mike waddingham</dc:creator>
		<pubDate>Fri, 11 Sep 2009 04:08:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.discoveringidentity.com/2009/09/10/anonymized-data-really-isnt/#comment-567</guid>
		<description>&lt;p&gt;(Forgive me if this story is included in your posted articles... didn&#039;t have time to read through all.)&lt;/p&gt;
&lt;p&gt;My favorite tale related to this was the story of a Yahoo or Google database of search queries that was &#039;anonymized&#039; for research purposes.  The data provider took great pains to remove any identifying characteristics from the records, for example they replaced the ip address with a scrambled one.  The data went to a group of researchers, then the entire data set got posted online.&lt;/p&gt;
&lt;p&gt;Each anonymous user&#039;s data could still be grouped by the random ip address.&lt;/p&gt;
&lt;p&gt;After some days/weeks, the fatal flaw was found: users, while searching for all kinds of things, also did vanity searches on their names...  So, in the grouped data you would see the vanity searches -- with their name in clear text -- right next to some fetish or decidedly politically incorrect search item!&lt;/p&gt;
&lt;p&gt;Mike&lt;/p&gt;
</description>
		<content:encoded><![CDATA[<p>(Forgive me if this story is included in your posted articles&#8230; didn&#8217;t have time to read through all.)</p>
<p>My favorite tale related to this was the story of a Yahoo or Google database of search queries that was &#8216;anonymized&#8217; for research purposes.  The data provider took great pains to remove any identifying characteristics from the records, for example they replaced the ip address with a scrambled one.  The data went to a group of researchers, then the entire data set got posted online.</p>
<p>Each anonymous user&#8217;s data could still be grouped by the random ip address.</p>
<p>After some days/weeks, the fatal flaw was found: users, while searching for all kinds of things, also did vanity searches on their names&#8230;  So, in the grouped data you would see the vanity searches &#8212; with their name in clear text &#8212; right next to some fetish or decidedly politically incorrect search item!</p>
<p>Mike</p>
]]></content:encoded>
	</item>
</channel>
</rss>
