<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Why Carry the Cost of Linked Data?</title>
	<atom:link href="http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/</link>
	<description></description>
	<lastBuildDate>Thu, 30 Jun 2011 10:54:54 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
	<item>
		<title>By: Ed Summers</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25725</link>
		<dc:creator>Ed Summers</dc:creator>
		<pubDate>Mon, 21 Jun 2010 01:55:23 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25725</guid>
		<description>Not sure if this was covered in any of the comments already ... but there are also costs associated with the consumer: csv loads pretty nice into excel. What does the average joe load RDF into?</description>
		<content:encoded><![CDATA[<p>Not sure if this was covered in any of the comments already &#8230; but there are also costs associated with the consumer: csv loads pretty nice into excel. What does the average joe load RDF into?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Confluence: Ãrea de TecnologÃ­a</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25713</link>
		<dc:creator>Confluence: Ãrea de TecnologÃ­a</dc:creator>
		<pubDate>Sun, 20 Jun 2010 21:50:14 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25713</guid>
		<description>&lt;strong&gt;Argumentario Linked Data...&lt;/strong&gt;

Argumentario APIs Vs Linked Data...</description>
		<content:encoded><![CDATA[<p><strong>Argumentario Linked Data&#8230;</strong></p>
<p>Argumentario APIs Vs Linked Data&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scott Banwart&#39;s Blog &#187; Blog Archive &#187; Distributed Weekly 55</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25614</link>
		<dc:creator>Scott Banwart&#39;s Blog &#187; Blog Archive &#187; Distributed Weekly 55</dc:creator>
		<pubDate>Fri, 18 Jun 2010 14:53:23 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25614</guid>
		<description>[...] Why Carry the Cost of Linked Data? [...]</description>
		<content:encoded><![CDATA[<p>[...] Why Carry the Cost of Linked Data? [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kingsley Idehen</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25577</link>
		<dc:creator>Kingsley Idehen</dc:creator>
		<pubDate>Thu, 17 Jun 2010 15:51:56 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25577</guid>
		<description>Just realized this post would benefit from a link to the Wikipedia article about: EAV/CR [1].

Link:

1. http://en.wikipedia.org/wiki/Entity-attribute-value_model

Kingsley</description>
		<content:encoded><![CDATA[<p>Just realized this post would benefit from a link to the Wikipedia article about: EAV/CR [1].</p>
<p>Link:</p>
<p>1. <a href="http://en.wikipedia.org/wiki/Entity-attribute-value_model" rel="nofollow">http://en.wikipedia.org/wiki/Entity-attribute-value_model</a></p>
<p>Kingsley</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Matthias Samwald</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25561</link>
		<dc:creator>Matthias Samwald</dc:creator>
		<pubDate>Thu, 17 Jun 2010 13:19:28 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25561</guid>
		<description>Somehow I am in the mood for playing devil&#039;s advocate today...

*) You take into account that choosing linked data over CSV comes with added costs for the publisher. You omit that in the current state of the world, it comes with added costs for the consumers as well. Most developers don&#039;t know much about RDF and surrounding tools and standards, so they have to learn about it in order to consume your dataset. These costs can easily outweigh potential benefits. Of course, the mission of the linked data community is to change that fact by popularizing RDF technologies and standards, so that might not be true anymore 5 years from now.
*) I guess interlinking could also be partially be tackled by adopting simple conventions for adding URLs and URIs as values ins CSV rows. For example, referencing Wikipedia URLs. CSV files and rows in these files could then be related to each other via shared Wikipedia references. This would not be that different from the current Linked Data cloud, where DBpedia acts as a central hub for many other datasets.




Cheers,
Matthias</description>
		<content:encoded><![CDATA[<p>Somehow I am in the mood for playing devil&#8217;s advocate today&#8230;</p>
<p>*) You take into account that choosing linked data over CSV comes with added costs for the publisher. You omit that in the current state of the world, it comes with added costs for the consumers as well. Most developers don&#8217;t know much about RDF and surrounding tools and standards, so they have to learn about it in order to consume your dataset. These costs can easily outweigh potential benefits. Of course, the mission of the linked data community is to change that fact by popularizing RDF technologies and standards, so that might not be true anymore 5 years from now.<br />
*) I guess interlinking could also be partially be tackled by adopting simple conventions for adding URLs and URIs as values ins CSV rows. For example, referencing Wikipedia URLs. CSV files and rows in these files could then be related to each other via shared Wikipedia references. This would not be that different from the current Linked Data cloud, where DBpedia acts as a central hub for many other datasets.</p>
<p>Cheers,<br />
Matthias</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kingsley Idehen</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25532</link>
		<dc:creator>Kingsley Idehen</dc:creator>
		<pubDate>Wed, 16 Jun 2010 20:21:58 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25532</guid>
		<description>As per my response on Twitter. My focus is demystification of Linked Data. To achieve this goal I am rewinding the Linked Data story back to the very beginning. Positioning RDF as a Data Model (rather than Markup for a Data Model) is problematic. Our 12 year comprehension odyssey is living proof.

Inserting EAV into the Linked Data conversation is about creating foundation for understanding roots of the model that drives RDF data representation formats. Basically, how its been enhanced by the addition of URIs re., basic RDF and generic HTTP URIs re. RDF based Linked Data. 

Kingsley</description>
		<content:encoded><![CDATA[<p>As per my response on Twitter. My focus is demystification of Linked Data. To achieve this goal I am rewinding the Linked Data story back to the very beginning. Positioning RDF as a Data Model (rather than Markup for a Data Model) is problematic. Our 12 year comprehension odyssey is living proof.</p>
<p>Inserting EAV into the Linked Data conversation is about creating foundation for understanding roots of the model that drives RDF data representation formats. Basically, how its been enhanced by the addition of URIs re., basic RDF and generic HTTP URIs re. RDF based Linked Data. </p>
<p>Kingsley</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John S. Erickson, Ph.D.</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25521</link>
		<dc:creator>John S. Erickson, Ph.D.</dc:creator>
		<pubDate>Wed, 16 Jun 2010 18:15:46 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25521</guid>
		<description>Regarding Kingsley&#039;s comment concerning EAV: I think we all get that &lt;a href=&quot;http://en.wikipedia.org/wiki/Entity-attribute-value_model&quot; rel=&quot;nofollow&quot;&gt;EAV&lt;/a&gt; has historical precedence and is a more fundamental model than RDF, but for practical purposes &lt;i&gt;why should adopters concern themselves with it?&lt;/i&gt;

To my knowledge EAV has no equivalent community like &lt;a href=&quot;http://openrdf.org&quot; rel=&quot;nofollow&quot;&gt;OpenRDF.org&lt;/a&gt; supporting an open source reference implementation; there is no Python library like &lt;a href=&quot;http://www.rdflib.net/&quot; rel=&quot;nofollow&quot;&gt;RDFlib&lt;/a&gt; for working with EAV; there is no standardized set of attributes like &lt;a href=&quot;http://www.w3.org/TR/xhtml-rdfa-primer/&quot; rel=&quot;nofollow&quot;&gt;RDFa&lt;/a&gt; for embedding assertions within Web documents...

You&#039;ve stated above, &lt;blockquote&gt;...EAV model data can be expressed in a myriad of representation formats (which includes the RDF family)...&lt;/blockquote&gt;

From a practical standpoint, what are those formats and why should adopters care?</description>
		<content:encoded><![CDATA[<p>Regarding Kingsley&#8217;s comment concerning EAV: I think we all get that <a href="http://en.wikipedia.org/wiki/Entity-attribute-value_model" rel="nofollow">EAV</a> has historical precedence and is a more fundamental model than RDF, but for practical purposes <i>why should adopters concern themselves with it?</i></p>
<p>To my knowledge EAV has no equivalent community like <a href="http://openrdf.org" rel="nofollow">OpenRDF.org</a> supporting an open source reference implementation; there is no Python library like <a href="http://www.rdflib.net/" rel="nofollow">RDFlib</a> for working with EAV; there is no standardized set of attributes like <a href="http://www.w3.org/TR/xhtml-rdfa-primer/" rel="nofollow">RDFa</a> for embedding assertions within Web documents&#8230;</p>
<p>You&#8217;ve stated above,<br />
<blockquote>&#8230;EAV model data can be expressed in a myriad of representation formats (which includes the RDF family)&#8230;</p></blockquote>
<p>From a practical standpoint, what are those formats and why should adopters care?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tony Hirst</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25518</link>
		<dc:creator>Tony Hirst</dc:creator>
		<pubDate>Wed, 16 Jun 2010 17:28:48 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25518</guid>
		<description>I think one of the things you get in a CSV data set is a view over a set of data.
When I discover a CSV data set, via Google or wherever (;-), using human search terms, I may or may not get some of the following:

- the data I was looking for;
- an identifiable source;
- related data (so eg I search for population of london and get a dataset that contains the population of other places too).

To the extent that a CSV table is a full text searchable file, if it has been indexed I can throw in my arbitrary and possibly not very good search terms, and maybe the index will throw a dataset back (eg as a CSV file) that contains the datum I want.

In terms of discovering the data, this is really hit and miss, really flaky, and dead simple. 

Cf. how many people would use Google books or Amazon&#039;s one search box rather than enter the right things in the right boxes in a Library OPAC (or hack an even purer Z39.50 query from a terminal command line (can you even do this?!))

I know I come coming across as anti-SPARQL and anti-Linked Data, but I&#039;m not. I just think that the barriers to entry are too many at the moment for enough people to see how they might benefit...

IMHO, if the datastores started publishing interesting and maybe frequently requested data views as full text searchable and discoverable csv, and alongside linking to a sparql query that would return that same data in a richer form, it would help....

Then I as a novice an stumble across the thing I was maybe looking clumsily for, do what I want with it, maybe realise it&#039;s not quite what I want see the Sparql, not understand a world of what it says but see one part of it that I can guess at changing (hmmm.... /bluerghhh London wibble/.... I wonder if /bluerghhh Manchester wibble/ works too...? Hmmm, looks like between two dates? what happens if i chance those dates... and so on...)

I think in terms of weird surface areas relating to discoverability and usability of this stuff. CSV is discoverable over a large area, (search engines can handle it, local datastores, etc); it&#039;s useable over a wide area (lots of tools import it, lots of peope use tools that can import it).

You&#039;re a scientist, I&#039;m a pragmatist. But you&#039;re maybe thinking a little bit too like  the librarians (we&#039;ll be having no tags here, we just want proper controlled vocabularies) as they upheld the proper way of doing things and they users Googled ?

Doh! I don&#039;t mean to say Linked Data is not the way forward, I just don&#039;t think it&#039;s yet a representation that large numbers of people would feel comfortable or capable of working with, given what they currently know, what they currently do, and they culturally currently do it...</description>
		<content:encoded><![CDATA[<p>I think one of the things you get in a CSV data set is a view over a set of data.<br />
When I discover a CSV data set, via Google or wherever (;-), using human search terms, I may or may not get some of the following:</p>
<p>- the data I was looking for;<br />
- an identifiable source;<br />
- related data (so eg I search for population of london and get a dataset that contains the population of other places too).</p>
<p>To the extent that a CSV table is a full text searchable file, if it has been indexed I can throw in my arbitrary and possibly not very good search terms, and maybe the index will throw a dataset back (eg as a CSV file) that contains the datum I want.</p>
<p>In terms of discovering the data, this is really hit and miss, really flaky, and dead simple. </p>
<p>Cf. how many people would use Google books or Amazon&#8217;s one search box rather than enter the right things in the right boxes in a Library OPAC (or hack an even purer Z39.50 query from a terminal command line (can you even do this?!))</p>
<p>I know I come coming across as anti-SPARQL and anti-Linked Data, but I&#8217;m not. I just think that the barriers to entry are too many at the moment for enough people to see how they might benefit&#8230;</p>
<p>IMHO, if the datastores started publishing interesting and maybe frequently requested data views as full text searchable and discoverable csv, and alongside linking to a sparql query that would return that same data in a richer form, it would help&#8230;.</p>
<p>Then I as a novice an stumble across the thing I was maybe looking clumsily for, do what I want with it, maybe realise it&#8217;s not quite what I want see the Sparql, not understand a world of what it says but see one part of it that I can guess at changing (hmmm&#8230;. /bluerghhh London wibble/&#8230;. I wonder if /bluerghhh Manchester wibble/ works too&#8230;? Hmmm, looks like between two dates? what happens if i chance those dates&#8230; and so on&#8230;)</p>
<p>I think in terms of weird surface areas relating to discoverability and usability of this stuff. CSV is discoverable over a large area, (search engines can handle it, local datastores, etc); it&#8217;s useable over a wide area (lots of tools import it, lots of peope use tools that can import it).</p>
<p>You&#8217;re a scientist, I&#8217;m a pragmatist. But you&#8217;re maybe thinking a little bit too like  the librarians (we&#8217;ll be having no tags here, we just want proper controlled vocabularies) as they upheld the proper way of doing things and they users Googled ?</p>
<p>Doh! I don&#8217;t mean to say Linked Data is not the way forward, I just don&#8217;t think it&#8217;s yet a representation that large numbers of people would feel comfortable or capable of working with, given what they currently know, what they currently do, and they culturally currently do it&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John S. Erickson, Ph.D.</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25512</link>
		<dc:creator>John S. Erickson, Ph.D.</dc:creator>
		<pubDate>Wed, 16 Jun 2010 15:31:34 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25512</guid>
		<description>Great post, Tom!

In my view the question is analogous to that of (web) API design: ultimately, to be success, it must be &lt;i&gt;usable!&lt;/i&gt; 

In his great Javapolis keynote, &lt;a href=&quot;http://bit.ly/c9AgpL&quot; rel=&quot;nofollow&quot;&gt; How to Design a Good API and Why it Matters&lt;/a&gt; Josh Bloch of Google laid out these characteristics of a &quot;Good API&quot;:

&lt;blockquote cite=&quot;http://bit.ly/c9AgpL&quot;&gt;
â€¢ Easy to learn
â€¢ Easy to use, even without documentation
â€¢ Hard to misuse
â€¢ Easy to read and maintain code that uses it
â€¢ Sufficiently powerful to satisfy requirements
â€¢ Easy to extend
â€¢ Appropriate to audience
&lt;/blockquote&gt;

Bloch argues that with these properties in mind, a public API can be an organization&#039;s &quot;greatest asset.&quot; I would argue that for data-intensive organizations, datasets published as &lt;a href=&quot;http://linkeddata.org&quot; rel=&quot;nofollow&quot;&gt;linked data&lt;/a&gt; with these same characteristic could well be &lt;i&gt;their&lt;/i&gt; most valuable asset!</description>
		<content:encoded><![CDATA[<p>Great post, Tom!</p>
<p>In my view the question is analogous to that of (web) API design: ultimately, to be success, it must be <i>usable!</i> </p>
<p>In his great Javapolis keynote, <a href="http://bit.ly/c9AgpL" rel="nofollow"> How to Design a Good API and Why it Matters</a> Josh Bloch of Google laid out these characteristics of a &#8220;Good API&#8221;:</p>
<blockquote cite="http://bit.ly/c9AgpL"><p>
â€¢ Easy to learn<br />
â€¢ Easy to use, even without documentation<br />
â€¢ Hard to misuse<br />
â€¢ Easy to read and maintain code that uses it<br />
â€¢ Sufficiently powerful to satisfy requirements<br />
â€¢ Easy to extend<br />
â€¢ Appropriate to audience
</p></blockquote>
<p>Bloch argues that with these properties in mind, a public API can be an organization&#8217;s &#8220;greatest asset.&#8221; I would argue that for data-intensive organizations, datasets published as <a href="http://linkeddata.org" rel="nofollow">linked data</a> with these same characteristic could well be <i>their</i> most valuable asset!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kingsley Idehen</title>
		<link>http://tomheath.com/blog/2010/06/why-carry-the-cost-of-linked-data/comment-page-1/#comment-25510</link>
		<dc:creator>Kingsley Idehen</dc:creator>
		<pubDate>Wed, 16 Jun 2010 14:48:54 +0000</pubDate>
		<guid isPermaLink="false">http://tomheath.com/blog/?p=166#comment-25510</guid>
		<description>Tom,

To cut a long story short. RDF formats provide one of several vehicles for publishing Linked Data on public or private HTTP based networks. 

RDF isn&#039;t accepted as a Data Model, its a Data Representation mechanism (Markup). The base Data Model is EAV (with EAV/CR when going into the realms of TBox and ABox partitioning). Linked Data adds the use of HTTP based URIs for Names re. Entity, Entity Attribute, and Entity Attribute Value slots re. the base models 3-tuple structure.

EAV model data can be expressed in a myriad of representation formats (which includes the RDF family).

RDF has its own merits wrt. to real object modeling fidelity. Thus, we don&#039;t need to conflate Data Representation and Data Model in order for RDF to succeed, based on its on tangible virtues. 

Important Note:
TimBL&#039;s original design issues doc never made mention of RDF or SPARQL as mandatory. Even today, I strongly believe his current edition  references RDF and SPARQL (in parenthesis) as examples of relevant standards that aid publication of Linked Data.

Links:

1. http://bit.ly/cA0zxw -- Data 3.0 Manifesto (decouples RDF from Linked Data).</description>
		<content:encoded><![CDATA[<p>Tom,</p>
<p>To cut a long story short. RDF formats provide one of several vehicles for publishing Linked Data on public or private HTTP based networks. </p>
<p>RDF isn&#8217;t accepted as a Data Model, its a Data Representation mechanism (Markup). The base Data Model is EAV (with EAV/CR when going into the realms of TBox and ABox partitioning). Linked Data adds the use of HTTP based URIs for Names re. Entity, Entity Attribute, and Entity Attribute Value slots re. the base models 3-tuple structure.</p>
<p>EAV model data can be expressed in a myriad of representation formats (which includes the RDF family).</p>
<p>RDF has its own merits wrt. to real object modeling fidelity. Thus, we don&#8217;t need to conflate Data Representation and Data Model in order for RDF to succeed, based on its on tangible virtues. </p>
<p>Important Note:<br />
TimBL&#8217;s original design issues doc never made mention of RDF or SPARQL as mandatory. Even today, I strongly believe his current edition  references RDF and SPARQL (in parenthesis) as examples of relevant standards that aid publication of Linked Data.</p>
<p>Links:</p>
<p>1. <a href="http://bit.ly/cA0zxw" rel="nofollow">http://bit.ly/cA0zxw</a> &#8212; Data 3.0 Manifesto (decouples RDF from Linked Data).</p>
]]></content:encoded>
	</item>
</channel>
</rss>

