acme (email not shown publicly)
http://www.astray.com/
Leon Brocard (aka acme) is an orange-loving Perl eurohacker with many varied contributions to the Perl community, including the GraphViz module on the CPAN. YAPC::Europe was all his fault. He is still looking for a Perl Monger group he can start which begins with the letter 'D'.
It's awful wordy (Score:1)
Re:It's awful wordy (Score:4, Insightful)
RDF may be more suitable and appropriate for aggregation of the various metadata files relating to a single distribution. Much of it will be primarily for PAUSE and the indexers like search.cpan and the various tools people already use like cpan.pm so users generally won't ever need to look at the raw metadata unless they really want to.
Reply to This
Parent
Re:It's awful wordy (Score:2)
So an interesting triple would be "LBROCARD" "is the author of" "Acme-Buffy-1.2". Or, in the RDF fragment about Acme-Buffy-1.2: "<cpan:id>LBROCARD</cpan:id>". Noti
Re:It's awful wordy (Score:1)
Identifying people and things (Score:1)
That's what RDF vocabularies are for.
If you stick
xmlns:foaf="http://xmlns.com/foaf/0.1/"
in the RDF declaration, that lets you do something like this:
<cpan:author>s um>
<cpan:id>EMARTIN</cpan:id>
<foaf:name>Earle Martin</foaf:name>
<foaf:mbox_sha1sum>8699ba79a95abf86e0055c133bf5d87ceab921e9</foaf:mbox_sha1
</cpan:author>
Of course, there's going to have to be a CPAN vocabula
Re:Identifying people and things (Score:1)
<foo>represents a FOO in the real world, and to write the code to handle FOOs correctly. That is, it requires just as much work as understanding what 'author' means in a structure such as:$VAR1 = { 'author' => 'Sheerluck Holmes', 'title' => 'true crimes and how to avoid them' }or a YAML equivalent.
Using XML-ish things does not help to define what your data is, regardless of what it says on the bottle of Kool-aid.
Re:Identifying people and things (Score:1)
Oh, OK, maybe I didn't follow your meaning. I wasn't meaning to imply that using RDF (and in the vocabulary itself, OWL [w3.org]) would actually define what the data is. But yes, isn't that always going to be the case, until we have smart computers? At the moment, the closest thing to "encapsulated meaning" we have is Cyc [opencyc.org], and that's a long way off from being the real thing. RDF vocabularies, as you say, are good for defining relationships between things.
I don't think, though, that RDF was ever intended to be hu
Re:Identifying people and things (Score:1)
Re:Identifying people and things (Score:1)
sky
rdf/cpan (Score:4, Interesting)
Wow, I really like this idea. Is the idea to serialize CPAN metadata in a similar way to how the Open Directory Project [dmoz.org] makes their data [dmoz.org] available? Speaking as an ex-librarian, your use of RDF and DublinCore is commendable. People in the library and information science communities have been getting all excited about RDF and DublinCore for years, and it's is very cool to see someone putting it to practical use. I bet the the semantic web folks [w3.org] would also be very interested to hear about your experiments.
On a somewhat related note: while it's a kind of eclectic the Open Archives Initiative [openarchives.org] has developed a protocol [openarchives.org] for sharing large sets of metadata. The OAI-PMH provides a very simple framework for building data providers and data harvesters using a set of 6 verbs over XML/HTTP: Identify(), ListIdentifiers(), GetRecord(), ListmetadataFormats(), ListRecords(), ListSets(). While it might not be of direct use, it could be of interest if you are looking for ideas on how to allow people to update their local copies of CPAN metadata without grabbing the whole lot each time. The OAI-PMH has its roots in the arxiv [arxiv.org] pre-print server at Los Alamos, and is currently being used by quite a mix [arxiv.org] of data providers. Oh, and I wrote Net::OAI::Harvester [cpan.org] for interacting with repositories :-)
Reply to This
Using RDF (Score:3, Informative)
The RDF format is rather, um, ugly to behold. It's good for interchange between apps, but greatly obfuscates the meaning for wetware parsers. I think the following is a faithful interpretation of the above example in Notation 3 [w3.org]:
Here are some important elements that are missing but should be trivial to add:- Author ID
- DSLIP values
- MD5 Checksum
- Module Prerequisites (as determined by Meta.yml or whatnot)
- Minimum Perl version required
Nevertheless, this snippet of RDF is a very good start. Thanks!Reply to This
Re:Using RDF (Score:2)
RDF/YAML (Score:1)
Re:RDF/YAML (Score:2)
Re:RDF/YAML (Score:2)
My cat would argue even more strongly that we should design a database schema and shove all the data into {SQLite|MySQL|PostgreSQL}. Even Cats can understand third normal form. ;-)
The one thing we really need is to agree on the triples and the meaning of the assertions that describe CPAN metadata. Everything else is just syntax. Mapping from one syntax or another (or deeming one syntax "preferred") is an e
About (Score:1)