Harvesting entry links into RDF

December 30th, 2002  |  Published in rdf

Continuing the theme of harvesting information from MT entries into RDF summaries, I wrote another MT plugin (src) that uses HTML::LinkExtor to extract <a> links.

The RDF for this entry shows that we have xlink:href triples for each link in the entry. Not sure if that’s a reasonable use of the xlink vocabulary…

The idea of this extraction is to be able to build better “more like this” links between entries; two entries linking to the same URL are going to be related in some way and so should be linked together.

