Creating the take home database

From Knot Atlas
Revision as of 13:35, 18 July 2007 by Drorbn (talk | contribs) (→‎Quick start)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

This page is really only for Scott and Dror. It documents the procedure for creating RDF dumps from the Knot Atlas. If you're only interested in the actual data, go to The Take Home Database.

Quick start

  1. Go to /www/html/data/
  2. As root, run ./create-rdf.sh
  3. Get a coffee, or perhaps go to sleep, travel to a conference etc.
  4. Eventually, check that [1] has been created.
  5. Then re-make the smaller data files - type "make all".

What does this do?

XML dump

First, we dump the current version of every page in the Knot Atlas to XML, using the /w/maintenance/dumpBackup.php script from mediawiki. This produces the file katlas.xml.

Convert to RDF

This is actually a two step process. First, we create RDF statements in an on-disk RDF repository. Second, we dump these statements back to the file katlas.rdf

Cleaning up

The file katlas.xml gets deleted, as it's not so useful, and the file katlas.rdf gets gzipped to katlas.rdf.gz. It's then available at [2].