%META:TOPICPARENT{name="VirtBulkRDFLoader"}% ---+ Virtuoso Bulk Load Example: DBpedia data sets The following example demonstrates how to upload the DBpedia data sets into Virtuoso using the Bulk Loading Sequence. 1 Assuming there is a folder named "tmp" in your filesystem, and it is within a directory specified in the [[http://docs.openlinksw.com/virtuoso/databaseadmsrv.html#fp_acliniallowed][DirsAllowed]] param defined in your virtuoso.ini file. 1 Load the required DBpedia data sets into the "tmp" folder * The latest data sets can be downloaded from the [[http://wiki.dbpedia.org/Downloads][DBpedia Download]] page. Note the compressed bzip'ed ".bz2" data set files need to be uncompressed first as the bulk loader scripts only supports the auto extraction of gzip'ed ".gz" files. 1 If it hasn't already been, execute the [[VirtBulkRDFLoaderScript][Bulk Loading script]]. 1 Register the graph IRI under which the triples are to be loaded, e.g., "http://dbpedia.org": SQL> ld_dir ('tmp', '*.*', 'http://dbpedia.org'); Done. -- 90 msec. * Note that while this procedure will also work with gzip'ed files, it is important to keep the pattern: <name>.<ext>.gz, e.g., 'ontology.owl.gz' or ontology.nt.gz * Note that if there are other data files in your folder (tmp), then their content will also be loaded into the specified graph. %BR%%BR% 1 Create a file named global.graph in the "tmp" folder, with its entire content being the URI of the desired target graph, e.g., http://dbpedia.org 1 Finally, execute the rdf_loader_run procedure. This may take some time, depending on the size of the data sets. SQL> rdf_loader_run (); Done. -- 100 msec. 1 As a result, the Virtuoso log should contain notification that the loading has completed: 10:21:50 PL LOG: Loader started 10:21:50 PL LOG: No more files to load. Loader has finished 1 Run a checkpoint to commit all transactions to the database. SQL> checkpoint; Done. -- 53 msec. 1 To check the inserted triples for the given graph, execute a query similar to -- SQL> SPARQL SELECT COUNT(*) FROM WHERE { ?s ?p ?o } ; 1 Install the [[https://virtuoso.openlinksw.com/download/][DBpedia]] and [[https://virtuoso.openlinksw.com/download/][RDF Mappers]] VAD packages, using either the Virtuoso Conductor or the following manual commands: SQL> vad_install ('dbpedia_dav.vad', 0); SQL> vad_install ('rdf_mappers_dav.vad', 0); 1 The Virtuoso-hosted data set can now be explored using a HTML browser, or queried from the SPARQL or Faceted Browser web service endpoints. For example, with the DBpedia 3.5.1 data sets, a description of the resource Bob Marley can be viewed as: http://<your-cname>:<your-port>/resource/Bob_Marley %BR%%BR% %BR%%BR% ---++Related * [[VirtBulkRDFLoader][Virtuoso Bulk data set loader]]