This HTML5 document contains 56 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

PrefixNamespace IRI
n11http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra12.
n15http://rdfs.org/sioc/services#
n7http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra5.
dchttp://purl.org/dc/elements/1.1/
n21http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra14.
n8http://vos.openlinksw.com/dataspace/owiki#
n2http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/
dctermshttp://purl.org/dc/terms/
n16http://vos.openlinksw.com/dataspace/services/wiki/
rdfshttp://www.w3.org/2000/01/rdf-schema#
n9http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra8.
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n33http://docs.openlinksw.com/virtuoso/xmlservices.html#
atomhttp://atomowl.org/ontologies/atomrdf#
n34http://cname/
n39http://vos.openlinksw.com/dataspace/dav#
xsdhhttp://www.w3.org/2001/XMLSchema#
n20http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra3.
siochttp://rdfs.org/sioc/ns#
n38http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra1.
n17http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra11.
n36http://data.libris.kb.se/nationalbibliography/feed/
n37http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cr3.
n31http://vos.openlinksw.com/dataspace/person/owiki#
n4http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra6.
oplhttp://www.openlinksw.com/schema/attribution#
n12http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra13.
n23http://vos.openlinksw.com/dataspace/person/dav#
n6http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra4.
n22http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra15.
n10http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra9.
foafhttp://xmlns.com/foaf/0.1/
n19http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#
n5http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra7.
n35http://librisbloggen.kb.se/2011/09/21/swedish-national-bibliography-and-authority-data-released-with-open-license/
siocthttp://rdfs.org/sioc/types#
n30http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/VirtCrawlerGuideAtom/sioc.
n14http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra10.
n25http://vos.openlinksw.com/dataspace/owiki/wiki/
n18http://vos.openlinksw.com/wiki/main/VOS/VirtCrawlerGuideAtom/cra2.
Subject Item
n23:this
foaf:made
n2:VirtCrawlerGuideAtom
Subject Item
n39:this
sioc:creator_of
n2:VirtCrawlerGuideAtom
Subject Item
n16:item
n15:services_of
n2:VirtCrawlerGuideAtom
Subject Item
n8:this
sioc:creator_of
n2:VirtCrawlerGuideAtom
Subject Item
n25:VOS
sioc:container_of
n2:VirtCrawlerGuideAtom
atom:entry
n2:VirtCrawlerGuideAtom
atom:contains
n2:VirtCrawlerGuideAtom
Subject Item
n2:VirtSetCrawlerJobsGuideSemanticSitemapsFuncExample
sioc:links_to
n2:VirtCrawlerGuideAtom
Subject Item
n2:VirtCrawlerSPARQLEndpoints
sioc:links_to
n2:VirtCrawlerGuideAtom
Subject Item
n2:VirtCrawlerGuideAtom
rdf:type
sioct:Comment atom:Entry
dcterms:created
2017-06-13T05:43:06.895602
dcterms:modified
2017-06-29T07:36:18.912740
rdfs:label
VirtCrawlerGuideAtom
foaf:maker
n31:this n23:this
dc:title
VirtCrawlerGuideAtom
opl:isDescribedUsing
n30:rdf
sioc:has_creator
n8:this n39:this
sioc:attachment
n4:png n5:png n6:png n7:png n9:png n10:png n11:png n12:png n14:png n17:png n18:png n20:png n21:png n22:png n37:png n38:png
sioc:content
%META:TOPICPARENT{name="VirtSetCrawlerJobsGuide"}% ---+Virtuoso Crawler Guide for populating Virtuoso Quad Store using ATOM feed %TOC% ---++What? This Guide demonstrates populating the Virtuoso Quad Store using ATOM feed. ---++Why? Populating the Virtuoso Quad Store can be done in different ways Virtuoso supports. The Conductor -> Content Import UI offers plenty of options, one of which is the XPath expression for crawling RDF resources URLs and this feature is a powerful and easy-to-use for managing the Quad Store. ---++How? To populate the Virtuoso Quad Store, in this Guide we will use a XPAth expression for the URLs of the RDF resources references in a given ATOM feed. For ex. [[http://data.libris.kb.se/nationalbibliography/feed/][this one]] of the "National Bibliography" Store. ---+++Sample Scenario 1 Go to http://cname/conductor 1 Enter dba credentials 1 Go to Web Application Server -> Content Management -> Content Imports: %BR%%BR%<a href="%ATTACHURLPATH%/cra1.png" target="_blank"><img src="%ATTACHURLPATH%/cra1.png" width="600px" /></a>%BR%%BR% 1 Click "New Target": %BR%%BR%<a href="%ATTACHURLPATH%/cra2.png" target="_blank"><img src="%ATTACHURLPATH%/cra2.png" width="600px" /></a>%BR%%BR% 1 In the presented form specify respectively: * <b>Crawl Job Name</b>: for ex. National Bibliography ; * <b>Data Source Address (URL)</b>: for ex. [[http://data.libris.kb.se/nationalbibliography/feed/][http://data.libris.kb.se/nationalbibliography/feed/]] ; * <b>Note</b>: the entered URL will be the graph URI for storing the imported RDF data. You can also set it explicitly by entering another graph URI in the "If Graph IRI is unassigned use this Data Source URL:" option. * <b>Local WebDAV Identifier </b>: for ex. <verbatim> /DAV/temp/nbio/ </verbatim> * <b>XPath expression for links extraction</b>: <verbatim> //entry/link/@href </verbatim> * <b>Update Interval (minutes)</b>: for ex. 10 ; * <b>Run Sponger</b>: hatch this check-box ; * <b>Accept RDF</b>: hatch this check-box ; * <b>Store metadata</b>: hatch this check-box ; * <b>RDF Cartridge</b>: hatch this check-box and specify what cartridges will be used: %BR%%BR%<a href="%ATTACHURLPATH%/cra3.png" target="_blank"><img src="%ATTACHURLPATH%/cra3.png" width="600px" /></a> %BR%<a href="%ATTACHURLPATH%/cra4.png" target="_blank"><img src="%ATTACHURLPATH%/cra4.png" width="600px" /></a> %BR%<a href="%ATTACHURLPATH%/cra5.png" target="_blank"><img src="%ATTACHURLPATH%/cra5.png" width="600px" /></a>%BR%%BR% 1 Click "Create": 1 The new created target should be displayed in the list of available Targets: %BR%%BR%<a href="%ATTACHURLPATH%/cra7.png" target="_blank"><img src="%ATTACHURLPATH%/cra7.png" width="600px" /></a>%BR%%BR% 1 Click "Import Queues": %BR%%BR%<a href="%ATTACHURLPATH%/cra8.png" target="_blank"><img src="%ATTACHURLPATH%/cra8.png" width="600px" /></a>%BR%%BR% 1 Click for "National Bibliography" target the "Run" link from the very-right "Action" column: 1 Should be presented list of Top pending URLs: %BR%%BR%<a href="%ATTACHURLPATH%/cra9.png" target="_blank"><img src="%ATTACHURLPATH%/cra9.png" width="600px" /></a>%BR%%BR% 1 Finally when the import is finished, should be shown the total URLs that were processed: %BR%%BR%<a href="%ATTACHURLPATH%/cra10.png" target="_blank"><img src="%ATTACHURLPATH%/cra10.png" width="600px" /></a>%BR%%BR% 1 Click "Back" %BR%%BR%<a href="%ATTACHURLPATH%/cra11.png" target="_blank"><img src="%ATTACHURLPATH%/cra11.png" width="600px" /></a>%BR%%BR% 1 Click "Retrieved Sites". %BR%%BR%<a href="%ATTACHURLPATH%/cra12.png" target="_blank"><img src="%ATTACHURLPATH%/cra12.png" width="600px" /></a>%BR%%BR% 1 Out target should be presented in the list of available retrieved sites. From here you could manage the retrieved URLs by editing the imported URLs or exporting to External/Internal WebDAV destination. Click for ex. the "Edit" link of the very-right "Action" column for our retrieved site. 1 Should be presented all downloaded URLs of RDF resources referenced in our initial <b>ATOM</b> feed. %BR%%BR%<a href="%ATTACHURLPATH%/cra13.png" target="_blank"><img src="%ATTACHURLPATH%/cra13.png" width="600px" /></a>%BR%%BR% 1 To view the imported RDF data, go to http://cname/sparql and enter a simple query for ex.: <verbatim> SELECT * FROM <http://data.libris.kb.se/nationalbibliography/feed/> WHERE { ?s ?p ?o } </verbatim> %BR%%BR%<a href="%ATTACHURLPATH%/cra14.png" target="_blank"><img src="%ATTACHURLPATH%/cra14.png" width="600px" /></a>%BR%%BR% 1 Click "Run Query". 1 The imported RDF data triples should be shown: %BR%%BR%<a href="%ATTACHURLPATH%/cra15.png" target="_blank"><img src="%ATTACHURLPATH%/cra15.png" width="600px" /></a>%BR%%BR% ---++Related * [[http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#rdfinsertmethodvirtuosocrawler][Setting up a Content Crawler Job to Add RDF Data to the Quad Store]] * [[VirtSetCrawlerJobsGuideSitemaps][Setting up a Content Crawler Job to Retrieve Sitemaps]] (when the source includes RDFa) * [[VirtSetCrawlerJobsGuideSemanticSitemaps][Setting up a Content Crawler Job to Retrieve Semantic Sitemaps]] (a variation of the standard sitemap) * [[VirtSetCrawlerJobsGuideDirectories][Setting up a Content Crawler Job to Retrieve Content from Specific Directories]] * [[VirtCrawlerSPARQLEndpoints][Setting up a Content Crawler Job to Retrieve Content from SPARQL endpoint]] * [[http://docs.openlinksw.com/virtuoso/xmlservices.html#xpath_sql][Virtuoso XPATH Implementation and SQL]] * [[http://librisbloggen.kb.se/2011/09/21/swedish-national-bibliography-and-authority-data-released-with-open-license/][Collection examples of live ATOM and OAI-PMH feeds.]]
sioc:id
39696faaf5e697da124a9bcfc6054541
sioc:link
n2:VirtCrawlerGuideAtom
sioc:has_container
n25:VOS
n15:has_services
n16:item
atom:title
VirtCrawlerGuideAtom
sioc:links_to
n19:rdfinsertmethodvirtuosocrawler n2:WebDAV n2:VirtSetCrawlerJobsGuideDirectories n33:xpath_sql n34:conductor n35: n36: n2:VirtSetCrawlerJobsGuideSitemaps n34:sparql
atom:source
n25:VOS
atom:author
n23:this
atom:published
2017-06-13T05:43:06Z
atom:updated
2017-06-29T07:36:18Z
sioc:topic
n25:VOS