This HTML5 document contains 51 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

PrefixNamespace IRI
dctermshttp://purl.org/dc/terms/
n12http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr12a.
n7http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr11a.
atomhttp://atomowl.org/ontologies/atomrdf#
foafhttp://xmlns.com/foaf/0.1/
n31http://vos.openlinksw.com/dataspace/services/wiki/
oplhttp://www.openlinksw.com/schema/attribution#
n2http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/
dchttp://purl.org/dc/elements/1.1/
n29http://vos.openlinksw.com/dataspace/dav#
rdfshttp://www.w3.org/2000/01/rdf-schema#
n32http://rdfs.org/sioc/services#
siocthttp://rdfs.org/sioc/types#
n13http://vos.openlinksw.com/dataspace/person/dav#
n17http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/VirtSetCrawlerJobsGuideSitemaps/sioc.
n9http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr1.
n10http://vos.openlinksw.com/dataspace/owiki/wiki/
n15http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr11ab.
n18http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr11b.
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n11http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr12.
n19http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr15.
n21http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr13.
n22http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr14.
n6http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr11.
n28http://cname:port/conductor,
n26http://vos.openlinksw.com/dataspace/owiki#
n20http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr2.
n25http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#
xsdhhttp://www.w3.org/2001/XMLSchema#
n24http://vos.openlinksw.com/dataspace/%28NULL%29/wiki/VOS/
n27http://vos.openlinksw.com/dataspace/person/owiki#
siochttp://rdfs.org/sioc/ns#
n23http://vos.openlinksw.com/wiki/main/VOS/VirtSetCrawlerJobsGuideSitemaps/cr3.
Subject Item
n13:this
foaf:made
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n29:this
sioc:creator_of
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n31:item
n32:services_of
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n26:this
sioc:creator_of
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n10:VOS
sioc:container_of
n2:VirtSetCrawlerJobsGuideSitemaps
atom:entry
n2:VirtSetCrawlerJobsGuideSitemaps
atom:contains
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtSetCrawlerJobsGuideSemanticSitemapsFuncExample
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtSetCrawlerJobsGuideSemanticSitemaps
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtSetCrawlerJobsGuideSitemaps
rdf:type
sioct:Comment atom:Entry
dcterms:created
2017-06-13T05:48:33.027482
dcterms:modified
2017-06-13T05:48:33.027482
rdfs:label
VirtSetCrawlerJobsGuideSitemaps
foaf:maker
n13:this n27:this
dc:title
VirtSetCrawlerJobsGuideSitemaps
opl:isDescribedUsing
n17:rdf
sioc:has_creator
n26:this n29:this
sioc:attachment
n6:png n7:png n9:png n11:png n12:png n15:png n18:png n19:png n20:png n21:png n22:png n23:png
sioc:content
%META:TOPICPARENT{name="VirtSetCrawlerJobsGuide"}% ---+Setting up a Content Crawler Job to retrieve Sitemaps The following guide describes how to set up a crawler job for getting content of a basic Sitemap where the source includes RDFa. 1 From the Virtuoso Conductor User Interface i.e. http://cname:port/conductor, login as the "dba" user. 1 Go to "Web Application Server" tab. %BR%%BR%<a href="%ATTACHURLPATH%/cr1.png" target="_blank"><img src="%ATTACHURLPATH%/cr1.png" width="600px" /></a>%BR%%BR% 1 Go to the "Content Imports" tab. %BR%%BR%<a href="%ATTACHURLPATH%/cr2.png" target="_blank"><img src="%ATTACHURLPATH%/cr2.png" width="600px" /></a>%BR%%BR% 1 Click on the "New Target" button. %BR%%BR%<a href="%ATTACHURLPATH%/cr3.png" target="_blank"><img src="%ATTACHURLPATH%/cr3.png" width="600px" /></a>%BR%%BR% 1 In the form displayed: * Enter a name of choice in the "Crawl Job Name" text-box: <verbatim> Basic Sitemap Crawling Example </verbatim> * Enter the URL of the site to be crawled in the "Data Source Address (URL)" text-box: <verbatim> http://psclife.pscdog.com/catalog/seo_sitemap/product/&nbsp </verbatim> * Enter the location in the Virtuoso WebDAV repository the crawled should stored in the "Local WebDAV Identifier" text-box, for example, if user demo is available, then: <verbatim> /DAV/home/demo/basic_sitemap/ </verbatim> * Choose the "Local resources owner" for the collection from the list-box available, for ex: user demo. * Select the "Accept RDF" check-box. %BR%%BR%<a href="%ATTACHURLPATH%/cr11.png" target="_blank"><img src="%ATTACHURLPATH%/cr11.png" width="600px" /></a>%BR%<a href="%ATTACHURLPATH%/cr11a.png" target="_blank"><img src="%ATTACHURLPATH%/cr11a.png" width="600px" /></a>%BR%%BR% 1 Click the "Create" button to create the import: %BR%%BR%<a href="%ATTACHURLPATH%/cr12.png" target="_blank"><img src="%ATTACHURLPATH%/cr12.png" width="600px" /></a>%BR%%BR% 1 Click the "Import Queues" button. 1 For the "Robot targets" with label "Basic Sitemap Crawling Example " click the "Run" button. 1 This will result in the Target site being crawled and the retrieved pages stored locally in DAV and any sponged triples in the RDF Quad store. %BR%%BR%<a href="%ATTACHURLPATH%/cr13.png" target="_blank"><img src="%ATTACHURLPATH%/cr13.png" width="600px" /></a>%BR%%BR% 1 Go to the "Web Application Server" -> "Content Management" tab. %BR%%BR%<a href="%ATTACHURLPATH%/cr14.png" target="_blank"><img src="%ATTACHURLPATH%/cr14.png" width="600px" /></a>%BR%%BR% 1 Navigate to the location of newly created DAV collection: <verbatim> /DAV/home/demo/basic_sitemap/ </verbatim> 1 The retrieved content will be available in this location. %BR%%BR%<a href="%ATTACHURLPATH%/cr15.png" target="_blank"><img src="%ATTACHURLPATH%/cr15.png" width="600px" /></a>%BR%%BR% ---++Related * [[VirtSetCrawlerJobsGuide][Setting up Crawler Jobs Guide using Conductor]] * [[http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#rdfinsertmethodvirtuosocrawler][Setting up a Content Crawler Job to Add RDF Data to the Quad Store]] * [[VirtSetCrawlerJobsGuideSemanticSitemaps][Setting up a Content Crawler Job to Retrieve Semantic Sitemaps (a variation of the standard sitemap)]] * [[VirtSetCrawlerJobsGuideDirectories][Setting up a Content Crawler Job to Retrieve Content from Specific Directories]] * [[VirtCrawlerSPARQLEndpoints][Setting up a Content Crawler Job to Retrieve Content from SPARQL endpoint]]
sioc:id
8e3b28cf81a7848dc1ce50585dfeebf2
sioc:link
n2:VirtSetCrawlerJobsGuideSitemaps
sioc:has_container
n10:VOS
n32:has_services
n31:item
atom:title
VirtSetCrawlerJobsGuideSitemaps
sioc:links_to
n2:VirtSetCrawlerJobsGuideDirectories n2:VirtSetCrawlerJobsGuide n24:VirtCrawlerSPARQLEndpoints n25:rdfinsertmethodvirtuosocrawler n28: n2:WebDAV
atom:source
n10:VOS
atom:author
n13:this
atom:published
2017-06-13T05:48:33Z
atom:updated
2017-06-13T05:48:33Z
sioc:topic
n10:VOS
Subject Item
n2:VirtCrawlerSPARQLEndpoints
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps
Subject Item
n2:VirtCrawlerGuideAtom
sioc:links_to
n2:VirtSetCrawlerJobsGuideSitemaps