ls
Semantic Web Search Engine. People pronounce it "swuasziee", "swishy", or "s-w-s-e". We prefer "swizzy".
SWSE is a search engine for the RDF Web on the Web, and provides the equivalent services a search engine currently provides for the HTML Web. The system explores and indexes the Semantic Web and provides an easy-to-use interface through which users can find the information they are looking for. Because of the inherent semantics of RDF and other Semantic Web languages, the search and information retrieval capabilities of SWSE are potentially much more powerful than those of current search engines.
SWSE is a research project being carried out by DERI Galway. Contributors include:
SWSE indexes RDF data which is retrieved from many sources. The main sources are OWL, RDF and RSS files. RSS2 is converted to RDF. We will add GRDDL sources soon.
We are aiming for weekly crawls. The latest dataset is from the week of 2008-03-31.
SWSE's default search mode will turn up results with any of the words in your query. To search for information using SWSE
Once you have selected an entity, you can select an incoming or outgoing link at the sidebar. You then get a list of entities associated with the current entity.
The endpoint supporting core SPARQL queries is available at http://swse.deri.org/yars2/.
The SPARQL endpoint is experimental. Please let us know about problems you might encounter.
We try to get link rel links from HTML pages to FOAF files, but we might have missed your HTML homepage.
The safe way is to get a rdfs:seeAlso link from somebody's FOAF file that is already in the index, or alternatively, you can add your file to the FOAF Bulletin Board page which is included in our crawl.
We normalise the data we put into our index, and sometimes statements get removed due to syntax errors. Please make sure your URIs do not contain illegal characters. One of the more common case of illegal URIs are those with spaces. Escape these characters if you really need them. We handle Unicode URIs.
Our pages are XHTML 1.0/CSS2 compliant, using client-side XSLT. The pages should display on any modern browser, such as Firefox, Opera, and Internet Explorer. However, we have tested the user interface currently only on Firefox and Opera under Linux.
We used a version of MultiCrawler to crawl the Semantic Web. An ISWC publication about MultiCrawler is available online.
MultiCrawler transforms all crawled documents to RDF and creates an index over them.
The focus is to find and transform Semantic Web documents, like RDF files and RSS feeds.
To find URLs to Semantic Web files we extract URLs from the HTML Web, and follow rdfs:seeAlso links in RDF files.
Approximately data from about 400k sources, but we are constantly adding new sources.
We stress-tested the system with 7 billion machine-generated statements. For the current demo, we use real-world data from the Semantic Web.
The components of SWSE are:
More information on SWSE is available here.
Yes. Other semantic web search engines include: