13 February 2018
ECS735P The Semantic Web Coursework
This coursework makes up 25% of the total marks for the module.
Submit your solution to the EECS coursework submission system by 26 March
2018. You should be able to see the details of the coursework submission on your Submit your solution QM+ by the date specified in the coursework area.
personal page.
You should be able to see the details of the coursework submission on your personal page.
Page 1 of 3
ECS735P Coursework
The goal of this coursework is to put into practice the ontology modelling and semantic data development skills that have been exercised in the Semantic Web labs with a larger project.
The task is to define, populate and query an ontology on a topic of your choice. The ontology must be able to integrate and reuse already available semantic data. At least two concepts of the ontology T-Box must be taken from external semantic data repositories. This way, the ontology will have an A-Box that can be populated with already existing data. You will use Python- based semantic tooling to populate the ontology with real world data.
Specifically, you should achieve the following:
Basic Task (80% marks): Define your ontology using Prot¨g¨-OWL2; populate the knowledge base from an external semantic data repository using SPARQL(1.1). Verify that you can also query the local ontology also using SPARQL.
Bonus Task (20% marks): As above, but your ontology should fuse information from at least two distinct external semantic data repositories. The query to your local ontology query should answer a question that cannot be answered by either remote knowledge base alone.
You should submit a zip file with the following elements:
¡ì A pdf document describing, in English, how you constructed the ontology: you should say where you got the data from, and you should also say what difficulties you encountered and how you solved them. The document must have also a final section explaining what source code files and models are included, and the required steps to run the code.
¡ì A Prot¨g¨-OWL ontology.
¡ì A python script (.py) that can be used to populate the ontology from a
SPARQL endpoint.
¡ì A python script that queries the local store to demonstrate to the user that information can be easily accessed. To test the system, the user should be able to execute any arbitrary query supported by your ontology.
Page 2 of 3
ECS735P Coursework
This assignment is intended to be open ended and exploratory in nature. However for illustration, some examples of possible tasks could be: create and populate an ontology covering movies and cities which could be queried to find movies filmed in cities with a population less than 1M; or create and populate an ontology about companies including location, employees and profits, which could be queried to find the UK based companies with the largest profit per employee.
For convenience, we provide you some publicly available semantic web data sources that can be useful for this coursework:
Dbpedia provides an RDF version of the information available in the regular Wikipedia. It also provides an SPARQL endpoint for remote access http://wiki.dbpedia.org/OnlineAccess
You can access governmental datasets from data.gov.uk, and data.gov. In some cases, you might need to download the dataset, as they don¡¯t provide a SPARQL endpoint.
Freebase (http://www.freebase.com/) is a community-generated semantic web data repository. It does not implement an SPARQL endpoint, but you can query it remotely using a specific library. You can see Python examples for accessing the Freebase endpoint in the O¡¯Reilly textbook.
http://labs.mondeca.com/sparqlEndpointsStatus/index.html. This web page provides links to numerous live SPARQL endpoints available over the web.
Courseworks are due on QM+ during the last week of term. Please check the date specified in the coursework area of the module page. Please do not be late with the coursework
The submission deadline is 26th March 2018, by the EECS electronic otherwise a late submission penalty is applied automatically. There is a final cut-off point
after one week. You should have about a month to complete this coursework.
submission system. This gives you over a month to complete your coursework. Do NOT wait until the last week to start. There will be no extensions.
Page 3 of 3