Frequently Asked Questions

Open Data is, according to Wikipedia, is the idea that certain data should be freely available to everyone to use and republish as they wish, without restrictions from copyright, patents or other mechanisms of control. In the last few years, this ideas has given origin to a big movement of people interested in getting access to data generated by a number of institutions, agencies, and entities, fostered by Freedom of Information legislation all over the world. Several national and local "Open Data portals" have been created, notably the United States', the United Kingdom's and the Greater London Authority's
A big drive in the creation of the Open Data movement has come from academic institutions, notably the University of Southampton. It's time that other institutions join the Open Data movement as data producers, and share the overwhelming amount of data they hold - from administrative and statistical datasets, to research raw data and outcomes.
There are several deliverables of this project:
  • the datasets and, possibly, an institutional ontology
  • an API
  • a SPARQL endpoint
  • documentation, metadata, and legal information.
An Academic API is, simply, an API returning a number of information about an academic institution. Developing an Academic API is the most speculative goal of this project. The idea behind an Academic API is that of seeking interoperability between the data services of different institution. We want to understand what are the most popular datasets, and how people link these datasets to generate derivative datasets. It makes sense, in this context, to provide the users with an easy to use, pre-developed API.
SPARQL (SPARQL Protocol and RDF Query Language) is a query language for RDF, that is a language to retrieve and manipulate data from a specific type of "database" using the Resource Description Framework format, i.e. graph-related data. It is one of the most important technologies used for the so called semantic web.
The Open Data service is an experimental project developed by Giuseppe Sollazzo in Computing Services. It aims to investigate what are the sources of Open Data within SGUL and what processes should be put in place to ensure their accurate and accessible publication.
Although we download and elaborate data directly from the authoritative sources, the datasets available on this portal should always be considered non-authoritative sources, at least while the service stays in beta.
That said, we aim at providing data with maximum accuracy and to mirror any update in the original sources.
Please also note that there is a separate FOI (Freedom Of Information) structure which offers direct answers.
We aim to update the data at least once a week, or more often if possible. The last update is reported on each dataset's page.
If you click on the "Datasets" link in the upper navigation bar, or go directly into a dataset on the left, you will see all available Metadata for a given dataset, with details on how to access the actual data.
We aim to provide:
  • Metadata (licence, update frequency, source of the data, external URL when available)
  • Linked data info, including vocabularies, classes, and predicates used in the dataset
  • Direct access links to the data, including dumps in several formats (JSON, CSV, RDF-XML, example SPARQL queries, and a list of the main API calls returning the chosen dataset).
Sure. Please go through the FAQs first, then you're welcome to get in touch with Giuseppe Sollazzo in Computing Services, who maintains these pages, at