All posts by Cripps P

Linking Geospatial Data 2014

Archaeogeomancy: Digital Heritage Specialists – archaeological geomatics – the majick of spatial data in archaeology – archaeological information systems for the digital age:

LGD14 Barcamp, featuring open plan space and beanbags.

LGD14 Barcamp, featuring open plan space and beanbags.

I was very pleased to attend this event co-organised by the World Wide Web Consortium (W3C) through the SmartOpenData project, the Open Geospatial Consortium (OGC), the UK Government (data.gov.uk), the Ordnance Survey (OS) and Google. Hosted by Google Campus London, the two day event comprised presentations, lightening talks and a barcamp, all focussing on the use of geospatial data within the world of Linked Data. It was refreshing to be amongst researchers, users, developers and commercial folk all working in this area; I for one picked up some good ideas to help with my research project and hopefully my contributions were of use.

It was certainly good to bring together the camps working in this area: the geospatial technologists on the one side and the web folks on the other (And people like me who have one foot in each camp, as well as limbs in other domains, my primary domain being digital cultural heritage of course). To make this stuff work it’s going to take both groups working together through their respective consortia, the W3C and OGC.

Highlights

I noted a number of specific highlights that really inspired and gave me food for thought. Some reinforced my own perceptions and others gave me some new ideas for application to my project. The extensive use of IRC and Twitter combined with fast internet access throughout the event made it possible to discuss and find out more whilst talks were ongoing. The format lent itself to interaction and I was impressed by the amount of progress made in such a short space of time, with new working groups forming and ideas for revisions to standards such as GeoSPARQL forthcoming.

Some of my favourite bits:

Ontologies and Linked Data

The discussion of the relationship of ontologies to Linked Data resources was informative. Whilst there is tendency in the world of the web to target the low hanging fruit, publish data and sort out issues later, it is my opinion that there needs to be robust semantics within our Linked Data resources. Otherwise we have a web of mess rather than semantically interoperable data. I noted a couple of points made by Tim Duffy (British Geological Survey) that resonated here:

Kerry Taylor (Commonwealth Scientific and Industrial Research Organisation) gave some examples of where ontological development can support but also restrict aims, showing how things can go wrong when trying to implement the various standards out there. This is an important point; ontologies need to be simple enough to work with but also suit the domain and applications.

GeoSPARQL and geometries

It was interesting to note that the use of Well Known Text (WKT) within GeoSPARQL can be problematic; hearing Lars G. Svensson (Deutsche Nationalbibliothek) talk about their experiences was reassuring given my experiences over the past few months!

Two crucial issues were raised by Raphaël Troncy (Eurecom) relating firstly to the use of coordinate systems and secondly to the way in which geometries are represented. I have often found the way in which geospatial data is used on the web to be problematic, with web developers focussing solely on location with only minimal respect for Coordinate Reference Systems (CRS), Spatial Reference System (SRS) or Spatial Reference Identifiers (SRID). In many cases, this is an acceptable way of working (if you just want features on maps in roughly the right place) but lack of clarity regarding spatial frameworks is problematic for any more detailed use of geospatial data. Being explicit about coordinate systems is essential for transforming between different coordinate systems and also takes into account factors such as tectonic plate movement. Put simply, assuming WGS84 is the only way to reference coordinates is a gross oversimplification.

Secondly, he went on to talk about the implementation of this within GeoSPARQL. The standard does support CRS (a good start) but the implementation is a little complex in my view. He suggested making CRS definitions simply part of the semantic model rather than being fudged into a geometry node as they currently are; a geometry node currently comprises up to three parts, the first being an (optional) SRID, the second being the geometry itself and the third being a literal describing the format of the geometry (eg a WKT or GML literal). It was suggested that these could better be stored as individual assertions relating to a geometry object and this was well received and may well appear in the next version of the standard: hurrah!

Versioning

A thorny issue if ever there was one. With heritage data in particular, it is important to know provenance of vocabularies. This topic came up a couple of times and it was pleasing to hear that a lightweight solution exists (current and then historical, versioned namespaces; bit clunky but doable) and versioning can be more fully supported using ontologies designed for the purpose.

Re-use

A key question with Linked Data is how do you know who is using your data? Does this matter? Arguably not, but as with anything, proper citation and accreditation is useful, polite and can be used to demonstrate impact (a good thing when looking for funding). Turns out that Adam Leadbetter (British Oceanographic Data Centre) and Dicky Allison (Woods Hole Oceanographic Institution) have both been using the Heritage Data vocabularies I blogged about previously, which is great stuff but this only came to light through seeing the inclusion of English Heritage as a provider on one of their slides!

Precision & Accuracy

Important concepts for heritage data are precision and accuracy. When working with historic maps in particular it is important to be able to record tolerances against which data has been captured. As with coordinate systems, this is an area often ignored in the world of Linked Data with coordinates expressed to spurious levels of precision (ten decimal places is a *seriously* precise measurement!) with no metadata to describe overall accuracy. Coming from a geospatial background where these are core items of metadata, the lack of proper support for this within current Linked Data standards is problematic. It took a speaker working with heritage data to make this point; nice one Rob Warren (Big Data Institute, Dalhousie University).

Time

There was talk of temporal aspects to data, most spatial data have some kind of temporal component to it. Interestingly, the data I work with is placed in archaeological time and rarely do we have any absolute temporal data; chronologies are typically relative and imprecise with occasional pegs to actual temporal classes generally used in Linked Data (timestamps, dates, etc). I think this makes for an interesting area to try out ideas and the way this is represented in cultural heritage ontologies such as the CIDOC CRM, whilst being a bit different to the norm, actually encapsulates some very powerful constructs for working with spatio-temporal data.

Cool stuff!

Last but not least, there was a liberal spread of really cool stuff.

Strabon

Strabon had a few mentions, with a point made that the GeoKnow report on platforms had evaluated an old version and actually Strabon is now a very capable and scalable system. Being a semantic spatio-temporal system built from the ground up rather than than adding semantic, spatial and temporal functionality to an existing system sounds promising. I will certainly be reviewing it in more detail as a result.

Sextant

Also, building on the Strabon system comes Sextant. This application is described as:

Sextant is a web-based system for the visualization and exploration of time-evolving linked geospatial data and the creation, sharing, and collaborative editing of `temporally-enriched’ thematic maps which are produced by combining different sources of such data and other geospatial information available in standard OGC file formats (e.g., KML).

This looks like a very interesting platform for mapping geosemantic data, one which I will definitely be investigating further.

RAGLD

An absolutely brilliant piece of work was presented by John Goodwin (Ordnance Survey) called entitled Rapid Assembly of Geo-centred Linked Data applications (RAGLD). A collaboration between the University of Southampton, Ordnance Survey and Seme4, this project provides a neat suite of developer tools (currently in beta) for working with Linked Geospatial Data. Massive +1 from me!

map4rdf

Another really interesting platform is map4rdf. This is described as:

map4rdf is a mapping and faceted browsing tool for exploring and visualizing RDF datasets enhanced with geometrical Information. map4rdf is an open source software. Just configure it to use your SPARQL endpoint and provide your users with a nice map-based visualization of your data.

Again, this is one I will be investigating further for my GSTAR project.

Campus London

Is just cool. Enough said. Love their displays of historical computer gear and of course the open plan, bean bag filled working space. Really tempted to join up and hang out there more (if only the trains to London didn’t require a mortgage…)

[flickr-gallery mode="photoset" photoset="72157642122872833"]

Summary

A brilliant event, well organised and some amazing ideas and discussion. Not only that, but an excellent forum for meeting people working in the same subject area; my Twitter peeps grew considerably as a result and I’ve added lots of new folks to my LinkedGeoData list.

Big thanks of course to John Goodwin and Phil Archer for leading on the organisation front.

Looking forward to LGD 15 :-)

The post Linking Geospatial Data 2014 appeared first on Archaeogeomancy: Digital Heritage Specialists.

GSTAR: investigation of methods for working with geosemantic data, integrating geospatial data with semantic data

Archaeogeomancy: Digital Heritage Specialists – archaeological geomatics – the majick of spatial data in archaeology – archaeological information systems for the digital age:

Mapping rubble by Brian Hoffman

Mapping rubble by Brian Hoffman

The first investigation in the GeoSemantic Technologies for Archaeological Research (GSTAR) research project is nearing completion, an assessment of approaches to the integration of geospatial archaeological data into a semantic framework to provide geosemantic capabilities.

The investigation draws on archaeological excavation data lodged with the Archaeology Data Service (ADS) and made available as Linked Data (LD) through the ADS’s Linked Data platform. The data relates to the Cobham Golf Course site and was produced by Oxford Archaeology (OA) as part of the Channel Tunnel Rail Link (CTRL) project then turned into a Linked Data resource through the Semantic Technologies Enhancing Links and Linked data for Archaeological Resources (STELLAR) project, undertaken by the Hypermedia Research Unit at the University of South Wales (USW).

Mapping a feature by Wessex Archaeology

Mapping a feature by Wessex Archaeology

The GSTAR literature review identified two strands of integration approaches within published literature. Emerging from the semantic web and Linked Data communities, an approach involving the direct inclusion of geospatial data within semantic resources, leveraging World Wide Web Consortium (W3C) standards for Resource Description Framework (RDF) and Open Geospatial Consortium (OGC) standards for Well Known Text (WKT, part of the Simple Features specification) and GeoSPARQL. Emerging from the Geographic Information Science (GISc) community, approaches involving the use of Web Feature Services (WFS) within broader Spatial Data Infrastructures (SDI) running in parallel and linked to to semantic resources.

This initial GSTAR investigation looked at both these strands with a view to assessing suitable modes for subsequent use in the next phases of the GSTAR project. A WISSKI installation has also been setup to allow for the minting of any additional URIs needed.

GeoSPARQL route

This involved creating geosemantic data aligned with the CRM-EH extension to the CIDOC CRM ontology, stored within the Oracle Spatial & Graph platform and accessed via GeoSPARQL using an Oracle WebLogic web server and the Jena Framework.

[code language=”xml”]
<owl:Class rdf:about="http://purl.org/crmeh#EHE0022_ContextDepiction">
<rdfs:isDefinedBy rdf:resource="http://purl.org/crmeh#CRMEH"/>
<rdfs:subClassOf rdf:resource="http://erlangen-crm.org/110404/E47_Spatial_Coordinates"/>
<rdfs:label>Context Depiction</rdfs:label>
<rdfs:comment>
The Spatial co-ordinates of a Context, defining the actual spatial extent of the context. Usually recorded at the time of excavation or other investigative work
</rdfs:comment>
</owl:Class>
[/code]
The OWL definition of the EHE0022 class used to describe depictions

Further triples were also added to describe the depiction using the GeoSPARQL ogc:hasGeometry and ogc:asWKT properties.

[code language=”xml”]
<owl:ObjectProperty rdf:about="#hasGeometry">
<rdfs:isDefinedBy rdf:resource=""/>
<rdfs:isDefinedBy rdf:resource="http://www.opengis.net/spec/geosparql/1.0"/>
<skos:prefLabel xml:lang="en">hasGeometry</skos:prefLabel>
<dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#date">2011-06-16</dc:date>
<dc:contributor>Matthew Perry</dc:contributor>
<dc:description xml:lang="en">
A spatial representation for a given feature.
</dc:description>
<rdfs:range rdf:resource="#Geometry"/>
<rdfs:comment xml:lang="en">
A spatial representation for a given feature.
</rdfs:comment>
<rdfs:domain rdf:resource="#Feature"/>
<rdfs:label xml:lang="en">hasGeometry</rdfs:label>
<dc:creator>OGC GeoSPARQL 1.0 Standard Working Group</dc:creator>
<skos:definition xml:lang="en">
A spatial representation for a given feature.
</skos:definition>
</owl:ObjectProperty>
[/code]

The OWL definition of the hasGeometry property

GIS Server route

A second approach used the same base platform and data but accessed the geospatial component via WFS provided by GeoServer, drawing on the Oracle database.

Next steps

The results of this stage and the GSTAR project in general will be presented at the Computer Applications and Quantitative Methods in Archaeology (CAA) conference to be held in Paris, France during April 2014.

Thanks are due to the University of South Wales for funding the GSTAR project and to the people and organisations responsible for the tools, technologies and data being used. Also my PhD supervisor (Prof. Douglas Tudhope; USW), advisors (Dr Mark Ware and Dr Alex Lohfink: USW) and fellow researchers Ceri Binding (USW), Dr Andreas Vlachidis (USW) and Michael Charno (ADS) for their input.

The post GSTAR: investigation of methods for working with geosemantic data, integrating geospatial data with semantic data appeared first on Archaeogeomancy: Digital Heritage Specialists.

First Birthday!

Archaeogeomancy: Digital Heritage Specialists – archaeological geomatics – the majick of spatial data in archaeology – archaeological information systems for the digital age:

Birthday cake by Will Clayton

Birthday cake by Will Clayton

Archaeogeomancy are pleased to be celebrating our first birthday this month! It’s hard to believe it has been a whole year since it all began, but LinkedIn confirms this with some lovely congratulatory messages, for which many thanks.

It’s been a busy year with a range of work successfully completed for a growing client base. 2014 is looking like it’s going to be a good year too with order books full for this quarter and a new and exciting venture just beginning.

Paul’s PhD is also progressing nicely with the three month review successfully completed and the first major case study in the final stages of being completed, ready for the next one and the transfer report.

So do follow us on Facebook and LinkedIn and contact us if you need some advice, training or have a project in mind.

The post First Birthday! appeared first on Archaeogeomancy: Digital Heritage Specialists.

‘Between the Monuments’ at Avebury

Archaeogeomancy: Digital Heritage Specialists – archaeological geomatics – the majick of spatial data in archaeology – archaeological information systems for the digital age:

The dig in the West Kennet Avenue, August 2013

The dig in the West Kennet Avenue, August 2013

It’s been almost a decade since I was last digging at Avebury as part of the ‘Negotiating Avebury‘ project, so it was lovely to be invited along with my Avebury Archaeological and Historical Research Group (AAHRG) colleagues to visit the latest excavations at this amazing place, part of the current ‘Between the Monuments‘ project and following the team’s 2012 geophysical survey. This latest project features two of the site directors from the Negotiating Avebury Project, Dr Josh Pollard (University of Southampton) and Dr Mark Gillings (University of Leicester), joined this time by Dr Nick Snashall (National Trust) as co-director.

Looking back in time: Alexander Keiller's trench reopened

Looking back in time: Alexander Keiller’s trench reopened

This years dig has opened up two trenches in the area of the West Kennet Avenue where Alexander Keiller identified what he described as a settlement site. Indeed, one of the trenches has been opened up over one of Keiller’s trenches to see what remains and how he dug the site. The excavations are being blogged by the project team as work progresses.

I was amazed to see the deposits in this section of the Avenue. There is a virtually untouched soil going right down to the chalk, soil which looks to have never been ploughed or otherwise interfered with. There are also no major cut features one might expect to find in the chalk around Avebury and elsewhere. Instead, artefacts reside pretty much where they were deposited, helped into their final resting places by the usual range of natural processes such as worm action.

The deposits being excavated

The deposits being excavated

The nature of the deposits has led to the adoption of a slightly different excavation strategy. Single context style recording is not ideally suited so the site has been gridded and then excavated in spits down through the fairly homogeneous soil to the flinty layer and ultimately the chalk beneath. This will allow for fine horizontal and vertical spatial resolution in the excavation data, ideal for the intended GIS based analysis.

The nature of these deposits also raises questions about the often employed strategy common in commercial fieldwork where the ‘topsoil’ is machined off to reveal cut features in the chalk below; such an approach used here would have revealed nothing yet the site is demonstrably rich in information.

Spatial Data

The level; essential bit of archaeological survey kit

The level; essential bit of archaeological survey kit

With Mark Gillings involved, there was always going to be extensive (and exemplary) use of GIS. To support this, other data is being gathered including photogrammetric data captured using pole mounted cameras (courtesy of Adam Stanford and his amazing Aerial Cam landrover) and Unmanned Aerial Vehicles (UAVs). The data it is possible to capture using UAVs combined with photogrammetric techniques is highly detailed and most useful for archaeological investigations.

GSTAR

Another reason for my visit was to discuss possibilities relating to my GSTAR project and given that the Between the Monuments project is right in the middle of my study area and data from this site will be extensively digitised, the activities indexed by the HER and Oasis and artefacts lodged with the Wiltshire Heritage Museum, my intention is to look at using some of the data for one of my case studies. This case study will involve the application of Linked Data techniques to (spatial) data from excavations, heritage inventories and museums; more on this to follow later this year, but for now, many thanks to the project directors and the aforementioned organisations for their kind permission to reuse their data.

Trench 2 being excavated, with the megalithic avenue in the background and the Aerial Cam landrover

Trench 2 being excavated, with the megalithic avenue in the background and the Aerial Cam landrover

Photos, Panoramas and PhotoSynths

As usual, I took quite a few photos during my visit! A selection of the best are shown below, taken from my Flickr photoset. I also took the opportunity to create a panoramic image from atop the spoil heap and used all the images to create a PhotoSynth.

Flickr Gallery

[flickr-gallery mode=”photoset” photoset=”72157634971425709″]

Panorama

Photosynth

The post ‘Between the Monuments’ at Avebury appeared first on Archaeogeomancy: Digital Heritage Specialists.

Day of Archaeology, 2013

Day of Archaeology, 2013

Day of Archaeology, 2013

For the 3rd year running, I’ve blogged for the Day of Archaeology project, which is an amazing project, recording a snapshot each year of what archaeologists were doing on a particular day, this year on Friday 26th July 2013.