CROSS Harmonization & HPC modelization of FOREST Datasets

DATA

The different modules of the ontologies and datasets are published separately, to allow the potential users to select which modules they want to use. All data is either already published or will be published under CC BY 4.0 License. They can be currently accessed at https://github.com/Cross-Forest/Data.

– Data from the Spanish National Forest Inventory (IFN) contains sampling information about trees and shrubs in Spain. IFN releases are scheduled with a periodicity of 10 years, although the ongoing release (4th) will take more than 15 years to complete. Currently, data collection for the 4th update is completed for half of the territory, so the 3rd version of the inventory (IFN3) is the focus of the transformation into RDF. The data of IFN3 is split in 100 files (two for each province of the Spanish territory). These files are published in zipped SQL databases, with an ad-hoc schema, and in a proprietary format (Microsoft Access). We are working on the transformation of IFN3 into RDF, obtaining so far 55.1 million triples with a size of 2.7 GB. It is publicly available at https://github.com/Cross-Forest/Data/tree/master/ifn.

– Data from the Spanish Forest Map (MFE) contains cartographical information about the spatial distribution of forest land, described using polygons over the Spanish territory. Similarly to the IFN, MFE is updated with a planned periodicity of 10 years, although at present, it takes around 15 years to complete an update due to economic and social causes. Currently, the 3rd version of the MFE (MFE25, with a scale of 1:25.000) is in the final stages of field data collection. As a result, the second version of the MFE (MFE50, with a scale of 1:50.000) is the dataset that will be transformed into RDF in Cross-Forest, although available MFE25 data will be also included so as to count on the mostly updated version of MFE for each Spanish province. MFE50 is split in 50 zipped GIS shapefiles (one for each province of the Spanish territory), where each patch is characterized by a polygon. In addition, a dataset to represent positions in a grid of cells is being developed, in order to provide a raster representation of positions and geometries over the Spanish territory. We have completed the transformation of MFE50 into RDF, corresponding to 142.2 million triples with a size of 7.5 GB. It is publicly available at https://github.com/Cross-Forest/Data/tree/master/mfe.

– Data from the Portuguese Carta de Uso e Ocupaçao do Solo (COS) – Incluir hipervínculo pinchable: ) contains information on land use land cover in vector format with a minimum cartographic unit of 1 ha, a minimum distance between lines of 20m and 1:25 000 equivalent scale. COS exists for several years with no specific periodicity (e.g.  1990, 1995, 2007, 2010, 2015, 2018). The last version contains more than 100.000 polygons. We have completed the transformation of COS data into RDF, obtaining 16 million triples with a size of 730 MB. It is publicly available at https://github.com/Cross-Forest/Data/tree/master/cos. 

SPARQL endpoints make available the content of the Core, Alignment, and Grid Modules. The SPARQL endpoint for forestry data can be currently accessed at https://forestexplorer.gsic.uva.es/sparql.