Citizen Science Archaeological Finds On The Semantic Web: The FindSampo Framework
Citizen Science Archaeological Finds On The Semantic Web: The FindSampo Framework
Citizen Science Archaeological Finds On The Semantic Web: The FindSampo Framework
Project Gallery
FindSampo fosters collecting, sharing, publishing and studying archaeological finds discovered by the public.
The framework includes the following: a mobile find-reporting system; a semantic portal for researchers, the
public and collection managers to use; and a Linked Open Data service for creating custom data analyses and
for application developers.
Keywords: Finland, citizen science, metal detecting, archaeology, linked open data, Semantic Web
Introduction
FindSampo develops a prototype framework system for supporting mobile finds data report-
ing in the field, and for studying archaeological artefacts discovered and reported by the pub-
lic. While it is unique in responding to the archaeological conditions in Finland, and in
providing solutions to its users’ needs (Wessman et al. 2019), the framework and its imple-
mentation are open source and can be replicated for use elsewhere. FindSampo sits within a
broader context of digitising Finnish heritage (Hyvönen 2020), the European Public Finds
Recording Network (2021) and the ARIADNEPlus (n.d.) infrastructure project.
Challenges
Metal-detectorists’ finds can contribute to archaeological knowledge and research. In Fin-
land, however, it has been laborious to access data regarding new metal-detector finds, espe-
cially from a researcher perspective, and there is a backlog in the cataloguing process at the
Finnish Heritage Agency preventing up-to-date research. Hence, a user-friendly tool for
reporting, viewing, browsing and researching metal-detector finds to access high-quality meta-
data in a timely manner was needed. We adopted a ‘citizen science’ approach, conducting
surveys, interviews and focus groups for future users to express their preferences.
FindSampo is based on the ‘Sampo model’ (Hyvönen 2020) using the “FAIR guiding prin-
ciples for scientific data management and stewardship” (GoFair n.d.). This model includes
three components. A business model for collating, aggregating and publishing heteroge-
neous, distributed data from different content providers based on a shared ontology infra-
structure. An approach to interface design, where the data can be re-used and accessed
independently from multiple application perspectives, while the data reside in a single
SPARQL (Protocol and RDF Query Language) endpoint (WC3 2015). A two-step model
for accessing and analysing the data, where the focus of interest is first filtered out using a
faceted semantic search, and then visualised and analysed by ready-to-use Digital Humanities
tools of the portal. Implementing user interfaces based on this model is supported by the
open source Sampo-UI framework (SeCo n.d.).
In FindSampo archaeological finds can be searched using the faceted search paradigm (Tun-
kelang 2009), allowing narrowing of the result set by making orthogonal category value selec-
tions, such as object type, material, time period and place, based on underlying ontologies
(Figure 1). Once a result set of interest has been found, ready-to-use data analytic tools and
visualisations can be applied to it with additional contextual information. For example, it is pos-
sible to visualise finds on maps at the same time as seeing protected archaeological sites. If the
question is about an individual find, its ‘home page’ can be studied further (Figure 2).
The FindSampo Data Service with its SPARQL endpoint (WC3 n.d.) and data download
facility can be used for custom-made analyses. Different software tools can be employed for
this. Figure 3 presents a matrix showing probabilities that two types of items of the same era,
here Iron Age, are found in the same area, here a municipality.
© The Author(s), 2021. Published by Cambridge University Press on behalf of Antiquity Publications Ltd.
2
https://doi.org/10.15184/aqy.2021.87 Published online by Cambridge University Press
Citizen science archaeological finds on the Semantic Web
Figure 1. Different views in FindSampo Reporter. From top left: Clustered Map view provides an aggregated view of
filtered finds on the map; HeatMap view visualises the filtered finds distribution in colours; Table view lists the finds in a
traditional way; Statistics view illustrates statistical distributions of the finds along different facet dimensions, here based
on the selected finds’ material (graphics by P. Hassanzadeh).
finds in terms of values taken from a set of (hierarchical) ontologies, such as object types,
materials and time periods. The ontologies collate heterogeneous data from different data
sources and are used to enrich the data by data linking to external data sources and by reason-
ing based on the Semantic Web logical standards (W3C n.d.). The shared ontology infra-
structure includes a new object type ontology of archaeological finds interlinked with the
MAO/TAO ontology for Museum Domain and Applied Arts (Finnish Thesaurus and
© The Author(s), 2021. Published by Cambridge University Press on behalf of Antiquity Publications Ltd.
3
https://doi.org/10.15184/aqy.2021.87 Published online by Cambridge University Press
https://doi.org/10.15184/aqy.2021.87 Published online by Cambridge University Press
© The Author(s), 2021. Published by Cambridge University Press on behalf of Antiquity Publications Ltd.
Figure 2. The novel timeline visualisation in the FindSampo portal’s user interface (after Anafi et al. 2020). The activated filters are shown on top of the facets in the faceted search
section, and the result of the filters is displayed on the activated tab in the results area. The timeline visualisation shows the distribution of the filtered finds over time. Finds are
grouped by material type, providing the user with a new perspective on the material distribution of the finds chronologically (graphics by B. Anafi).
Citizen science archaeological finds on the Semantic Web
Figure 3. Analysis and visualisation of co-occurring Iron Age object types found in the same municipality, made using
Python Matplotlib library and a Google Colab notebook. If coins are found then the probability for jewellery is 0.93, but
finding jewellery indicates coins with less probability, i.e. 0.41. Probability for co-occurrence of weapon and coin finds
seems low (graphics by H. Rantala).
Ontology Service n.d.) and the Art and Architecture Thesaurus of the Getty Research Centre
(Getty Research Institute n.d.), and a time-period ontology interlinked with the PeriodO
(n.d.) ontology, as recommended for international semantic interoperability in the ARIAD-
NEplus project.
Future visions
The surge of new metal-detected find records in Finland since the 2010s is rewriting our
understanding of material culture and associated fields in social, cultural and economic his-
tory. To actualise these developments, the FindSampo framework offers novel, ground-
breaking qualitative and quantitative research tools to advance digital humanities and citizen
science research. Furthermore, a new Marie Skłodowska-Curie project (CORDIS 2021),
which began in September 2020, will deploy FindSampo and other Finnish Heritage Agency
archaeological data to produce new analysis of large-scale and long-term development of
Finnish archaeological landscapes. To test the usability of the FindSampo framework for
other find datasets, we plan to apply it to the large Portable Antiquities Scheme (2021)
© The Author(s), 2021. Published by Cambridge University Press on behalf of Antiquity Publications Ltd.
5
https://doi.org/10.15184/aqy.2021.87 Published online by Cambridge University Press
Eero Hyvönen et al.
database managed by the British Museum (2021). These initiatives push towards a deeper
understanding of the agency of the public as creators of new knowledge about the past.
Acknowledgements
CSC—IT Center for Science, Finland has provided computational resources for the work.
Funding statement
This article is an output of the research project SuALT—The Finnish Archaeological Finds
Recording Linked Open Database (2017–2021), funded by the Academy of Finland (deci-
sion numbers 310854, 310859 and 310860). Thanks to AriadnePlus and the Marie
Skłodowska-Curie project DeepFIN (grant agreement 896044) for additional funding.
References
Anafi, B., M. Koho & E. Hyvönen. 2020. https://www.getty.edu/research/tools/
Temporal visualization and data analysis vocabularies/aat/ (accessed 4 June 2021).
of archaeological finds: case GoFair. n.d. FAIR principles. Available at:
FindSampo. Conference on Cultural https://www.go-fair.org/fair-principles (accessed
Heritage and New Technologies (CHNT 25). 4 June 2021).
Museum Stadt Archäologie Wien, November, Hassanzadeh, P., E. Hyvönen, E. Ikkala,
2020. Available at: J. Tuominen, S. Thomas, A. Wessman &
https://www.chnt.at/abstract-2020/ (accessed 4 V. Rohiola. 2020. FindSampo platform for
June 2021). reporting and studying archaeological finds using
AriadnePlus. n.d. Available at: citizen science, in A. Adamou, E. Daga &
https://ariadne-infrastructure.eu (accessed 4 June A. Meroño-Peñuela (ed.) Proceedings of the 3rd
2021) workshop on humanities in the Semantic Web
British Museum. 2021. Treasure and the Portable (WHiSe), Heraklion, Greece, June 2, 2020: 33–
Antiquities Scheme. Available at: 40. CEUR Workshop Proceedings. Available at:
https://www.britishmuseum.org/our-work/ http://ceur-ws.org/Vol-2695/ (accessed 4 June
national/treasure-and-portable-antiquities- 2021).
scheme (accessed 4 June 2021). Heath, T. & C. Bizer. 2011. Linked Data: evolving
CORDIS. 2021. Assessing archaeological deep time the web into a global data space. Palo Alto: Morgan
in Finland through spatial exploration 500 BCE– & Claypool.
1520 CE. Available at: https://doi.org/10.2200/S00334ED1V01Y20
https://cordis.europa.eu/project/id/896044 1102WBE001
(accessed 4 June 2021). Hyvönen, E. 2020. ‘Sampo’ model and semantic
European Public Finds Recording Network. 2021. portals for digital humanities on the Semantic
Available at: Web, in S. Reinsone, I. Skadiņ a, A. Baklāne &
https://www.helsinki.fi/en/networks/european- J. Daugavietis (ed.) DHN 2020: digital
public-finds-recording-network (accessed 4 June humanities in the Nordic countries. Proceedings of
2021). the digital humanities in the Nordic countries 5th
FindSampo. n.d. FindSampo portal. Available at: conference, Riga, Latvia, October 21–23, 2020:
https://findsampo.fi (accessed 4 June 2021) 373–78. CEUR Workshop Proceedings.
Finnish Thesaurus and Ontology Service. n.d. Available at: http://ceur-ws.org/Vol-2612/
Available at: https://finto.fi/maotao/fi/ (accessed 4 (accessed 4 June 2021).
June 2021). Hyvönen, E., J. Tuominen, M. Alonen &
Getty Research Institute. n.d. Art & architecture E. Mäkelä. 2014. Linked Data Finland: a 7-star
thesaurus® online. Available at: model and platform for publishing and re-using
© The Author(s), 2021. Published by Cambridge University Press on behalf of Antiquity Publications Ltd.
6
https://doi.org/10.15184/aqy.2021.87 Published online by Cambridge University Press
Citizen science archaeological finds on the Semantic Web
© The Author(s), 2021. Published by Cambridge University Press on behalf of Antiquity Publications Ltd.
7
https://doi.org/10.15184/aqy.2021.87 Published online by Cambridge University Press