{"id":34653,"date":"2025-02-11T18:59:02","date_gmt":"2025-02-11T17:59:02","guid":{"rendered":"https:\/\/www.vtei.cz\/2025\/02\/citovani-geografickych-datovych-sad-pilotni-studie-dibavod-systematicky-prehled-literatury-2\/"},"modified":"2025-02-12T21:35:46","modified_gmt":"2025-02-12T20:35:46","slug":"geographic-data-citations-case-study-of-dibavod-systematic-review","status":"publish","type":"post","link":"https:\/\/www.vtei.cz\/en\/2025\/02\/geographic-data-citations-case-study-of-dibavod-systematic-review\/","title":{"rendered":"Geographic data citations: case study of DIBAVOD \u2013 systematic review"},"content":{"rendered":"<h2>ABSTRACT<\/h2>\n<p>Society, mainly through state and public institutions, spends considerable funds on the\u00a0acquisition, management and sharing of\u00a0data acquired with public funds, including scientific data. Evaluating the\u00a0success of\u00a0an open data policy is very problematic. One possibility is to use the\u00a0citations of\u00a0these datasets to track the\u00a0use of\u00a0open data. Dataset citation is a\u00a0relatively new field and still faces a\u00a0number of\u00a0methodological and technical challenges, including little awareness in\u00a0the\u00a0scientific community of\u00a0the\u00a0positives of\u00a0dataset citation. Also problematic is the\u00a0low level of\u00a0skill in\u00a0citing datasets, which generally leads to different forms and ways of\u00a0citations. In\u00a0this study, an analysis was performed of\u00a0the\u00a0citations of\u00a0the\u00a0geographic database DIBAVOD, which is managed by T.\u00a0G.\u00a0Masaryk Water Research Institute. In\u00a0total, 122 citing documents were included in\u00a0the\u00a0study. The\u00a0study showed that the\u00a0forms and methods of\u00a0citation vary widely and do not show any discernible trends over time. Only the\u00a0number of\u00a0citations shows a\u00a0slightly increasing trend. Almost a\u00a0quarter of\u00a0the\u00a0papers then only mentioned the\u00a0use of\u00a0DIBAVOD without indicating the\u00a0source of\u00a0this data or citing it in\u00a0another form.<\/p>\n<h2>INTRODUCTION<\/h2>\n<h3>The\u00a0issue of\u00a0data citation<\/h3>\n<p>In\u00a0today\u2019s\u00a0digital era, data \u2013 including geographic data \u2013 play a\u00a0key role in\u00a0science. Both government and scientific institutions invest significant resources in\u00a0dataset creation, management, and access. As the\u00a0digital environment has developed and the\u00a0size of\u00a0datasets increased in\u00a0recent decades, the\u00a0cost of\u00a0this data has also steadily increased. Monitoring the\u00a0use of\u00a0datasets within\u00a0the\u00a0research community allows us to verify the\u00a0value of\u00a0the\u00a0resources invested in\u00a0creating datasets, their administration, and providing them to the\u00a0public and other users.<\/p>\n<p>Current public policies encourage or even stipulate that publicly funded datasets be shared and used for other types of\u00a0analysis. Costello [1] has mapped a\u00a0number of\u00a0positive aspects of\u00a0sharing research data, as well as the\u00a0concerns and arguments of\u00a0scientists who oppose it. The\u00a0willingness to share research data varies across disciplines, and in\u00a0addition to data repositories, personal websites of\u00a0individual scientists are still widely used\u00a0[2]. Zhao et al. [3] analysed 600 papers published in\u00a0the\u00a0journal PLoS One and concluded that scientists still prefer to create their own sets rather than analyse already collected data. In\u00a0VTEI journal, one can also find very few articles that are based on the\u00a0reuse of\u00a0already published data [4, 5].<\/p>\n<p>The\u00a0requirement to make research data available in\u00a0the\u00a0Czech Republic is enshrined in\u00a0Act No. 130\/2002 Coll., on the\u00a0support of\u00a0research and development from public funds and on amendments to certain\u00a0related acts (the\u00a0Act on the\u00a0Support of\u00a0Research and Development). Act No. 130\/2002 Coll. introduced in\u00a0Section 12a the\u00a0obligation to provide research data, including research data that is an annex to or part of\u00a0scientific publications, free of\u00a0charge upon request. The\u00a0data have to be provided no later than one year after the\u00a0end of\u00a0the\u00a0project public funding. Information on acquired research data is recorded through the\u00a0Research &amp;\u00a0Development Information System (R&amp;D IS). The\u00a0goal is that the\u00a0considerable resources provided for the\u00a0acquisition and management of\u00a0research data are demonstrably spent \u201cfor the\u00a0public good\u201d, i.e. so that other scientists can deal with other scientific tasks using already acquired datasets, whether based on individual datasets or\u00a0combinations of\u00a0multiple data sources\u00a0[6]. Of\u00a0course, Act No.\u00a0130\/2002 Coll. also introduces certain\u00a0exceptions that regulate when research data does not have to be provided.<\/p>\n<p>Citations are a\u00a0way of\u00a0appreciating the\u00a0work of\u00a0cited authors in\u00a0the\u00a0scientific community; in\u00a0the\u00a0contemporary world they are used as a\u00a0tool for evaluating science, which often serves for career advancement and as a\u00a0basis for allocating funding for science. However, data citation is not intended to replace citation of\u00a0relevant literature, but rather to provide verifiable and reusable information about the\u00a0availability of\u00a0research data that support published conclusions and claims. The\u00a0lack of\u00a0proper citation of\u00a0datasets makes peer-reviewed publications less transparent, jeopardizes reproducibility, and hinders open science [7].<\/p>\n<p>Citation of\u00a0the\u00a0dataset used is also necessary to comply with Act\u00a0No.\u00a0121\/2000\u00a0Coll., on copyright, rights related to copyright, and on amendments to certain\u00a0acts (Copyright Act). Datasets fall under copyright works. According to Section 31 of\u00a0the\u00a0Copyright Act, the\u00a0use of\u00a0a\u00a0copyrighted work for scientific purposes is permitted only \u201cif possible, the\u00a0name of\u00a0the\u00a0author, unless the\u00a0work is anonymous, or the\u00a0name of\u00a0the\u00a0person under whose name the\u00a0work is made public, as well as the\u00a0title of\u00a0the\u00a0work and the\u00a0source are stated\u201d.<\/p>\n<p>Data citation is therefore an important tool for acknowledging the\u00a0work of\u00a0data creators and curators and allowing them to track how their data is used. Data citation allows scientists and other users to easily find data that have been used in\u00a0a\u00a0particular piece of\u00a0research, and to replicate that research and verify its results. Without proper data citation, it would be difficult to achieve the\u00a0goals of\u00a0open science, which seeks to share data and scientific knowledge to accelerate scientific progress. Finally, data citation helps to ensure that data sharing is fair and that the\u00a0creators and curators of\u00a0datasets receive due credit for their work [8].<\/p>\n<p>Citing datasets faces a number of issues [9], such as the uniqueness and verifiability of the citation, i.e. how to cite datasets so that the citation allows for the precise identification of the dataset used and so that it can be verified that the data cited were used. Another issue is how to cite dynamic datasets that change over time, or whether to cite the dataset or the article that describes the dataset. Citing an article that describes the dataset contributes\u00a0to the\u00a0author\u2019s\u00a0H-index, which can be beneficial for their scientific career. Conversely, citing a\u00a0dataset, even if relevant to research, usually does not directly affect the\u00a0author\u2019s\u00a0H-index. This disparity in\u00a0impact on academic metrics may influence decisions about the\u00a0recommended form of\u00a0citation. Last but not least, the\u00a0scientific community is addressing the\u00a0question of\u00a0how we can track and evaluate the\u00a0use of\u00a0datasets.<\/p>\n<p>To address these issues, a\u00a0number of\u00a0standards and best practices for data citation have been developed. The\u00a0most well-known are the\u00a0Data Citation Principles, developed by FORCE11\u00a0[10]. Adherence to the\u00a0proposed standards contributes to increasing the\u00a0impact of\u00a0both the\u00a0cited and citing work\u00a0[11]. However, the\u00a0basic task of\u00a0a\u00a0data citation system is to guarantee the\u00a0permanence of\u00a0the\u00a0cited data and the\u00a0citations themselves\u00a0[12], i.e., to ensure that the\u00a0cited dataset remains available in\u00a0the\u00a0cited form in\u00a0the\u00a0future.<\/p>\n<p>In\u00a0recent years, a\u00a0number of\u00a0studies have been conducted to examine how datasets are cited. Gregory et al. [13] examined the\u00a0practices, preferences, and motivations for citing data; they distinguish three types of\u00a0dataset citation. The\u00a0first type is data citation in\u00a0the\u00a0references. This means that datasets are cited like any other bibliometric source, with an abbreviated citation in\u00a0the\u00a0text of\u00a0the\u00a0article and a\u00a0full citation in\u00a0the\u00a0reference list. This form of\u00a0citation allows for easy tracking of\u00a0citations using citation analysis tools and specialized citation services. The\u00a0second type of\u00a0data citation is a\u00a0simple mention of\u00a0the\u00a0data used in\u00a0the\u00a0text of\u00a0the\u00a0publication. The\u00a0last type is an indirect citation, where the\u00a0reference to the\u00a0data is given in\u00a0the\u00a0form of\u00a0a\u00a0citation of\u00a0another related publication (e.g. a\u00a0data article describing the\u00a0data or a\u00a0data document).<\/p>\n<p>Smith et al. [14] point out another issue with citation of\u00a0datasets, using the\u00a0Paleobiology Database as an example. The\u00a0problem is that collective works, such as large datasets composed of\u00a0contributions from many authors (and articles based on them), are cited more often than the\u00a0original data contributors to these large datasets.<\/p>\n<h3>Digital water management database<\/h3>\n<p>The\u00a0Fundamental Base of\u00a0Geographic Data of\u00a0the\u00a0Czech Republic (ZABAGED) is the\u00a0primary geographic data set in\u00a0the\u00a0Czech Republic. The\u00a0administrator of\u00a0ZABAGED is the\u00a0Land Survey Office, which administers and expands it in\u00a0the\u00a0public interest in\u00a0accordance with Act No. 200\/1994 Coll. The\u00a0financing of\u00a0the\u00a0ZABAGED administration is thus ensured from the\u00a0Czech state budget. In\u00a0addition to ZABAGED, there are other geographic datasets. In\u00a0the\u00a0field of\u00a0water management, this is mainly the\u00a0Digital Base of\u00a0Water Management Data (DIBAVOD). DIBAVOD is managed by the\u00a0T. G. Masaryk Water Research Institute (TGM WRI) public research institution and its administration is ensured from the\u00a0internal resources of\u00a0this institution.<\/p>\n<p>DIBAVOD is a\u00a0reference geographic database created primarily from the\u00a0corresponding ZABAGED layers. It is used to create thematic cartographic outputs in\u00a0the\u00a0field of\u00a0water management and water protection over the\u00a0base map of\u00a0the\u00a0Czech Republic 1\u00a0:\u00a010,000. DIBAVOD is used, for example, for spatial analyses in\u00a0the\u00a0geographic information systems (GIS) environment and for processing reporting data under the\u00a0Water Framework Directive 2000\/60\/EC in\u00a0the\u00a0field of\u00a0water policy.<\/p>\n<p>DIBAVOD can be characterized as a\u00a0dynamic database containing 75 different objects that describe water management elements for the\u00a0creation of\u00a0basic water management maps. The\u00a0objects are divided into ten focus groups:<\/p>\n<ol style=\"list-style-type: upper-alpha;\">\n<li>Basic phenomena of\u00a0surface and groundwater<\/li>\n<li>Focus classification of\u00a0surface and groundwater<\/li>\n<li>Protected areas<\/li>\n<li>Floodplains<\/li>\n<li>Surface water gauging and monitoring points<\/li>\n<li>Groundwater gauging and monitoring points<\/li>\n<li>Water use subsystem objects<\/li>\n<li>Abstraction and discharge points<\/li>\n<li>Objects in\u00a0streams<\/li>\n<li>Meteorological observation objects<\/li>\n<\/ol>\n<p>As part of\u00a0sharing individual objects with the\u00a0general public or interested parties, a\u00a0web map application was created on the\u00a0dibavod.cz website. This application is a\u00a0synoptic interactive tool for publishing data and services with online access via a\u00a0web browser. It can contain\u00a0raster and vector datasets and allows the\u00a0use of\u00a0analytical and publication tools.<\/p>\n<p>Currently, the\u00a0system-wide stable financing of\u00a0DIBAVOD is not ensured\u00a0[15], which leads to some objects being unavailable or not updated for a\u00a0long time. Information on the\u00a0use of\u00a0DIBAVOD is therefore an important aspect when deciding on further financing of\u00a0DIBAVOD administration. TGM WRI does not have detailed information on the\u00a0use of\u00a0this database by the\u00a0scientific community as the\u00a0DIBAVOD data can be downloaded for free from the\u00a0dibavod.cz portal. The\u00a0aim of\u00a0this study is therefore to map the\u00a0citation rate of\u00a0DIBAVOD and analyse the\u00a0types of\u00a0citations of\u00a0this dataset. On the\u00a0main\u00a0page of\u00a0the\u00a0dibavod.cz application, the\u00a0DIBAVOD authors themselves recommend citing the\u00a0DIBAVOD dataset in\u00a0the\u00a0form of\u00a0an indirect citation of\u00a0the\u00a0article GIS and Cartography at the\u00a0TGM WRI\u00a0[15], published in\u00a02022 in\u00a0the\u00a0VTEI journal.<\/p>\n<h2>DATA AND METHODS<\/h2>\n<p>A\u00a0systematic literature review was chosen as the\u00a0primary research method for this study. A\u00a0systematic literature review is a\u00a0specific type of\u00a0review that focuses on finding an answer to a\u00a0pre-formulated research question by analysing the\u00a0proof\u00a0collected in\u00a0the\u00a0literature search\u00a0[16]. The\u00a0fundamental difference compared to so-called \u201cnarrative\u201d literature reviews is the\u00a0limitation of\u00a0subjectivity through clearly defined rules for selecting and including literature in\u00a0the\u00a0review [17, 18]. Systematic literature reviews use the\u00a0PRISMA methodology\u00a0[19]. Due to their complexity, systematic literature reviews are suitable for cases where several dozen or a\u00a0few hundred contributions are analysed.<\/p>\n<p>The\u00a0bibliometric databases Dimensions.AI\u00a0[20], Scopus\u00a0[21], and Web of\u00a0Science [22] were selected for citation analysis. Data collection was carried out via the\u00a0web interface of\u00a0all three databases. Data collection was carried out on 7 March 2024 by searching for the\u00a0string DIBAVOD in\u00a0all fields and then repeated on 1 July 2024. A\u00a0total of\u00a0216 scientific publications were found in\u00a0the\u00a0Dimensions.AI database, their metadata were exported in\u00a0csv format and loaded into a\u00a0spreadsheet. A\u00a0total of\u00a047 scientific publications were found in\u00a0the\u00a0Scopus database, which were again\u00a0exported in\u00a0csv format and loaded into a\u00a0spreadsheet. Three articles were found in\u00a0the\u00a0Web of\u00a0Science\u00a0\u2013 Core collection database, and when the\u00a0query was expanded to all databases in\u00a0the\u00a0Web of\u00a0Science, five references were found to two datasets derived from DIBAVOD. Records for these datasets were not included in\u00a0the\u00a0analysis.<\/p>\n<p>In the first step, duplicates were eliminated, resulting in a list of 231 scientific publications for screening. As part of the screening, each document found was checked to see if it actually contained a DIBAVOD citation. 104 records that did not cite DIBAVOD were excluded from further analysis, as were five records\u00a0for which it was not possible to verify whether they cited DIBAVOD (e.g. due to the\u00a0unavailability of\u00a0the\u00a0paper for the\u00a0authors).<\/p>\n<p>The\u00a0citation analysis included 122 papers citing DIBAVOD. A\u00a0modified typology described by Gregory et al. [13] was used to monitor the\u00a0types of\u00a0citations. For each paper citing DIBAVOD, the\u00a0form of\u00a0citation and the\u00a0method of\u00a0citing the\u00a0source were checked. The\u00a0form of\u00a0citation was classified into one of\u00a0two categories \u2013 \u201ccitation in\u00a0the\u00a0text\u201d or \u201ccitation in\u00a0the\u00a0list of\u00a0references\u201d. In\u00a0the\u00a0case of\u00a0the\u00a0form of\u00a0citation in\u00a0the\u00a0references, categories of\u00a0the\u00a0method of\u00a0citing the\u00a0source were created: \u201cno source is cited\u201d, \u201ca\u00a0recommended article is cited\u201d (i.e. the\u00a0article GIS and cartography at TGM WRI [15] is cited), \u201cTGM\u00a0WRI is cited\u201d, \u201cthe\u00a0website dibavod.cz is cited\u201d). Based on the\u00a0analysis of\u00a0citations, a\u00a0new category \u201cTGM WRI Hydroecological Information System is cited\u201d (alias HEIS\u00a0TGM\u00a0WRI) was added. HEIS TGM WRI is another information system operated by TGM WRI, which provides attribute data on water management in\u00a0the\u00a0Czech Republic. In\u00a0the\u00a0case of\u00a0the\u00a0\u201ccitation in\u00a0text\u201d form, the\u00a0same categories of\u00a0source citation methods were chosen, but logically, \u201ca\u00a0recommended article is cited\u201d cannot appear in\u00a0this DIBAVOD citation form. Citation analysis was performed by both authors of\u00a0this study; the\u00a0second author was in\u00a0charge of\u00a0the\u00a0initial analyses, the\u00a0first author checked the\u00a0results and made decisions in\u00a0the\u00a0case of\u00a0unclear classifications.<\/p>\n<p>Subsequently, these data were statistically processed and the\u00a0content analysed.<\/p>\n<h2>RESULTS AND DISCUSSION<\/h2>\n<p>Citations according to individual categories of\u00a0citation form and method of\u00a0citing the\u00a0source are shown in\u00a0Fig. 1. A\u00a0total of\u00a0122 papers citing DIBAVOD were found. In\u00a0DIBAVOD citations, citations in\u00a0the\u00a0form of\u00a0links in\u00a0the\u00a0list of\u00a0references slightly predominate; there are 64 (i.e. 52.5 %). This can be considered a\u00a0good result, since in-text citations generally prevail over citations in\u00a0the\u00a0list of\u00a0references\u00a0[23]. However, the\u00a0presented results are difficult to generalize because the\u00a0number of\u00a0citing articles is low. As Rogers et al. [24] point out, samples of\u00a01,000 documents provide a\u00a0good guide for relative (but not absolute) citation analyses; studies with fewer than 200 documents suffer from high variability in\u00a0results.<\/p>\n<a href=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-1.jpg\" rel=\"shadowbox[sbpost-34653];player=img;\"><img decoding=\"async\" class=\"alignnone wp-image-34687 size-full lazyload\" data-src=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-1.jpg\" alt=\"\" width=\"800\" height=\"650\" data-srcset=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-1.jpg 800w, https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-1-300x244.jpg 300w, https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-1-768x624.jpg 768w\" data-sizes=\"(max-width: 800px) 100vw, 800px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 800px; --smush-placeholder-aspect-ratio: 800\/650;\" \/><\/a>\n<p>&nbsp;<\/p>\n<h6>Fig. 1. Structure of DIBAVOD citation types<\/h6>\n<p>Citing in\u00a0the\u00a0form of\u00a0a\u00a0link in\u00a0the\u00a0list of\u00a0references is preferable from the\u00a0perspective of\u00a0dataset curators, as it allows for easier tracking of\u00a0the\u00a0use of\u00a0the\u00a0dataset using specialized bibliometric databases. However, this does not mean that citations of\u00a0the\u00a0dataset in\u00a0the\u00a0text of\u00a0the\u00a0citing document should be considered inappropriate. Data citation is still in\u00a0its infancy and authors are still learning how to use it. It is therefore important that citing the\u00a0datasets used becomes part of\u00a0general \u201ccitation skills\u201d and good publishing practices.<\/p>\n<p>A\u00a0somewhat unpleasant finding is the\u00a029 papers (23.8 %) that only mention DIBAVOD in\u00a0the\u00a0text without providing any acknowledgement to the\u00a0authors and curators of\u00a0DIBAVOD, or a\u00a0reference to this source. Although 23.8 % may not seem like a\u00a0high percentage, it is still a\u00a0manifestation of\u00a0ignorance or non-compliance with citation rules. Inaccurate, incomplete or careless citation, where it is not possible to identify the\u00a0cited source, is considered by most publication ethics manuals [e.g. 25] to be a\u00a0violation of\u00a0publication ethics, or plagiarism. Moreover, it is a\u00a0de facto violation of\u00a0Czech copyright law, which requires the\u00a0author and source to be cited in\u00a0addition to the\u00a0title of\u00a0the\u00a0paper used.<\/p>\n<p>In\u00a0this context, the\u00a0question arises whether currently not citing the\u00a0exact source can be considered a\u00a0violation of\u00a0publication ethics if the\u00a0(non-)citing work indicates that a\u00a0specific dataset was used that can be easily found on the\u00a0Internet. With regard to compliance with the\u00a0FAIR principles\u00a0[8], the\u00a0citation of\u00a0datasets should also contribute to finding the\u00a0dataset used, its accessibility, interoperability and reusability. Citations of\u00a0data play an important role in\u00a0ensuring their findability and accessibility, especially when persistent identifiers such as DOI are used in\u00a0the\u00a0citations. Groth et al. [26] discuss the\u00a0benefits of\u00a0citing datasets for their reuse. In\u00a0our view, more frequent data citations will also have an impact on their interoperability, as data with higher interoperability should be used and cited more. Recognizing the\u00a0importance of\u00a0data citations, for example by including data citations in\u00a0rating systems, will put pressure on data curators to ensure greater interoperability of\u00a0the\u00a0datasets they manage.<\/p>\n<p>A\u00a0total of\u00a054 papers referenced the\u00a0dibavod.cz website, which was the\u00a0most common way of\u00a0citing a\u00a0source in\u00a0the\u00a0DIBAVOD dataset. Of\u00a0these, 46 references to dibavod.cz were in\u00a0the\u00a0form of\u00a0a\u00a0link in\u00a0the\u00a0list of\u00a0references, while eight references to dibavod.cz were listed directly in\u00a0the\u00a0text of\u00a0the\u00a0citing article. The\u00a0descriptive article\u00a0[15], which is recommended to be cited by the\u00a0curators of\u00a0the\u00a0DIBAVOD database on the\u00a0dibavod.cz website, was cited only five times, which is a\u00a0very small number. This may be due to the\u00a0fact that the\u00a0article was written relatively recently (in\u00a02022), and also to the\u00a0fact that data articles are not yet widely used for citing datasets, but there is still a\u00a0steady increase in\u00a0citations of\u00a0data articles\u00a0[27]. However, the\u00a0overall citation of\u00a0datasets is still at a\u00a0very low level, regardless of\u00a0the\u00a0data repository from which the\u00a0data is uploaded [28].<\/p>\n<p>Two articles cited DIBAVOD as part of\u00a0the\u00a0HEIS TGM WRI. In\u00a0both cases, these were relatively old citations, the\u00a0first from 2009, the\u00a0second from 2021, but citing a\u00a0source from 1965. Two other articles cited the\u00a0HEIS TGM WRI; however, because both of\u00a0these articles also cited TGM WRI or dibavod.cz, they were included in\u00a0the\u00a0categories citing these sources.<\/p>\n<p>The\u00a0last way of\u00a0citing a\u00a0source is represented by citations stating that DIBAVOD is managed by the\u00a0TGM WRI. There were 32 such citations in\u00a0total, 21 of\u00a0which were in\u00a0the\u00a0form of\u00a0in-text citations and 11 in\u00a0the\u00a0form of\u00a0citations in\u00a0the\u00a0references. This method of\u00a0citation cannot be considered optimal; however, it at least acknowledges the\u00a0TGM WRI for the\u00a0management of\u00a0the\u00a0DIBAVOD dataset.<\/p>\n<p><em>Fig. 2<\/em> shows that a\u00a0certain\u00a0increasing trend can be seen in\u00a0the\u00a0total number of\u00a0citations, but not in\u00a0whether the\u00a0share of\u00a0citations in\u00a0the\u00a0form of\u00a0in-text citations and citations in\u00a0the\u00a0references is changing. Similarly, Fig. 3 shows that the\u00a0way of\u00a0citing the\u00a0source does not indicate any noticeable trend either, and the\u00a0individual categories are randomly represented in\u00a0individual years. This suggests that dataset citation is not yet widely practiced in\u00a0the\u00a0Czech scientific community. However, education in\u00a0the\u00a0field of\u00a0dataset citation is essential for supporting academic integrity, developing critical digital skills, and improving the\u00a0ethical and effective use of\u00a0data.<\/p>\n<p>&nbsp;<\/p>\n<h6><a href=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-2.jpg\" rel=\"shadowbox[sbpost-34653];player=img;\"><img decoding=\"async\" class=\"alignnone wp-image-34685 size-full lazyload\" data-src=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-2.jpg\" alt=\"\" width=\"800\" height=\"521\" data-srcset=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-2.jpg 800w, https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-2-300x195.jpg 300w, https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-2-768x500.jpg 768w\" data-sizes=\"(max-width: 800px) 100vw, 800px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 800px; --smush-placeholder-aspect-ratio: 800\/521;\" \/><\/a><\/h6>\n<h6>Fig. 2. Development of DIBAVOD citations over time<\/h6>\n<a href=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-3.jpg\" rel=\"shadowbox[sbpost-34653];player=img;\"><img decoding=\"async\" class=\"alignnone wp-image-34683 size-full lazyload\" data-src=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-3.jpg\" alt=\"\" width=\"800\" height=\"399\" data-srcset=\"https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-3.jpg 800w, https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-3-300x150.jpg 300w, https:\/\/www.vtei.cz\/wp-content\/uploads\/2025\/02\/Ansorge-fig-3-768x383.jpg 768w\" data-sizes=\"(max-width: 800px) 100vw, 800px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 800px; --smush-placeholder-aspect-ratio: 800\/399;\" \/><\/a>\n<h6><\/h6>\n<h6>Fig. 3. Distribution of citation types over time<\/h6>\n<p>In\u00a0this study, we focused on citations in\u00a0scientific journals that are included in\u00a0citation databases, which allowed for relatively simple data processing. However, geographic datasets such as DIBAVOD or ZABAGED are created primarily with the\u00a0aim of\u00a0providing data for the\u00a0lay public. One of\u00a0the\u00a0challenges for systems for assessing the\u00a0reuse of\u00a0research data is the\u00a0way in\u00a0which these data are used by the\u00a0general professional public outside the\u00a0academic sphere. This includes in\u00a0particular citations in\u00a0the\u00a0so-called grey literature, in\u00a0strategic documents and policies, in\u00a0decisions of\u00a0administrative bodies, etc. However, these citation analyses are highly demanding because, unlike scientific publication production, there are no easily usable sources of\u00a0information for these types of\u00a0documents. The\u00a0easiest way to do this is to use web search engines such as Google. However, the\u00a0subsequent analysis of\u00a0the\u00a0search results is very difficult to automate. Citation of\u00a0datasets may not be the\u00a0only way to demonstrate the\u00a0use of\u00a0research data. Other options include download counts, usage agreements, etc. The\u00a0biggest complication here is again\u00a0the\u00a0lack of\u00a0readily available information on these types of\u00a0indicators.<\/p>\n<h2>CONCLUSIONS<\/h2>\n<p>The\u00a0analysis showed that citing the\u00a0DIBAVOD dataset cannot be considered optimal. Of\u00a0the\u00a0122 papers analysed, 58 only used the\u00a0reference in\u00a0the\u00a0text and, of\u00a0these, only 29 mentioned the\u00a0use of\u00a0DIBAVOD without more detailed information about the\u00a0dataset or its authors or originator. A\u00a0total of\u00a054 papers provided a\u00a0link to the\u00a0dibavod.cz website and 34 papers cited DIBAVOD in\u00a0the\u00a0form of\u00a0a\u00a0reference to the\u00a0originator, i.e. TGM WRI. Only five papers used the\u00a0recommended citation via the\u00a0article GIS and Cartography in\u00a0TGM WRI; however, this may be mainly due to the\u00a0fact that this recommended article is quite new. The\u00a0study thus demonstrated that citing water management datasets, such as DIBAVOD in\u00a0particular, is not widespread in\u00a0the\u00a0Czech Republic, and there is no established form and method of\u00a0citing these datasets. The\u00a0importance of\u00a0citing geographic data should therefore be emphasized both within\u00a0university study programmes and through public events and professional committees. Similarly, the\u00a0study demonstrated high heterogeneity in\u00a0the\u00a0form of\u00a0citations of\u00a0the\u00a0DIBAVOD dataset. Much more awareness-raising is needed in\u00a0this regard as well.<\/p>\n<p>All data used in\u00a0the\u00a0study can be obtained from Dimension.AI, Scopus, and Web of\u00a0Science databases using the\u00a0procedures described in\u00a0this study. The\u00a0source file in\u00a0MS Excel format in\u00a0which all analyses were performed is available upon request from the\u00a0corresponding author.<\/p>\n<h3>Acknowledgements<\/h3>\n<p><em>The\u00a0authors would like to thank both reviewers for their very insightful comments and recommendations.<\/em><\/p>\n<h3>Conflict of\u00a0interest declaration<\/h3>\n<p>The\u00a0corresponding author is part of\u00a0the\u00a0TGM WRI management, which publishes the\u00a0VTEI journal, and the\u00a0chairman of\u00a0the\u00a0VTEI journal Editorial Board. However, these facts had no influence on the\u00a0results of\u00a0the\u00a0presented study. TGM WRI did not provide any funds for the\u00a0preparation of\u00a0this study.<\/p>\n<p class=\"00TEXTbezodsazeni\">The\u00a0 Czech version of\u00a0 this article was peer-reviewed, the\u00a0 English version was\u00a0translated from the\u00a0Czech original by Environmental Translation Ltd.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Society, mainly through state and public institutions, spends considerable funds on the acquisition, management and sharing of data acquired with public funds, including scientific data. Evaluating the success of an open data policy is very problematic. One possibility is to use the citations of these datasets to track the use of open data. Dataset citation is a relatively new field and still faces a number of methodological and technical challenges, including little awareness in the scientific community of the positives of dataset citation. Also problematic is the low level of skill in citing datasets, which generally leads to different forms and ways of citations. In this study, an analysis was performed of the citations of the geographic database DIBAVOD, which is managed by T. G. Masaryk Water Research Institute. In total, 122 citing documents were included in the study. The study showed that the forms and methods of citation vary widely and do not show any discernible trends over time. Only the number of citations shows a slightly increasing trend. Almost a quarter of the papers then only mentioned the use of DIBAVOD without indicating the source of this data or citing it in another form.<\/p>\n","protected":false},"author":8,"featured_media":34566,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[88],"tags":[3730,2648,3731,279],"coauthors":[399,3710],"class_list":["post-34653","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-informatics-cartography-in-water-management","tag-data-citations","tag-dibavod","tag-geographic-database","tag-gis"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/posts\/34653","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/comments?post=34653"}],"version-history":[{"count":5,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/posts\/34653\/revisions"}],"predecessor-version":[{"id":34737,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/posts\/34653\/revisions\/34737"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/media\/34566"}],"wp:attachment":[{"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/media?parent=34653"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/categories?post=34653"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/tags?post=34653"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.vtei.cz\/en\/wp-json\/wp\/v2\/coauthors?post=34653"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}