Access to this full-text is provided by Springer Nature.
Content available from Scientific Data
This content is subject to copyright. Terms and conditions apply.
1
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
Data sharing practices and data
availability upon request dier
across scientic disciplines
Leho Tedersoo1,2 ✉ , Rainer Küngas1, Ester Oras1,3,4, Kajar Köster
1,5, Helen Eenmaa1,6,
Äli Leijen1,7, Margus Pedaste7, Marju Raju1,8, Anastasiya Astapova1,9, Heli Lukner1,10,
Karin Kogermann1,11 & Tuul Sepp
1,12
Data sharing is one of the cornerstones of modern science that enables large-scale analyses and
reproducibility. We evaluated data availability in research articles across nine disciplines in Nature and
Science magazines and recorded corresponding authors’ concerns, requests and reasons for declining
data sharing. Although data sharing has improved in the last decade and particularly in recent years,
data availability and willingness to share data still dier greatly among disciplines. We observed that
statements of data availability upon (reasonable) request are inecient and should not be allowed by
journals. To improve data sharing at the time of manuscript acceptance, researchers should be better
motivated to release their data with real benets such as recognition, or bonus points in grant and job
applications. We recommend that data management costs should be covered by funding agencies;
publicly available research data ought to be included in the evaluation of applications; and surveillance
of data sharing should be enforced by both academic publishers and funders. These cross-discipline
survey data are available from the plutoF repository.
Introduction
Technological advances and accumulation of case studies have led many research fields into the era of ‘big
data’ - the possibility to integrate data from various sources for secondary analysis, e.g. meta-studies and
meta-analyses1,2. Nearly half of the researchers commonly use data generated by other scientists3. Data sharing is
a scientic norm and an important part of research ethics in all disciplines, also increasingly endorsed by publish-
ers, funders and the scientic community4–6. Despite decades of argumentation7, much of the published data is
still essentially unavailable for integration into secondary data analysis and evaluation of reproducibility, a proxy
for reliability8–10. Furthermore, the deposited data may also be incomplete, sometimes intentionally11–14, e.g. in
cases these exhibit mismatching sample codes or lack information about important metadata such as sex and age
of studied organisms in biological and social sciences.
Although the vast majority of researchers prefer data sharing12,15, scientists tend to be concerned about losing
their priority in future publishing and potential commercial use of their work without their consent or partici-
pation12,16,17. Researchers working on human subjects may be bound by legal agreements not to reveal sensitive
data16,18. Across research elds, papers indicating available data are cited on average 25% more19. In research
using microarrays, papers with access to raw data accumulate on average 69% more citations compared with
other articles20. Unfortunately, higher citation rate has not motivated many researchers enough to release their
data, although referees and funding agencies account for bibliometrics when evaluating researchers and their
1Estonian Young Academy of Sciences, Kohtu 6, 10130, Tallinn, Estonia. 2Mycology and Microbiology Center,
University of Tartu, Ravila 14a, 50411, Tartu, Estonia. 3Institute of Chemistry, University of Tartu, Ravila 14a,
50411, Tartu, Estonia. 4Institute of History and Archaeology, University of Tartu, Jakobi 2, 51005, Tartu, Estonia.
5Department of Forest Sciences, University of Helsinki, PO Box 27 (Latokartanonkaari 7), Helsinki, FI-00014, Finland.
6School of Law, University of Tartu, Näituse 20, 50409, Tartu, Estonia. 7Institute of Education, University of Tartu,
Salme 1a, 50103, Tartu, Estonia. 8Department of Musicology, Music Pedagogy and Cultural Management, Estonian
Academy of Music and Theatre, Tatari 13, 10116, Tallinn, Estonia. 9Institute for Cultural Research and Fine Arts,
University of Tartu, Ülikooli 16, 51003, Tartu, Estonia. 10Institute of Physics, University of Tartu, W. Ostwaldi 1, 50411,
Tartu, Estonia. 11Institute of Pharmacy, University of Tartu, Nooruse 1, 50411, Tartu, Estonia. 12Institute of Ecology
and Earth Sciences, University of Tartu, Vanemuise 46, 51003, Tartu, Estonia. ✉e-mail: leho.tedersoo@ut.ee
ANALYSIS
OPEN
Content courtesy of Springer Nature, terms of use apply. Rights reserved
2
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
proposals21. Multiple case studies have revealed high variation in data availability in dierent journals and disci-
plines, ranging from 9 to 76%8,11,13,19,22–24. Data requests to authors are successful in 27–59% of cases, whereas the
request is ignored in 14–41% cases based on previous research10,25–28. To promote access to data, many journals
have implemented mandatory data availability statements and require data storage in supplementary materials
or specic databases29,30. Because of poor enforcement, this has not always guaranteed access to published data
because of broken links, the lack of metadata or the authors’ lack of willingness to share upon request8,26.
is study aims to map and evaluate cross-disciplinary dierences in data sharing, authors’ concerns and
reasons for denying access to data, and whether these decisions are reected in article citations (Fig.1). We
selected the scholarly articles published in journals Nature and Science because of their multidisciplinary con-
tents, stringent data availability policies outlined in authors’ instructions, and high-impact conclusions derived
from the data of exceptional size, accuracy and/or novelty. We hypothesised that in spite of overall improvement
in data sharing culture, the actual data availability and reasons for declining the requests to share data depend
on scientic disciplines because of eld-specic ‘traditions’, ‘sensitivity’ of data, or their economic potential. Our
broader goal is to improve data sharing principles and policies among authors, academic publishers and research
foundations.
Results
Initial and nal data availability. We evaluated the availability of most critical data in 875 articles across
nine scientic disciplines (TableS1) published in Nature and Science over two 10-year intervals (2000–2009 and
2010–2019) and, in case these data were not available for access, we contacted the authors. e initial (pre-con-
tacting) full and at least partial data availability averaged at 54.2% (range across disciplines, 33.0–82.8%) and
71.8% (40.4–100.0%), respectively. Stepwise logistic regression models revealed that initial data availability
diered by research eld, type of data, journal and publishing period (no vs. full availability: n = 721; Somers’
D = 0.676; R2model = 0.476; P < 0.001). According to the best model (TableS2), the data were less readily avail-
able in materials for energy and catalysis (W = 68.0; β = −1.52 ± 0.19; P < 0.001), psychology (W = 55.6;
β = −1.11 ± 0.15; P < 0.001), optics and photonics (W = 18.8; β = −0.59 ± 0.14; P < 0.001) and forestry (W = 9.8;
β = −0.52 ± 0.19; P = 0.002) compared with other disciplines, especially humanities (Fig.2). Data availability was
relatively lower in the period of 2000–2009 (W = 82.5; β = −0.57 ± 0.10; P < 0.001) and when the most important
data were in the form of a dataset (relative to image/video and model; W = 41.5; β = −1.23 ± 0.19; P < 0.001;
Fig.3). Relatively less data were available for Nature (W = 32.7; β = −0.57 ± 0.19; P < 0.001), with striking sever-
al-fold dierences in optics and photonics (Fig.2).
Data
request
cle
Discipline
Journal
Age/ eperiod
Country/conent
No. corr. authors
Open access
Data
complexity
No response
Data declined
Data obtained
Requests &
concerns
Time to
obtain data
Recommenda fordata
sharingpolicies
Data sharing al
Data sharing final
Reminder1
Reminder2 Storage
op ons
Cit ons
Reason for
decline
Fig. 1 Schematic rationale of the study.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
3
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
0
20
Data availability (%)
40
60
80 b
**
*
*
**
*
*
*
*
*
**
**
b
b
bcd
cd
bc
cd
a
{
{
{
{
{
{
{
{
{
{
{
{
{
{
{
{
{
d
Ecology
Forestry
Humanities
Materials
for energy
& catalysis
Biomaterials &
biotechnolog
y
Micro-
biology
Optics &
photonics
Psychology
Social
sciences
P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature P1 P2 P1 P2
Science Nature
Fig. 2 Dierences in partial (grey) and full (black) data availability among disciplines depending on journal and
publishing period (P1, 2000–2009; P2, 2010–2019) before contacting the authors (n = 875). Letters above bars
indicate statistically signicant dierence groups among disciplines in full data availability compared to no data
availability. Asterisks show signicant dierences in full data availability between journals and publishing periods.
Data availability (%) Frequency of critical data types (%)
0
0
20
20
40
40
60
60
80
80
Ecology
Forestry
Humanities
Materials
for energy
& catalysis
Biomaterials &
biotechnology
Micro-
biology
Optics &
photonics
Psychology
Social
sciences
DS Img Mod DS Img Mod DS Img Mod DS Img Mod DS Img Mod DS Img Mod DS Img Mod DS Img Mod DS Img Mod
a
b
Fig. 3 Types of critical data (n = 875). (a) Distribution of data types among disciplines (blue, dataset; purple,
image; black, model); (b) Partial (light shades) and full (dark shades) data availability among disciplines
depending on the type of critical data (DS, dataset; Img, image; Mod, model) before contacting the author(s).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
4
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
Upon contacting the authors of 310 papers, the overall data availability was improved by 35.0%. Full and at
least partial availability averaged 69.5% (range across disciplines, 57.0–87.9%) and 83.2% (64.9–100.0%), respec-
tively (Fig.4), aer 60 days since contacting, a reasonable time frame4. e nal data availability (aer contacting
the authors) was best predicted by scientic discipline, data type and time lapse since publishing (no vs. full
availability: n = 580; D = 0.659; R2model = 0.336; Padj < 0.001; TableS2) but with no major changes in the ranking of
disciplines or data types compared with the initial data availability (Fig.4). It took a median of 15 days to receive
data from the authors (Fig.5), with a minimum time of 13 minutes. Four authors sent their data aer the 60-days
period since the initial request (max. 107 days). e rate of receiving data was unrelated to any studied parameter.
Authors’ responses to data requests. e data were obtained from the authors in 39.4% of data requests
on average, with a range of 27.9–56.1% among research elds. e likelihood of receiving data, the request being
declined or ignored depended mostly on the time period and eld of research. According to the best model
(n = 310; D = 0.300; R2model = 0.106; Padj < 0.001; TableS2), the data were obtained slightly less frequently for
the earlier time period (29.4% vs. 56.0%; W = 20.4; β = 0.56 ± 0.12; Padj < 0.001). Receiving data upon request
tended to be lowest in the eld of forestry (W = 3.6; β = −0.31 ± 0.16; Padj = 0.177), especially when compared
with microbiology (Fig.2).
Declining the data request averaged 19.4% and it differed most strongly among the research fields. The
best model (n = 310; D = 0.508; R2model = 0.221; Padj < 0.001) revealed that the data were not made available
Data availability (%)
Ecology Forestry Humanities Materials
for energy
& catalysis
Microbiology Psychology Social sciences
0
20
P1 P2 P1 P2 P1 P2
DS Image Model P1 P2 P1 P2 P1 P2
DS Image Model P1 P2 P1 P2 P1 P2
DS Image Model P1 P2 P1 P2 P1 P2
DS Image Model P1 P2 P1 P2 P1 P2
DS Image Model P1 P2 P1 P2 P1 P2
DS Image Model P1 P2 P1 P2 P1 P2
DS Image Model
*
40
60
80
b
b
b
b
a
b
ab
Fig. 4 Dierences in partial (grey) and full (black) data availability among disciplines aer data requests
(n = 672) depending on the type of critical data (DS, dataset; image; model) and publishing period (P1, 2000–
2009; P2, 2010–2019). Numbers above bars indicate statistically signicant dierence groups among disciplines
in full data availability.
01248163260 128
Data obtained upon request (days)
0
5
10
15
20
25
30
35
40
Number of datasets
Fig. 5 Histogram of time for receiving data from authors upon request within the 60-day reasonable time
period (blue bars) and beyond (purple bar; data excluded from analyses; n = 199 requests). Note the 2-base
logarithmic scale until 60 days.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
5
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
upon request most likely in the elds of social sciences (W = 24.3; β = −1.09 ± 0.22; Padj < 0.001), psychology
(W = 20.0; β = −0.73 ± 0.20; Padj < 0.001) and humanities (W = 5.0; β = −0.67 ± 0.30; Padj = 0.078) compared
with natural sciences (Fig.2). Furthermore, the data request was more likely to be declined when the data com-
plexity was high (W = 9.8; β = −0.59 ± 0.19; Padj = 0.005), the paper was not open access in ISI Web of Science
(W = 4.0; β = −0.37 ± 0.18; Padj = 0.132) and published in Science rather than Nature (W = 4.6; β = −0.35 ± 0.16
Padj = 0.096), although these two latter gures are non-signicant when accounting for multiple testing.
We received no response to 41.3% of our data requests, including two biweekly reminders. Responding to the
data request diered most strongly among scientic disciplines and time periods (Fig.6). Altogether 28.9% and
49.0% of requests were ignored by the authors of earlier (2000–2009) and later (2010 to 2019) papers, respec-
tively. According to the best model (n = 310; D = 0.429; R2model = 0.200; Padj < 0.001; TableS2), articles from the
earlier time period (W = 9.3; β = 0.41 ± 0.13; Padj = 0.007) and the elds of forestry (W = 13.4; β = −0.57 ± 0.16;
Padj < 0.001) and ecology (W = 7.0; β = −0.53 ± 0.20; Padj = 0.024) had the greatest likelihood of no response,
whereas social scientists (W = 7.7; β = 0.87 ± 0.31; Padj = 0.016) answered most frequently.
In general, there was no residual eect of time since publication when the publication period was included
in the best model. Within the 2010–2019 period, we specically tested whether the authors publishing in 2019
and 2018 were less likely to share their data because of potential conicting publishing interests. is hypothesis
was not supported and a non-signicant reverse trend was observed as the proportion of data obtained from the
authors increased from 44% in 2010–2017 to 63% in 2018–2019. Accounting for time since publishing across
the entire survey period, the data availability upon request decayed at a rate 5.9% year−1 based on an exponential
model. is estimate was marginally higher than the 3.5% annual loss of publicly available data (Fig.7). e num-
ber of articles was insucient to test dierences in data decay rates among disciplines.
Authors’ concerns and reasons for declining data sharing. Upon contacting the authors, we recorded
and categorised their concerns and requests related to data sharing (n = 188 authors) and their reasons for decline
(n = 65). Altogether 22.9% of authors were concerned about certain aspects of our request (Fig.8). Authors
of non-open access publications (W = 4.6; β = 0.49 ± 0.23; Padj = 0.064) and the eld of humanities (W = 9.7;
β = 1.11 ± 0.36; Padj = 0.004) expressed any types of concerns or requests relatively more oen (TableS2). In par-
ticular, researchers in the elds of humanities (W = 15.2; β = 1.36 ± 0.35; Padj < 0.001), materials for energy and
catalysis (W = 6.4; β = 0.65 ± 0.26; Padj = 0.022) and ecology (W = 5.6; β = 0.81 ± 0.34; Padj = 0.036) were more
concerned about the study’s specic purpose than researchers on average.
Data sharing was declined by 33.0% of the 188 established contacts. When we specically inquired about the
reasons, the lack of time to search for data (29.2%), loss of data (27.7%) and privacy or legal concerns (23.1%)
were most commonly indicated by the authors (Fig.8), whereas no specic answer was provided by 10.8% of
authors. According to the best binomial models (TableS2), social scientists indicated data loss more commonly
than other researchers (W = 10.9; β = 1.04 ± 0.32; Padj = 0.003) and psychologists pointed most commonly to
legal or privacy issues (W = 4.9; β = 0.85 ± 0.38; Padj = 0.078). Data decline due to legal issues became increasingly
important in more recent publications (days since 01.01.2000: W = 7.2; β = 0.07 ± 0.03; Padj = 0.035). e lack of
time to search tended to be more common for older studies (W = 4.0; β = −0.73 ± 0.37; Padj = 0.135).
Data storage options and citations. e ways how the data were released diered greatly among disci-
plines (Fig.9), with most common storage options being the supplementary materials on the publisher’s web-
site (62.2% of articles), various data archives (22.3%) and upon request from corresponding authors (19.7%).
Although 29.8% articles declared depositing data in multiple sources, no source was indicated for 35.0% of arti-
cles. Declaring data availability upon request (n = 172) ranged from 1.0% in psychology to 52.0% in forestry, with
greater frequency in earlier (days back since 31.09.2019: W = 15.0; β = 0.016 ± 0.004; P < 0.001) studies and arti-
cles by non-North American corresponding authors (by primary aliation; W = 5.6; β = 0.23 ± 0.10; P = 0.018).
With a few exceptions (three datasets only commercially available, one removed during nal acceptance and one
0
10
20
30
40
50
60
70
80
Frequency (%)
a
ab
b
m
m
mn
mn
x
xy xy
xy
xy
xy
y
nn
n
b
b
b
ab
Ecology
Forestry
Humanities
Materials
for energy
& catalysi
s
Micro-
biology
Psychology
Social
sciences
Fig. 6 Authors’ response to data request (n = 199) depending on discipline (blue, declined; orange, ignored;
purple, obtained). Bars indicate 95% CI of Sison and Glaz51. Letters above bars indicate statistically signicant
dierence groups in frequency of data availability by each category based on Tukey post-hoc test and Bonferroni
correction.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
6
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
homepage corrupt), all data were successfully located for other indicated data sources, but only 42.3% of data
could be obtained from the authors upon request in practice. is rate is comparable to articles with no such
statement (38.3%; Chi-square test: P = 0.501).
e number of citations to articles ranged from 0.0 to 692.9 per year (median, 23.1). In contrast to the hypoth-
esis that articles with available data accumulate more citations20, general linear modelling revealed no signicant
eect of initial or nal data availability on annual citations. e model demonstrated that the average number of
yearly citations was explained by research discipline (F8,855 = 11.2; R2 = 0.105; P < 0.001), data type (F2,855 = 7.0;
Initial availability: y=31.1+e ; R=0.804; P<0.001
Final availability: y=47.1+e ; R=0.820; P<0.001
Upon request availability: y=20.3+e ; R=0.670; P=0.004
0
2000 2002
0.0354
0.0590
0.0480
2004 2006 2008 2010
Year
2012 2014 2016 2018
2020
10
20
30
40
50
60
70
80
90
Data availability (%)
Fig. 7 Decay in critical data availability initially (blue circles; n = 672), at the end of a 60-day contacting period
(purple circles; n = 672) and upon request from the authors (black circles; n = 310).
040 80
Number of requests and concerns
Number of reasons for declining data sharing
120 16
0
a
b
none
purpose
seeing results
citing
privacy
authorship
sharing
04812 16 20
no time to search
data lost
data protected by agreements
not specified
privacy
person moved
purpose unclear
more work in progress
person retired
interpretation problematic
need a good reason
bad experience with sharing data
not shared with strangers
person dead
putting on web in progress
Fig. 8 Frequency distribution of authors’ (a) Concerns and requests (n = 199) and (b) reasons for declining
data sharing (n = 67). White bars indicate answers where no concerns or reasons were specied.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
7
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
R2 = 0.016; P < 0.001), open access status (F1,855 = 4.5; R2 = 0.005; P = 0.034) and the interaction term between
open access and discipline (F8,855 = 2.94; R2 = 0.027; P = 0.003). Post-hoc tests indicated that articles with a data-
set as a critical data source were cited on average 6% more than those with an image or model, and open access
articles attracted 9% more citations than regular articles. Because of high variability in citation counts, it was not
possible to test the interaction terms with scientic discipline in the current dataset. We speculate that the articles
in Nature and Science are heavily cited on the basis of their key ndings and interpretations that may mask the few
extra citations raising from re-use of the data.
Discussion
Our study uniquely points to dierences among scientic disciplines in data availability as published along with
the article and upon request from the authors. We demonstrate that in several disciplines such as forestry, mate-
rials for energy and catalysis and psychology, critical data are still unavailable for re-analysis or meta-analysis
for more than half of the papers published in Nature and Science in the last decade. ese overall gures roughly
match those reported for other journals in various research elds8,11,13,22, but exceed the lowest reported val-
ues of around 10% available data13,23,24. Fortunately, data availability tends to improve, albeit slowly, in nearly
all disciplines (Figs.3, 7), which conrms recent implications from psychological and ecological journals13,31.
Furthermore, the reverse trend we observed in microbiology corroborates the declining metagenomics sequence
data availability22. Typically, such large DNA sequence data sets are used to publish tens of articles over many
years by the teams producing these data; hence releasing both raw data and datasets may jeopardise their expec-
tations of priority publishing. e weak discipline-specic dierences among Nature and Science (Fig.2) may be
related to how certain subject editors implemented and enforced stringent data sharing policies.
Aer rigorous attempts to contact the authors, data availability increased by one third on average across dis-
ciplines, with full and at least partial availability reaching 70% and 83%, respectively. ese gures are in the
top end of studies conducted thus far8,22 and indicate the relatively superior overall data availability in Science
and Nature compared with other journals. However, the relative rates of data retrieval upon request, decline
sharing data and ignoring the requests were on par with studies covering other journals and specic research
elds10,12,25,26,28. Across 20 years, we identied the overall loss of data at an estimated rate of 3.5% and 5.9% for
initially available data and data eectively available upon request, respectively. is rate of data decay is much less
than 17% year−1 previously reported in plant and animal sciences based on a comparable approach24.
While the majority of data are eventually available, it is alarming that less than a half of the data clearly stated
to be available upon request could be eectively obtained from the authors. Although there may be objective
reasons such as force majeure, these results suggest that many authors declaring data availability upon contacting
may have abused the publishers’ or funders’ policy that allows statements of data availability upon request as
the only means of data sharing. We nd that this infringes research ethics and disables fair competition among
research groups. Researchers hiding their own data may be in a power position compared with fair players in
situations of big data analysis, when they can access all data (including their own), while others have more limited
opportunities. Data sharing is also important for securing a possibility to re-analyse and re-interpret unexpected
results9,32 and detect scientic misconduct25,33. More rigorous control of data release would prevent manuscripts
with serious issues in sampling design or analytical procedures from being prepared, reviewed and eventually
accepted for publication.
Our study uniquely recorded the authors’ concerns and specic requests when negotiating data sharing.
Concerns and hesitations about data sharing are understandable because of potential drawbacks and misun-
derstandings related to data interpretation and priority of publishing17,34 that may outweigh the benets of rec-
ognition and passive participation in broader meta-studies. Nearly one quarter of researchers expressed various
concerns or had specic requests depending on the discipline, especially about the specic objectives of our
study. Previous studies with questionnaires about hypothetical data sharing unrelated to actual data sharing reveal
0
Relative frequency (%)
10
20
30
40
50
60
Ecology
Forestry
Humanities
Materials
for energy
& catalysis
Biomaterials &
biotechnolog
y
Micro-
biology
Optics &
photonics
Psychology
Social
sciences
Fig. 9 Preferred ways of data storage in articles (n = 875) representing dierent disciplines (blue, text and
supplement; purple, data archive; yellow, authors’ homepage; vermillion, previous publications; grey, museum;
black, upon (reasonable) request; white, none declared.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
8
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
that nancial interests, priority of additional publishing and fear of challenging the interpretations aer data
re-analysis constitute the authors’ major concerns12,35,36. Another study indicated that two thirds of researchers
sharing biomedical data expected to be invited as co-authors upon use of their data37 although this does not
full the authorship criteria6,38. At least partly related to these issues, the reasons for declining data sharing dif-
fered among disciplines: while social scientists usually referred to the loss of data, psychologists most commonly
pointed out ethical/legal issues. Recently published data were, however, more commonly declined due to ethical/
legal issues, which indicates rising concerns about data protection and potential misuse. Although we oered a
possibility to share anonymised data sets, such trimmed data sets were never obtained from the authors, sug-
gesting that ethical issues were not the only reason for data decline. Because research elds strongly diered in
the frequency of no response to data requests, most unanswered requests can be considered declines that avoid
ocial replies, which may harm the authors’ reputation.
Because we did not sample randomly across journals, our interpretations are limited to the journals Nature
and Science. Our study across disciplines did not account for the particular academic editor, which may have
partly contributed to the dierences among research elds and journals. Not all combinations of disciplines,
journals and time periods received the intended 25 replicate articles because of the poor representation of cer-
tain research elds in the 2000–2009 period. is may have reduced our ability to detect statistically signicant
dierences among the disciplines. We also obtained estimates for the nal data availability for seven out of nine
disciplines. Although we excluded the remaining two disciplines from comparisons of initial and nal data avail-
ability, it may have slightly altered the overall estimates. e process of screening the potentially relevant articles
chronologically backwards resulted in overrepresentation of more recent articles in certain relatively popular
disciplines, which may have biased comparisons across disciplines. However, the paucity of residual year eect
and year x discipline interaction in overall models and residual time eect in separate analyses within research
elds indicate a minimal bias (FigureS1).
We recorded the concerns and requests of authors that had issues with initial data sharing. erefore, these
responses may be relatively more sceptic than the opinions of the majority of the scientic community publishing
in these journals. It is likely that the authors who did not respond may have concerns and reasons for declining
similar to those who refused data sharing.
Our experience shows that receiving data typically required long email exchanges with the authors, contacting
other referred authors or sending a reminder. Obtaining data took on average 15 days, representing a substantial
eort to both parties39. is could have been easily avoided by releasing data upon article acceptance. On the
other hand, we received tips for analysis, caution against potential pitfalls and the authors’ informed consent upon
contacting. According to our experience, more than two thirds of the authors need to be contacted for retrieving
important metadata, variance estimates or specifying methods for meta-analyses40. us, contacting the authors
may be commonly required to ll gaps in the data, but such extra specications are easier to provide compared
with searching and converting old datasets into a universally understandable format.
Due to various concerns and tedious data re-formatting and uploading, the authors should be better moti-
vated for data sharing41. Data formatting and releasing certainly benets from clear instructions and support
from funders, institutions and publishers. In certain cases, public recognition such as badges of open data for
articles following the best data sharing practices and increasing numbers of citations may promote data release by
an order of magnitude42. Citable data papers are certainly another way forward43,44, because these provide access
to a well-organised dataset and add to the authors’ publication record. Encouraging enlisting published data
sets with download and citation metrics in grant and job applications alongside with other bibliometric indica-
tors should promote data sharing. Relating released data in publicly available research accounts such as ORCID,
ResearcherID and Google Scholar would benet both authors, other researchers and evaluators. To account for
many authors’ fear of data the17 and to prioritise the publishing options of data owners, setting a reasonable
embargo period for third-party publishing may be needed in specic cases such as immediate data release follow-
ing data generation45 and dissertations.
All funders, research institutions, researchers, editors and publishers should collectively contribute to turn
data sharing into a win-win situation for all parties and the scientic endeavour in general. Funding agencies may
have a key role here due to the lack of conicting interests and a possibility of exclusive allocation to depositing
and publishing huge data les46. Funders have ecient enforcing mechanisms during reports periods, with an
option to refuse extensions or approving forthcoming grant applications. We advocate that funders should include
published data sets, if relevant, as an evaluation criterion besides other bibliometric information. Research insti-
tutions may follow the same principles when issuing institutional grants and employing research sta. Institutions
should also insist their employees on following open data policies45.
Academic publishers also have a major role in shaping data sharing policies. Although deposition and main-
tenance of data incur extra costs to commercial publishers, they should promote data deposition in their servers
or public repositories. An option is to hire specic data editors for evaluating data availability in supplementary
materials or online repositories and refusing nal publishing before the data are fully available in a relevant for-
mat47. For ecient handling, clear instructions and a machine-readable data availability statement option (with
a QR code or link to the data) should be provided. In non-open access journals, the data should be accessible
free of charge or at reduced price to unsubscribed users. Creating specic data journals or ‘data paper’ formats
may promote publishing and sharing data that would otherwise pile up in the drawer because of disappointing
results or the lack of time for preparing a regular article. e leading scientometrics platforms Clarivate Analytics,
Google Scholar and Scopus should index data journals equally with regular journals to motivate researchers
publishing their data. ere should be a possibility of article withdrawal by the publisher, if the data availability
statements are incorrect or the data have been removed post-acceptance30. Much of the workload should stay on
the editors who are paid by the supporting association, institution or publisher in most cases. e editors should
grant the referees access to these data during the reviewing process48, requesting them a second opinion about
Content courtesy of Springer Nature, terms of use apply. Rights reserved
9
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
data availability and reasons for declining to do so. Similar stringent data sharing policies are increasingly imple-
mented by various journals26,30,47.
In conclusion, data availability in top scientic journals diers strongly by discipline, but it is improving in
most research elds. As our study exemplies, the ‘data availability upon request’ model is insucient to ensure
access to datasets and other critical materials. Considering the overall data availability patterns, authors’ concerns
and reasons for declining data sharing, we advocate that (a) data releasing costs ought to be covered by funders;
(b) shared data and the associated bibliometric records should be included in the evaluation of job and grant
applications; and (c) data sharing enforcement should be led by both funding agencies and academic publishers.
Materials and Methods
Data collection. To assess dierences in data availability in dierent research disciplines, we focused our
study on Nature and Science, two high-impact, general-interest journals that practise relatively stringent data
availability policies49. Because of major changes in the public attitude and journals’ policies about data shar-
ing, our survey was focused on two study periods, 2000–2009 and 2010–2019. We selected nine scientic dis-
ciplines as dened by the Springer Nature publishing group - biomaterials and biotechnology, ecology, forestry,
humanities, materials for energy and catalysis, microbiology, optics and photonics, psychology and social
sciences (see TableS1 for details) - for analysis based on their coverage in Nature and Science journals and data-
driven research. ese nine disciplines were selected based on the competence of our team and the objective
to cover as dierent research elds as possible including natural sciences, social sciences and humanities. e
articles were searched by discipline, keywords and/or manual browsing as follows. For Nature, our search was
rened as https://www.nature.com/search?order=date_desc&journal=nature&article_type=research&sub-
ject=microbiology&date_range=2010-2019 (italicised parts varied). For Science, the corresponding search
string was the following: https://search.sciencemag.org/?searchTerm=microbiology&order=newest&limit=−
textFields&pageSize=10&startDate=2010-01-01&endDate=2019-08-31&articleTypes=Research%20and%20
reviews&source=sciencemag%7CScience. In both journals, the articles were retrieved by browsing search results
chronologically backwards since September 2019 or September 2009 until reaching 25 articles matching the cri-
teria. When the number of suitable articles was insucient, we searched by using additional discipline-specic
keywords in the title and browsed all issues manually when necessary. In some research elds, 25 articles could
not be found for all journal and time period combinations and therefore, data availability was evaluated for 875
articles in total (TableS1). In each article, we identied a specic analysis or result that was critical for the main
conclusion of that study based on both the authors’ emphasis and our subjective assessment. We determined
whether the underlying data of these critical results - datasets, images (including videos), models (including
scripts, programs and analytical procedures) or physical items - are available in the main text, supplementary
materials or other indicated sources such as specic data repositories, authors’ homepages, museums, or upon
request to the corresponding author (FigureS2). When available, we downloaded these data, checked for relevant
metadata, identiers and other components, and evaluated whether it is theoretically possible to repeat these
specic analyses and include these materials in a eld-specic metastudy. For example, in the case of a dataset, we
evaluated the data table for the presence of relevant metadata and sample codes necessary to perform the analysis;
for any statistical procedure, the authors must have used such a data table in their original work. We considered
the data to be too raw if these either required a large amount of work (other than common data transformations)
to generate the data table or model, or we had doubts whether the same data table can be reproduced with the
methods described. Raw high-throughput sequencing data are typical examples of incomplete datasets, because
these usually lack necessary metadata and require a thorough bioinformatics analysis, with the output depending
on soware and selected options. For further examples, certain optical raw images or videos make no sense with-
out expert ltering, and computer scripts are of limited use without thorough instructions.
If these critical data were unavailable or only partly available (i.e., missing some integral metadata, instructions
or explanations), we contacted the rst corresponding author or a relevant author referred in relation to access
to the specic item, requesting the data for a meta-study by using a pre-dened format and an institutional email
address (Item S1). In the email, we carefully specied the materials required to produce a particular gure or table
to avoid confusion and upsetting the authors with a messy request. We indicated that the data are intended for a
metastudy in a related topic to test the authors’ willingness to share the data for actual use, not just their intention
to share for no reasonable purpose. We similarly evaluated the received data for integrity and requested further
information, if necessary, to meet the standards. We also recorded the responses of corresponding authors to data
requests, including any specic requests or concerns and reasons for declining (Item S1).
e authors were mostly contacted early in the week and two reminders were sent ca. 14 and 28 days later if
necessary (Item S1). e reminders were also addressed to other corresponding authors if relevant. If emails were
returned with an error message, we contacted other corresponding authors or used an updated email address
found from the internet or newer publications. We considered 60 days from sending the rst email a reasonable
time period for the authors to locate and send the requested data4.
For each article, we recorded the details of publishing (date printed, journal, discipline), corresponding
authors (number, country of rst aliation, acquaintance to the contact author) and data (availability, type, ways
of access)50. Data complexity was evaluated based on the authors’ relative amount of extra work to polish the
raw data (e.g. low-complexity data include raw DNA sequence data, raw images, artefacts; high-complexity data
include bioinformatics-treated molecular data sets, noise-removed images, models and scripts). As of 23.03.2020,
we recorded the open access status and number of citations for each article using searches in the ISI Web of
Science (https://apps.webonowledge.com/). e citation count was expressed as citations per year, discounting
the rst 90 days with initially less citations.
Content courtesy of Springer Nature, terms of use apply. Rights reserved
10
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
Data analysis. e principal aim of this study was to determine the relative importance of scientic discipline
and time period on data availability and authors’ concerns in response to data sharing requests, by accounting
for multiple potentially important covariates (Fig.1). e response variables, i.e. initial and nal data availability
(none, partly or fully available), author’s responses (ignored, data shared or declined), concerns and reasons
for decline, exhibit multinomial distribution50 and were hence transformed to dummy variables. Similarly, the
multi-level explanatory variables (discipline, topic overlap, countries and continents of corresponding authors,
data type and complexity) were transformed to dummies, whereas continuous variables (linear time, number of
citations, time to obtain data, number of corresponding authors) were square root- or logarithm-transformed
where appropriate. All analyses were performed in STATISTICA 12 (StatSo Inc., Tulsa, OK, USA).
Data analysis of the dummy-transformed multinomial and binomial variables was performed using stepwise
logistic regression model selection with a binomial link function using corrected Akaike information criterion
(AICc) as a selection criterion, and Somers’ D statistic and model determination coecients (R2) as measures of
overall goodness of t. Determination coecients and Wald’s W statistic were used to estimate the relative impor-
tance of explanatory variables. We calculated 95% condence intervals for multiple proportions51 using the R
package multinomialCI (https://rdrr.io/cran/MultinomialCI/). Increasing false discovery rates related to multiple
comparisons were accounted for by using Bonferroni correction of P-values (expressed as Padj) where appropriate.
Models with continuous response variables (proportion of available data, annual citations, time to receive
data) were tested using general linear models in two steps. First, the model selection included only dummy and
continuous explanatory variables. Multilevel categorical predictors corresponding to signicant dummies as well
as signicant continuous variables were included in the nal model selection as based on forward selection. To
check for potential biases related to the article selection procedure in both periods, we tested the eect of disci-
pline, period and year and all their interaction terms on initial data availability by retaining all variables in the
model (FigureS1). Dierences in these factor levels were tested using Tukey post-hoc tests for unequal sample
size, which accounts for multiple testing issues.
Data availability
e entire dataset is available as in a spreadsheet format in plutoF data repository50.
Code availability
No specic code was generated for analysis of these data.
Received: 11 December 2020; Accepted: 29 June 2021;
Published: xx xx xxxx
References
1. Fan, J. et al. Challenges of big data analysis. Nat. Sci. Rev. 1, 293–314 (2014).
2. itchin, . e data revolution: Big data, open data, data infrastructures and their consequences. (Sage Publications, London, 2014).
3. Science Sta. Challenges and opportunities. Science 331, 692–693 (2011).
4. Cech, T. . et al. Sharing publication-related data and materials: responsibilities of authorship in the life sciences. National
Academies Press, Washington, D.C. (2003).
5. Fischer, B. A. & Zigmond, M. J. e essential nature of sharing in science. Sci. Engineer. Ethics 16, 783–799 (2010).
6. Due, C. S. & Porter, H. H. e ethics of data sharing and reuse in biology. BioScience 63, 483–489 (2013).
7. Fienberg, S. E. et al. Sharing esearch Data. National Academy Press, Washington, D.C. (1985).
8. Begley, C. G. & Ioannidis, J. P. eproducibility in science: improving the standard for basic and preclinical research. Circul. Res. 116,
116–126 (2015).
9. Open S cience Collaboration. Estimating the reproducibility of psychological science. Science 349, aac4716 (2015).
10. Hardwice, T. E. & Ioannidis, J. P. Populating the Data Ar: An attempt to retrieve, preserve, and liberate data from the most highly-
cited psychology and psychiatry articles. PLoS One 13, e0201856 (2018).
11. oche, D. G. et al. Public data archiving in ecology and evolution: how well are we doing? PLoS Biol. 13, e1002295 (2015).
12. Tenopir, C. et al. Changes in data sharing and data reuse practices and perceptions among scientists worldwide. PLoS One 10,
e0134826 (2015).
13. Hardwice, T. E. et al. Data availability, reusability, and analytic reproducibility: Evaluating the impact of a mandatory open data
policy at the journal Cognition. R. Soc. Open Sci. 5, 180448 (2018).
14. Witwer, . W. Data submission and quality in microarray-based microNA proling. Clin. Chem. 59, 392–400 (2013).
15. Stuart, D. et al. Whitepaper: Practical challenges for researchers in data sharing. gshare https://doi.org/10.6084/m9.gshare.5975011
(2018).
16. Borgman, C.L. Scholarship in the digital age: Information, infrastructure, and the Internet. MIT press, Cambridge (2010).
17. Longo, D. L. & Drazen, J. M. Data sharing. New England J. Med. 375, 276–277 (2016).
18. Lewandowsy, S. & Bishop, D. esearch integrity: Don’t let transparency damage science. Nature 529, 459–461 (2016).
19. Colavizza, G. et al. e citation advantage of lining publications to research data. PLoS One 15, e0230416 (2020).
20. Piwowar, H. A. et al. Sharing detailed research data is associated with increased citation rate. PLoS One 2, e308 (2007).
21. Hics, D. et al. Bibliometrics: the Leiden Manifesto for research metrics. Nature 520, 429–431 (2015).
22. Ecert, E. M. et al. Every h published metagenome is not available to science. PLoS Biol. 18, e3000698 (2020).
23. Sherry, C. et al. Assessment of transparent and reproducible research practices in the psychiatry literature. Preprint at https://osf.io/
jtcr/download (2019).
24. Vines, T. H. et al. e availability of research data declines rapidly with article age. Curr. Biol. 24, 94–97 (2014).
25. Wicherts, J. M. et al. e poor availability of psychological research data for reanalysis. Am. Psychol. 61, 726–728 (2006).
26. Vines, T. H. et al. Mandated data archiving greatly improves access to research data. FASEB J. 27, 1304–1308 (2013).
27. rawczy, M. & euben, E. (Un)available upon request: Field experiment on researchers’ willingness to share supplementary
materials. Account. Res. 19, 175–186 (2012).
28. Vanpaemel, W. et al. Are we wasting a good crisis? e availability of psychological research data aer the storm. Collabra 1, 1–5
(2015).
29. Grant, . & Hrynasziewicz, I. e impact on authors and editors of introducing data availability statements at Nature journals. Int.
J. Digit. Curat. 13, 195–203 (2018).
30. Hrynasziewicz, I. et al. Developing a research data policy framewor for all journals and publishers. Data Sci. J. 19, 5 (2020).
Content courtesy of Springer Nature, terms of use apply. Rights reserved
11
SCIENTIFIC DATA | (2021) 8:192 | https://doi.org/10.1038/s41597-021-00981-0
www.nature.com/scientificdata
www.nature.com/scientificdata/
31. Wallach, J. D. et al. eproducible research practices, transparency, and open access data in the biomedical literature, 2015–2017.
PLoS Biol. 16, e2006930 (2018).
32. raus, W. L. Do you see what I see? Quality, reliability, and reproducibility in biomedical research. Mol. Endocrinol. 28, 277–280
(2014).
33. Wicherts, J. M. et al. Willingness to share research data is related to the strength of the evidence and the quality of reporting of
statistical results. PLoS One 6, e26828 (2011).
34. Wallis, J. C., olando, E. & Borgman, C. L. If we share data, will anyone use them? Data sharing and reuse in the long tail of science
and technology. PLoS One 8, e67332 (2013).
35. Blumenthal, D. et al. Withholding research results in academic life science. JAMA 277, 1224–1228 (1997).
36. im, Y. & Stanton, J. M. Institutional and individual inuences on scientists’ data sharing practices. J. Comput. Sci. Edu. 3, 47–56
(2013).
37. Federer, L. M. et al. Biomedical data sharing and reuse: Attitudes and practices of clinical and scientic research sta. PLoS One 10,
e0129506 (2015).
38. Patience, G. S. et al. Intellectual contributions meriting authorship: Survey results from the top cited authors across all science
categories. PLoS One 14, e0198117 (2019).
39. Vol, C., Lucero, Y. & Barnas, . Why is data sharing in collaborative natural resource eorts so hard and what can we do to improve
it? Environ. Manage. 53, 883–893 (2014).
40. Tedersoo, L. et al. Towards global patterns in the diversity and community structure of ectomycorrhizal fungi. Mol. Ecol. 21,
4160–4170 (2012).
41. eichman, O. J. et al. Challenges and opportunities of open data in ecology. Science 331, 703–705 (2011).
42. idwell, M. C. et al. Badges to acnowledge open practices: A simple, low-cost, eective method for increasing transparency. PLoS
Biol. 14, e1002456 (2016).
43. Candela, L., Castelli, D., Manghi, P. & Tani, A. Data journals: a survey. J. Ass. Inform. Sci. Technol. 66, 1747–1762 (2015).
44. Callaghan, S. et al. Maing data a rst class scientic output: data citation and publication by NEC’s Environmental Data Centres.
Int. J. Digit. Curat. 7, 107–113 (2012).
45. Dye, S. O. & Hubbard, T. J. Developing and implementing an institute-wide data sharing policy. Genome Med. 3, 1–8 (2011).
46. Heidorn, P. B. Shedding light on the dar data in the long tail of science. Libr. Trends 57, 280–299 (2008).
47. Langille, M. G. et al. “Available upon request”: not good enough for microbiome data! Microbiome 6, 8 (2018).
48. Morey, . D. et al. e Peer eviewers’ Openness Initiative: incentivizing open research practices through peer review. R. Soc. Open
Sci. 3, 150547 (2016).
49. Alsheih-Ali, A. A. et al. Public availability of published research data in high-impact journals. PLoS One 6, e24357 (2011).
50. Tedersoo, L. et al. Data sharing across disciplines:’available upon request’ holds no promise. University of Tartu; Institute of Ecology
and Earth Sciences https://doi.org/10.15156/BIO/1359426 (2021).
51. Sison, C. P. & Glaz, J. Simultaneous condence intervals and sample size determination for multinomial proportions. J. Am. Stat. Ass.
90, 366–369 (1995).
Acknowledgements
We thank all authors who released their data along with their article or responded to our data request. Although
some of the obtained datasets are used in a series of meta-analyses or released by us upon agreement, we apologise
to the authors who spent a signicant amount of time to provide the data, which we cannot use for secondary
analyses. We thank A. Kahru, T. Soomere, Ü. Niinemets and J. Allik for their constructive comments on an earlier
version of the manuscript.
Author contributions
All authors contributed to study design, work with literature and writing. L.T. analysed data and led writing.
Competing interests
e authors declare no competing interests.
Additional information
Supplementary information e online version contains supplementary material available at https://doi.
org/10.1038/s41597-021-00981-0.
Correspondence and requests for materials should be addressed to L.T.
Reprints and permissions information is available at www.nature.com/reprints.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and
institutional aliations.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International
License, which permits use, sharing, adaptation, distribution and reproduction in any medium or
format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Cre-
ative Commons license, and indicate if changes were made. e images or other third party material in this
article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the
material. If material is not included in the article’s Creative Commons license and your intended use is not per-
mitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the
copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
© e Author(s) 2021
Content courtesy of Springer Nature, terms of use apply. Rights reserved
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com
Available via license: CC BY 4.0
Content may be subject to copyright.
Content uploaded by Leho Tedersoo
Author content
All content in this area was uploaded by Leho Tedersoo on Aug 02, 2021
Content may be subject to copyright.