A decade of empowering biodiversity science: celebrating 10 years of Biodiversity Data Journal

Together, we have redefined scientific communication, and we will continue to push the boundaries of knowledge.

Today, 16 September 2023, we are celebrating our tenth anniversary: an important milestone that has prompted us to reflect on the incredible journey that Biodiversity Data Journal (BDJ) has been through.

From the very beginning, our mission was clear: to revolutionise the way biodiversity data is shared, accessed, and harnessed. This journey has been one of innovation, collaboration, and a relentless commitment to making biodiversity data FAIR – Findable, Accessible, Interoperable, and Reusable.

Over the past 10 years, BDJ, under the auspices of our esteemed publisher Pensoft, has emerged as a trailblazing force in biodiversity science. Our open-access platform has empowered researchers from around the world to publish comprehensive papers that seamlessly blend text with morphological descriptions, occurrences, data tables, and more. This holistic approach has enriched the depth of research articles and contributed to the creation of an interconnected web of biodiversity information.

In addition, by utilising ARPHA Writing Tool and ARPHA Platform as our entirely online manuscript authoring and submission interface, we have simplified the integration of structured data and narrative, reinforcing our commitment to simplifying the research process.

One of our most significant achievements is democratising access to biodiversity data. By dismantling access barriers, we have catalysed the emergence of novel research directions, equipping scientists with the tools to combat critical global challenges such as biodiversity loss, habitat degradation, and climate fluctuations.

We firmly believe that data should be openly accessible to all, fostering collaboration and accelerating scientific discovery. By upholding the FAIR principles, we ensure that the datasets accompanying our articles are not only discoverable and accessible, but also easy to integrate and reusable across diverse fields.

As we reflect on the past decade, we are invigorated by the boundless prospects on the horizon. We will continue working on to steer the global research community towards a future where biodiversity data is open, accessible, and harnessed to tackle global challenges.

Ten years of biodiversity research

To celebrate our anniversary, we have curated some of our most interesting and memorable BDJ studies from the past decade.

  • Recently, news outlets were quick to cover a new species of ‘snug’ published in our journal.
  • This Golden Retriever trained to monitor hermit beetle larvae proved once again the incredible capabilities of our canine friends.
Teseo, the Golden Retriever monitoring hermit beetle larvae
  • Who could forget this tiny fly named after the former Governor of California?
  • Or this snail named after climate activist Greta Thunberg?
Craspedotropis gretathunbergae

New discoveries are always exciting, but some of our favourite research focuses on formerly lost species, back where they belong.

  • Like the griffon vulture, successfully reintroduced to Bulgaria after fifty years.

Citizen science has shown time and time again that it holds an important position in biodiversity research.

  • This group, for example, who found a beetle the size of a pinhead in Borneo.
“Life Beneath the Ice”, a short musical film about light and life beneath the Antarctic sea-ice by Dr. Emiliano Cimoli

We extend our heartfelt gratitude to our authors, reviewers, readers, and the entire biodiversity science community for being integral parts of this transformative journey. Together, we have redefined scientific communication, and we will continue to push the boundaries of knowledge.

Follow BDJ on social media:

Nanopublications tailored to biodiversity data

Novel nanopublication workflows and templates for associations between organisms, taxa and their environment are the latest outcome of the collaboration between Knowledge Pixels and Pensoft.

First off, why nanopublications?

Nanopublications complement human-created narratives of scientific knowledge with elementary, machine-actionable, simple and straightforward scientific statements that prompt sharing, finding, accessibility, citability and interoperability. 

By making it easier to trace individual findings back to their origin and/or follow-up updates, nanopublications also help to better understand the provenance of scientific data. 

With the nanopublication format and workflow, authors make sure that key scientific statements – the ones underpinning their research work – are efficiently communicated in both human-readable and machine-actionable manner in line with FAIR principles. Thus, their contributions to science are better prepared for a reality driven by AI technology.

The machine-actionability of nanopublications is a standard due to each assertion comprising a subject, an object and a predicate (type of relation between the subject and the object), complemented by provenance, authorship and publication information. A unique feature here is that each of the elements is linked to an online resource, such as a controlled vocabulary, ontology or standards. 

Now, what’s new?

As a result of the partnership between high-tech startup Knowledge Pixels and open-access scholarly publisher and technology provider Pensoft, authors in Biodiversity Data Journal (BDJ) can make use of three types of nanopublications:

  1. Nanopublications associated with a manuscript submitted to BDJ. This workflow lets authors add a Nanopublications section within their manuscript while preparing their submission in the ARPHA Writing Tool (AWT). Basically, authors ‘highlight’ and ‘export’ key points from their papers as nanopublications to further ensure the FAIRness of the most important findings from their publications.
  1. Standalone nanopublication related to any scientific publication, regardless of its author or source. This can be done via the Nanopublications page accessible from the BDJ website. The main advantage of standalone nanopublication is that straightforward scientific statements become available and FAIR early on, and remain ready to be added to a future scholarly paper.
  1. Nanopublications as annotations to existing scientific publications. This feature is available from several journals published on the ARPHA Platform, including BDJ. By attaching an annotation to the entire paper (via the Nanopublication tab) or a text selection (by first adding an inline comment, then exporting it as a nanopublication), a reader can evaluate and record an opinion about any article using a simple template based on the Citation Typing Ontology (CiTO).

Nanopublications for biodiversity data?

At Biodiversity Data Journal (BDJ), authors can now incorporate nanopublications within their manuscripts to future-proof the most important assertions on biological taxa and organisms or statements about associations of taxa or organisms and their environments

On top of being shared and archived by means of a traditional research publication in an open-access peer-reviewed journal, scientific statements using the nanopublication format will also remain ‘at the fingertips’ of automated tools that may be the next to come looking for this information, while mining the Web.

Using the nanopublication workflows and templates available at BDJ, biodiversity researchers can share assertions, such as:

So far, the available biodiversity nanopublication templates cover a range of associations, including those between taxa and individual organisms, as well as between those and their environments and nucleotide sequences. 

Nanopublication template customised for biodiversity research publications available from Nanodash.

As a result, those easy-to-digest ‘pixels of knowledge’ can capture and disseminate information about single observations, as well as higher taxonomic ranks. 

The novel domain-specific publication format was launched as part of the collaboration between Knowledge Pixels – an innovative startup tech company aiming to revolutionise scientific publishing and knowledge sharing and the open-access scholarly publisher Pensoft.

… so, what exactly is a nanopublication?

General structure of a nanopublication:

“the smallest unit of publishable information”,

as explained on nanopub.net.

Basically, a nanopublication – unlike a research article – is a tiny snippet of a precise and structured scientific finding (e.g. medication X treats disease Y), which exists as a reusable and cite-able pieces of a growing knowledge graph stored on a decentralised server network in a format that it is readable for humans, but also “understandable” and actionable for computers and their algorithms.

These semantic statements expressed in community-agreed terms, openly available through links to controlled vocabularies, ontologies and standards, are not only freely accessible to everyone in both human-readable and machine-actionable formats, but also easy-to-digest for computer algorithms and AI-powered assistants.

In short, nanopublications allow us to browse and aggregate such findings as part of a complex scientific knowledge graph. Therefore, nanopublications bring us one step closer to the next revolution in scientific publishing, which started with the emergence and increasing adoption of knowledge graphs. 

“As pioneers in the semantic open access scientific publishing field for over a decade now, we at Pensoft are deeply engaged with making research work actually available at anyone’s fingertips. What once started as breaking down paywalls to research articles and adding the right hyperlinks in the right places, is time to be built upon,”

said Prof. Lyubomir Penev, founder and CEO at Pensoft, which had published the very first semantically enhanced research article in the biodiversity domain back in 2010 in the ZooKeys journal.

Why are nanopublications necessary?

By letting computer algorithms access published research findings in a structured format, nanopublications allow for the knowledge snippets that they are intended to communicate to be fully understandable and actionable. With nanopublications, each of those fragments of scientific information is interconnected and traceable back to its author(s) and scientific evidence. 

A nanopublication is a tiny snippet of a precise and structured scientific finding (e.g. medication X treats disease Y), which exists within a growing knowledge graph stored on a decentralised server network in a format that it is readable for humans, but also “understandable” and actionable for computers and their algorithms. Illustration by Knowledge Pixels. 

By building on shared knowledge representation models, these data become Interoperable (as in the I in FAIR), so that they can be delivered to the right user, at the right time, in the right place , ready to be reused (as per the R in FAIR) in new contexts. 

Another issue nanopublications are designed to address is research scrutiny. Today, scientific publications are produced at an unprecedented rate that is unlikely to cease in the years to come, as scholarship embraces the dissemination of early research outputs, including preprints, accepted manuscripts and non-conventional papers.

A network of interlinked nanopublications could also provide a valuable forum for scientists to test, compare, complement and build on each other’s results and approaches to a common scientific problem, while retaining the record of their cooperation each step along the way. 

*** 

We encourage you to try the nanopublications workflow yourself when submitting your next biodiversity paper to Biodiversity Data Journal

Community feedback on this pilot project and suggestions for additional biodiversity-related nanopublication templates are very welcome!

This Nanopublications for biodiversity workflow was created with a partial support of the European Union’s Horizon 2020 BiCIKL project under grant agreement No 101007492 and in collaboration with Knowledge Pixels AG.The tool uses data and API services of ChecklistBank, Catalogue of Life, GBIF, GenBank/ENA, BOLD, Darwin Core, Environmental Ontology (ENVO), Relation Ontology (RO), NOMEN, ZooBank, Index Fungorum, MycoBank, IPNI, TreatmentBank, and other resources. 

*** 

On the journal website: https://bdj.pensoft.net/, you can find more about the unique features and workflows provided by the Biodiversity Data Journal (BDJ), including innovative research paper formats (e.g. Data Paper, OMICS Data Paper, Software Description, R Package, Species Conservation Profiles, Alien Species Profile), expert-provided data audit for each data paper submission, automated data export and more.

Don’t forget to also sign up for the BDJ newsletter via the Email alert form on the journal’s homepage and follow it on Twitter and Facebook.

***

Earlier this year, Knowledge Pixels and Pensoft presented several routes for readers and researchers to contribute to research outputs – either produced by themselves or by others – through nanopublications generated through and visualised in Pensoft’s cross-disciplinary Research Ideas and Outcomes (RIO) journal, which uses the same nanopublication workflows.

New way to browse interlinked biodiversity data: Biodiversity Knowledge Hub NOW ONLINE!

The Biodiversity Knowledge Hub is a one-stop portal that allows users to access FAIR and interlinked biodiversity data and services in a few clicks.

The Horizon 2020 BiCIKL Project is proud to announce that the Biodiversity Knowledge Hub (BKH) is now online.

BKH is a one-stop portal that allows users to access FAIR and interlinked biodiversity data and services in a few clicks. BKH was designed to support a new emerging community of users over time and across the entire biodiversity research cycle providing its services to anybody, anywhere and anytime.

The Knowledge Hub is the main product from our BiCIKL consortium, and we are delighted with the result!

BKH can easily be seen as the beginning of the major shift in the way we search interlinked biodiversity information.”

Biodiversity researchers, research infrastructures and publishers interested in fields ranging from taxonomy to ecology and bioinformatics can now freely use BKH as a compass to navigate the oceans of biodiversity data. BKH will do the linkages.

says Prof. Lyubomir Penev, BiCIKL’s Project coordinator and Founder of Pensoft Publishers
The BKH is designed to serve a new emerging community of users over time and across the entire biodiversity research cycle. 

We have invested our best energies and resources in the development of BKH and the Fair Data Place (FDP), which is the beating heart of the portal,”

BKH has been designed to support a new emerging community of users across the entire biodiversity research cycle.

Its purpose goes beyond the BiCIKL project itself: we are thrilled to say that BKH is meant to stay, aiming to reshape the way biodiversity knowledge is accessed and used.

says Dr Christos Arvanitidis, CEO of LifeWatch ERIC.

The BKH outlines how users can navigate and access the linked data, tools and services of the infrastructures cooperating in BiCIKL.

By revealing how they harvest, liberate and reuse data, these increasingly integrated sources enable researchers in the natural sciences to move more seamlessly between specimens and material samples, genomic and metagenomic data, scientific literature, and taxonomic names and units.

said Dr Joe Miller, Executive Secretary of GBIF—the Global Biodiversity Information Facility.

A training programme on how to best utilise the platform is currently being developed by the Consortium of European Taxonomic Facilities (CETAF), Pensoft PublishersPlaziMeise Botanic GardenEMBL’s European Bioinformatics Institute (EMBL-EBI), ELIXIR HubGBIF – the Global Biodiversity Information Facility, and LifeWatch ERIC and will be finalised in the coming months.

***

A detailed description of the BKH tools and services provided by its contributing organisations is available at: https://biodiversityknowledgehub.eu.

***

Find more information about the BiCIKL consortium partners on the project’s website.

***

Follow BiCIKL Project on Twitter and Facebook. Join the conversation on Twitter at #BiCIKL_H2020.

Eye for Detail: Papers in Pensoft journals sport a new look

As behaviours and needs of readers change, we strive to keep up with the times. Let’s run through what & why has changed to the PDF format.

Readers at some of the journals published by Pensoft, who have downloaded/printed a publication or ordered a physical copy of a journal issue over the last few weeks, might be in for a surprise concerning the layout of the PDF format of the articles. 

Research papers published in ZooKeys demonstrating the former (left) and the current (right) article layout seen in the PDF format. 

Even though it’s been years since online publishing has become the norm in how we are consuming information – including scientific publications – we understand that academia is still very much fond of traditional, often paper-based, article layout format: the one you use when accessing a PDF file or a print copy, rather than directly scrolling down through the HTML version of the article. 

Even if today large orders of printed volumes from overseas are the exception, rather than the rule, we know we have readers of ours who regularly print manuscripts at home or savе them on their devices. Trends like this have already led to many journals first abandoning the physical- for digital-first, then transitioning to digital-only publication format.

Meanwhile, it is true that needs and demands have fundamentally changed in recent times. 

As we speak, readers are accessing PDF files from much higher-quality desktops, in order to skim through as much content as possible. 

In the meantime, authors are relying on greater-quality cameras to document their discoveries, while using advanced computational tools capable of generating and analysing extra layers of precise data. While producing more exhaustive research, however, it is also of key importance that their manuscripts are processed and published as rapidly as possible.

So, let’s run through the updates and give you our reasoning for their added value to readers and authors.

Revised opening page

One of the major changes is the one to the format of the first page. By leaving some blank space on the left, we found a dedicated place for important article metadata, i.e. academic editor, date of manuscript submission / acceptance / publication, citation details and licence. As a result, we “cleaned up” the upper part of the page, so that it can better highlight the authors and their affiliations. 

Bottom line: The new layout provides a better structure to the opening page to let readers find key article metadata at a glance. 

Expand as much – or as little – as comfortable

As you might know, journals published by Pensoft have been coming in different formats and sizes. Now, we have introduced the standard A4 page size, where the text is laid in a single column that has been slightly indented to the right, as seen above. Whenever a figure or a table is used in a manuscript, however, it is expanded onto the whole width of the page.

Before giving our reasons why, let’s see what were the specific problems that we address.

Case study 1

Some of our signature journals, including ZooKeys, PhytoKeys and MycoKeys, have become quite recognisable with their smaller-than-average B5 format, widely appreciated by people who would often be seen carrying around a copy during a conference or an international flight.

However, in recent times, authors began to embrace good practices in research like open sharing of data and code, which resulted in larger and more complex tables. Similarly, their pocket-sized cameras would capture much higher-resolution photos capable of revealing otherwise minute morphological characters. Smaller page size would also mean that often there would be pages between an in-text reference of a figure or a table and the visual itself.

So, here we faced an obvious question: shall we deprive their readers from all those detailed insights into the published studies?

Case study 2

Meanwhile, other journals, such as Herpetozoa, Zoosystematics and Evolution and Deutsche Entomologische Zeitschrift, had long been operating in A4 size, thereby providing their readers with a full view of the figures in their publications. 

Yet, the A4 format brought up another issue: the lines were too long for the eye comfort of their readers. 

What they did was organise their pages into two-column format. While this sounds like a good and quite obvious decision, the format – best known from print newspapers – is pretty inconvenient when accessed digitally. Since the readers would like to zoom in on the PDF page or simply access the article on mobile, they will need to scroll up and down several times per page. 

In addition, the production of a two-column text is technologically more challenging, which results in extra production time.

Bottom line: The new layout allows journals to not sacrifice image quality for text readability and vice versa. As a bonus, authors enjoy faster publication for their papers.

Simplified font

If you have a closer look at the PDF file, you would notice that print-ready papers have also switched to a more simplistic – yet easier to the eye – font. Again, the update corresponds to today’s digital-native user behaviour, where readers often access PDF files from devices of various resolutions and skim through the text, as opposed to studying its content in detail.

In fact, the change is hardly new, since the same font has long been utilised for the webpages (HTML format) of the publications across all journals.

Bottom line: The slightly rounder and simplified font prompts readability, thereby allowing for faster and increased consumption of content. 

What’s the catch? How about characters and APCs?

While we have been receiving a lot of positive feedback from editors, authors and readers, there has been a concern that the updates would increase the publication charges, wherever these are estimated based on page numbers.

Having calculated the lines and characters in the new layout format, we would like to assure you that there is no increase in the numbers of characters or words between the former and current layout formats. In fact, due to the additional number of lines fitting in an A4 page as opposed to B5, authors might be even up for a deal.

________

* At the time of the writing, the new paper layout has not been rolled out at all journals published by Pensoft. However, most of the editorial boards have already confirmed they would like to incorporate the update.

________

For news from & about Pensoft and our journal portfolio, follow us on Twitter, Facebook and Linkedin.

Call for Expression of Interest for biodiversity data-related scientific projects from BiCIKL

The purpose of this call is to solicit, select and implement four to six biodiversity data-related scientific projects that will make use of the added value services developed by the leading Research Infrastructures that make the BiCIKL project.

The BiCIKL project invites submissions of Expression of Interest (EoI) to the First BiCIKL Open Call for projects. The purpose of this call is to solicit, select and implement four to six biodiversity data-related scientific projects that will make use of the added value services developed by the leading Research Infrastructures that make the BiCIKL project.

By opening this call, BiCIKL aims to better understand how it could support scientific questions that arise from across the biodiversity world in the future, while addressing specific scientific or technical biodiversity data challenges presented by the applicants.

We need and want to assess real-world problems and make the best possible use of our data and technical capabilities. This will greatly assist in defining the long-term development goals of the participating Research Infrastructures and improve the way they can technically and operationally work together to deliver greater scientific value.

explain the project partners.

The BiCIKL project – a Horizon 2020-funded project involving 14 European institutions, representing major global players in biodiversity research and natural history, and coordinated by Pensoft – establishes a European starting community of key research infrastructures, researchers, citizen scientists and other biodiversity and life sciences stakeholders based on open science practices through access to data, tools and services.

Find more about the Call and submit your Expression of Interest

***

Follow BiCIKL on Twitter and Facebook.

Join the conversation on Twitter via #BiCIKL_H2020.

Digitising the Natural History Museum London’s entire collection could contribute over £2 billion to the global economy

In a world first, the Natural History Museum, London, has collaborated with economic consultants, Frontier Economics Ltd, to explore the economic and societal value of digitising natural history collections and concluded that digitisation has the potential to see a seven to tenfold return on investment. Whilst significant progress is already being made at the Museum, additional investment is needed in order to unlock the full potential of the Museum’s vast collections – more than 80 million objects. The project’s report is published in the open science scientific journal Research Ideas and Outcomes (RIO Journal).

One of the Museum’s digitisers imaging a butterfly to join the 4.93 million specimens already available online. 
© The Trustees of the Natural History Museum, London

The societal benefits of digitising natural history collections extends to global advancements in food security, biodiversity conservation, medicine discovery, minerals exploration, and beyond. Brand new, rigorous economic report predicts investing in digitising natural history museum collections could also result in a tenfold return. The Natural History Museum, London, has so far made over 4.9 million digitised specimens available freely online – over 28 billion records have been downloaded over 429,000 download events over the past six years. 

Digitisation at the Natural History Museum, London 

Digitisation is the process of creating and sharing the data associated with Museum specimens. To digitise a specimen, all its related information is added to an online database. This typically includes where and when it was collected and who found it, and can include photographs, scans and other molecular data if available. Natural history collections are a unique record of biodiversity dating back hundreds of years, and geodiversity dating back millennia. Creating and sharing data this way enables science that would have otherwise been impossible, and we accelerate the rate at which important discoveries are made from our collections.  

The Natural History Museum’s collection of 80 million items is one of the largest and most historically and geographically diverse in the world. By unlocking the collection online, the Museum provides free and open access for global researchers, scientists, artists and more. Since 2015, the Museum has made 4.9 million specimens available on the Museum’s Data Portal, which have seen more than 28 billion downloads over 427,000 download events. 

This means the Museum has digitised  about 6% of its collections to date. Because digitisation is expensive, costing tens of millions of pounds, it is difficult to make a case for further investment without better understanding the value of this digitisation and its benefits. 

In 2021, the Museum decided to explore the economic impacts of collections data in more depth, and commissioned Frontier Economics to undertake modelling, resulting in this project report, now made publicly available in the open-science journal Research Ideas and Outcomes (RIO Journal), and confirming benefits in excess of £2 billion over 30 years. While the methods in this report are relevant to collections globally, this modelling focuses on benefits to the UK, and is intended to support the Museum’s own digitisation work, as well as a current scoping study funded by the Arts & Humanities Research Council about the case for digitising all UK natural science collections as a research infrastructure.

Sharing data from our collections can transform scientific research and help find solutions for nature and from nature. Our digitised collections have helped establish the baseline plant biodiversity in the Amazon, find wheat crops that are more resilient to climate change and support research into potential zoonotic origins of Covid-19. The research that comes from sharing our specimens has immense potential to transform our world and help both people and the planet thrive,

says Helen Hardy, Science Digital Programme Manager at the Natural History Museum.

How digitisation impacts scientific research?

The data from museum collections accelerates scientific research, which in turn creates benefits for society and the economy across a wide range of sectors. Frontier Economics Ltd have looked at the impact of collections data in five of these sectors: biodiversity conservation, invasive species, medicines discovery, agricultural research and development and mineral exploration. 

The Natural History Museum’s collection is a real treasure trove which, if made easily accessible to scientists all over the world through digitisation, has the potential to unlock ground-breaking research in any number of areas. Predicting exactly how the data will be used in future is clearly very uncertain. We have looked at the potential value that new research could create in just five areas focussing on a relatively narrow set of outcomes. We find that the value at stake is extremely large, running into billions,”

says Dan Popov, Economist at Frontier Economics Ltd.

The new analyses attempt to estimate the economic value of these benefits using a range of approaches, with the results in broad agreement that the benefits of digitisation are at least ten times greater than the costs. This represents a compelling case for investment in museum digital infrastructure without which the many benefits will not be realised.

This new analysis shows that the data locked up in our collections has significant societal and economic value, but we need investment to help us release it,

adds Professor Ken Norris, Head of the Life Sciences Department at the Natural History Museum.

Other benefits could include improvements to the resilience of agricultural crops by better understanding their wild relatives, research into invasive species which can cause significant damage to ecosystems and crops, and improving the accuracy of mining.  

Finally, there are other impacts that such work could have on how science is conducted itself. The very act of digitising specimens means that researchers anywhere on the planet can access these collections, saving time and money that may have been spent as scientists travelled to see specific objects.

The value of research enabled by digitisation of natural history collections can be estimated by looking at specific areas where the Museum’s collections contribute towards scientific research and subsequently impact the wider economy. 
© Frontier Economics Ltd.

Original source: 

Popov D, Roychoudhury P, Hardy H, Livermore L, Norris K (2021) The Value of Digitising Natural History Collections. Research Ideas and Outcomes 7: e78844. https://doi.org/10.3897/rio.7.e78844